DPGLEAN16796 in OGS1.0

New model in OGS2.0DPOGS205472 
Genomic Positionscaffold224:+ 2947-17520
See gene structure
CDS Length3675
Paired RNAseq reads  9953
Single RNAseq reads  25153
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008442 (0.0)
Best Drosophila hit  CG1516, isoform M (0.0)
Best Human hitpyruvate carboxylase, mitochondrial precursor (0.0)
Best NR hit (blastp)  AGAP004742-PA [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  AGAP004742-PA [Anopheles gambiae str. PEST] (0.0)
GeneOntology terms






  
GO:0004736 pyruvate carboxylase activity
GO:0005759 mitochondrial matrix
GO:0006090 pyruvate metabolic process
GO:0005524 ATP binding
GO:0009374 biotin binding
GO:0006094 gluconeogenesis
GO:0005811 lipid particle
GO:0005875 microtubule associated complex
InterPro families















  
IPR016185 PreATP-grasp-like fold
IPR011054 Rudiment single hybrid motif
IPR011053 Single hybrid motif
IPR005482 Biotin carboxylase, C-terminal
IPR005930 Pyruvate carboxylase
IPR013817 Pre-ATP-grasp fold
IPR013815 ATP-grasp fold, subdomain 1
IPR013816 ATP-grasp fold, subdomain 2
IPR013785 Aldolase-type TIM barrel
IPR001882 Biotin-binding site
IPR005479 Carbamoyl-phosphate synthetase, large subunit, ATP-binding
IPR000089 Biotin/lipoyl attachment
IPR011761 ATP-grasp fold
IPR011764 Biotin carboxylation domain
IPR000891 Pyruvate carboxyltransferase
IPR003379 Carboxylase, conserved domain
IPR005481 Carbamoyl-phosphate synthase, large subunit, N-terminal
Orthology groupMCL14813

Nucleotide sequence:

ATGCAGATACTTAAAGCAAGGTATGCCATAAGGGCGACAACCTCACACTTACAAGCATGG
AATTCTGCGAGAAACAGGAATGCAACTACACAATCAAAAACCGTAGACTACAAGCCCATC
CGTAGCGTGCTGGTTGCTAATAGAGGTGAAATAGCCATACGGGTGTTCAGAGCATGCACG
GAGTTGGGGATTCGATCAGTCGCTATATACAGCGAACAGGATCGACTACAAATGCACAGA
CAAAAAGCCGACGAGTCCTACCTCGTTGGCAAAGGTCTGCCCCCGGTGGAAGCATACCTG
AGCATACCTGAAATCATCAGGGTGGCCAAAGAAAACGACGTAGATGCTGTACATCCAGGA
TATGGTCTGCTCTCAGAGAGATCAGACTTCGCGGAGGCCGTCATTAAGGCAGGGCTTCGT
TTCATCGGGCCCTCGCCGTTCGTTGTTCAGCAGATGGGGGATAAAGTTGCCGCTAGAAAG
GCCGCCATTGAAGCCAAGGTCCCAATCGTTCCTGGTACCGACGGTCCCATTACGACGAAG
GAAGAAGCTCTCGAATTTTGCAAACAACATGGTCTCCCTGTTATATTTAAGGCAGCCTAC
GGTGGTGGCGGCCGTGGTATGCGAGTGGTCCGCGAAATGAGTGAGGTGGCGTCGTCTTTC
GAGCGAGCCTCCTCTGAAGCCTTAGGGGCCTTCGGGAACGGATCCATGTTCATAGAAAGA
TTCATAGAGAGACCCAGACATATTGAAGTGCAGCTTTTGGGCGACAAAGCCGGTAACGTA
GTGCACTTGTACGAACGAGACTGCTCGGTACAACGGCGACATCAGAAGGTTGTTGAAATA
GCGCCCGCCCCTGGACTAGACCCTGAGATTCGTAATCGCATGACGGATTGTGCCGTGCAT
CTCGCACGCCACGTGGGCTACGAGAACGCTGGCACGGTGGAGTTCCTCTTGGACGAGAAA
GGAAACTTCTACTTCATAGAAGTCAACGCTAGATTGCAAGTAGAGCACACAATAACAGAG
GAAGTTACCGGGATAGATCTCGTCCAATCCCAGATCAGAGTCGCCGAAGGTATGACTCTA
CCAGAGATGGGATTGACCCAAGATAACATTAAGGCTCAAGGATACGCCATACAATGCAGA
GTCACTACCGAAGACCCCGCCAATAACTTCCAGCCTAGCACTGGCAGGATTGAAGTATTC
AGATCTGGAGAGGGTATGGGCATCCGTTTAGACTCAGCGTCGACCTACGCTGGCGCCATA
ATATCACCATACTACGACTCGCTTCTTGTTAAGGTCATCTCCCACGCCCAAGACCTGTCT
TCATCAGCCGCTAAGATGAATCGAGCGTTACGAGAGTTCCGTATACGAGGGGTCAAGACC
AACATACCGTTCCTGCTGAATGTGCTCGAAAACCAAAAGTTCTTGAACGAACGTACTTCG
AAGGCAGCCCTAGTGGGTTCGTCTCAAATTTGGTTAGGTAGAGAGAGGAGCTTGGACGGT
GGAGGTGATTTGGACACGTACTTCATAGACGAACACCCTCGTCTCTTCATGTTCAAGGCG
TCACAGAACAGAGCTCAGAAGATATTGAACTACTTGGGATATGTCCTCGTTAACGGCCCG
GCCACACCACTCGCAACTAAGATACCACCATCGGACGTCAAGCCATACATACCACCGGTA
CCGTTGGACCTTTCACCCGAGGCTATTAAAAAACAAGAATTGACCGGCGAGAACGTAGCG
GTCCAGCCCCCAAAGGGCTTTAAGGCGATCCTGAACGAAGGCGGTCCGGAAGCCTTCGCT
AAAGCGGTTCGAGAGCACAAGGGTCTATTATTAATGGACACTACATACAGAGACGCTCAT
CAGTCCCTCTTGGCCACCAGAGTTAGATCCCACGATCTTCTTACAGTGTCGCCATATGTG
GCCCATAACTTCAGCAATTTATACTCCCTTGAGAACTGGGGCGGCGCTACCTTCGACGTG
GCTTTGCGATTCCTTCATGAATGTCCTTGGGAACGTCTCGAAGACATGCGTCGGTTGATA
CCAAACATTCCCTTCCAAATGTTACTCCGCGGAGCCAACGCGGTCGGTTACACCAACTAT
CCAGATAATGTCGTCTTCAAGTTTTGTGAAATGGCTGTGAAATCCGGAATGGACATCTTC
CGTGTCTTTGACTCCTTGAACTATCTGCCGAATCTGATCCTGGGTATGGACGCGGCGGGC
AAGGCCGGGGGGGTGGTGGAAGCTGCCATATCATACACCGGAGACGTCTCCGATCCGAAC
AAAACGAAATATAACCTGAAGTACTACTGCGATCTAGCTGACGAACTCGTCAAGGCGGGG
ACACACGTCCTCGGCATTAAAGATATGGCTGGACTTTTAAAACCGCAGGCTGCTAAACTT
CTGATAACCGCTATCCGTGATAAGCACCCATCCGTGCCGATCCACGTCCACACCCACGAC
ACTTCCGGTGCGGGCGTCGCGGCCATGTTGGCGTGCGCTGAGGCCGGTGCTGACGTGGTC
GACTGCGCCGTAGACTCAATGTCCGGCCTCACCAGCCAGCCCAGTATGGGCGCACTTGTC
GCGTCCCTACAAGGAACCAAACTGGATACAGGTATACCTCTGCAGACCGTATCCGAATAT
TCAGCTTACTGGGAACAGGCTCGCACTCTGTACGGGCCGTTCGAGTGCACCGCTACCATG
AAATCAGGAAATGCTGATGTTTACATCAACGAGATTCCCGGCGGTCAATACACGAACCTG
CAGTTCCAGGCCTTCTCGTTGGGCCTGGGAAGTCAATTCGAGGAAGTGAAGAAGGCCTAT
AGGGAAGCGAATCTGCTCCTGGGGGACATTATTAAAGTGACTCCATCATCGAAGGTAGTG
GGTGATCTGGCTCAATTCATGGTTCAGAACAAACTGACCGCTGACGACATCAGGGCGAGG
GCTGAAGAATTATCCTTCCCCAAATCAGTGGTCGAGTTCTTCCAAGGAGCCATTGGCATC
CCTTACGGAGGTTTCCCAGAACCCTTAAGGTCCAAAATCCTCAAGGACATGCCAAGGATA
GAAGGCCGCCCGGGACAGGAACTGCCGCCGCTAGATTTTGACAAACTAAAGGAGGAGTTA
AAGGAGTCTTACCCTGAGATCACAGACCAGGACGTGATGTCATCGGCGATGTATCCTCAA
GTGGCGTCAGACTTCTTCCGTATCCGGGATAAGTACGGCCCAGTCAAACACCTCGACACG
AAGACTTTCCTCGTTGGTCCGGCGGTCGGTGAAACCATTGAAGTTAAAATCGAGAGAGGC
AAAACACTGGATATAAAAACATTAGCAGTATCCGAGGAAATGACAGCGGCCGGTGAGAGG
GAAGTGTTCTTTGAACTCAACGGACAACTGAGATCTGTGTTCATCAGAGATGACAACGCT
AGCAAGGAAATGAAAATACATCCGAAGGCTGTTAAAGGAGATAAGAACCAAGTCGGCGCA
CCCATGCCTGGGACAGTGCTAACTCTTAAAGTTAAAGAAGGCGACCACGTGGAGAAAGGC
CAACCAATAGCCGTTCTGTCTGCCATGAAAATGGAGATGATAGTACAAGCGCCCCGCGCT
GGCACTGTGGCCAATGTGGCCATCACTAATGGACAGAAACTGGAGGGCGATGACCTCATC
TGCACCCTAGAGTAA

Protein sequence:

MQILKARYAIRATTSHLQAWNSARNRNATTQSKTVDYKPIRSVLVANRGEIAIRVFRACT
ELGIRSVAIYSEQDRLQMHRQKADESYLVGKGLPPVEAYLSIPEIIRVAKENDVDAVHPG
YGLLSERSDFAEAVIKAGLRFIGPSPFVVQQMGDKVAARKAAIEAKVPIVPGTDGPITTK
EEALEFCKQHGLPVIFKAAYGGGGRGMRVVREMSEVASSFERASSEALGAFGNGSMFIER
FIERPRHIEVQLLGDKAGNVVHLYERDCSVQRRHQKVVEIAPAPGLDPEIRNRMTDCAVH
LARHVGYENAGTVEFLLDEKGNFYFIEVNARLQVEHTITEEVTGIDLVQSQIRVAEGMTL
PEMGLTQDNIKAQGYAIQCRVTTEDPANNFQPSTGRIEVFRSGEGMGIRLDSASTYAGAI
ISPYYDSLLVKVISHAQDLSSSAAKMNRALREFRIRGVKTNIPFLLNVLENQKFLNERTS
KAALVGSSQIWLGRERSLDGGGDLDTYFIDEHPRLFMFKASQNRAQKILNYLGYVLVNGP
ATPLATKIPPSDVKPYIPPVPLDLSPEAIKKQELTGENVAVQPPKGFKAILNEGGPEAFA
KAVREHKGLLLMDTTYRDAHQSLLATRVRSHDLLTVSPYVAHNFSNLYSLENWGGATFDV
ALRFLHECPWERLEDMRRLIPNIPFQMLLRGANAVGYTNYPDNVVFKFCEMAVKSGMDIF
RVFDSLNYLPNLILGMDAAGKAGGVVEAAISYTGDVSDPNKTKYNLKYYCDLADELVKAG
THVLGIKDMAGLLKPQAAKLLITAIRDKHPSVPIHVHTHDTSGAGVAAMLACAEAGADVV
DCAVDSMSGLTSQPSMGALVASLQGTKLDTGIPLQTVSEYSAYWEQARTLYGPFECTATM
KSGNADVYINEIPGGQYTNLQFQAFSLGLGSQFEEVKKAYREANLLLGDIIKVTPSSKVV
GDLAQFMVQNKLTADDIRARAEELSFPKSVVEFFQGAIGIPYGGFPEPLRSKILKDMPRI
EGRPGQELPPLDFDKLKEELKESYPEITDQDVMSSAMYPQVASDFFRIRDKYGPVKHLDT
KTFLVGPAVGETIEVKIERGKTLDIKTLAVSEEMTAAGEREVFFELNGQLRSVFIRDDNA
SKEMKIHPKAVKGDKNQVGAPMPGTVLTLKVKEGDHVEKGQPIAVLSAMKMEMIVQAPRA
GTVANVAITNGQKLEGDDLICTLE