New model in OGS2.0 | DPOGS205472  |
---|---|
Genomic Position | scaffold224:+ 2947-17520 |
See gene structure | |
CDS Length | 3675 |
Paired RNAseq reads   | 9953 |
Single RNAseq reads   | 25153 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008442 (0.0) |
Best Drosophila hit   | CG1516, isoform M (0.0) |
Best Human hit | pyruvate carboxylase, mitochondrial precursor (0.0) |
Best NR hit (blastp)   | AGAP004742-PA [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | AGAP004742-PA [Anopheles gambiae str. PEST] (0.0) |
GeneOntology terms    | GO:0004736 pyruvate carboxylase activity GO:0005759 mitochondrial matrix GO:0006090 pyruvate metabolic process GO:0005524 ATP binding GO:0009374 biotin binding GO:0006094 gluconeogenesis GO:0005811 lipid particle GO:0005875 microtubule associated complex |
InterPro families    | IPR016185 PreATP-grasp-like fold IPR011054 Rudiment single hybrid motif IPR011053 Single hybrid motif IPR005482 Biotin carboxylase, C-terminal IPR005930 Pyruvate carboxylase IPR013817 Pre-ATP-grasp fold IPR013815 ATP-grasp fold, subdomain 1 IPR013816 ATP-grasp fold, subdomain 2 IPR013785 Aldolase-type TIM barrel IPR001882 Biotin-binding site IPR005479 Carbamoyl-phosphate synthetase, large subunit, ATP-binding IPR000089 Biotin/lipoyl attachment IPR011761 ATP-grasp fold IPR011764 Biotin carboxylation domain IPR000891 Pyruvate carboxyltransferase IPR003379 Carboxylase, conserved domain IPR005481 Carbamoyl-phosphate synthase, large subunit, N-terminal |
Orthology group | MCL14813 |
Nucleotide sequence:
ATGCAGATACTTAAAGCAAGGTATGCCATAAGGGCGACAACCTCACACTTACAAGCATGG
AATTCTGCGAGAAACAGGAATGCAACTACACAATCAAAAACCGTAGACTACAAGCCCATC
CGTAGCGTGCTGGTTGCTAATAGAGGTGAAATAGCCATACGGGTGTTCAGAGCATGCACG
GAGTTGGGGATTCGATCAGTCGCTATATACAGCGAACAGGATCGACTACAAATGCACAGA
CAAAAAGCCGACGAGTCCTACCTCGTTGGCAAAGGTCTGCCCCCGGTGGAAGCATACCTG
AGCATACCTGAAATCATCAGGGTGGCCAAAGAAAACGACGTAGATGCTGTACATCCAGGA
TATGGTCTGCTCTCAGAGAGATCAGACTTCGCGGAGGCCGTCATTAAGGCAGGGCTTCGT
TTCATCGGGCCCTCGCCGTTCGTTGTTCAGCAGATGGGGGATAAAGTTGCCGCTAGAAAG
GCCGCCATTGAAGCCAAGGTCCCAATCGTTCCTGGTACCGACGGTCCCATTACGACGAAG
GAAGAAGCTCTCGAATTTTGCAAACAACATGGTCTCCCTGTTATATTTAAGGCAGCCTAC
GGTGGTGGCGGCCGTGGTATGCGAGTGGTCCGCGAAATGAGTGAGGTGGCGTCGTCTTTC
GAGCGAGCCTCCTCTGAAGCCTTAGGGGCCTTCGGGAACGGATCCATGTTCATAGAAAGA
TTCATAGAGAGACCCAGACATATTGAAGTGCAGCTTTTGGGCGACAAAGCCGGTAACGTA
GTGCACTTGTACGAACGAGACTGCTCGGTACAACGGCGACATCAGAAGGTTGTTGAAATA
GCGCCCGCCCCTGGACTAGACCCTGAGATTCGTAATCGCATGACGGATTGTGCCGTGCAT
CTCGCACGCCACGTGGGCTACGAGAACGCTGGCACGGTGGAGTTCCTCTTGGACGAGAAA
GGAAACTTCTACTTCATAGAAGTCAACGCTAGATTGCAAGTAGAGCACACAATAACAGAG
GAAGTTACCGGGATAGATCTCGTCCAATCCCAGATCAGAGTCGCCGAAGGTATGACTCTA
CCAGAGATGGGATTGACCCAAGATAACATTAAGGCTCAAGGATACGCCATACAATGCAGA
GTCACTACCGAAGACCCCGCCAATAACTTCCAGCCTAGCACTGGCAGGATTGAAGTATTC
AGATCTGGAGAGGGTATGGGCATCCGTTTAGACTCAGCGTCGACCTACGCTGGCGCCATA
ATATCACCATACTACGACTCGCTTCTTGTTAAGGTCATCTCCCACGCCCAAGACCTGTCT
TCATCAGCCGCTAAGATGAATCGAGCGTTACGAGAGTTCCGTATACGAGGGGTCAAGACC
AACATACCGTTCCTGCTGAATGTGCTCGAAAACCAAAAGTTCTTGAACGAACGTACTTCG
AAGGCAGCCCTAGTGGGTTCGTCTCAAATTTGGTTAGGTAGAGAGAGGAGCTTGGACGGT
GGAGGTGATTTGGACACGTACTTCATAGACGAACACCCTCGTCTCTTCATGTTCAAGGCG
TCACAGAACAGAGCTCAGAAGATATTGAACTACTTGGGATATGTCCTCGTTAACGGCCCG
GCCACACCACTCGCAACTAAGATACCACCATCGGACGTCAAGCCATACATACCACCGGTA
CCGTTGGACCTTTCACCCGAGGCTATTAAAAAACAAGAATTGACCGGCGAGAACGTAGCG
GTCCAGCCCCCAAAGGGCTTTAAGGCGATCCTGAACGAAGGCGGTCCGGAAGCCTTCGCT
AAAGCGGTTCGAGAGCACAAGGGTCTATTATTAATGGACACTACATACAGAGACGCTCAT
CAGTCCCTCTTGGCCACCAGAGTTAGATCCCACGATCTTCTTACAGTGTCGCCATATGTG
GCCCATAACTTCAGCAATTTATACTCCCTTGAGAACTGGGGCGGCGCTACCTTCGACGTG
GCTTTGCGATTCCTTCATGAATGTCCTTGGGAACGTCTCGAAGACATGCGTCGGTTGATA
CCAAACATTCCCTTCCAAATGTTACTCCGCGGAGCCAACGCGGTCGGTTACACCAACTAT
CCAGATAATGTCGTCTTCAAGTTTTGTGAAATGGCTGTGAAATCCGGAATGGACATCTTC
CGTGTCTTTGACTCCTTGAACTATCTGCCGAATCTGATCCTGGGTATGGACGCGGCGGGC
AAGGCCGGGGGGGTGGTGGAAGCTGCCATATCATACACCGGAGACGTCTCCGATCCGAAC
AAAACGAAATATAACCTGAAGTACTACTGCGATCTAGCTGACGAACTCGTCAAGGCGGGG
ACACACGTCCTCGGCATTAAAGATATGGCTGGACTTTTAAAACCGCAGGCTGCTAAACTT
CTGATAACCGCTATCCGTGATAAGCACCCATCCGTGCCGATCCACGTCCACACCCACGAC
ACTTCCGGTGCGGGCGTCGCGGCCATGTTGGCGTGCGCTGAGGCCGGTGCTGACGTGGTC
GACTGCGCCGTAGACTCAATGTCCGGCCTCACCAGCCAGCCCAGTATGGGCGCACTTGTC
GCGTCCCTACAAGGAACCAAACTGGATACAGGTATACCTCTGCAGACCGTATCCGAATAT
TCAGCTTACTGGGAACAGGCTCGCACTCTGTACGGGCCGTTCGAGTGCACCGCTACCATG
AAATCAGGAAATGCTGATGTTTACATCAACGAGATTCCCGGCGGTCAATACACGAACCTG
CAGTTCCAGGCCTTCTCGTTGGGCCTGGGAAGTCAATTCGAGGAAGTGAAGAAGGCCTAT
AGGGAAGCGAATCTGCTCCTGGGGGACATTATTAAAGTGACTCCATCATCGAAGGTAGTG
GGTGATCTGGCTCAATTCATGGTTCAGAACAAACTGACCGCTGACGACATCAGGGCGAGG
GCTGAAGAATTATCCTTCCCCAAATCAGTGGTCGAGTTCTTCCAAGGAGCCATTGGCATC
CCTTACGGAGGTTTCCCAGAACCCTTAAGGTCCAAAATCCTCAAGGACATGCCAAGGATA
GAAGGCCGCCCGGGACAGGAACTGCCGCCGCTAGATTTTGACAAACTAAAGGAGGAGTTA
AAGGAGTCTTACCCTGAGATCACAGACCAGGACGTGATGTCATCGGCGATGTATCCTCAA
GTGGCGTCAGACTTCTTCCGTATCCGGGATAAGTACGGCCCAGTCAAACACCTCGACACG
AAGACTTTCCTCGTTGGTCCGGCGGTCGGTGAAACCATTGAAGTTAAAATCGAGAGAGGC
AAAACACTGGATATAAAAACATTAGCAGTATCCGAGGAAATGACAGCGGCCGGTGAGAGG
GAAGTGTTCTTTGAACTCAACGGACAACTGAGATCTGTGTTCATCAGAGATGACAACGCT
AGCAAGGAAATGAAAATACATCCGAAGGCTGTTAAAGGAGATAAGAACCAAGTCGGCGCA
CCCATGCCTGGGACAGTGCTAACTCTTAAAGTTAAAGAAGGCGACCACGTGGAGAAAGGC
CAACCAATAGCCGTTCTGTCTGCCATGAAAATGGAGATGATAGTACAAGCGCCCCGCGCT
GGCACTGTGGCCAATGTGGCCATCACTAATGGACAGAAACTGGAGGGCGATGACCTCATC
TGCACCCTAGAGTAA
Protein sequence:
MQILKARYAIRATTSHLQAWNSARNRNATTQSKTVDYKPIRSVLVANRGEIAIRVFRACT
ELGIRSVAIYSEQDRLQMHRQKADESYLVGKGLPPVEAYLSIPEIIRVAKENDVDAVHPG
YGLLSERSDFAEAVIKAGLRFIGPSPFVVQQMGDKVAARKAAIEAKVPIVPGTDGPITTK
EEALEFCKQHGLPVIFKAAYGGGGRGMRVVREMSEVASSFERASSEALGAFGNGSMFIER
FIERPRHIEVQLLGDKAGNVVHLYERDCSVQRRHQKVVEIAPAPGLDPEIRNRMTDCAVH
LARHVGYENAGTVEFLLDEKGNFYFIEVNARLQVEHTITEEVTGIDLVQSQIRVAEGMTL
PEMGLTQDNIKAQGYAIQCRVTTEDPANNFQPSTGRIEVFRSGEGMGIRLDSASTYAGAI
ISPYYDSLLVKVISHAQDLSSSAAKMNRALREFRIRGVKTNIPFLLNVLENQKFLNERTS
KAALVGSSQIWLGRERSLDGGGDLDTYFIDEHPRLFMFKASQNRAQKILNYLGYVLVNGP
ATPLATKIPPSDVKPYIPPVPLDLSPEAIKKQELTGENVAVQPPKGFKAILNEGGPEAFA
KAVREHKGLLLMDTTYRDAHQSLLATRVRSHDLLTVSPYVAHNFSNLYSLENWGGATFDV
ALRFLHECPWERLEDMRRLIPNIPFQMLLRGANAVGYTNYPDNVVFKFCEMAVKSGMDIF
RVFDSLNYLPNLILGMDAAGKAGGVVEAAISYTGDVSDPNKTKYNLKYYCDLADELVKAG
THVLGIKDMAGLLKPQAAKLLITAIRDKHPSVPIHVHTHDTSGAGVAAMLACAEAGADVV
DCAVDSMSGLTSQPSMGALVASLQGTKLDTGIPLQTVSEYSAYWEQARTLYGPFECTATM
KSGNADVYINEIPGGQYTNLQFQAFSLGLGSQFEEVKKAYREANLLLGDIIKVTPSSKVV
GDLAQFMVQNKLTADDIRARAEELSFPKSVVEFFQGAIGIPYGGFPEPLRSKILKDMPRI
EGRPGQELPPLDFDKLKEELKESYPEITDQDVMSSAMYPQVASDFFRIRDKYGPVKHLDT
KTFLVGPAVGETIEVKIERGKTLDIKTLAVSEEMTAAGEREVFFELNGQLRSVFIRDDNA
SKEMKIHPKAVKGDKNQVGAPMPGTVLTLKVKEGDHVEKGQPIAVLSAMKMEMIVQAPRA
GTVANVAITNGQKLEGDDLICTLE