Monarch geneset OGS2.0

DPOGS205472
TranscriptDPOGS205472-TA3723 bp
ProteinDPOGS205472-PA1240 aa
Genomic positionDPSCF300166 + 38447-53020
RNAseq coverage1884x (Rank: top 7%)
Annotation
HeliconiusHMEL0175240.090.49% 
BombyxBGIBMGA008442-TA0.085.47% 
DrosophilaCG1516-PJ0.077.75% 
EBI UniRef50UniRef50_P114980.065.13%Pyruvate carboxylase, mitochondrial n=557 Tax=root RepID=PYC_HUMAN
NCBI RefSeqXP_001689096.10.077.54%AGAP004742-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582979620.077.54%AGAP004742-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|5504860.077.05%pyruvate carboxylase [Aedes aegypti]
Group
Gene OntologyGO:00047360pyruvate carboxylase activity
GO:00040750biotin carboxylase activity
GO:00060940gluconeogenesis
GO:00081521.5e-131metabolic process
GO:00038241.5e-131catalytic activity
GO:00055241.4e-99ATP binding
GO:00168741.4e-99ligase activity
KEGG pathwayaga:AgaP_AGAP0047420.0 
 K01958 (PC, pyc)maps-> Citrate cycle (TCA cycle)
    Pyruvate metabolism
InterPro domain[42-1022] IPR0059300Pyruvate carboxylase
[581-869] IPR0137851.5e-131Aldolase-type TIM barrel
[242-492] IPR0138161.4e-99ATP-grasp fold, subdomain 2
[154-362] IPR0054791.2e-79Carbamoyl-phosphate synthetase, large subunit, ATP-binding
[378-485] IPR0054822.3e-51Biotin carboxylase, C-terminal
[882-1019] IPR0033792.4e-49Carboxylase, conserved domain
[40-167] IPR0138174.4e-45Pre-ATP-grasp fold
[5-152] IPR0161858.7e-44PreATP-grasp-like fold
[373-490] IPR0110542.4e-41Rudiment single hybrid motif
[40-146] IPR0054811.9e-36Carbamoyl-phosphate synthase, large subunit, N-terminal
[624-841] IPR0008913.9e-28Pyruvate carboxyltransferase
[168-241] IPR0138158.4e-24ATP-grasp fold, subdomain 1
[1157-1240] IPR0110531.8e-22Single hybrid motif
[1172-1239] IPR0000891.1e-20Biotin/lipoyl attachment
Orthology groupMCL13808 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205472-TA
ATGCAGATACTTAAAGCAAGGTATGCCATAAGGGCGACAACCTCACACTTACAAGCATGGAATTCTGCGAGAAACAGGAATGCAACTACACAATCAAAAACCGTAGACTACAAGCCCATCCGTAGCGTGCTGGTTGCTAATAGAGGTGAAATAGCCATACGGGTGTTCAGAGCATGCACGGAGTTGGGGATTCGATCAGTCGCTATATACAGCGAACAGGATCGACTACAAATGCACAGACAAAAAGCCGACGAGTCCTACCTCGTTGGCAAAGGTCTGCCCCCGGTGGAAGCATACCTGAGCATACCTGAAATCATCAGGGTGGCCAAAGAAAACGACGTAGATGCTGTACATCCAGGATATGGTCTGCTCTCAGAGAGATCAGACTTCGCGGAGGCCGTCATTAAGGCAGGGCTTCGTTTCATCGGGCCCTCGCCGTTCGTTGTTCAGCAGATGGGGGATAAAGTTGCCGCTAGAAAGGCCGCCATTGAAGCCAAGGTCCCAATCGTTCCTGGTACCGACGGTCCCATTACGACGAAGGAAGAAGCTCTCGAATTTTGCAAACAACATGGTCTCCCTGTTATATTTAAGGCAGCCTACGGTGGTGGCGGCCGTGGTATGCGAGTGGTCCGCGAAATGAGTGAGGTGGCGTCGTCTTTCGAGCGAGCCTCCTCTGAAGCCTTAGGGGCCTTCGGGAACGGATCCATGTTCATAGAAAGATTCATAGAGAGACCCAGACATATTGAAGTGCAGCTTTTGGGCGACAAAGCCGGTAACGTAGTGCACTTGTACGAACGAGACTGCTCGGTACAACGGCGACATCAGAAGGTTGTTGAAATAGCGCCCGCCCCTGGACTAGACCCTGAGATTCGTAATCGCATGACGGATTGTGCCGTGCATCTCGCACGCCACGTGGGCTACGAGAACGCTGGCACGGTGGAGTTCCTCTTGGACGAGAAAGGAAACTTCTACTTCATAGAAGTCAACGCTAGATTGCAAGTAGAGCACACAATAACAGAGGAAGTTACCGGGATAGATCTCGTCCAATCCCAGATCAGAGTCGCCGAAGGTATGACTCTACCAGAGATGGGATTGACCCAAGATAACATTAAGGCTCAAGGATACGCCATACAATGCAGAGTCACTACCGAAGACCCCGCCAATAACTTCCAGCCTAGCACTGGCAGGATTGAAGTATTCAGATCTGGAGAGGGTATGGGCATCCGTTTAGACTCAGCGTCGACCTACGCTGGCGCCATAATATCACCATACTACGACTCGCTTCTTGTTAAGGTCATCTCCCACGCCCAAGACCTGTCTTCATCAGCCGCTAAGATGAATCGAGCGTTACGAGAGTTCCGTATACGAGGGGTCAAGACCAACATACCGTTCCTGCTGAATGTGCTCGAAAACCAAAAGTTCTTGAACGGTGATTTGGACACGTACTTCATAGACGAACACCCTCGTCTCTTCATGTTCAAGGCGTCACAGAACAGAGCTCAGAAGATATTGAACTACTTGGGATATGTCCTCGTTAACGGCCCGGCCACACCACTCGCAACTAAGATACCACCATCGGACGTCAAGCCATACATACCACCGGTACCGTTGGACCTTTCACCCGAGGCTATTAAAAAACAAGAATTGACCGGCGAGAACGTAGCGGTCCAGCCCCCAAAGGGCTTTAAGGCGATCCTGAACGAAGGCGGTCCGGAAGCCTTCGCTAAAGCGGTTCGAGAGCACAAGGGTCTATTATTAATGGACACTACATACAGAGACGCTCATCAGTCCCTCTTGGCCACCAGAGTTAGATCCCACGATCTTCTTACAGTGTCGCCATATGTGGCCCATAACTTCAGCAATTTATACTCCCTTGAGAACTGGGGCGGCGCTACCTTCGACGTGGCTTTGCGATTCCTTCATGAATGTCCTTGGGAACGTCTCGAAGACATGCGTCGGTTGATACCAAACATTCCCTTCCAAATGTTACTCCGCGGAGCCAACGCGGTCGGTTACACCAACTATCCAGATAATGTCGTCTTCAAGTTTTGTGAAATGGCTGTGAAATCCGGAATGGACATCTTCCGTGTCTTTGACTCCTTGAACTATCTGCCGAATCTGATCCTGGGTATGGACGCGGCGGGCAAGGCCGGGGGGGTGGTGGAAGCTGCCATATCATACACCGGAGACGTCTCCGATCCGAACAAAACGAAATATAACCTGAAGTACTACTGCGATCTAGCTGACGAACTCGTCAAGGCGGGGACACACGTCCTCGGCATTAAAGATATGGCTGGACTTTTAAAACCGCAGGCTGCTAAACTTCTGATAACCGCTATCCGTGATAAGCACCCATCCGTGCCGATCCACGTCCACACCCACGACACTTCCGGTGCGGGCGTCGCGGCCATGTTGGCGTGCGCTGAGGCCGGTGCTGACGTGGTCGACTGCGCCGTAGACTCAATGTCCGGCCTCACCAGCCAGCCCAGTATGGGCGCACTTGTCGCGTCCCTACAAGGAACCAAACTGGATACAGGTATACCTCTGCAGACCGTATCCGAATATTCAGCTTACTGGGAACAGGCTCGCACTCTGTACGGGCCGTTCGAGTGCACCGCTACCATGAAATCAGGAAATGCTGATGTTTACATCAACGAGATTCCCGGCGGTCAATACACGAACCTGCAGTTCCAGGCCTTCTCGTTGGGCCTGGGAAGTCAATTCGAGGAAGTGAAGAAGGCCTATAGGGAAGCGAATCTGCTCCTGGGGGACATTATTAAAGTGACTCCATCATCGAAGGTAGTGGGTGATCTGGCTCAATTCATGGTTCAGAACAAACTGACCGCTGACGACATCAGGGCGAGGGCTGAAGAATTATCCTTCCCCAAATCAGTGGTCGAGTTCTTCCAAGGAGCCATTGGCATCCCTTACGGAGGTTTCCCAGAACCCTTAAGGTCCAAAATCCTCAAGGACATGCCAAGGATAGAAGGCCGCCCGGGACAGGAACTGCCGCCGCTAGATTTTGACAAACTAAAGGAGGAGTTAAAGGAGTCTTACCCTGAGATAACATATTTCAGGTCCAAAATCCTCAAGGACATGCCAAGGATAGAAGGCCGCCCGGGACAGGAACTGCCGCCGCTAGATTTTGACAAACTAAAGGAGGAGTTAAAGGAGTCTTACCCTGAGATCACAGACCAGGACGTGATGTCATCGGCGATGTATCCTCAAGTGGCGTCAGACTTCTTCCGTATCCGGGATAAGTACGGCCCAGTCAAACACCTCGACACGAAGACTTTCCTCGTTGGTCCGGCGGTCGGTGAAACCATTGAAGTTAAAATCGAGAGAGGCAAAACACTGGATATAAAAACATTAGCAGTATCCGAGGAAATGACAGCGGCCGGTGAGAGGGAAGTGTTCTTTGAACTCAACGGACAACTGAGATCTGTGTTCATCAGAGATGACAACGCTAGCAAGGAAATGAAAATACATCCGAAGGCTGTTAAAGGAGATAAGAACCAAGTCGGCGCACCCATGCCTGGGACAGTGCTAACTCTTAAAGTTAAAGAAGGCGACCACGTGGAGAAAGGCCAACCAATAGCCGTTCTGTCTGCCATGAAAATGGAGATGATAGTACAAGCGCCCCGCGCTGGCACTGTGGCCAATGTGGCCATCACTAATGGACAGAAACTGGAGGGCGATGACCTCATCTGCACCCTAGAGTAA

Protein sequence:

>DPOGS205472-PA
MQILKARYAIRATTSHLQAWNSARNRNATTQSKTVDYKPIRSVLVANRGEIAIRVFRACTELGIRSVAIYSEQDRLQMHRQKADESYLVGKGLPPVEAYLSIPEIIRVAKENDVDAVHPGYGLLSERSDFAEAVIKAGLRFIGPSPFVVQQMGDKVAARKAAIEAKVPIVPGTDGPITTKEEALEFCKQHGLPVIFKAAYGGGGRGMRVVREMSEVASSFERASSEALGAFGNGSMFIERFIERPRHIEVQLLGDKAGNVVHLYERDCSVQRRHQKVVEIAPAPGLDPEIRNRMTDCAVHLARHVGYENAGTVEFLLDEKGNFYFIEVNARLQVEHTITEEVTGIDLVQSQIRVAEGMTLPEMGLTQDNIKAQGYAIQCRVTTEDPANNFQPSTGRIEVFRSGEGMGIRLDSASTYAGAIISPYYDSLLVKVISHAQDLSSSAAKMNRALREFRIRGVKTNIPFLLNVLENQKFLNGDLDTYFIDEHPRLFMFKASQNRAQKILNYLGYVLVNGPATPLATKIPPSDVKPYIPPVPLDLSPEAIKKQELTGENVAVQPPKGFKAILNEGGPEAFAKAVREHKGLLLMDTTYRDAHQSLLATRVRSHDLLTVSPYVAHNFSNLYSLENWGGATFDVALRFLHECPWERLEDMRRLIPNIPFQMLLRGANAVGYTNYPDNVVFKFCEMAVKSGMDIFRVFDSLNYLPNLILGMDAAGKAGGVVEAAISYTGDVSDPNKTKYNLKYYCDLADELVKAGTHVLGIKDMAGLLKPQAAKLLITAIRDKHPSVPIHVHTHDTSGAGVAAMLACAEAGADVVDCAVDSMSGLTSQPSMGALVASLQGTKLDTGIPLQTVSEYSAYWEQARTLYGPFECTATMKSGNADVYINEIPGGQYTNLQFQAFSLGLGSQFEEVKKAYREANLLLGDIIKVTPSSKVVGDLAQFMVQNKLTADDIRARAEELSFPKSVVEFFQGAIGIPYGGFPEPLRSKILKDMPRIEGRPGQELPPLDFDKLKEELKESYPEITYFRSKILKDMPRIEGRPGQELPPLDFDKLKEELKESYPEITDQDVMSSAMYPQVASDFFRIRDKYGPVKHLDTKTFLVGPAVGETIEVKIERGKTLDIKTLAVSEEMTAAGEREVFFELNGQLRSVFIRDDNASKEMKIHPKAVKGDKNQVGAPMPGTVLTLKVKEGDHVEKGQPIAVLSAMKMEMIVQAPRAGTVANVAITNGQKLEGDDLICTLE-