Monarch geneset OGS2.0

DPOGS201094
TranscriptDPOGS201094-TA1068 bp
ProteinDPOGS201094-PA355 aa
Genomic positionDPSCF300185 + 337428-345771
RNAseq coverage2979x (Rank: top 4%)
Annotation
HeliconiusHMEL0079985e-17394.14% 
BombyxBGIBMGA007160-TA2e-8088.46% 
Drosophilal(1)G0156-PB8e-16473.74% 
EBI UniRef50UniRef50_B4GW561e-14670.51%GL14780 n=4 Tax=Endopterygota RepID=B4GW56_DROPE
NCBI RefSeqXP_001656452.15e-16778.99%isocitrate dehydrogenase [Aedes aegypti]
NCBI nr blastpgi|3323756305e-16979.66%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323756301e-16280.57%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00002874.7e-220magnesium ion binding
GO:00166164.7e-220oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00512874.7e-220NAD binding
GO:00551144.7e-220oxidation-reduction process
GO:00060996e-148tricarboxylic acid cycle
GO:00044496e-148isocitrate dehydrogenase (NAD+) activity
KEGG pathwaydme:Dmel_CG122331e-165 
 K00030 (IDH3)maps-> Citrate cycle (TCA cycle)
InterPro domain[2-354] IPR0018044.7e-220Isocitrate/isopropylmalate dehydrogenase
[23-354] IPR0044346e-148Isocitrate dehydrogenase NAD-dependent
[24-354] IPR0240842.3e-135Isopropylmalate dehydrogenase-like domain
Orthology groupMCL13179 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201094-TA
ATGGCAGCGAGAATAATAAGAAAAATCATCCCCGCGGGCCGGGCTGGTTCCGCTCAATACAGCACGGGGGTCCGTAAGGTGACACTCATCCCTGGACACGGGATCGGCCCCGAGATCACGGTAGCTGTACAGAAAATATTCGAAGCCGCCAAAGTACCCATTGAATGGGACGAGGTGGACGTGACAGCTGTTAGAGGTCCGGATGGCAAATTCGGTATTCCCCAACGTGCCATTGATTCTGTCAATGCCAACAAGATAGGCCTCAAAGGTCCCCTGATGACCCCAGTGGGTAAAGGATACAGGTCGTTGAACTTGGCTTTAAGAAAAGAGTTTGATCTGTATGCCAATGTGAGGCCTTGCAAGAGTTTAGACGGCATCAAGACTCTGTACGACAACGTGGACGTGGTGACCATCAGGGAGAACACGGAGGGAGAGTACTCCGGCATAGAACACGAGATAGTGGACGGCGTGGTTCAGTCCATCAAGCTCATCACAGAGGAGGCCAGCAAGAGAGTGGCGGAGTTTGCGTTCACCTTCGCCAGGGACAACAAGCGGAAGAAGGTCACCGCCGTGCACAAAGCCAACATCATGCGTATGTCGGACGGCCTGTTCCTGCGCTGCTGCCGCGAGCTGGCCACGCAGTTCCCCGACATCAAGTTCGAGGAGCGCTACCTGGACACCGTCTGCCTCAACATGGTCCAGGACCCGTCCAAATTTGACGTACTCGTGATGCCGAATCTGTACGGAGACATCATGTCGGACATGTGCTCCGGCCTGGTGGGGGGCCTCGGCCTCACGCCGTCCGGGAACATCGGCAAGAACGGAGCCCTGTTTGAATCGGTTCATGGCACAGCCCCGGCCATAGCCGGCCAGGATAAGGCCAATCCCACAGCACTCCTCCTGTCCGGAGTCATGATGCTGAGATACATGAAGCTGGAAGACATCGCCGACAGAATAGAAACAGCCTGCTTCACCGTCCTGAAGGAAGGGAGAGTCCTCACCGAGGACCTCGGGGGAAAGAGCTCATGCACGGAGTACACCAACGAGATCATCAAACATCTGTACTAG

Protein sequence:

>DPOGS201094-PA
MAARIIRKIIPAGRAGSAQYSTGVRKVTLIPGHGIGPEITVAVQKIFEAAKVPIEWDEVDVTAVRGPDGKFGIPQRAIDSVNANKIGLKGPLMTPVGKGYRSLNLALRKEFDLYANVRPCKSLDGIKTLYDNVDVVTIRENTEGEYSGIEHEIVDGVVQSIKLITEEASKRVAEFAFTFARDNKRKKVTAVHKANIMRMSDGLFLRCCRELATQFPDIKFEERYLDTVCLNMVQDPSKFDVLVMPNLYGDIMSDMCSGLVGGLGLTPSGNIGKNGALFESVHGTAPAIAGQDKANPTALLLSGVMMLRYMKLEDIADRIETACFTVLKEGRVLTEDLGGKSSCTEYTNEIIKHLY-