Monarch geneset OGS2.0

DPOGS202452
TranscriptDPOGS202452-TA4152 bp
ProteinDPOGS202452-PA1383 aa
Genomic positionDPSCF300174 - 162888-186058
RNAseq coverage2336x (Rank: top 5%)
Annotation
HeliconiusHMEL0101250.091.04% 
BombyxBGIBMGA009974-TA0.091.69% 
DrosophilaNc73EF-PI0.071.38% 
EBI UniRef50UniRef50_Q9ULD00.066.20%2-oxoglutarate dehydrogenase-like, mitochondrial n=87 Tax=Bilateria RepID=OGDHL_HUMAN
NCBI RefSeqXP_973425.20.081.23%PREDICTED: similar to 2-oxoglutarate dehydrogenase [Tribolium castaneum]
NCBI nr blastpgi|1892371410.081.23%PREDICTED: similar to 2-oxoglutarate dehydrogenase [Tribolium castaneum]
NCBI nr blastxgi|1892371410.081.93%PREDICTED: similar to 2-oxoglutarate dehydrogenase [Tribolium castaneum]
Group
Gene OntologyGO:00060962.3e-270glycolysis
GO:00045912.3e-270oxoglutarate dehydrogenase (succinyl-transferring) activity
GO:00551142.3e-270oxidation-reduction process
GO:00309762.3e-270thiamine pyrophosphate binding
GO:00081524.7e-75metabolic process
GO:00166244.7e-75oxidoreductase activity, acting on the aldehyde or oxo group of donors, disulfide as acceptor
KEGG pathwaytca:6622190.0 
 K00164 (OGDH, sucA)maps-> Citrate cycle (TCA cycle)
    Tryptophan metabolism
    Lysine degradation
InterPro domain[5-1379] IPR01160302-oxoglutarate dehydrogenase, E1 component
[133-453] IPR0010174.7e-75Dehydrogenase, E1 component
[984-1199] IPR0054751.5e-66Transketolase-like, pyrimidine-binding domain
Orthology groupMCL10198 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202452-TA
ATGCCTCTTTGCTTACTAATCACTCGTTCTCGGGGTCACCTGGTTGCTCGTCTCGATCCTCTCGGGATATCCACCGGCGATCCTGTATCGAGCGGGGATTGGCACGGCGGTCTCCGAGCGTTCGCCAGCGAGGCGGTTATTAGGCAACACGTGCGATTCGATGAAGCTGATATGGATAGGGTTTTCAAACTACCCTCGACCACTTTCATCGGCGAGAAGGAAAAGGCGCTACCGTTGAGAGAGATCTTGAATCGTCTGGAACAAGCGTATTGCAATAATATTGGTATTGAATTCATGTTCATAAATTCTTTGGAGCAATGCAATTGGATCAGACAACGTATGGAGCCGCCGAATGTAACGAAGATGAGCAACGACCAGAAGAGATTGATCCTCGCCCGTCTCACTAGATCCACTGGGTTCGAGAATTTCCTGGCCAAGAAATGGTCGTCCGAGAAACGTTTCGGTCTCGAGGGCTGTGAGATTTTGATACCAGCTATGAAGCAGGTCATCGACACCTCCACACGGCTCGGAGTAGAATCCATCATTATGGGAATGCCCCATAGAGGTCGTCTCAATGTGTTGGCCAACGTGTGTCGCAAACCGCTGCATCAGCTGTTCACTCAGTTTGCTGGTTTGGAAGCTGAAGATGACGGTTCCGGCGACGTGAAGTATCACCTGGGTACTTATATCGAGCGTCTGAATCGCGTCACAAACAAGAATATCCGTCTGGCTGTATGTGCTAATCCTTCGCATCTGGAGGCCGTTGACCCCGTGGTGCAGGGCAAGACCAGGGCTGAACAGTTCTACCGAGGAGACAACGAAGGAAAGAAGGTGATGTCAATTCTACTTCACGGTGACGCAGCGTTCGTAGGACAGGGCGTTGTGTTCGAGACAATGCATCTCTCGGACTTGCCGGCCTACACCACACACGGCACCATACACATCGTGGTCAACAACCAGATAGGATTCACCACCGACCCGAGACACTCGCGGTCATCAGCTTACTGTACAGACGTCGCCCGTGTGGTGAACGCTCCTATATTCCACGTGAACAGCGACAACCCCGAGGCGGTGATGCACGTTTGTAACGTGGCGGCTGAATGGAGAGCCACCTTCCACAAGGACGTGGTCATTGACATTGTCAGCTACAGGCGGAACGGACACAACGAGGTCGACGAACCCATGTTCACACAGCCCCTCATGTACCAAAAGATTAGGAAAACTAAACCAGTTTTGGAGAAATACGCCGACCAGCTGATCGTTGAGGGCGTCGTGACCGCTGAGGAGGTGAAAGATGTAAAGGACAAATACGAGAAGATCTGCGAGGACGCTTACAACCAGGCTAAGCAGGAAACTCACATCAAATACAAGGACTGGCTCGACTCGCCCTGGTCTGGCTTCTTTGAAGGCAAAGACCCACTCAAGATGTCTCCGACCGGTGTTGTAGAGGAGACTCTAGTGCATATTGGCAAGCGTTTCTCGTCGCCACCACCCAACGCGGCTGAATTCGAAATACACAAGGGTCTGCTTCGTATTCTAAAAGCCAGGATGGAGATGGTTGAGAATAGAACCGTCGACTGGGCTCTGGCCGAGGCTATGGCGTTTGGCTCGCTGTTGAAGGAGGGCATACATGTGAGACTATCGGGAGAGGACGTTGAGAGAGGAACATTCTCTCATAGGCACCACGTGCTGCATCACCAGAAGGTCGACAAGGCCACTTATTGTCCATTGGCTCATCTTTATCCAGACCAAGCTCCTTACACCGTTTGCAACAGTTCATTGTCTGAATACGGTAAGTTCGAGAATTTCCTGGCCAAGAAATGGTCGTCCGAGAAACGTTTCGGTCTCGAGGGCTGTGAGATTTTGATACCAGCTATGAAGCAGGTCATCGACACCTCCACACGGCTCGGAGTAGAATCCATCATTATGGGAATGCCCCATAGAGGTCGTCTCAATGTGTTGGCCAACGTGTGTCGCAAACCGCTGCATCAGCTGTTCACTCAGTTTGCTGGTTTGGAAGCTGAAGATGACGGTTCCGGCGACGTGAAGTATCACCTGGGTACTTATATCGAGCGTCTGAATCGCGTCACAAACAAGAATATCCGTCTGGCTGTATGTGCTAATCCTTCGCATCTGGAGGCCGTTGACCCCGTGGTGCAGGGCAAGACCAGGGCTGAACAGTTCTACCGAGGAGACAACGAAGGAAAGAAGGTGATGTCAATTCTACTTCACGGTGACGCAGCGTTCGTAGGACAGGGCGTTGTGTTCGAGACAATGCATCTCTCGGACTTGCCGGCCTACACCACACACGGCACCATACACATCGTGGTCAACAACCAGATAGGATTCACCACCGACCCGAGACACTCGCGGTCATCAGCTTACTGTACAGACGTCGCCCGTGTGGTGAACGCTCCTATATTCCACGTGAACAGCGACAACCCCGAGGCGGTGATGCACGTTTGTAACGTGGCGGCTGAATGGAGAGCCACCTTCCACAAGGACGTGGTCATTGACATTGTCAGCTACAGGCGGAACGGACACAACGAGGTCGACGAACCCATGTTCACACAGCCCCTCATGTACCAAAAGATTAGGAAAACTAAACCAGTTTTGGAGAAATACGCCGACCAGCTGATCGTTGAGGGCGTCGTGACCGCTGAGGAGGTGAAAGATGTAAAGGACAAATACGAGAAGATCTGCGAGGACGCTTACAACCAGGCTAAGCAGGAAACTCACATCAAATACAAGGACTGGCTCGACTCGCCCTGGTCTGGCTTCTTTGAAGGCAAAGACCCACTCAAGATGTCTCCGACCGGTGTTGTAGAGGAGACTCTAGTGCATATTGGCAAGCGTTTCTCGTCGCCACCACCCAACGCGGCTGAATTCGAAATACACAAGGGTCTGCTTCGTATTCTAAAAGCCAGGATGGAGATGGTTGAGAATAGAACCGTCGACTGGGCTCTGGCCGAGGCTATGGCGTTTGGCTCGCTGTTGAAGGAGGGCATACATGTGAGACTATCGGGAGAGGACGTTGAGAGAGGAACATTCTCTCATAGGCACCACGTGCTGCATCACCAGAAGGTCGACAAGGCCACTTATTGTCCATTGGCTCATCTTTATCCAGACCAAGCTCCCTACACCGTTTGCAACAGTTCATTGTCTGAATACGGTGTGTTGGGCTTCGAAGTGGGTTACTCAGTAACAAACCCGAACGCTCTAGTGCTATGGGAGGCTCAGTTTGGTGACTTCAACAATGTGGCTCAGTGTATCATTGACCAGTTCATATCAAGCGGGCAGGCCAAGTGGGTCAGGCAGTCAGGCATAGTACTGCTTCAACCTCATGGAATGGAGGGCATGGGCCCCGAACATTCTTCAGCTCGTTTGGAGCGCTTCTTGCAAATGAGCTCCGACGACCCCGACTACATGCCGCCGGAGAGTCCTGACTACGAAGTCCGTCAGCTCCACGACTGCAACTGGATAGTAGCGAACTGTTCGACCCCGGCCTCCTTGTTCCACATCCTCCGTCGGCAGATCGCTCTGCCTTTCCGTAAACCTCTAATATTGATGACTCCCAAATCACTACTGAGACATCCCGAATGCAAATCTTCGTTCGACGATATGGTCGATGGAACCACATTCAAAAGATTGATTCCCGAAGAGGGTCCGGCGTCCGAGAACCCGTCTAACGTCCGCAAGCTTGCTTTCTGCTCCGGACGTGTTTACTACGACCTGCTGAAACAGAGGAGAGACCGCGGACTGGAGAAGGATATAGCCATAGCCAGACTGGAGCAGATCTCGCCGTTCCCATACGATCTGATCAAGGCTGAGATCGCAAAGTATCCGAACGCTCAGCTGGTCTGGAGTCAGGAGGAACACAAGAACATGGGCTCCTGGAGTTACATCGAGCCGAGGTTCCGTACACTGCTGCACAACCAGAAACAGATCTGGCCAATATCCAACTCTCGCGGCGGTTGGTTTAGTCAACTTTTCGGCAAACCTGAGCCGGCTCAGACGGACACACAGGCGGAAACAGTACCGCGTACTATTAGCTACAACGGTCGCGCCACGGCCGCGTCGCCGGCCACCGGCTCTAAGGCCGCCCACAACAAGGAACTCCGGAACCTGCTGGAAGAGTTCTGCGTCCTATAA

Protein sequence:

>DPOGS202452-PA
MPLCLLITRSRGHLVARLDPLGISTGDPVSSGDWHGGLRAFASEAVIRQHVRFDEADMDRVFKLPSTTFIGEKEKALPLREILNRLEQAYCNNIGIEFMFINSLEQCNWIRQRMEPPNVTKMSNDQKRLILARLTRSTGFENFLAKKWSSEKRFGLEGCEILIPAMKQVIDTSTRLGVESIIMGMPHRGRLNVLANVCRKPLHQLFTQFAGLEAEDDGSGDVKYHLGTYIERLNRVTNKNIRLAVCANPSHLEAVDPVVQGKTRAEQFYRGDNEGKKVMSILLHGDAAFVGQGVVFETMHLSDLPAYTTHGTIHIVVNNQIGFTTDPRHSRSSAYCTDVARVVNAPIFHVNSDNPEAVMHVCNVAAEWRATFHKDVVIDIVSYRRNGHNEVDEPMFTQPLMYQKIRKTKPVLEKYADQLIVEGVVTAEEVKDVKDKYEKICEDAYNQAKQETHIKYKDWLDSPWSGFFEGKDPLKMSPTGVVEETLVHIGKRFSSPPPNAAEFEIHKGLLRILKARMEMVENRTVDWALAEAMAFGSLLKEGIHVRLSGEDVERGTFSHRHHVLHHQKVDKATYCPLAHLYPDQAPYTVCNSSLSEYGKFENFLAKKWSSEKRFGLEGCEILIPAMKQVIDTSTRLGVESIIMGMPHRGRLNVLANVCRKPLHQLFTQFAGLEAEDDGSGDVKYHLGTYIERLNRVTNKNIRLAVCANPSHLEAVDPVVQGKTRAEQFYRGDNEGKKVMSILLHGDAAFVGQGVVFETMHLSDLPAYTTHGTIHIVVNNQIGFTTDPRHSRSSAYCTDVARVVNAPIFHVNSDNPEAVMHVCNVAAEWRATFHKDVVIDIVSYRRNGHNEVDEPMFTQPLMYQKIRKTKPVLEKYADQLIVEGVVTAEEVKDVKDKYEKICEDAYNQAKQETHIKYKDWLDSPWSGFFEGKDPLKMSPTGVVEETLVHIGKRFSSPPPNAAEFEIHKGLLRILKARMEMVENRTVDWALAEAMAFGSLLKEGIHVRLSGEDVERGTFSHRHHVLHHQKVDKATYCPLAHLYPDQAPYTVCNSSLSEYGVLGFEVGYSVTNPNALVLWEAQFGDFNNVAQCIIDQFISSGQAKWVRQSGIVLLQPHGMEGMGPEHSSARLERFLQMSSDDPDYMPPESPDYEVRQLHDCNWIVANCSTPASLFHILRRQIALPFRKPLILMTPKSLLRHPECKSSFDDMVDGTTFKRLIPEEGPASENPSNVRKLAFCSGRVYYDLLKQRRDRGLEKDIAIARLEQISPFPYDLIKAEIAKYPNAQLVWSQEEHKNMGSWSYIEPRFRTLLHNQKQIWPISNSRGGWFSQLFGKPEPAQTDTQAETVPRTISYNGRATAASPATGSKAAHNKELRNLLEEFCVL-