Monarch geneset OGS2.0

DPOGS210696
TranscriptDPOGS210696-TA2751 bp
ProteinDPOGS210696-PA916 aa
Genomic positionDPSCF300013 - 535151-546986
RNAseq coverage91x (Rank: top 63%)
Annotation
HeliconiusHMEL0220360.075.61% 
BombyxBGIBMGA006311-TA0.069.87% 
DrosophilaCG1544-PA0.052.79% 
EBI UniRef50UniRef50_Q7QFL50.053.41%AGAP000551-PA n=5 Tax=Endopterygota RepID=Q7QFL5_ANOGA
NCBI RefSeqXP_001650884.10.054.48%2-oxoglutarate dehydrogenase [Aedes aegypti]
NCBI nr blastpgi|1571099310.054.48%2-oxoglutarate dehydrogenase [Aedes aegypti]
NCBI nr blastxgi|1571099310.054.37%2-oxoglutarate dehydrogenase [Aedes aegypti]
Group
Gene OntologyGO:00060961.5e-256glycolysis
GO:00045911.5e-256oxoglutarate dehydrogenase (succinyl-transferring) activity
GO:00551141.5e-256oxidation-reduction process
GO:00309761.5e-256thiamine pyrophosphate binding
GO:00081527.1e-39metabolic process
GO:00166247.1e-39oxidoreductase activity, acting on the aldehyde or oxo group of donors, disulfide as acceptor
KEGG pathway 
InterPro domain[27-917] IPR01160302-oxoglutarate dehydrogenase, E1 component
[566-769] IPR0054758.1e-57Transketolase-like, pyrimidine-binding domain
[218-497] IPR0010177.1e-39Dehydrogenase, E1 component
Orthology groupMCL11990 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210696-TA
ATGTTCGCGTTCAATAAAGTTAAACCGTTGATAAGATGGAAGCAGTTGAGGTTAGAGCGAGCGAAATACAACTCCGGAGTGGGAGTGTTCGGTCATCGACCGCGTCAAACTAACGATATTGATGTACCTCAAGAAATAATCTCAAGACGAAACGAAAACTGTCGGGCGCAACAACTAGTCGATGCTTACCGGAAGTACGGCCATCTCCGAGCTACCATAGATAATGTCGACTATGAAAATAAAAACCGGGATATCAAAGAGCTTCATCTATCAAGATATGGTTTATCAGGCTCCGACACGGTTGACTTGGGACTGTTATATGGCCACAATGGAAAACAATTTGCGAACGATCTAGTGGAACAATTGGAGAAAATTTATTGCGGTCCAATATCTTACGAATTCAGTCATCTGGAGACGGAGGCCGAAAGAGAATGGTTCTCACAGAGAGTCGAAAGCGGTTCAGATGTTGTGAGTAAGGAACGTCAGATCGAAATAATAAAAGAACTCCTACACTCACAAGCGTTGGACAAATTCCTATCAACAAAATTGCCATCGGTTAAGAGGTACTGCGGGGAAGGCGCGGAGTCTTTATTAACTTTCCTGTCGACTTTGTTCCGACTGACAGCCTCAGAACAAATACAGCATGTAGTAGTAGCAATGGCACACAGAGGTAAACTCAACGCACTGGGTTGTCTTCTAAAAGTTCCACCAGTGAAGATATTCCACAAGCTAGCCGGCAACCCTGAGTTTCCGGACGAAGCCAACGCGGCTTGCGACATTGCCACTCATTTAAGCGTTTCCAACGACATAACAGTGAATGGCAATACAGTTAGATTCTCTTTGATAAACAATCCATCACATCTCGAGGCCGCCAATTCTGTGTCGATGGGCAAAACGAGGTCGAAGCAATTAAAGTTACGAGAAGGCGACTATTCTGAAAACAGTACTTCACGGTTTGGCGACAAAGTTTTAAATGTTCAGATACACGGGGATGCAGCTTTTGTTGGACAAGGAGTGAATCAGGAGAGCCTTATGTTTTCACAATCACCACACTTCGACGTGGGTGGAAGTTTACATGTCGTAGTTAACAATCAATTAGGATTCACACTCCCAGCGAGCCGAGGACGTTCGAGTCGCTACGTTACTGATTTGGCTAAATCAATAGCTGTCCCGGTTATTCACGTCAATGGAGACTATCCTGAGCTTGTAGAAAAGGCAACGAACATAGCGTTTGAGTATCAGAGGAAATTCCGCAAGGACGTTTTCATAGATTACAACTGTTTCCGGAAATGGGGTCACAATGAGCTCGACGATCCGACCGTCACAAACCCTCTCATATACAAAATTATTAATAAAAAACAATCAATACCTAATCACTATGCAAATAAACTTGTATCAGAAGGGATTCTAACTGGAGATGAAGTCGAGAGCATAACAACAGAATTCACTAAATATTTGCAATCGCAATTCGAACAACATAATTCCTACAAACCCGAGGGATCATATTACCAAGATCAGTGGTCAGAAATGAGTGCTGCACCTCGAGCTGTGGAACTCTGGGATACTGGCGTTGACACTGAAATCTTGAAGCAGGTCGGACGAGCCTCCGTCATTGTACCTGATGACTTTGTCATACATCCCCATTTGGCGAAAACCCATGTAAAAAATCGATTGAACAAATTGTCCGAGGAAAAAGGACTCGACTGGGCTACAGCGGAGGCTCTCGCTTTTGGATCATTACTAATGGAAGGCAGGAACGTTCGCATCAGTGGAGAAGATGTTGGTAGAGGTACCTTCGCCCACAGGCATGTTATGTTCGTAGACCAGGAGAAAGAGAATATACACATCCCACTGAACCATATACACAAGGAACAGAAAGCGTTTTTAGAGGTGGCAAATTCAATTCTATCTGAGGAAGCCGTGTTGGGATTCGAATACGGCATGGCATTCGATTCGCCCGAAAATCTTTGTTTATGGGAAGCACAGTTCGGCGATTTTTACACGGGAGCACAGATTATAGTCGATAATTTCATTGCTTCTGGCGAATCGAAATGGGTTCGCAGCAACGGTCTAGTGATGTTGCTTCCACACGGATTCGATGGCGCAGCATCCGAACATTCCTCCTGCAGGATGGAGCGTTTTTTGCAGCTAACAGACAGTTCGGAGATAAGCCCCGACTCTGAGGCCGTGAACATGAACGTAGCGAATGCCACAACACCAGCACAGTACTTCCATTTGTTAAGGAGACAGATGGTTCGCAACTACAGAAAACCGTTGGTTGTGGTCTCTCCAAAAACTCTTCTGCGTTTGGCCGAAGCGACATCCAACTTGTCAGAATTCGCACCCGGGACACACTTTAAGCCTGTTATTGGTGATCAAATCGCTGATCCATTGAAAGTTAAAAGGGTGTTTTTCGTCAGTGGTAAACATTATTACGAATTGCACAACGAACGGATGAAAAGTAAAATTGATGACGTCGCTATTGTGAGGGTAGAATTGCTTTGTCCTTTCCCCGTGCAAACAATACAAGCGGAATTGCAAAAATATACTAACGCTAGAAAATTCATATGGTGTCAGGAAGAACATAGAAATATGGGTGCGTGGAGTTTTGTAAAGCCACGATTTGAAAATCTGGTCGGGAGAAAGCTCCTATACGCAGGTCGTCCTGAAGCACCAACTACAGCAGTAGGAGCACCTAAACTTCACAAACTGGAAGTTGATTATATACTACGCCAACCATTTTTGACATAA

Protein sequence:

>DPOGS210696-PA
MFAFNKVKPLIRWKQLRLERAKYNSGVGVFGHRPRQTNDIDVPQEIISRRNENCRAQQLVDAYRKYGHLRATIDNVDYENKNRDIKELHLSRYGLSGSDTVDLGLLYGHNGKQFANDLVEQLEKIYCGPISYEFSHLETEAEREWFSQRVESGSDVVSKERQIEIIKELLHSQALDKFLSTKLPSVKRYCGEGAESLLTFLSTLFRLTASEQIQHVVVAMAHRGKLNALGCLLKVPPVKIFHKLAGNPEFPDEANAACDIATHLSVSNDITVNGNTVRFSLINNPSHLEAANSVSMGKTRSKQLKLREGDYSENSTSRFGDKVLNVQIHGDAAFVGQGVNQESLMFSQSPHFDVGGSLHVVVNNQLGFTLPASRGRSSRYVTDLAKSIAVPVIHVNGDYPELVEKATNIAFEYQRKFRKDVFIDYNCFRKWGHNELDDPTVTNPLIYKIINKKQSIPNHYANKLVSEGILTGDEVESITTEFTKYLQSQFEQHNSYKPEGSYYQDQWSEMSAAPRAVELWDTGVDTEILKQVGRASVIVPDDFVIHPHLAKTHVKNRLNKLSEEKGLDWATAEALAFGSLLMEGRNVRISGEDVGRGTFAHRHVMFVDQEKENIHIPLNHIHKEQKAFLEVANSILSEEAVLGFEYGMAFDSPENLCLWEAQFGDFYTGAQIIVDNFIASGESKWVRSNGLVMLLPHGFDGAASEHSSCRMERFLQLTDSSEISPDSEAVNMNVANATTPAQYFHLLRRQMVRNYRKPLVVVSPKTLLRLAEATSNLSEFAPGTHFKPVIGDQIADPLKVKRVFFVSGKHYYELHNERMKSKIDDVAIVRVELLCPFPVQTIQAELQKYTNARKFIWCQEEHRNMGAWSFVKPRFENLVGRKLLYAGRPEAPTTAVGAPKLHKLEVDYILRQPFLT-