Monarch geneset OGS2.0

DPOGS208941
TranscriptDPOGS208941-TA3012 bp
ProteinDPOGS208941-PA1003 aa
Genomic positionDPSCF300009 + 246508-254463
RNAseq coverage161x (Rank: top 52%)
Annotation
HeliconiusHMEL0210320.084.43% 
BombyxBGIBMGA002883-TA0.079.42% 
DrosophilaNc73EF-PI0.051.32% 
EBI UniRef50UniRef50_Q022180.048.87%2-oxoglutarate dehydrogenase, mitochondrial n=152 Tax=cellular organisms RepID=ODO1_HUMAN
NCBI RefSeqXP_001652168.10.050.81%2-oxoglutarate dehydrogenase [Aedes aegypti]
NCBI nr blastpgi|1571141190.050.81%2-oxoglutarate dehydrogenase [Aedes aegypti]
NCBI nr blastxgi|1571141190.050.45%2-oxoglutarate dehydrogenase [Aedes aegypti]
Group
Gene OntologyGO:00060960glycolysis
GO:00045910oxoglutarate dehydrogenase (succinyl-transferring) activity
GO:00551140oxidation-reduction process
GO:00309760thiamine pyrophosphate binding
GO:00081521.4e-55metabolic process
GO:00166241.4e-55oxidoreductase activity, acting on the aldehyde or oxo group of donors, disulfide as acceptor
KEGG pathwayaag:AaeL_AAEL0067210.0 
 K00164 (OGDH, sucA)maps-> Citrate cycle (TCA cycle)
    Tryptophan metabolism
    Lysine degradation
InterPro domain[22-1004] IPR01160302-oxoglutarate dehydrogenase, E1 component
[257-570] IPR0010171.4e-55Dehydrogenase, E1 component
[640-854] IPR0054752.3e-53Transketolase-like, pyrimidine-binding domain
Orthology groupMCL10198 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208941-TA
ATGGCTTTATTTCGTACAATTCGGAAACTAACTCTTCCAGTACACACATCGAATTCGACGCATTTAATGCTATTAGGGAGAGTCGTGTCATCCAAAGCGGAAACCTTTTTAAGCGGCACCAATGCGAATTATTTAGAAGCCCAATATTTAAACTGGTCGAAATCCCGTGGATCAGTGGACCCTTCATGGGACAGATTCTACGAGAGCGTCAGTGGTCCAACTTTAAAAATAGAAGAGTTTGAAGACGAGTTCAGTCCTAAGACCATCGGTCAAAATAAAACGTCGCGATCTGAAAAAGATGACAAGGAAGCCAAAGAACGAATCAAACTTCATCTGGCTGTACAGAATCTCATTCGAAGTTATCAGGCTCGCGGTCACCTCCTGGCGAAAACAGATCCTTTGAACCTGTCTATAGGAGCGGGTTGGTTGCTTAAAACTCAAAAGCCTACAATGGAAATAAAAGGAGTAGACAGTGTTGTAGTTGCAAGAGAAATTGGTACTATATTGCGCGAAGAGCATATGGATACTGTTTTTGAACTACCTGAAAGAACTTGTATCGGGGGAACGGAAACTGCTTTGACTCTTAGGGAAATCATAAGACGCTTGGAGATAGTGTACTGCGGTCCTATAGGTGTGGAGTACATGCATCTATTTGACATCGACTGTCTGCAGTTTATGAGGGAAAAAATGGAAACACCAGGATGTTTACAGAGAACTGTTGAAGAGAAGAAACTCATCATGAGACGTTTAACGAAAGCGGTCTACTTAGAAAAGTACTTCGCAACGAAATGGCCGGCTGAAAAGCGTTTTGGTCTGGAAGGTGGAGAGAGTATGATCGTGATGCTGGAGGAGATTGTTGACAGCAGCACTCAATTGGGAGTTGAGTCTATAGTGATGGCCATGCAGCATAGAGGTCGTCTAAATATGTTAGTGAATGTGTGTCGGAAACAGCTGACTGATATTTTTGCTCAATTTAAACCTATGGAACCTAAAGAGCCGGGGTCAGGGGATATAAAGTACCATTTGGGTACATTCATCCACCGCTTCATCCGGAAGACGAATAAATATTTGAAGGTGTCCATGAGCTGCAATCCATCTCATTTGGAGGTTGTGTCTCCGGTTGTAGTCGGAAAAGCTCGTGCTGAACAACATTGGAAGGGGGACAATCAAGGCGATAAGGTGATGGCGATCATAATGCACGGTGACGCGGCGTTCTCCGGTCAAGGTGTAGTGTATGAGACCCTTCAGCTCGGCAACCTACCCAACTTCACAACTCACGGCTCCATACACATAGTATGCAATAATCAGATCGGGTACACCACGGACCCGAGGTTCGCGAGGAGCTCGCCTTATTGTTCCGATGTAGCGAAATGCATGGACGCGCCAGTGTTACATGTAAACGGCGATGATGCGGAGGCTGTCGCGCACGTGGCGAAGGTCGCCATCGAGTTCAGGTGCAAATTTAAGAAGGATGTGGTCTTGGACCTGGTCTGCTATCGAAGGTTCGGACACAGCGAGGAAGACGAACCGATGTTCACGCAGCCATTTATGTACAAGAAAATACGGAGTATGGAAACTGTTGATAAGATCTACGCGAAAAAAATACTGGCAGAGGGCGTCGTGACGCAAGCAGATATCAATCGATGGGAAAAGGAATACAACGATACTTTGAACAAACATTTCGAACTGGCAAAGAAGGTTACCAAACTGAGCATAATGGATTGGATTGACACGCCATGGACCGGATTCTTCGAATCCTGTGACCCGAAAAAGGTAAAAGAGACTGGAATTTGCGAAACATCGTTATCAACGATAGCACACCATTTTTGTAAAGCACCGGAACCGTGGGCTTTCGAAGTCCATAAAGGCATCCACAAGATTTTGGAAAAGAGGGCGAAAATGGTAAAAGAAGGCGTTGCAGATTGGGCGATGGGAGAAGCTCTCGCCTACGGTTCGCTATTAAGAGATAAAGTACACATAAGACTCACAGGGGAAGACGTCGAAAGAGGGACTATGGCACATAGACATCATGTTTACCATCACCAAGGTGTAGACGGGGCGACGCACCGAGTACTGGATACCTTGTACGCTGATCAATCACTGTATAGCCTTCACAACAGCTCGTTATGCGAATTTGGTATCTTGGGTTTCGAGGTCGGCTACTCCTATTCTAGTCCGAATCTACTTACTATATGGGAAGCTCAGTATGGTGACTTTGCGGACACGGCTCAGCCGGTTTTCGACACCTTCATCGTGAACGGAGAAAGCAAGTGGGTCTGTCAGTCGGGCCTTGTAGTGCAACTACCGCACGGCATTGACGGAGCGGGTCCTGAGCATTCTTCAGCAAGGCCAGAAAGATATTTGCAGCAGGCAGATGATGATGAAGATGTCATACCGGATCTCGATGACAAAAATATGCCTCTAAACCAACTTCGTGCCGCAAACTGGATTGTTTGCAACTTGACTACACCAGCCAACTATTTCCATATGATACGAAGACAAATCGCGTTGCCCTTCCGCAAGCCCCTCATCCTCATGACTCCTAAAGTCGGCCTCAAACATCCTTATTACACTTCACCATTTAAAGACTTCTTGCTTGGAACTCAGTTTCAAAGGGTAATAAGAGAAACCGGTCCTGCCAGTAAGGACCCTAAAAATGTAAAGAAGCTAATATTCTGCTCCGGCAAAGTCGCAATTATAATTGATGAATTACGGAAAGAAAAGAAGCTACAAGACAAGATCGCATTGTGTCGCATCGAACAACTATACCCATTCCCATACGACCTAATACTTAAAGAGTTTTGCTTTTACCCCAGCGCTAAAGTAGCATTTTGTCAAGAAGAACATAAAAATCAGGGACCGTGGACATTCGTTAGAAATAGACTCGAAAATCTCTTTGGGAAGAAAGTGGAGTGCATTTCAAGACCACCCAGTGCTGCGTCAGCTACGGGTATTAAATGGATACACGCAAAGGAACTTAAAGAATTGAAAGAAAAAATTATCGCAATGTGA

Protein sequence:

>DPOGS208941-PA
MALFRTIRKLTLPVHTSNSTHLMLLGRVVSSKAETFLSGTNANYLEAQYLNWSKSRGSVDPSWDRFYESVSGPTLKIEEFEDEFSPKTIGQNKTSRSEKDDKEAKERIKLHLAVQNLIRSYQARGHLLAKTDPLNLSIGAGWLLKTQKPTMEIKGVDSVVVAREIGTILREEHMDTVFELPERTCIGGTETALTLREIIRRLEIVYCGPIGVEYMHLFDIDCLQFMREKMETPGCLQRTVEEKKLIMRRLTKAVYLEKYFATKWPAEKRFGLEGGESMIVMLEEIVDSSTQLGVESIVMAMQHRGRLNMLVNVCRKQLTDIFAQFKPMEPKEPGSGDIKYHLGTFIHRFIRKTNKYLKVSMSCNPSHLEVVSPVVVGKARAEQHWKGDNQGDKVMAIIMHGDAAFSGQGVVYETLQLGNLPNFTTHGSIHIVCNNQIGYTTDPRFARSSPYCSDVAKCMDAPVLHVNGDDAEAVAHVAKVAIEFRCKFKKDVVLDLVCYRRFGHSEEDEPMFTQPFMYKKIRSMETVDKIYAKKILAEGVVTQADINRWEKEYNDTLNKHFELAKKVTKLSIMDWIDTPWTGFFESCDPKKVKETGICETSLSTIAHHFCKAPEPWAFEVHKGIHKILEKRAKMVKEGVADWAMGEALAYGSLLRDKVHIRLTGEDVERGTMAHRHHVYHHQGVDGATHRVLDTLYADQSLYSLHNSSLCEFGILGFEVGYSYSSPNLLTIWEAQYGDFADTAQPVFDTFIVNGESKWVCQSGLVVQLPHGIDGAGPEHSSARPERYLQQADDDEDVIPDLDDKNMPLNQLRAANWIVCNLTTPANYFHMIRRQIALPFRKPLILMTPKVGLKHPYYTSPFKDFLLGTQFQRVIRETGPASKDPKNVKKLIFCSGKVAIIIDELRKEKKLQDKIALCRIEQLYPFPYDLILKEFCFYPSAKVAFCQEEHKNQGPWTFVRNRLENLFGKKVECISRPPSAASATGIKWIHAKELKELKEKIIAM-