Monarch geneset OGS2.0

DPOGS202451
TranscriptDPOGS202451-TA690 bp
ProteinDPOGS202451-PA229 aa
Genomic positionDPSCF300174 - 186180-193512
RNAseq coverage2670x (Rank: top 5%)
Annotation
HeliconiusHMEL0101254e-8477.27% 
BombyxBGIBMGA009974-TA2e-6884.46% 
DrosophilaNc73EF-PI4e-3545.50% 
EBI UniRef50UniRef50_E2AHW44e-4051.69%2-oxoglutarate dehydrogenase E1 component, mitochondrial n=1 Tax=Camponotus floridanus RepID=E2AHW4_CAMFO
NCBI RefSeqXP_391838.22e-4753.09%PREDICTED: similar to Neural conserved at 73EF CG11661-PF, isoform F isoform 1 [Apis mellifera]
NCBI nr blastpgi|3287864553e-4653.09%PREDICTED: 2-oxoglutarate dehydrogenase, mitochondrial-like isoform 1 [Apis mellifera]
NCBI nr blastxgi|2420247949e-4556.73%2-oxoglutarate dehydrogenase, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00060965.6e-43glycolysis
GO:00045915.6e-43oxoglutarate dehydrogenase (succinyl-transferring) activity
GO:00551145.6e-43oxidation-reduction process
GO:00309765.6e-43thiamine pyrophosphate binding
KEGG pathwayame:4082864e-47 
 K00164 (OGDH, sucA)maps-> Citrate cycle (TCA cycle)
    Tryptophan metabolism
    Lysine degradation
InterPro domain[22-160] IPR0116035.6e-432-oxoglutarate dehydrogenase, E1 component
Orthology groupMCL34447 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202451-TA
ATGCATCGTGCAAAGTTAGTGCTTCAGGCTACAAATAAAATCGGCAACAGTGAAAGATTTGCATCATGGCTCCTCAACAAACCCCAGACGGCGGCCGTGATGGTGAACAACCAAAGGCTGAAGAGTTCAACGGCAGCAGAGCCTTTCTTGAACGGTTCAAGTTCAGCATATGTGGAGACTATGTACAATGCCTGGCTGTCAGATCCTAACTCGGTGCATGCGTCTTGGGACGCGTTCTTCCGCAACGCCACAAATGGCGCCCAGCCCGGGGTCGCATACACGTCTCCACCGAATCTAGCCCCGTACTCCAAGAACGAAGTCCCCTTGACCTCCCTGGTACCAGCGGCTGGCATGCCCTCGATATCAGCAGGTTCACCCATCAACGAGAAAATCATCGACGACCATCTGGCGGTGCAGGCTATCATCAGAAGCTACCAGGCTCGAGGTCACCTGGCGGCGGATGTGGACCCGCTCGGCATCACCACGGCCAACCTGCCCGCCCTCGGCATGCGCGCGCCACGCTCGGAGCTCATCATGAGGAAATATTTCAATTTCGGTACGATTGGAAACAATCGCCAGGACGAAAGATCCGTTGGCGGAACTATTGGCCAGCTTGGCCCGACGTGTGGCCGTGGTTACGACACACTCGCCCCACGTCGGGCGCCACTATCGAATAGTTTCGCCAGCTGA

Protein sequence:

>DPOGS202451-PA
MHRAKLVLQATNKIGNSERFASWLLNKPQTAAVMVNNQRLKSSTAAEPFLNGSSSAYVETMYNAWLSDPNSVHASWDAFFRNATNGAQPGVAYTSPPNLAPYSKNEVPLTSLVPAAGMPSISAGSPINEKIIDDHLAVQAIIRSYQARGHLAADVDPLGITTANLPALGMRAPRSELIMRKYFNFGTIGNNRQDERSVGGTIGQLGPTCGRGYDTLAPRRAPLSNSFAS-