Monarch geneset OGS2.0

DPOGS214110
TranscriptDPOGS214110-TA1011 bp
ProteinDPOGS214110-PA336 aa
Genomic positionDPSCF300014 - 1897168-1899755
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0114082e-17089.42% 
BombyxBGIBMGA006160-TA6e-16183.44% 
DrosophilaCG5955-PA1e-14876.53% 
EBI UniRef50UniRef50_Q9VPE82e-14676.53%CG5955 n=46 Tax=Opisthokonta RepID=Q9VPE8_DROME
NCBI RefSeqNP_001037542.18e-16685.41%L-threonine dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1129828202e-16485.41%L-threonine dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1129828209e-16385.41%L-threonine dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00054881.1e-31binding
GO:00442372.7e-16cellular metabolic process
GO:00038242.7e-16catalytic activity
GO:00506622.7e-16coenzyme binding
KEGG pathwaybfo:BRAFLDRAFT_1166583e-111 
 K00060 (E1.1.1.103, tdh)maps-> Glycine, serine and threonine metabolism
InterPro domain[29-238] IPR0160401.1e-31NAD(P)-binding domain
[31-258] IPR0015092.7e-16NAD-dependent epimerase/dehydratase
Orthology groupMCL14550 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214110-TA
ATGGTAGCTATTGCTAGATTTGTAGCTAGATTTGTAGCTATACTATTGCTAGATTTACGGCCAATTACTTTTAAAATTGCTCGTGGACTTGGACAACTTGGTGTGGAGTGTGCGAAATATTTACGAGGAAAATATGGAAGAGAAAATGTTATACTGTCTGATATTATTAAACCAACGACTGAAGTGTTCAATGATGGACCATATATTTTCGCAGATATTCTAGATTTTAAAGGTCTCCAGAAAATTGTTGTCGATCACAGAGTGGACTGGTTGATACATTTTTCCGCTCTACTTAGTGCTATTGGCGAACAAAACGTACCATTAGCTGTCAGAGTTAATATAGAAGGAATGCACAATGTTATAGAGCTAGCAAAACAATATCGTCTCAGAATTTTCGTGCCAAGTACGATTGGAGCTTTCGGACCTGACTCACCGAGAAATCCTACACCTAATATAACTGTGCAGAGACCACGAACGATATATGGTGTTTCTAAAGTGCATGCGGAATTACTTGGTGAATATTATTACTATAAGTTTGGACTGGATTTCCGTTGCCTGAGATTCCCTGGAGTTATTTCCAGTGATCCTCCCGGTGGAGGTACTACAGACTATGCCATCGCAATATTCCATGATGTTCTTCGGAAGGGTCGCTACGAGTGTTACCTGAGGCCCGACACACGTCTACCAATGATGCATGTCAAGGATGCACTGAGAGCTCTCTCGAACTTTCTGGAAGCCCCCAACAAGATGTTACACAGACGAGTATACAACGTTACCTCAATGAGTTTCACCCCAGAAGAATTGGCTGATCATATGTTCAAATACATACCTGATTTTAGTATTTCGTATAAACCGGACAGTCGGCAGGATATCGCCGACTCCTGGCCTCAGGTTTTCGACGACAGCGAAGCCAGACGAGACTGGAACTGGAAGCCGGAAGTAGACTTGGATAATTTAGTTAAATTAATGCTGAAAGAAGTTAAGGAAAAGATAAATGATTACGACTATTGA

Protein sequence:

>DPOGS214110-PA
MVAIARFVARFVAILLLDLRPITFKIARGLGQLGVECAKYLRGKYGRENVILSDIIKPTTEVFNDGPYIFADILDFKGLQKIVVDHRVDWLIHFSALLSAIGEQNVPLAVRVNIEGMHNVIELAKQYRLRIFVPSTIGAFGPDSPRNPTPNITVQRPRTIYGVSKVHAELLGEYYYYKFGLDFRCLRFPGVISSDPPGGGTTDYAIAIFHDVLRKGRYECYLRPDTRLPMMHVKDALRALSNFLEAPNKMLHRRVYNVTSMSFTPEELADHMFKYIPDFSISYKPDSRQDIADSWPQVFDDSEARRDWNWKPEVDLDNLVKLMLKEVKEKINDYDY-