Monarch geneset OGS2.0

DPOGS214109
TranscriptDPOGS214109-TA1029 bp
ProteinDPOGS214109-PA342 aa
Genomic positionDPSCF300014 - 1910230-1916384
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0114082e-18088.43% 
BombyxBGIBMGA006160-TA9e-16283.49% 
DrosophilaCG5955-PA2e-15275.23% 
EBI UniRef50UniRef50_Q9VPE83e-15075.23%CG5955 n=46 Tax=Opisthokonta RepID=Q9VPE8_DROME
NCBI RefSeqNP_001037542.11e-16986.02%L-threonine dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1129828202e-16886.02%L-threonine dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1129828201e-16686.28%L-threonine dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00054881.9e-36binding
GO:00442379.4e-20cellular metabolic process
GO:00038249.4e-20catalytic activity
GO:00506629.4e-20coenzyme binding
KEGG pathwaybfo:BRAFLDRAFT_1166581e-113 
 K00060 (E1.1.1.103, tdh)maps-> Glycine, serine and threonine metabolism
InterPro domain[29-244] IPR0160401.9e-36NAD(P)-binding domain
[30-264] IPR0015099.4e-20NAD-dependent epimerase/dehydratase
Orthology groupMCL14550 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214109-TA
ATGACGTTATTAAGAAAATTGAACTGCGCTCTGGCACTCAACACAAGGAGGTACAGTTCGGCAATTAAAAATAAGAATTCACCAAAAATTCTAATAACTGGTGGACTTGGACAACTTGGTGTGGAGTGTGCGAAATATTTACGAGGAAAATATGGAAGAGAAAATGTTATACTGTCTGATATTATTAAACCAACGACTGAAGTGTTCAATGATGGACCATATATTTTCGCAGATATTCTAGATTTTAAAGGTCTCCAGAAAATTGTTGTCGATCACAGAGTGGACTGGTTGATACATTTTTCCGCTCTACTTAGTGCTATTGGCGAACAAAACGTACCATTAGCTGTCAGAGTTAATATAGAAGGAATGCACAATGTTATAGAGCTAGCAAAGCAATATCGTCTCAGAATTTTCGTGCCAAGTACGATTGGAGCTTTCGGACCTGACTCACCGAGAAATCCTACACCTAATATAACTGTGCAGAGACCACGAACAATATATGGTGTTTCTAAAGTGCATGCGGAATTACTTGGTGAATATTATTACTATAAGTTTGGACTGGATTTCCGTTGCCTGAGATTCCCTGGAGTTATTTCCAGTGATCCTCCCGGTGGAGGTACTACAGACTATGCCATCGCTATATTCCATGATGTTCTTCGGAAGGGACGCTACGAGTGTTACCTGAGGCCCGACACACGTCTACCAATGATGCATGTCAAGGATGCACTGAGAGCTCTCTCGAACTTTCTGGAAGCCCCCAACAACATGTTACACAGACGAGTATACAACGTTACCTCAATGAGTTTCACCCCAGAAGAATTGGCTGATCATATGTTCAAATACATACCTGATTTTAGTATTTCGTATAAACCGGACAGTCGGCAGGATATCGCCGACTCCTGGCCTCAGGTTTTCGACGACAGCGAAGCCAGACGAGACTGGAACTGGAAGCCGGAAGTAGACTTGGATAATTTAGTTAAATTAATGCTGAAAGAAGTTAAGGAAAAGATAAATGATTACGACTATTGA

Protein sequence:

>DPOGS214109-PA
MTLLRKLNCALALNTRRYSSAIKNKNSPKILITGGLGQLGVECAKYLRGKYGRENVILSDIIKPTTEVFNDGPYIFADILDFKGLQKIVVDHRVDWLIHFSALLSAIGEQNVPLAVRVNIEGMHNVIELAKQYRLRIFVPSTIGAFGPDSPRNPTPNITVQRPRTIYGVSKVHAELLGEYYYYKFGLDFRCLRFPGVISSDPPGGGTTDYAIAIFHDVLRKGRYECYLRPDTRLPMMHVKDALRALSNFLEAPNNMLHRRVYNVTSMSFTPEELADHMFKYIPDFSISYKPDSRQDIADSWPQVFDDSEARRDWNWKPEVDLDNLVKLMLKEVKEKINDYDY-