Monarch geneset OGS2.0

DPOGS202329
TranscriptDPOGS202329-TA1257 bp
ProteinDPOGS202329-PA418 aa
Genomic positionDPSCF300032 + 597660-601253
RNAseq coverage48x (Rank: top 71%)
Annotation
HeliconiusHMEL0051000.078.12% 
BombyxBGIBMGA004906-TA1e-7541.28% 
DrosophilaCG8129-PB2e-6137.68% 
EBI UniRef50UniRef50_F0WTS32e-7341.06%Threonine dehydratase catabolic putative n=6 Tax=root RepID=F0WTS3_9STRA
NCBI RefSeqXP_624902.19e-7440.10%PREDICTED: similar to CG8129-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3407141572e-7439.61%PREDICTED: threonine dehydratase catabolic-like isoform 1 [Bombus terrestris]
NCBI nr blastxgi|3407141591e-7439.61%PREDICTED: threonine dehydratase catabolic-like isoform 2 [Bombus terrestris]
Group
Gene OntologyGO:00081521.9e-73metabolic process
GO:00038241.9e-73catalytic activity
GO:00301701.9e-73pyridoxal phosphate binding
KEGG pathwayame:5525233e-73 
 K01754 (E4.3.1.19, ilvA, tdcB)maps-> Valine, leucine and isoleucine biosynthesis
    Glycine, serine and threonine metabolism
InterPro domain[11-360] IPR0019261.9e-73Pyridoxal phosphate-dependent enzyme, beta subunit
Orthology groupMCL34438 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202329-TA
ATGGAACCGGAAAGGCCACACGTATTAACATTTAATGAAATAAAACAAGCGGCGGAGAGAATTGAAGACGGCATCATTCACACTCCTCTTTGTGAGGCAAAAATAAGTAAATACATGGATTATAACATTTATTTGAAATGTGATAATTTACAATACACTGGAAGCTGCTCTGAAAGAGGTATCCGGAACGCCTTTGTCGCTCTTAACGATGAAATGTCTGAAAGAGGAGTGATTGTACCGTCTAACGGAAACATGGCTCTGGGAGCTGCATATCAAGGACACTTGCTAAATATACCAGTGACAGCGGTACTTCCGGAGAGATGTCCCCCGTCACTCTCGCAGCGGTGTGCTGAACTAGGCGCTCACGTGGTTCTGGTTGGTGAAACAGTCGAAGATGCTGTCTGTTACGCCAACAAGACCAACCGGGATGGATCACAGATTATATTAACTTCGGACGACCCGTTAGTAATGGCTGGCCTGGGTACTGTTGGCGTTGAAATCATAACTCAATTGCCAGAGACAGATGCGGTTATTGTACCTGTAGCGTCAGGCGGTCTTCTAGCTGCAACGCTTGTAGCCTGCAAGAAGCTAAAGTGCACCTGTCTCGTTTATGGAGCAGAATGCGCCAAAGTCCCAAAAATGATGAAGGCCTTACAATCAGGATATCCGGTTTCGGTGCCCGTTATACCCAATATAGCACAGGGTTTAAGTTCTGCGGTTGTCGGGGAGAACGCCTTCGCGACTATAAAGGGTCGTTTGGATAGAATGTTAGTCGTCGACGAAGCCTACATTGCTCGCGCCGTAATAAATGTATTAGAACGTGAGAGACTTGTGGCGGACGGCGCCGGCGTGTGCGCGTTGGCTGCTGTCATGCAAGGTCTAGTTCCAGAGTTGAGAGGAAAACGAGCTGTGTGCGTTATAAGCGGAGGCAATATCGACTCGGGACGTCTCTCCCGGACTATTCACCGCGGGCTGGGTGTGAGTGGGAGACTTATGAGGTTTGCAGTGCCAGTCCCTGATCACTGCAAGGGACTTGAAGTTCTAGCCGAAGCCATCGCTGATAAAAGAGCGGTCATTAAGAGCTTCGGCACCGAACAGATATGGGTTCACAGTGATATCGGATCTACTTGGGCAAATGTTGTAGTCGAAACGGCAAATGAAGAAAATGCTCTAGCGCTCAAAGAACATTTGAGGAATTTGTACCCAACTGTCAAATTTGCAGTTTTTGATGTGGATGAAAAACATAAGATGAGATAA

Protein sequence:

>DPOGS202329-PA
MEPERPHVLTFNEIKQAAERIEDGIIHTPLCEAKISKYMDYNIYLKCDNLQYTGSCSERGIRNAFVALNDEMSERGVIVPSNGNMALGAAYQGHLLNIPVTAVLPERCPPSLSQRCAELGAHVVLVGETVEDAVCYANKTNRDGSQIILTSDDPLVMAGLGTVGVEIITQLPETDAVIVPVASGGLLAATLVACKKLKCTCLVYGAECAKVPKMMKALQSGYPVSVPVIPNIAQGLSSAVVGENAFATIKGRLDRMLVVDEAYIARAVINVLERERLVADGAGVCALAAVMQGLVPELRGKRAVCVISGGNIDSGRLSRTIHRGLGVSGRLMRFAVPVPDHCKGLEVLAEAIADKRAVIKSFGTEQIWVHSDIGSTWANVVVETANEENALALKEHLRNLYPTVKFAVFDVDEKHKMR-