Monarch geneset OGS2.0

DPOGS201038
TranscriptDPOGS201038-TA1104 bp
ProteinDPOGS201038-PA367 aa
Genomic positionDPSCF300299 - 167274-171038
RNAseq coverage327x (Rank: top 35%)
Annotation
HeliconiusHMEL0121560.082.79% 
BombyxBGIBMGA008076-TA3e-15778.96% 
DrosophilaCG17691-PE4e-14766.48% 
EBI UniRef50UniRef50_P219538e-14566.57%2-oxoisovalerate dehydrogenase subunit beta, mitochondrial n=230 Tax=root RepID=ODBB_HUMAN
NCBI RefSeqXP_974707.12e-16075.36%PREDICTED: similar to AGAP007531-PA [Tribolium castaneum]
NCBI nr blastpgi|910768364e-15975.36%PREDICTED: similar to AGAP007531-PA [Tribolium castaneum]
NCBI nr blastxgi|910768363e-15676.18%PREDICTED: similar to AGAP007531-PA [Tribolium castaneum]
Group
Gene OntologyGO:00081521.4e-42metabolic process
GO:00038241.4e-42catalytic activity
KEGG pathwaytca:6635756e-160 
 K00167 (E1.2.4.4B, bkdA2)maps-> Valine, leucine and isoleucine degradation
InterPro domain[46-221] IPR0054751.5e-57Transketolase-like, pyrimidine-binding domain
[240-363] IPR0159411.4e-42Transketolase-like, C-terminal
[230-367] IPR0090145e-42Transketolase, C-terminal/Pyruvate-ferredoxin oxidoreductase, domain II
[236-344] IPR0054763e-36Transketolase, C-terminal
Orthology groupMCL15091 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201038-TA
ATGAGTTTCTTAGCTCAAAAACTTTTCAGTTTCGGTGGTGCTATTAAAAATGTAAACAAAAATTCTATTAGACTTTCTTCTCATTTCATATATCATCCAGATAATGAAAAACCCATTGAAGGAGAAACAAAAAAGATGAACATGATGCAAGCTATAAACGATGCAATGGACATCACACTCAAAAACGATCCAACGGCTGTTTTATTCGGAGAAGATGTCGGCTTTGGAGGTGTTTTTAGATGTGCCTTGGGATTACAGGAAAAGTATGGCAAAGACAGAGTATTTAACACACCATTGTGTGAGCAGGGTATTGCAGGGTTTGGTATTGGATTAGCGACGGCCGGTGCTACTGCCATAGCTGAAATACAGTTTGCAGATTATATATTCCCAGCCTTTGATCAGCTTGTAAATGAAGCAGCTAAGGCTCGATACAGATCGGGCGGTCAGTTTGACTGCGGCGCGTTGACGGTTCGCGCTCCGTGTGGTGCCGTGGGCCACGGAGGGTTGTACCACTCACAGAGCCCTGAGGCATTCTTCGCTCATGCAGCTGGGCTCAAGGTGATAGTACCAAGAGGTCCAATTGCTGCGAAAGGTCTTTTATTGGCGTGCATCCAAGAAAGGGACCCCTGTATTTTCTTAGAACCAAAAATTTTATACAGATCTGCCAATGAAGAAGTCCCTATTGATAGTTATACTTTACCCATCGGAAAGGCTCAAATTTTAAGAGAAGGTAATCAAGTCACTTTAATAGCGTGGGGTACACAAGTACACGTTTTACTGGAAGTTGCTAAACTAGCAAAGGAGCAGTTTGACGTTAGTTGTGAGGTCATAGATCTCATGTCAATACAACCGTGGGACGAAGTGACTGTTTGTGATTCAGTGAAAAAAACCGGAAGATGTCTAATAGCGCATGAAGCTCCACTCACTTGCGGTTTCGGCGCTGAATTGGCAGCCACTATTCAGGAGGAATGCTTTCTTCACCTGGAGGCACCTATATCACGTGTGACAGGCTGGGATGCGCCCTTCCCTCATGTGTTCGAACCCTTCTACTTACCAGACCGTTGGCGATGTCTAGAAGCCATCAAACAATTGGTGCAGTACTAG

Protein sequence:

>DPOGS201038-PA
MSFLAQKLFSFGGAIKNVNKNSIRLSSHFIYHPDNEKPIEGETKKMNMMQAINDAMDITLKNDPTAVLFGEDVGFGGVFRCALGLQEKYGKDRVFNTPLCEQGIAGFGIGLATAGATAIAEIQFADYIFPAFDQLVNEAAKARYRSGGQFDCGALTVRAPCGAVGHGGLYHSQSPEAFFAHAAGLKVIVPRGPIAAKGLLLACIQERDPCIFLEPKILYRSANEEVPIDSYTLPIGKAQILREGNQVTLIAWGTQVHVLLEVAKLAKEQFDVSCEVIDLMSIQPWDEVTVCDSVKKTGRCLIAHEAPLTCGFGAELAATIQEECFLHLEAPISRVTGWDAPFPHVFEPFYLPDRWRCLEAIKQLVQY-