Monarch geneset OGS2.0

DPOGS215949
TranscriptDPOGS215949-TA1137 bp
ProteinDPOGS215949-PA378 aa
Genomic positionDPSCF300308 + 162744-170385
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0206861e-10557.28% 
BombyxBGIBMGA001856-TA2e-7547.04% 
DrosophilaCG10175-PC2e-1827.45% 
EBI UniRef50UniRef50_Q7PY293e-2225.61%AGAP001722-PA n=3 Tax=Anopheles RepID=Q7PY29_ANOGA
NCBI RefSeqXP_002072946.11e-2426.61%GK13419 [Drosophila willistoni]
NCBI nr blastpgi|1954514913e-2326.61%GK13419 [Drosophila willistoni]
NCBI nr blastxgi|1566195022e-2127.93%carboxylesterase-like protein [Helicoverpa armigera]
Group
KEGG pathwaydpo:Dpse_GA153794e-12 
 K01044 (E3.1.1.1)maps-> Drug metabolism - other enzymes
    Tropane, piperidine and pyridine alkaloid biosynthesis
InterPro domain[1-118] IPR0020188.6e-23Carboxylesterase, type B
Orthology groupMCL21140 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215949-TA
ATGGGAGCCAAAGATATTTTACTCGCCCTTAAGTGGTTGCTTGATAACATTAGTATCTTTAATGGTGATCCCAGAAGAATAACTATAATGGGTTCAGGTGAAGCAACGACGTCCGTGGCATCTATGCTTTTGACGAGAATGGCTGGAACTTTATTCCAAAGAGCGATAATCATGGACGGAAGTGCGCTATCCCCTGCTGACTATAGACCATATAATTTTGAAGTTGCCAACAAATTATTTTGGAAATTTAATGGGAGTTGGGATCAGTTCAATCGCAAAGAATTGTATAATATACTTAAAAATTCCTCAGCTGCCAAATTAAAACTAGCCTCTCGCGAACTCTATGACAGCACGGAAGAAAAAATGTCAGCTCAATTGGACGGCATTTCTAAAAACAGAAAACTGTTGAAATATTTAAATTACAATTTTCAATATCTGTTGCCGTTCGAGGGGGAATGTGATGAATACGGATCCAAGCGCTATAGAAGAATAAGAAAGGAAATCATGGATTTTTATTTCATAAATGGAACTATAACGGATGTTAGTTTCAAAAGGTATGCAAAATATATTTCCGACCAGACAATATATCCATTGCTTCGGCAAGCTGTATTACAGTCCAAAGTATCAAAAGGGAGCGTTTACTTGTATCGACTTCCTCTCAACGGACCGTTGAATATGAGGTGGAGAAAATTTGTACCCAATTTGAAATGGAAAGGTGCTACTTCAGGCGACGAAATCTGCTACCTATTCAAATGCAAATCGGAATTAGAAACTTATAATAGAATTGGGAAGGTGAATGAAACACGGTTTATTAGCAGGATTGTTAGACTATTTGGAAACTTTGTCAAATTTGGGCTCCCAACACCTCGAGGTGTTGATAACATTTTTGGATCATTCAAATGGTTGCCGTTAGAAGAAAGAAGACCTATAAGAGCTTTGAATATGGGACATCACTTAAAAATGACAAATTATATCAGCGGAATAGTTTATCACTGGTGTGATAAATCTATGAGCCTACTCTCCCTCACAGCGATTTCCAGCGGCGGCCATTTTGGATGCATAGAGCAAAGAGACGCGTCTATACAAAACAAAGTGCTCAGATTTAAAATTTTAGTGTCAATGAAATTTATTCAGTGA

Protein sequence:

>DPOGS215949-PA
MGAKDILLALKWLLDNISIFNGDPRRITIMGSGEATTSVASMLLTRMAGTLFQRAIIMDGSALSPADYRPYNFEVANKLFWKFNGSWDQFNRKELYNILKNSSAAKLKLASRELYDSTEEKMSAQLDGISKNRKLLKYLNYNFQYLLPFEGECDEYGSKRYRRIRKEIMDFYFINGTITDVSFKRYAKYISDQTIYPLLRQAVLQSKVSKGSVYLYRLPLNGPLNMRWRKFVPNLKWKGATSGDEICYLFKCKSELETYNRIGKVNETRFISRIVRLFGNFVKFGLPTPRGVDNIFGSFKWLPLEERRPIRALNMGHHLKMTNYISGIVYHWCDKSMSLLSLTAISSGGHFGCIEQRDASIQNKVLRFKILVSMKFIQ-