Monarch geneset OGS2.0

DPOGS211395
TranscriptDPOGS211395-TA4755 bp
ProteinDPOGS211395-PA187 aa
Genomic positionDPSCF300115 - 491908-507733
RNAseq coverage1030x (Rank: top 12%)
Annotation
HeliconiusHMEL0170924e-4548.85% 
BombyxBGIBMGA004683-TA3e-4552.60% 
Drosophilaalpha-Est2-PA3e-3042.77% 
EBI UniRef50UniRef50_D5G3F41e-4247.49%Carboxyl/choline esterase CCE017a n=4 Tax=Obtectomera RepID=D5G3F4_HELAM
NCBI RefSeqNP_001165227.14e-3951.28%alpha-esterase 48 isoform s1 [Bombyx mori]
NCBI nr blastpgi|3388327391e-4757.59%carboxylesterase [Melitaea cinxia]
NCBI nr blastxgi|1892364670.049.61%PREDICTED: similar to Sec24B protein, putative [Tribolium castaneum]
Group
KEGG pathwayapi:1001633769e-29 
 K03927 (PNBA)maps-> Drug metabolism - other enzymes
InterPro domain[26-187] IPR0020187.5e-49Carboxylesterase, type B
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211395-TA
ATGAATTATTATAATAAGTATATTAATACAGGGCTAAAAATAATTGACTGTTTAAGTCTTAACACGAGAAATCTCGCTACTTACTCAAGTAACATGCATTCACCCATTTTGGATCTGAAGGAAGGAAAGGTCCGGGGAAGAATAAGAAAACTTGATAATGGCAAGGAGTTTTATAGCTTCAAAGGAATCCCATACGCACAGCCACCAGTGGGAAGCCTTCGATTCAAGGTAGTATATGGATCTGATGAATTAGTGAAAGCCCCGCTTCCTCCCAAACCCTGGTCGCATGTCCTAGATGCTGCAGAACACGGAGCTACTTGCCCACAATGGGATATGGTGGCTTTGAAATTTAGAAAAGGGGAAGAGAACTGTTTATTCATTAATGTTTACACTCCCACCTTACAAACTGACTCCAAATTACCTGTGATGGTGTATATCCACGAGGGAGGCTTTCAATCAGGATCTGGAAATGAAAGAATGTATGGCCCAAATTTTCTGATAAGGCATGATGTTATTCTTGTTTTGTTTAATTGTAGATTGGATCTATTGGGATTTTTGTCTTGACAGCCCTGAAGTTCCTGGCAACGCAGGTATGAAAGATATGGTAGCAGTCCTGAGATGGGTTCAAAATAACACTAAACAGATAATTTAATCATTATTGGAGTAAGTGGTGGAGCAGTTTCCGTGACTTGTCTTATGATATCTCCGATGGCTCAAGGCTTGTTTCATAAAGCTATTGCACAAAGTGGATCATGTCTTGCCGATTATGCAATAGATGTTAATCCTGTACAGAGAGCTTTTCGAGTTGGAAAGGTCCTTGGGAAGGATACGACAGATCCTTATGAGTTACTCACATATTTACAAAGCTTGCCTGCAGAAAAACTTCTGTATTTGACGTTAAAAACTGCAACTAAGAGAGAAAAACTTGGTGGTCTACCCATTGTTTTTACTCCCGTCGTAGAAAAGAGGTTTGGTGATAATGAAGTATTTTTGCCGGAAGACCCGTTAGATATAATGTTATCTGGAAAATATAATCCTGTTCCTCTTATAACCGGATATACTTCATCAGAAGGAGCAGTAACATTAAACGACGTGATACCAAAGCTGGACTTTATCAATAATTGTCCGTCCCTGTTGGTGCCTAAAGACATTTATATAAAAATTACAGAAGACAAGGCTTTGGAATTTGGTGAAAGAATCAAGCAATTCTATTGGGGTGACAGGGATTTAACAGCAGATGATGTTGAAATAATTGCTGATTTGCAGTCAGATATTCATTTTGTGATGGCAACATACAGGACAGCAGATTATTTTTCAAAATACTATTGCGAAGCCATGTCTCACATACCGAATCCGTATAACAATGCTCCCGGGACCTACAACTCAAATCCTGCTATGTTTCAACAATTCAATCAAACGCTGACGAGCCAACTGCCTCCGAAACAAGACACGAAACCAATCATTCCTACAAGCATTCAAGGGTTCAACCACCAAACGAATCCATATAATGACGAAAGTGCCAGCAGTTCGCCAGCTTTTAATAATACTGGAATGATGCCCCTTCCAACAAGCTTACAGTCATCCCCCATGAGGCCCGTTAAGCCAGTGCCAAGTCAACAGCAGGCTCAAAGCATTCAGAACATTCCACTGAGCTCATCCAGCCCACATTCAATGACCAACACTAACCAATCAAACTCTCCATATCATCCTCAATACCATCAAATCAATAATAATGCCCCACCTCTAATTCCTTCTAGTCTAAACAACCAGCCAAACATGAATGGTCCAATGAGTGCCCCAAATGCACTCCCTAATCAATATCCAGCAGCATCCAATTCATTCAGTCCAGCTCCTATGAATCGCCCTCCGACAATTAATACTTTCAGTACAAATCAAAGTTATAGAACAAATCACACATCCCAAACTTCATCTCCTCATGCTGTAAACCATCAGCCTAACGTTCCATTTAATGGACCATCTGGTCCCAGAAATAGTCCTCTAACTACAGGACAGCAAGTTAATTCCCTCACATCTAGTATGGCAAAGTTACCGCCTATTTCTGGACCACCTGGGCCACCGAGAAGTCAAAACAGTTTAATTAATGGACCGATGCAAGGCCAAGCTCCGCTTAATAACATGGGACCCCCAAAAAATATGTCCCATGGGCCGTCTTCTCTACCCATAGGTCAACCGGGAGTCCCATTACGACCTACGGGACAAACAAACCAACAGCCACCCCTTATTAATAGTTCCCAACCAAACTTCATCACTGGACCACCTAGCGGCCCTCAAAGTATGCCAATCGGGCCAGGACAAGCTCCTCCTATGGTACAACAGCCTAGACCTGGGCCAGCAAGCGGCCCATCTTCCATGACGAGTAGATATCCACAGATGCCTTACACGAATCTTAGCCCACAGCAACAAATGCAAGTCCAACAGAACATTGCTAAACAGTTTCCCACACATAACTTGTACGACGTGAACCAGCAAGGAGGACAGCTCAGTGTGACTAAGCAGGGCTTCAACCAGCTTTGGGGTCATCAGATGGTGGACTTGATGCAGTGTAAGCACATTCTGCCAGAATATCCCGAAGATCCACCGGAAATAAGGCTCGGACAACAGTTTGCTGAGGCCAATAACTGCAGCCCAGAAATATTCCGTTGCACTGTCAACCGTATCCCGGAGACGAATTCTTTATTGCAGAAGTCTAGACTGCCTCTCGGTATCCTGATCCATCCGTTCAAAGACCTCAATCATCTCCCCGTAATACAATGTACTACTATAGTCCGCTGCCGGGCGTGTCGCACCTACATCAACCCCTTCGTTCACTTCGTCGACTCCAAGAGATGGAAATGCAACCTTTGCTACCGGGTCAATGAATTGCCCGAGGAGTTCCAGTATGACCCCGTGAGCAAGTCCTACGGCGACCCGTCCCGGAGGCCGGAGGTGAAGTCGGCGACGATAGAGTTCATCGCGCCCAGCGAGTACATGCTGCGACCGCCGCAGCCGGCCGTCTACCTGTTCCTGTTCGACGTCTCGCAGAACGCCAGGGAGTCGGGATACCTGCAGGTGGTGTGTGATACTCTGAAGTCTAACCTGGAACAGCTGCCGGGGGACGCCCGCACTCAAGTCGGCTTCATCTGTTACGACGAGCACATTCACTACTACCTCATGAGTGACGGACTCTCCAAACCAAGGGAAATGACCGTTTTGGACGTTGAAGAGGTGTTCCTCCCGTCGCCGGAGTCGCTGCTGGTGAACCTACTGGAGCATCGTGCGATCGTCGTGGAGCTGCTGTCGGTGCTGCCCCGCCGGTACAGCTCGCCCACCGCCCCCGCCAGCGCCCTCGGCCCGGCGCTCCAGGCCGCTTACAAGCTCATGGCCCCGACCGGAGGCAGGGTGACCGTGTTCCAAACGTGTCTGCCTAACGTGGGCCCCGGCGCATTAGAGTCCAGGGAGGACCCCAACGCCCGCTCGGCCCAGGACGTGCGTCACCTGTCCCCGGCCACGGACTTCTACAAGCGGCTGGCGTTAGACTGTTCGGGCGTGCAGGTGGCCGTGGATCTGTTCCTGCTCAGCTCGCAGTACTGCGACCTCGCTACCATCAGCGGTATGAGTAAGTTCAGCGCGGGTACGGTGCACCACATCCCGCTGTTCCGCGCCAGTCGGCAATGGCAGGCGGAGCTGCTGACGCGCATGCTGTGTCGCTACCTCACCCGCAAGATAGGCTTCGAGGCCGTCATGAGGGTCCGCTGCACGAGAGGGATATCCATCCACACGTTCCACGGCAACTTCTTCGTGCGCTCCACGGACCTGTTGTCTCTGGCCAACGTGTCTCCGGACGCTGGGTTCGCTATGCAGCTCAGCATCGACGAGTCGCTGACGGAGCTGCAGCACGTGTGCTTCCAGGCCGCTCTGCTGTACACCAGCAGCAAGGGTGAGCGTCGTATCCGCGTGCACACGCTGTCGCTCCCGGTGGCCAGTACGCTGCCGGATGTGTTGCACTCCGCGGACCAGCACGCCGTCATCGGTCTACTCGCTAAGATGGCCGTGGACCGCTGCGTGTCGGCGTCCATGTCGGAAGCGAAGGAGGCTCTGATGAACGCCGCAGTGGATATGTTGAGCGCCCACCGCCTCGCCCACAGTTTGCCCACGGGTGACCAGAGCGCCTCCCTGCACGCGCCCTGGTGCGAGTCTGATGACGGCGAACTGATACTGGCGCTGCTTAAACGGAAAGCATTCCGCACGGGCACGTCGACTCGCCTGGACGAGCGCGTCTCCGACATGCTGTTCCTGAAGACCGCGCCGCTGGCGAGTCTCCTGCGGGCCGTGCACCCCGACCTGTACGAGCTGCACACGCTCTCCAGCCAGCACCAGCCGCCGCGGCTCCAGCTGTCCGCCGAGAGACTGAGCCTGGACGGCGCGTACCTCCTGGACGAGGGGGAGACCATGGTGATATACGTGTGTCGGGGGGTCAGCGCCGCCTGGCTGTCCGAGGCGCTGGGGGTCAACTCGTTCGCGGAGCTGCCGGCCGAGGGTCGCGACCTGCCGCACATAGACACCGGCCTCAACGACCTGCTGCACGGGTTCATCGACCGCCTCAACGAGGACCGCCCTTACGCCGCCGGCCTGCTACTGCTCAGAGACGACTCGCCGTCCCGCCAACTGTTCACGGAGCGCCTGGTGGAGGACCGCGTGGAATCCGCCTTCTCCTACTACGAGTTCCTGCAACACCTCAAGAGCCAAGTGAAATGA

Protein sequence:

>DPOGS211395-PA
MNYYNKYINTGLKIIDCLSLNTRNLATYSSNMHSPILDLKEGKVRGRIRKLDNGKEFYSFKGIPYAQPPVGSLRFKVVYGSDELVKAPLPPKPWSHVLDAAEHGATCPQWDMVALKFRKGEENCLFINVYTPTLQTDSKLPVMVYIHEGGFQSGSGNERMYGPNFLIRHDVILVLFNCRLDLLGFLS-