Monarch geneset OGS2.0

DPOGS201956
TranscriptDPOGS201956-TA1755 bp
ProteinDPOGS201956-PA584 aa
Genomic positionDPSCF300384 - 13746-17937
RNAseq coverage133x (Rank: top 56%)
Annotation
HeliconiusHMEL0020604e-15555.50% 
BombyxBGIBMGA014599-TA3e-16355.79% 
Drosophilaalpha-Est2-PA7e-6534.57% 
EBI UniRef50UniRef50_B5AK612e-8838.45%Carboxylesterase 4 variant 1 n=25 Tax=Endopterygota RepID=B5AK61_BOMMA
NCBI RefSeqNP_001165227.14e-8938.25%alpha-esterase 48 isoform s1 [Bombyx mori]
NCBI nr blastpgi|3388327392e-8838.82%carboxylesterase [Melitaea cinxia]
NCBI nr blastxgi|3388327391e-8638.79%carboxylesterase [Melitaea cinxia]
Group
KEGG pathwaydpo:Dpse_GA108431e-63 
 K01066 (E3.1.1.-)maps-> 2,4-Dichlorobenzoate degradation
    Tropane, piperidine and pyridine alkaloid biosynthesis
InterPro domain[4-509] IPR0020188.5e-131Carboxylesterase, type B
Orthology groupMCL26208 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201956-TA
ATGTCCTGTCTCGTAAGAGTTTCGGAGGGAATCCTGGAAGGGAAACTGTGTAACACATACTATGGAAAACAATACTACAGTTTTGAGGGCATCCCATATGCGAAACCGCCGATAGGAGACCTCAGGTTTAAGGCTCCAGTGCCACCAGAAAGTTGGACAGGCATCAGGGACGCTAAGAAGCCGGGAGAGAAATGTCCTCAGATGAATCCATATGGAAAAGCAGTTGTAGAAGGTTCTGAAGACTGTTTGTACCTAAACGTGTACACGCCGAGCTTGCCAGATGAAAAAAATCAAAATTTACCGGTAATATTTTTTGTGCACGGCGGCAGATTGGTCCTGGGCTATGGAGACTATTACAAACCGGACTATTTAATAAGAAACGACGTCATTCTGGTGACGATAAACTATAGACTAAACGTCTTTGGATTCCTGTGCTTAGATATCCCCGAGGTCCCGGGGAACGCCGGGCTTAAAGATACAATAATGGCCTTGAAGTGGGTCAAGAGGAATATCAGACATTTTAACGGAGATGATAACAATATTACAGCGTACGGAGAGAGCGCTGGCGCCGCCGTAGTGTCATCGTATCTGACGTCCAAAATGGCGGGAAATTTGTTCAATAAAGTCATCTGTCAATCGGGCGTGTCTGTGTCGGACCTGTTTATAATGATGAGCGACGATCCGGTGAGCAAAGCCAGCGAAATAGCAAAACATTTGGGGCATAGTGTGTCAGATAAAGTGGCCTTATATGAAATATTTAGAAGTACGCCAGTGGACGATCTCGTCAGCGCCTTTGTGAGCGCCGAGATGAGTCGCCCGCCGGCTGTGATTCACGCGTTCCTGATGCCGGTCGTCGAAAGACACTACGATGGTGTCGAGCGATTCTTCGACGAGCTGCCGCTGGTTGCGTTCCGCGAGAACCGGTTCCGCAAAGTGCCGATCATAGTCACCATTAATTCTATGGAGTCCGCTTTATTTGTGAACAAAGATGGCGACGGCGGTATAATTTATGAAGACCTCAAATATTACATACCGAGTTTCTTGCAAATGGATCACGGGACTGAGAGAGCGTCGCGGTTCGTAACCAAGCTACGGGATTATTACTTCGGGGACAGGAGTCTTGATGAGAACGTCAAAATTGACTACCTTAATTTAACATCGGACCATTATTTCGCAAGGGACACGATGTTGTTCGTGGAACTGGTGTCCAAGTACCACAAAGACATATACATGTGTAGATTCGATTATAATGGAAACGTAAATACAAGTACGATCAGAAATATGGGCCTCCAGGGAGCGTCGCACGGAGACCTCGTGCAGTACGTATTCTATAGGAGGAGTAAGTTGAAAGTGGCCAGTGATAGTGATGTGAAAATAATCGATATGTTGACGGAGGCTTGGTGTAATTTCGTTAAATCAGGCCAACCACATTGGAAAACTCAGCAAACAAAATGGCTTCCATATAATAATGAAGAGAAGTTATGCCTCAATATAGACAACCAGCAAATAGAGGTCAAAAAATATCCCAACTTCGANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGTGATATTTGTAGGGTCCAGGTTCTCCTTGAAGATCTTCTATCAGCCTCAAATTCCTACAGAACCCTCGTTATCTTCGGATGTAATCAGCTGTTCCTCGAATCTTCCACAGATAATGCTGAATATGGTGACGTGGTCACATCTTGGGGTATACAAAGATTGTGCTAA

Protein sequence:

>DPOGS201956-PA
MSCLVRVSEGILEGKLCNTYYGKQYYSFEGIPYAKPPIGDLRFKAPVPPESWTGIRDAKKPGEKCPQMNPYGKAVVEGSEDCLYLNVYTPSLPDEKNQNLPVIFFVHGGRLVLGYGDYYKPDYLIRNDVILVTINYRLNVFGFLCLDIPEVPGNAGLKDTIMALKWVKRNIRHFNGDDNNITAYGESAGAAVVSSYLTSKMAGNLFNKVICQSGVSVSDLFIMMSDDPVSKASEIAKHLGHSVSDKVALYEIFRSTPVDDLVSAFVSAEMSRPPAVIHAFLMPVVERHYDGVERFFDELPLVAFRENRFRKVPIIVTINSMESALFVNKDGDGGIIYEDLKYYIPSFLQMDHGTERASRFVTKLRDYYFGDRSLDENVKIDYLNLTSDHYFARDTMLFVELVSKYHKDIYMCRFDYNGNVNTSTIRNMGLQGASHGDLVQYVFYRRSKLKVASDSDVKIIDMLTEAWCNFVKSGQPHWKTQQTKWLPYNNEEKLCLNIDNQQIEVKKYPNFXXXXXXXXXXXXXXXXXXCDICRVQVLLEDLLSASNSYRTLVIFGCNQLFLESSTDNAEYGDVVTSWGIQRLC-