Monarch geneset OGS2.0

DPOGS211843
TranscriptDPOGS211843-TA1857 bp
ProteinDPOGS211843-PA618 aa
Genomic positionDPSCF300011 - 1380758-1386627
RNAseq coverage591x (Rank: top 22%)
Annotation
HeliconiusHMEL0058820.065.54% 
BombyxBGIBMGA013812-TA0.057.35% 
DrosophilaCG4757-PA2e-11742.37% 
EBI UniRef50UniRef50_B2ZDZ00.061.51%Carboxylesterase CarE-12 n=13 Tax=Obtectomera RepID=B2ZDZ0_BOMMO
NCBI RefSeqNP_001121191.10.061.51%integument esterase 2 [Bombyx mori]
NCBI nr blastpgi|1891816800.061.51%integument esterase 2 precursor [Bombyx mori]
NCBI nr blastxgi|1891816800.061.82%integument esterase 2 precursor [Bombyx mori]
Group
KEGG pathwaydse:Dsec_GM175584e-76 
 K03927 (PNBA)maps-> Drug metabolism - other enzymes
InterPro domain[82-610] IPR0020182.7e-130Carboxylesterase, type B
Orthology groupMCL11854 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211843-TA
ATGTTAAAGACAATTGCGAAACCAAGACTTAATAAAACACTCGCCAACCGCAACGAGCGGAACAGGAAAAAAGTCAATAAATCCCTATCGGTGACGTGTTGTCAGCAACATGTTAGTGTATCGAGTCCAGCGGTCAGGAGCGAGCACAGCGCCGCGCGCTCTCTTGTCGCGATGAAGTCTCCCGGAGCTGTTCTGGTCTTCCTCTCCGCCGTGTTCCCCGGACTGTACTGTGAGGACCGGCCGAGGACGGATGTGAAACAACCGGTTGTCAAATGTGAAGCCGGTGAGTTCTTCGGTCGTTACGAAGTGTCGCGGAGAGGTCGCCGATTCGAGTCGTATCGCGGCATGAGGTACGCAGAGCCGCCCGTGGGAGAGTTACGTTTCCAGCCTCCAGTTGCTATCGCGCAGTACTTGGATCCAGTGAATGCCAGCGTGGACGGTCCGGCGTGTCCTCACAGGACTCGCCCCGGATCATACGTCAGTGAAGATTGTCTTAGGATTAACGTGTACAGTCCTCTCAGAGAGGACAGAAGCAGTCCCCTGCCCGTGGTCGTGTACATTCACGCCGGAGGTCTGTACTCCATGACGGCCCGCAGTGACCTGGCGGGACCCCACGTGCTGCTGGACAGAGACCTGGTGCTCGTCACCTTTAACTACCGACTTGGATCATTAGGTTTCTTGAGCACCGGCGATGCTCTGGCACCAGGTAACAACGGCTTCAAGGACCAGGTGGTGGCACTCCGCTGGGTCCAAAGAAACATAGCCGCCTTCGGTGGAGACCCGACCAGCGTCACAATCGCTGGCTGTAGCGCCGGCTCTCTCAGTGTACTTCTTCATATGGTGTCACCGATGTCTAAAGGCCTGTTTCACCGTGCAATATCAATGAGCGGATCTCCGATTAGCAAAGCTCCTCTGACGACCGATTTATATCAGCTCGCCGTTAAACAAGCTCAGCTTCTCGACTGTCCGACAAACAACTCCAAAATTATTATTGACTGTTTGAAGACTAAGTCCTGGCGGGACCTTGGAACTTCGGTTCTTGGTTTCTATGAATTTGCTTTTGATCCAGTACGTATTTGGAATCCGGTTGTCGAAAAAGACTTCGGTCAAGAACGCTTCTTGGACATGCAGCCGACTGATTCTATCCGCGAAAACAAGATCCATGCTGTTCCGTACATCGTGAGCCAGACACAAGGAGAGTTCTTCTGGATGGCATTTACCGTTCTTAAAAACAAAACATTAACGAACCGAATGAATTCGGAGTGGGAATCGATCGCTCCTATATCTTTCTTACTTCCGAGAGAAAACAAAACTTACGCAAAATACGCAGCAAGAAGACTTCGCACCCAATATCTAAAAGATAAAGATTTATCCAATGAAGAAGAGAGTGTTCAAAACTTAGGTTTGTTATATGCCGATGCGATCGTAAGTTTTCCAGTGCACAGAATGTTTAATCTCATGGTGCGTCACTCGCCGCAGCCTGTCTGGTACTACGAGTTCAGCTTCGTCGGCAATCACAGTCACTATGAGGACCCCGTCACTAAAAAGGCCATCGCGGCCGCACACCATGACGACCTTATATACCTCTTCTCTCTGAGCTACCGTTTCCCCGCCCTGGGGACGGACGGCGAAGACGCTGCCATGGCCGACCGCCTCACCGCCATCTGGTACAACTTTGCCAAATACGGTGACCCCAACCCCCGCGGAGACAGCCGCGAGTTGTCAGGTCTGCGCTGGCCGCAAGCCACCCCTGGACGTCGCACCTACTTACGAATCGACAACCAACTGTCCCTCCACGAAAATTTGAAAGAAGATCGAATGAATATTTGGGAGGAATTGTACCCCTTAGAGTACTGA

Protein sequence:

>DPOGS211843-PA
MLKTIAKPRLNKTLANRNERNRKKVNKSLSVTCCQQHVSVSSPAVRSEHSAARSLVAMKSPGAVLVFLSAVFPGLYCEDRPRTDVKQPVVKCEAGEFFGRYEVSRRGRRFESYRGMRYAEPPVGELRFQPPVAIAQYLDPVNASVDGPACPHRTRPGSYVSEDCLRINVYSPLREDRSSPLPVVVYIHAGGLYSMTARSDLAGPHVLLDRDLVLVTFNYRLGSLGFLSTGDALAPGNNGFKDQVVALRWVQRNIAAFGGDPTSVTIAGCSAGSLSVLLHMVSPMSKGLFHRAISMSGSPISKAPLTTDLYQLAVKQAQLLDCPTNNSKIIIDCLKTKSWRDLGTSVLGFYEFAFDPVRIWNPVVEKDFGQERFLDMQPTDSIRENKIHAVPYIVSQTQGEFFWMAFTVLKNKTLTNRMNSEWESIAPISFLLPRENKTYAKYAARRLRTQYLKDKDLSNEEESVQNLGLLYADAIVSFPVHRMFNLMVRHSPQPVWYYEFSFVGNHSHYEDPVTKKAIAAAHHDDLIYLFSLSYRFPALGTDGEDAAMADRLTAIWYNFAKYGDPNPRGDSRELSGLRWPQATPGRRTYLRIDNQLSLHENLKEDRMNIWEELYPLEY-