Monarch geneset OGS2.0

DPOGS209884
TranscriptDPOGS209884-TA1350 bp
ProteinDPOGS209884-PA449 aa
Genomic positionDPSCF300049 - 496693-498804
RNAseq coverage380x (Rank: top 32%)
Annotation
HeliconiusHMEL0035363e-11260.49% 
BombyxBGIBMGA004205-TA5e-10354.32% 
DrosophilaCG10175-PC7e-3130.86% 
EBI UniRef50UniRef50_D3GDN11e-9451.99%Antennal esterase CXE20 n=2 Tax=Spodoptera RepID=D3GDN1_SPOLI
NCBI RefSeqNP_001124352.11e-4432.93%alpha-esterase 41 [Bombyx mori]
NCBI nr blastpgi|3135062482e-9452.62%antennal esterase CXE20 [Spodoptera exigua]
NCBI nr blastxgi|3135062484e-9352.62%antennal esterase CXE20 [Spodoptera exigua]
Group
KEGG pathwaycfa:4919474e-26 
 K07378 (NLGN)maps-> Cell adhesion molecules (CAMs)
InterPro domain[124-422] IPR0020182.6e-46Carboxylesterase, type B
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209884-TA
ATGAAGTCTTACTGGCTGATATTATGGTCGCTATGGGCAGCGCGACTTGTACGGCAACCAACATTGCCTGTGCGAATATCTGGTGGCCTGGTCCGTGGTAGCCTGGCAGCTGATGGATCATATGTCAGCTATCTAGGTCTACCCTATGCTTCATATCAAAATAGGTTTCAGCCTTCTCAGCCATCTGCAGAATGGAACGGCGTATTCCAAGCCACCGAGGAGCATGTACGATGCTCTCAGAGATTCTCTACCACTTGGATAAACGGCCAGGAAGACTGTCTCACTCTCAACGTATATACTCCGATACCATATGTGACAACAGAGAAACTGCTGCCAGTTATGGTTTTCATTCACGGAGGAGGATACAGAGATGGATTATTTCACAGAGCAATAATGCAAAGCGGTTCCGCCATTTCACCCTGGAGTTTACAATTCGAGCCATTTAAAACGGCTTCATATTTAGCTCATCAAATTGGATATAAATTAGAAGATCCTACAGAAATTCTTAAATTACTATCAAAAATATCCGTCAAAGAATTACTATCCACTCGTATTCCCAGAGCGAAGGGTGACGTCATATTGTCAGAAAATATATTTGTCCCGTGTATTGAAAAAAATAATCTCGATGAAGAAAAATTCCTAACCGATTCCCCGTATAATTTAATTTCGAAAGGTCAATTTAACAAAGTTCCAATAATAATGGGTTATAACAACGCCGAAGGCTATATGTTTATTGGAAAAGAAAATAAGACTACTTTGGAGACATTCAATTTTTACGACGCTTTACCGAGAGATTTAGAATTTCCGTTAGCATCAGAAAAACATGTCGTTGCAAAAAAAATGGAAAACCAATATTACAAGAGAAAAGATTTTGGAAATGACAAACTGATTAGAGTAGCGAAATACGAAGGTGACTCTGGAATCGGATATCCAGTGACTGCGACAATAGAATTACTTTCAGCCAGTATGGAGTATCCGGTGTATGCTTATAAGTTTTGTTATGATGGATGGATGAATCTTATCAAAATGTTGTTCAGGCTTTGGAAGTATCCCGGAGCGACGCATGCTGATGATTTATTTTACATATTTAAAGTGAAGGCGACGCTTCCACAATCATTTATAGAAAAGAATATTATTGACAGAATGGCATTAATGTGGACTAACTTCGCTAAATACGGCAATCCTACCCCTGGGACTCAAGAGCAACTACCCAAAAAATGGTATTCAATAAATAGAAAAAATCCTCAACTCTATGTCATAGACAAGGAGTTCTCCTCGGCAGGTCTGTGGGATGACGAAGATTTAAGATTATGGAATGAAACCTACTCCAAATACAGAAGAAGAAAATAA

Protein sequence:

>DPOGS209884-PA
MKSYWLILWSLWAARLVRQPTLPVRISGGLVRGSLAADGSYVSYLGLPYASYQNRFQPSQPSAEWNGVFQATEEHVRCSQRFSTTWINGQEDCLTLNVYTPIPYVTTEKLLPVMVFIHGGGYRDGLFHRAIMQSGSAISPWSLQFEPFKTASYLAHQIGYKLEDPTEILKLLSKISVKELLSTRIPRAKGDVILSENIFVPCIEKNNLDEEKFLTDSPYNLISKGQFNKVPIIMGYNNAEGYMFIGKENKTTLETFNFYDALPRDLEFPLASEKHVVAKKMENQYYKRKDFGNDKLIRVAKYEGDSGIGYPVTATIELLSASMEYPVYAYKFCYDGWMNLIKMLFRLWKYPGATHADDLFYIFKVKATLPQSFIEKNIIDRMALMWTNFAKYGNPTPGTQEQLPKKWYSINRKNPQLYVIDKEFSSAGLWDDEDLRLWNETYSKYRRRK-