Monarch geneset OGS2.0

DPOGS212827
TranscriptDPOGS212827-TA3234 bp
ProteinDPOGS212827-PA1077 aa
Genomic positionDPSCF300086 - 189377-194808
RNAseq coverage34x (Rank: top 74%)
Annotation
HeliconiusHMEL0044790.053.63% 
BombyxBGIBMGA000773-TA2e-14152.65% 
Drosophilaalpha-Est9-PD2e-4627.37% 
EBI UniRef50UniRef50_G3CM711e-13453.50%Carboxylesterase CXE27 (Fragment) n=1 Tax=Spodoptera littoralis RepID=G3CM71_SPOLI
NCBI RefSeqNP_001121786.13e-5833.14%alpha-esterase 3 [Bombyx mori]
NCBI nr blastpgi|3427314325e-13453.50%carboxylesterase CXE27 [Spodoptera littoralis]
NCBI nr blastxgi|3427314322e-12954.11%carboxylesterase CXE27 [Spodoptera littoralis]
Group
KEGG pathwaydvi:Dvir_GJ102324e-48 
 K03927 (PNBA)maps-> Drug metabolism - other enzymes
InterPro domain[3-471] IPR0020182.1e-82Carboxylesterase, type B
Orthology groupMCL22693 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212827-TA
ATGATGCTGTTACTTGTGTTGCTCGTTGTTTTCATGACGATCGATGAGTCCGAACAAAACGTAGTGAGCATAGCGCAGGGACGAATACGGGGCGTTCAGAAACGAGGGTACATCACATACGGCGGGATTCCGTATACCACGGTCAGCGGCATGCCAGGGAGATTCAAGAGTGGCGGGCTAGCGCCGGTGTGGCCAGACATAAGGGGATCTCAGATGGGTGATTGTACAACTACGTCACCAGTTGAAGACTGTTTACAACTAGATGTCAACGTACCCCCCGGATCACTTTTACCTGTCATGGTGTGGGTAACTGGTGGTAGTGGACACTACGATCCCACCATGCTTGTCGAACAACAAATTTTAGTAGTGATAGTGCGACACAGATTAGGTCCAACGGGTTTTTTATGTTCGAGGGAAGATAAAATAGCAGGCAATGCTGGTCTCAAGGACGTAGTGTTGGCCTTGCGATGGGTGAGAGATAATATCGTGGCTTTTGATGGAAATCCCAACATGGTGGTGGTCTCTGGGCAGAGCTTTGGGGCCGCTATGGTGCAATCCTTGATATTGTCAAATATGGCCAGGGGGTTGTTCCATGGAGCCATAATACAGAGCGGAAGCGTTTTAGCTCCTTGGGCTTTTAACTTTGATGCCGAACAACGGGCGAAGTTGTTAAAGATGAAATTCAATGAAAGTCGAGATTCTTTTACATTGGTCAGAGCTGGTACAGCGGACCTTGCTAGGAAAGCAGACGAGCTAGACTTCCCATATTTACCCTTCGGTATATGTGTAGAAAACCCCATGAAAAATGAGGAAATTTTTTTACCTGATTCTCCTTTCGACTTACTTTCTAATGGCAAAATAATATCAGTCCCAATTATAATGGGATACACCAACAACGAAGCGTATATATTCTCTTCGATGTTGAAAGAAGTCGGGGTATTGAAAAAGATGGCAAAGGAGATAAACTTTTTAATACCAATAGAACTGCAAACCAGTAAACGGGACGTCCTGCAACTAGCCAAAAAAGTAAAGGAAATGTACTTTCCTGGAAATGTAACTATGAAATCAGTTCTAAAATACCATAGGGATGCGTATTTCTTGAGTCACATCTATAAAAGCATTCGTTTTCTCACGCCGTCGTCGAGTCCTGTATACTTCTACCAGTTCTCCCATAGCGGCTCTGTAGGTGTGGAAGAAGAGCCCGACGTTATCAAGGATGGAGCAGCGCACTCTGATGAACTGGCGTATTTGTTCCCAGACAAGGGACGCAAATTAGATGGTGACGATGGAGACGCCCAGAGAAATGTTGTTAAACTTTGGACAAATTTTGTTAAATATCTAAACCCCACTCCCCAAGGTGAACTGGGCCTGGGATTGGTCTGGGAGCCGTACAGTCCTTACGACAACCGAGTGTTAGCCATTAGCACCGAGTTGGAGATGATCGAATTCCCGTACAATAAGGAGATGCAGATCAGTTTCAGTTCCATTTTGGGGAACGGCACGGCGACGGCGTTTATCGAATCGGAACAACCGAAATATAAGACTTTGATATTGGTGTTCATTTGTGCCTTCGTGGTCTCACGGGCGGTCGGCGAATTATCCGTCATAGACAACGTGGTCACTGTTCAAGATTTAGACGACGATAACAAACACAACAAGGGAAATTATCGACAAGGTATACAAGTTAAAACCGAACAAGGTCTTGTGCGGGGCTATCAGGACGAATCGGGCATCGTGACATTTTTTGACATACCCTATGGAGAATTCAGTAAAGATCAACCATTTCAGGAACCAACTCCGCCGAAACCTTGGGAGGCTGTGCACGACAGAACATCACACTCATCCCGCTGTCCGCAGATACGCAACGGAACCTTCGAAGGCACTCATGAATGTCTCAGTCTGACAGTAATGAAGCTAAACGATGCTGCTAATGCTGATGTCTTATTTCATATCCACGACAGTTCCTTCACTTCCGGAAGTGGCGATCCTCTTGTTTATAATCCTAAACATATAGTTCCGAAAGGAGTTATCTTGGTTTTACCGAATTACAGACTTGGACCTCTAGGATTTCTGTGTTTGCAAAATAATACGGTGCCCGGTAATGCTGGCCTTAAGGATCTGACGCTTGCGTTAGATTGGACGAAAAACAATATAGAGTCTTTTGGAGGAAATTCGACTAATATAGTGGTAAGCGGCACCGCAGCTGCAGGGGCATTGGTTGAGTATCTGCTTCTATACAACAGTTCTAATATATATATTTCTAAAGCGATCACTGAAAGCGGATCTGTGCTTTCTCCTTGGGCTGTAGACAGATATCCATTAGATACTGCCAATTTATTTTTTCAAAAGTTAAGGGGACGCCAGATTACAGGAGACCCTCAGCATGTATTGCAAAACGTGGATTTAGAAGTTCTAACGAAAGCTGGTGACGGGATAGACTTTAAACCATGCATTGAATCAGGAGAAGGCTTCATGACCCAATCGCCTTGGGTTTCGTTACAGAATGGAAAGTTTAACTTTTCATACATGACAGGATCAGCGAACCAAGCTGCGATGAGGGAAGTTTTAGAACTTTCAACACAAAGTTTATCACAGATTAATAAAAATTTATCCGAATTGCTACCTAACGATCTCTTCTTCGACAATGATAACGAGAAATCTCGTCTAGCATTAAAAGTGAAAACTATTTACTTCGGCAACAAAGACATCTCTGAAAATAATAGAGAGAAACTTTCGCTATGCTACTCGGATAGCCAATATCTCAATAGTGCAATAAGAACCGCCAGGCTATTAGCTAAGGGAGGTTCTCATGTATATTTTTATGAATTCTCTTTGGATAACCATTCATCGCCTATTAGCGGTTCTGCGCGGGGAGACTCCCTGAATTTTATATTCGGAACAGAAGAAACCACTACGAATAATAACGGAGAAACAAACAAAAATGTAATGAGAGAGATGATGCTACATTTATGGATTAGTTTCATAAAATACGGGAATCCTTCCGCGGAAAATATTACATGGAAGAACTTGAAATATGGGGAACAGAACAGCGAGGAGTGGCTCTCAATTGGCTCGGCGGTGGAGATGAAGAAGGGTCTTCATGTGGAACGGCTGAAGTTATGGGACGACTTATATAACTCTTACTACATTGAGAGGAACAAGGGCTCAGGACACAAACCTGTCAGTTTCTTGGCGATCGTTTCTTGGTCAGTCCTTGCCTTCATTAACCTTTAG

Protein sequence:

>DPOGS212827-PA
MMLLLVLLVVFMTIDESEQNVVSIAQGRIRGVQKRGYITYGGIPYTTVSGMPGRFKSGGLAPVWPDIRGSQMGDCTTTSPVEDCLQLDVNVPPGSLLPVMVWVTGGSGHYDPTMLVEQQILVVIVRHRLGPTGFLCSREDKIAGNAGLKDVVLALRWVRDNIVAFDGNPNMVVVSGQSFGAAMVQSLILSNMARGLFHGAIIQSGSVLAPWAFNFDAEQRAKLLKMKFNESRDSFTLVRAGTADLARKADELDFPYLPFGICVENPMKNEEIFLPDSPFDLLSNGKIISVPIIMGYTNNEAYIFSSMLKEVGVLKKMAKEINFLIPIELQTSKRDVLQLAKKVKEMYFPGNVTMKSVLKYHRDAYFLSHIYKSIRFLTPSSSPVYFYQFSHSGSVGVEEEPDVIKDGAAHSDELAYLFPDKGRKLDGDDGDAQRNVVKLWTNFVKYLNPTPQGELGLGLVWEPYSPYDNRVLAISTELEMIEFPYNKEMQISFSSILGNGTATAFIESEQPKYKTLILVFICAFVVSRAVGELSVIDNVVTVQDLDDDNKHNKGNYRQGIQVKTEQGLVRGYQDESGIVTFFDIPYGEFSKDQPFQEPTPPKPWEAVHDRTSHSSRCPQIRNGTFEGTHECLSLTVMKLNDAANADVLFHIHDSSFTSGSGDPLVYNPKHIVPKGVILVLPNYRLGPLGFLCLQNNTVPGNAGLKDLTLALDWTKNNIESFGGNSTNIVVSGTAAAGALVEYLLLYNSSNIYISKAITESGSVLSPWAVDRYPLDTANLFFQKLRGRQITGDPQHVLQNVDLEVLTKAGDGIDFKPCIESGEGFMTQSPWVSLQNGKFNFSYMTGSANQAAMREVLELSTQSLSQINKNLSELLPNDLFFDNDNEKSRLALKVKTIYFGNKDISENNREKLSLCYSDSQYLNSAIRTARLLAKGGSHVYFYEFSLDNHSSPISGSARGDSLNFIFGTEETTTNNNGETNKNVMREMMLHLWISFIKYGNPSAENITWKNLKYGEQNSEEWLSIGSAVEMKKGLHVERLKLWDDLYNSYYIERNKGSGHKPVSFLAIVSWSVLAFINL-