Monarch geneset OGS2.0

DPOGS214119
TranscriptDPOGS214119-TA1272 bp
ProteinDPOGS214119-PA423 aa
Genomic positionDPSCF300014 - 1665084-1671618
RNAseq coverage642x (Rank: top 20%)
Annotation
HeliconiusHMEL0113890.085.25% 
BombyxBGIBMGA006167-TA4e-11591.63% 
DrosophilaHydr2-PA3e-14560.80% 
EBI UniRef50UniRef50_Q240934e-14360.80%Abhydrolase domain-containing protein 2 n=22 Tax=Endopterygota RepID=ABHD2_DROME
NCBI RefSeqXP_966390.13e-15363.64%PREDICTED: similar to GA17475-PA [Tribolium castaneum]
NCBI nr blastpgi|910866315e-15263.64%PREDICTED: similar to GA17475-PA [Tribolium castaneum]
NCBI nr blastxgi|910866318e-15063.68%PREDICTED: similar to GA17475-PA [Tribolium castaneum]
Group
Gene OntologyGO:00040911.6e-74carboxylesterase activity
KEGG pathway 
InterPro domain[1-393] IPR0120201.6e-74AB-hydrolase YheT, putative
[139-269] IPR0000732.6e-07Alpha/beta hydrolase fold-1
Orthology groupMCL13620 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214119-TA
ATGTCTACTGTTCTTCTTGCTGTCATTGCCGTAATTTTGTGCGTTCTTTTCCGAATACTGAATGTTAACAGCCAACCGCACAAGCCCGTAATATATGGCAGGGATAGAAATTTCATTGAAAATATTCTTTTCGTCGCACCGTTTCTGAGTGAACCATATATACCGACACGATTATGGGGTTTTAGTGGACACATTCAGACAATCCTTCATAGTTTGATAGGAAGGGTGCGTTGTCCTTGGCCTATTGGTGGACGAATTTCTCTCATCTTGCCCGATAAGTCAACACTGACATATGATCTCTATGAACCACTAGGAGCAGAACATGAGGATGATGTCACGGTGGCCATTTGTCCCGGAATCGGTAATACGTCGGAGAGTGTGTACATACGTACTTATGTACATTACTCGCAGCGCCACGGCTACCGGTGCGCTGTACTGAATCATATCGGCGCCCTGAACAGCGTGCCTGTGACAGGCGCTAGAATATTCTCGTACGGTCACACGGATGACTTTAATTATATGATAGATAATTTAATGGAGAGATATCCGAATACGAAGTTGATCCTAGTCGGTTTCAGCCTCGGCGGCAATTTGATTACTAAATATTTAGGTGAAGACAGGAAAAGGCCTGATAATATTATAGGTGGAATATCAATATGCCAGGGATACAATGCCATAGACACGATGGTGTATCTTCTTCAGTGGCAGAACTTCCGCCGCTTCTATTTGTACATAATGACAGACAATTACCGTAACATCATAACCCGCCACAAGAGGACGCTGCTTAGCAACGAAATGAAGAACAGATACTCGCTGGACGAGAACATGATTGTGTCAGCTGGTACATTACCGGATCTTGACGAAGCTTACTCCAGGAGAGTGTATGGATTTAACTCCGTGGCAGAGTTGTATAAATGGAGCTCATCGGCGTTCTATCTTCAGAATATAAAGACGCCTATGATATATCTTACGCCGATATATTACTTAATGTCATTAATTATTAACTTCGCCTTGGCCGTTAGAGTTCAAGGTTCCCACGACAAGGTGATGTACCTGGAACTGTCTCACGGCGGTCACCTGGGTTTCTACGAGGGCGGGCTGTTGTACGCTAACCCGGTCACGTGGCTGGACCGAGCGCTGGCGGCCATCGTCGGCGGTCTGATGATGGCACACCACAAATGTGTACCACAAGAGAGCGTTGACGAACCCGACTTAATCAAATCATCCACCATCATATATAGAGATCCGCTGGACAAGGCACTCGTATTGTAG

Protein sequence:

>DPOGS214119-PA
MSTVLLAVIAVILCVLFRILNVNSQPHKPVIYGRDRNFIENILFVAPFLSEPYIPTRLWGFSGHIQTILHSLIGRVRCPWPIGGRISLILPDKSTLTYDLYEPLGAEHEDDVTVAICPGIGNTSESVYIRTYVHYSQRHGYRCAVLNHIGALNSVPVTGARIFSYGHTDDFNYMIDNLMERYPNTKLILVGFSLGGNLITKYLGEDRKRPDNIIGGISICQGYNAIDTMVYLLQWQNFRRFYLYIMTDNYRNIITRHKRTLLSNEMKNRYSLDENMIVSAGTLPDLDEAYSRRVYGFNSVAELYKWSSSAFYLQNIKTPMIYLTPIYYLMSLIINFALAVRVQGSHDKVMYLELSHGGHLGFYEGGLLYANPVTWLDRALAAIVGGLMMAHHKCVPQESVDEPDLIKSSTIIYRDPLDKALVL-