Monarch geneset OGS2.0

DPOGS207686
TranscriptDPOGS207686-TA1227 bp
ProteinDPOGS207686-PA408 aa
Genomic positionDPSCF300357 + 59890-64445
RNAseq coverage570x (Rank: top 22%)
Annotation
HeliconiusHMEL0149032e-11064.05% 
BombyxBGIBMGA008633-TA1e-15159.40% 
DrosophilaCG5068-PA9e-11351.26% 
EBI UniRef50UniRef50_Q95R981e-11051.26%Protein phosphatase methylesterase 1 n=18 Tax=Neoptera RepID=Q95R98_DROME
NCBI RefSeqXP_001658170.15e-11451.34%hypothetical protein AaeL_AAEL001170 [Aedes aegypti]
NCBI nr blastpgi|1571152631e-11251.34%hypothetical protein AaeL_AAEL001170 [Aedes aegypti]
NCBI nr blastxgi|1571152633e-11051.34%hypothetical protein AaeL_AAEL001170 [Aedes aegypti]
Group
Gene OntologyGO:00040911.3e-139carboxylesterase activity
KEGG pathway 
InterPro domain[1-406] IPR0168121.3e-139Protein phosphatase methylesterase, eukaryotic
Orthology groupMCL13830 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207686-TA
ATGTCTTCATTAGAGAAGAGTATGATGAAAAATAAATTACCACCCCGAATGCCAAGAACGTCAGGTGTTTCTAAACTTGCACCATTCGGAATTTCTCGCCGCAAAGATTACACACCTGTGTCATGGAAGAACTATTTCGAGAAATACGTTGATATTGAGATAGACGCGGGAATCTTCAGAGTGTACCTGTCTCCTGAGCCTGATCATCCACAGAGGCCTAGAATCGTGACATTACACGGCGGAGGATATTCTGCGTTAAGCTGGTCTCTATTTACAGAGGAAATAACGAACATGATTCACTGTCAAGTAGTGTCTATGGACATTCGAGGTCACGGTGAAACAAAGGCCGTTGATCCCGACGACTTAAGCATTGAAACTTTGGTCAAAGACGTGGAACAAGTTCTTCACAAACTGTTTGGTTCCGAGCTCCCGCCTCTAATTCTTCTCGGTCATTCGATGGGTGGTGCTATCGCTGTCCGGACAGGTCACATACCATCGCTGCAACTCGCTGTCCAAGGAGTTGCTGTGATAGATGTGGTCGAAGGTACAGCGATGGAGGCATTGGCTAGTATGCAGAGTTTCCTAAGAAGTCGTCCGACACACTTCAAGAGTATAGAACACGCTATAGAGTGGTGTATAAGGAGCGGTCAAGTGCGGAATGTCGAGTCGGCCCGGGTGTCCATGCCGAGTCAAATTGTGAACTGCGTCACTGGCGAACTAGCGACGAATGAAGTGGAAGACTACAAGGCAGTGGAGTCCCCCCTGGAGCCGACCGCCAGGGGGAGGAGGGATGACGTCATCGCCGAGGAGGGGGAGGGGGAAATGGAGGGAGGGGAGAGCACGGCCGAGCAGAAGGGGGATGGGGGGCCGTTCACTAAGCCATCACCTGTAGATGGGAATATGAGGTATCGTTGGCGAGTGGACCTCTCTCGTTCTGAGCGTCACTGGTCGGGGTGGTTCAGTGGCCTATCGGGTTCGTTCCTCTCAACGCCAGCACCCAGACTGTTACTGCTCGCCTCCGTCGACGGACTGGACAGGGACCTCACAGTCGGACAGATGCAGGGCAAATTCCAGATGCAAGTTCTAACGAGATGTGGTCACGCTGTTCACGAGGACACGCCATCAGAGGTGGCCCGCGTGGTCGCATCGTTCGCGCTACGTCACCGTCTCACAGTCCCGACCGAGACCGGGGAGGACATGAACCTGATCTCCGCCCCCGGATGCTAA

Protein sequence:

>DPOGS207686-PA
MSSLEKSMMKNKLPPRMPRTSGVSKLAPFGISRRKDYTPVSWKNYFEKYVDIEIDAGIFRVYLSPEPDHPQRPRIVTLHGGGYSALSWSLFTEEITNMIHCQVVSMDIRGHGETKAVDPDDLSIETLVKDVEQVLHKLFGSELPPLILLGHSMGGAIAVRTGHIPSLQLAVQGVAVIDVVEGTAMEALASMQSFLRSRPTHFKSIEHAIEWCIRSGQVRNVESARVSMPSQIVNCVTGELATNEVEDYKAVESPLEPTARGRRDDVIAEEGEGEMEGGESTAEQKGDGGPFTKPSPVDGNMRYRWRVDLSRSERHWSGWFSGLSGSFLSTPAPRLLLLASVDGLDRDLTVGQMQGKFQMQVLTRCGHAVHEDTPSEVARVVASFALRHRLTVPTETGEDMNLISAPGC-