Monarch geneset OGS2.0

DPOGS202609
TranscriptDPOGS202609-TA1374 bp
ProteinDPOGS202609-PA457 aa
Genomic positionDPSCF300140 - 35048-45506
RNAseq coverage197x (Rank: top 47%)
Annotation
HeliconiusHMEL0104387e-12297.20% 
BombyxBGIBMGA003321-TA4e-8636.66% 
DrosophilaAce-PA3e-15258.60% 
EBI UniRef50UniRef50_Q276770.066.38%Acetylcholinesterase n=88 Tax=Pancrustacea RepID=ACES_LEPDE
NCBI RefSeqNP_001108113.10.092.34%acetylcholinesterase type 2 [Bombyx mori]
NCBI nr blastpgi|557931890.095.19%acetylcholinesterase [Helicoverpa assulta]
NCBI nr blastxgi|557931890.095.19%acetylcholinesterase [Helicoverpa assulta]
Group
Gene OntologyGO:00041046e-12cholinesterase activity
KEGG pathwaytca:6593680.0 
 K01049 (ACHE)maps-> Glycerophospholipid metabolism
InterPro domain[1-388] IPR0020183.7e-117Carboxylesterase, type B
[215-223] IPR0009976e-12Cholinesterase
Orthology groupMCL16654 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202609-TA
ATGAGTGGAACAGCTACACTTGACCTGTATAAAGCAGATATAATGGCTTCATCGAGTGATGTTATAGTAGCGTCAATGCAATATAGAGTTGGTGCATTTGGATTTTTATATTTAAATAAATATTTTTCTCCTGGAAGTGAGGAAGCACCCGGAAATATGGGTCTTTGGGATCAACAAATGGCTATTCGCTGGATAAAAGATAATGCTAAAGCATTTGGGGGCGATCCGGAGTTAATAACTCTTTTCGGAGAGTCTGCTGGAGGAAGTAGTGTGAGCTTGCATATGTTATCCCCGGAAATGAAAGGATTATTCAAACGAGGAATCTTGCAATCTGGAACTTTAAATGCTCCATGGAGTTGGATGACTGGAGAGAGAGCTCAAGATATTGGAAAAGTTTTAGTCGATGACTGCAATTGTAACAGCAGTTTATTAAGTGCGGACCCAAGTTTAGTTATGGATTGCATGCGAGGTGTTGATGCCAAGACGATTTCAGTACAGCAATGGAATTCTTATACTGGTATTTTGGGTTTTCCGTCGGCACCAACTGTGGATGGAATATTTTTGCCAAAAGATCCCGATACTTTGATGAAAGAAGGTCATTTCCATAATACAGAAGTGCTTCTCGGAAGCAACCAAGACGAAGGGACATACTTTCTGCTTTACGATTTTCTCGACTACTTCGAAAAAGATGGTCCAAGTTTCCTGCAAAGGGAGAAGTTTTTGGACATAATAGACACCATTTTTAAGGAGTTCTCTAAAATTAAAAGAGAAGCTATTGTATTTCAGTACACCGATTGGGAGGAAATCACCGACGGCTATTTGAACCAGAAGATGGTTGCTGACGTCGTAGGGGATTATTTCTTCGTGTGCCCAACAAATTATTTTGCTGAAATTCTCGCTGATTCTGGTGTTGACGTTTACTATTACTATTTTACACACAGAACTAGTACAAGTCCTTGGGGTGAATGGATGGGAGTGATTCATGGGGATGAAGTGGAATATGTCTTCGGGCATCCCCTGAACATATCTTTACAGTACCATACTCGTGAACGTGACCTGGCTTCTCACATAATGCAATCATTCACCAGATTTGCTTTAACCGGTAAACCTCACAAGCCTGATGAAAAATGGCCACTTTACTCTCGGACAGCCCCTCACTATTACACTTACACAGCTGATGGCCCCAGTGGCCCGGCCGGCCCGCGTGGTCCCCGAGCTTCCGCCTGCGCCTTCTGGAACGACTTCCTCAACAAGCTTAATGAACTGGAGCACGTGCCATGTGATGGAGCCGTCACCGGACCATATAGCAGCGTCGCTGGCACTACGCTACCCATCGTACTACTAACTACTCTTGCGATTTCTGTAGCACTTTAA

Protein sequence:

>DPOGS202609-PA
MSGTATLDLYKADIMASSSDVIVASMQYRVGAFGFLYLNKYFSPGSEEAPGNMGLWDQQMAIRWIKDNAKAFGGDPELITLFGESAGGSSVSLHMLSPEMKGLFKRGILQSGTLNAPWSWMTGERAQDIGKVLVDDCNCNSSLLSADPSLVMDCMRGVDAKTISVQQWNSYTGILGFPSAPTVDGIFLPKDPDTLMKEGHFHNTEVLLGSNQDEGTYFLLYDFLDYFEKDGPSFLQREKFLDIIDTIFKEFSKIKREAIVFQYTDWEEITDGYLNQKMVADVVGDYFFVCPTNYFAEILADSGVDVYYYYFTHRTSTSPWGEWMGVIHGDEVEYVFGHPLNISLQYHTRERDLASHIMQSFTRFALTGKPHKPDEKWPLYSRTAPHYYTYTADGPSGPAGPRGPRASACAFWNDFLNKLNELEHVPCDGAVTGPYSSVAGTTLPIVLLTTLAISVAL-