Monarch geneset OGS2.0

DPOGS206985
TranscriptDPOGS206985-TA1593 bp
ProteinDPOGS206985-PA530 aa
Genomic positionDPSCF300001 + 449014-457369
RNAseq coverage868x (Rank: top 15%)
Annotation
HeliconiusHMEL0021270.079.77% 
BombyxBGIBMGA014198-TA5e-14651.99% 
DrosophilaCG42249-PC1e-10138.40% 
EBI UniRef50UniRef50_E0A9220.070.19%Apyrase n=3 Tax=Obtectomera RepID=E0A922_HELZE
NCBI RefSeqXP_001648679.11e-11442.05%apyrase, putative [Aedes aegypti]
NCBI nr blastpgi|3020259150.070.19%apyrase [Helicoverpa zea]
NCBI nr blastxgi|3020259150.070.19%apyrase [Helicoverpa zea]
Group
Gene OntologyGO:00167871.8e-180hydrolase activity
GO:00091661.8e-180nucleotide catabolic process
KEGG pathwayaag:AaeL_AAEL0005754e-114 
 K01081 (E3.1.3.5)maps-> Purine metabolism
    Nicotinate and nicotinamide metabolism
    Pyrimidine metabolism
InterPro domain[13-529] IPR0061791.8e-1805'-Nucleotidase/apyrase
[327-528] IPR0083344.1e-395'-Nucleotidase, C-terminal
[21-233] IPR0048431.2e-15Metallophosphoesterase domain
Orthology groupMCL26180 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206985-TA
ATGCTTGGCGTTGTTTTAACTTTCGCTTTGCCATTCGAAGGATTGTTTCCAGTGGATTTGATACATTACAATGATTTTCATGCGAGGTTCGAAGAAACGTCAGTTGAGACGCCAACTTGTCGATTTAACAACAATTCCTGTATTGGTGGCCTGCCGAGACTCTTCCAGAAGATAGAAGACTTACGAAAGGAGAAACCAGATTCCATCTTACTGAACGCCGGAGATAGCTTTCAGGGGACGTATTGGTATACTCTTCTTAAATGGAATGTTACACAAGAGTTTATGAATCTTTTGCCCCATGACGCTCATGCTATCGGAAACCATGAGTTTGACGATGGACCACAAGGGTTGGCTCCATATCTCCAAGCTCTCAAAGCACCAGTACTTGCTGCCAACATGGACGCCAGTAAGGAACCAATTTTACAAGGTCTCTATAGAGGTCATGTTATCATAGAACGAAGAAAAAGAAGGATTGGACTCATTGGATTAATTACTCCTGATACAAAAATATTATCATCCGCCGGTAATGTAGAGTTCACTGATCCTGGCGAGGCGATGAGACGAGAAGCTAAGTGGCTTAATGAGAAAGGTGTAGACATCATCATTGTGCTCTCACATTGTGGCCTCGAAGTCGACAAGACATTAGCACGCGATTACGGCAAACACGTGGACATAATAGTGGGCGGACATTCCCACTCTTTACTCTGGAACGGTCCCTCTCCTAGCGGGGAAGACGTTGCCGGTCCATATCCCGTTTTTGTTCAATCTACTGCCACGACCAAACATAAGGTTTTAATAGTACAAGCATCAGCCTTCACCAAATATATGGGTAACTTGACAGTGTATTTCAATTATAGAGGTGACTATGTTAAATGGGAGGGAGGACCGGTTTTCCTTGACAGATCTTTACCGGAAGATAAAGAGATAAAAGCAAAGCTAGCGCCTTACGCAGCCATGGTGCATGCAGCTGAAAAGGAGATAGTGGGTGAAACATCTAAAACACTCCACTTTGAGGAGTGTGTGTCTGGGGAATGCGCTTTGGGAGATCTATTGGTTGATGCAATGACAGAATATGGAAAATCTTTGAAACCTGACTTGCATTACGTTGGTTTTATTCAGCGCGGGAACATAAAGTCTTCTATTCCGAGCGGAAATATAACGAAAGGGGTCATATTCGAACTTTTGCCGTTTAACGACCGTATTGAGATTTTCGAGTTACAAGGCAAAGATATATTGAAAGCCTTGGAGAGAAGCTTTTCCGGAGCCTGGAACATTAATCCGTTTAAGGGTCCCTACGTGTTACAACTGTCTGGTCTTCAAGTGACATACAACGTGTCACTACCTGAAGGGCAGCGTGTGAAATCAGTATTCGTTGGCCACAGTAAATCTAATATATCCCTAGATCCACATATATATTATCACGTGATAGCACCAGCATACTTGTCGGACGGAGGAGACGGATTTAACATGTTCAAAGAAGGAAAACGAAATACCGAGATCGTTGGTCGAGATGAGAAAGTTTTGGAACTTTACATAAAGAAGCACTCTCCGTTAAACATCAACACGGACGGACGGATTTTCGTCAATTATTGA

Protein sequence:

>DPOGS206985-PA
MLGVVLTFALPFEGLFPVDLIHYNDFHARFEETSVETPTCRFNNNSCIGGLPRLFQKIEDLRKEKPDSILLNAGDSFQGTYWYTLLKWNVTQEFMNLLPHDAHAIGNHEFDDGPQGLAPYLQALKAPVLAANMDASKEPILQGLYRGHVIIERRKRRIGLIGLITPDTKILSSAGNVEFTDPGEAMRREAKWLNEKGVDIIIVLSHCGLEVDKTLARDYGKHVDIIVGGHSHSLLWNGPSPSGEDVAGPYPVFVQSTATTKHKVLIVQASAFTKYMGNLTVYFNYRGDYVKWEGGPVFLDRSLPEDKEIKAKLAPYAAMVHAAEKEIVGETSKTLHFEECVSGECALGDLLVDAMTEYGKSLKPDLHYVGFIQRGNIKSSIPSGNITKGVIFELLPFNDRIEIFELQGKDILKALERSFSGAWNINPFKGPYVLQLSGLQVTYNVSLPEGQRVKSVFVGHSKSNISLDPHIYYHVIAPAYLSDGGDGFNMFKEGKRNTEIVGRDEKVLELYIKKHSPLNINTDGRIFVNY-