Monarch geneset OGS2.0

DPOGS206986
TranscriptDPOGS206986-TA1422 bp
ProteinDPOGS206986-PA473 aa
Genomic positionDPSCF300001 + 465121-468934
RNAseq coverage293x (Rank: top 38%)
Annotation
HeliconiusHMEL0021280.072.82% 
BombyxBGIBMGA012948-TA9e-15254.01% 
DrosophilaCG42249-PC1e-8735.79% 
EBI UniRef50UniRef50_E0A9224e-13550.95%Apyrase n=3 Tax=Obtectomera RepID=E0A922_HELZE
NCBI RefSeqXP_308620.41e-9739.63%AGAP007140-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3020259151e-13450.95%apyrase [Helicoverpa zea]
NCBI nr blastxgi|3020259157e-13351.06%apyrase [Helicoverpa zea]
Group
Gene OntologyGO:00167872.8e-149hydrolase activity
GO:00091662.8e-149nucleotide catabolic process
KEGG pathwayaag:AaeL_AAEL0005755e-94 
 K01081 (E3.1.3.5)maps-> Purine metabolism
    Nicotinate and nicotinamide metabolism
    Pyrimidine metabolism
InterPro domain[3-473] IPR0061792.8e-1495'-Nucleotidase/apyrase
[272-473] IPR0083342.2e-385'-Nucleotidase, C-terminal
[5-178] IPR0048431.3e-07Metallophosphoesterase domain
Orthology groupMCL26024 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206986-TA
ATGGTCTTAAAGAAAGAGAAGCCGGGTGCACTTCTATTAAACGCAGGCGACACATTCCAAGGAACGTATTGGTATACTCTATTAAAGTGGAATATCACTCAGAAGTTCATCAACTTACTACCCCACGATGCTCATGCAATATTCGCTGTTGGAAATCACGAATTTGATGATGGAGTAGCGGGATTAGCACCTTATCTTGGCGCACTCAAAGCTCCCGTCTTAATCGCAAATATTGACACCAGCTTAGAGCCTTCCCTAGATGGCTTATATCAACCACACGTCGTCATAGAGAAGAACGGAAGGAAAATAGGAATAATCGGTCTTATTACTACAGAAACACAATCAACGTCTAACCCTGGAGAGGTGAAATTTTTAGATCCTATAAGTGTTGTTGAAAGAGAAGCGCAAATTTTGACAGATCAAGGAATTGATATCATCTTAGTATTGTCACACTGTGGACTTGCTATCGATAAGCAGATAGCGGCAACGGTTGGACAGAATATTGACGTTATTATAGGTGGTCACTCTCATAGCCTGCTATGGAATGGACAGTCACCAAGCCATGAGCACATATCAGGGCCTTATCCTGTTTTAGTAGAATCTGAATCAAAACCTCATCACCAAGTGCTCATAGTGACAGCGAGTTGTTTTACAAAATATATTGGAAATTTAACTGTTTACTTTGATGAGATGGGTGATTTGAAGGACTTTGATGGCGTTCCGATATTTCTTAACAGATCTATCCCTGAGAGTCCAGAGGCGAAAGCGATTTTGCAACCTTATTCCAAGAAGTTACACGAACTGGTCAATGAGGTGGTGGGATACGTTGATAAAGATTTTTTGGCGAAGACTTGCGGCTCACGAGAATGTGCTATCGGGGACTTTTTTGCCGATGCATTCGTGAATGAAACTAAAGAGCAAAATATATCAAATCTCACCCATGTAGCTTTCATTTTGAGGAATATGATCCGAGGGTCGATACCTAAAGGAGATATATCAAGAGGCGACATAATTAATGCTTTACCGTTCACTAATAAGGTGGTAACGTTTTCATTATTAGGAAAATATTTAATAGAGGCTTTTAGGAATTGTATGACAAATTATTGGGTTTATAAGCCTTTTGATGGGCCCTGGATGCCGCAAGTGTCAGGAATACGAGTGGTTCTGAATTTAACAGATGATTTAACAATAAAAGTGTTTATTAAGGAAGGAAATGAGTTTTTGCCCCTAGACCCAGACAAGGCATATCAAGTGTCAACTTTAAGTTTCTTAAGTAGAGGAGGAAATGGATTCGATATGCTGAAAAAATACGGTCTCAATAAAACAATAATTGGAAAGGACACAGATATTCTAGAAAAGTACATAAGAAAACGTACACCAATAACACCTACTTTAGACAATAGATTAACTGTAATAAACTAG

Protein sequence:

>DPOGS206986-PA
MVLKKEKPGALLLNAGDTFQGTYWYTLLKWNITQKFINLLPHDAHAIFAVGNHEFDDGVAGLAPYLGALKAPVLIANIDTSLEPSLDGLYQPHVVIEKNGRKIGIIGLITTETQSTSNPGEVKFLDPISVVEREAQILTDQGIDIILVLSHCGLAIDKQIAATVGQNIDVIIGGHSHSLLWNGQSPSHEHISGPYPVLVESESKPHHQVLIVTASCFTKYIGNLTVYFDEMGDLKDFDGVPIFLNRSIPESPEAKAILQPYSKKLHELVNEVVGYVDKDFLAKTCGSRECAIGDFFADAFVNETKEQNISNLTHVAFILRNMIRGSIPKGDISRGDIINALPFTNKVVTFSLLGKYLIEAFRNCMTNYWVYKPFDGPWMPQVSGIRVVLNLTDDLTIKVFIKEGNEFLPLDPDKAYQVSTLSFLSRGGNGFDMLKKYGLNKTIIGKDTDILEKYIRKRTPITPTLDNRLTVIN-