Monarch geneset OGS2.0

DPOGS209775
TranscriptDPOGS209775-TA1218 bp
ProteinDPOGS209775-PA405 aa
Genomic positionDPSCF300397 - 9248-13230
RNAseq coverage99x (Rank: top 61%)
Annotation
HeliconiusHMEL0149728e-9748.17% 
BombyxBGIBMGA010291-TA5e-14460.09% 
DrosophilaCG42249-PC9e-6934.54% 
EBI UniRef50UniRef50_B3A0N51e-8237.33%Apyrase n=3 Tax=Tabanidae RepID=APY_TABYA
NCBI RefSeqXP_001661610.14e-8439.50%apyrase, putative [Aedes aegypti]
NCBI nr blastpgi|1571291058e-8339.50%apyrase, putative [Aedes aegypti]
NCBI nr blastxgi|1571291052e-8238.69%apyrase, putative [Aedes aegypti]
Group
Gene OntologyGO:00167879.3e-114hydrolase activity
GO:00091669.3e-114nucleotide catabolic process
KEGG pathwayaag:AaeL_AAEL0005752e-81 
 K01081 (E3.1.3.5)maps-> Purine metabolism
    Nicotinate and nicotinamide metabolism
    Pyrimidine metabolism
InterPro domain[1-400] IPR0061799.3e-1145'-Nucleotidase/apyrase
[232-400] IPR0083342.3e-195'-Nucleotidase, C-terminal
[5-139] IPR0048434.2e-07Metallophosphoesterase domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209775-TA
ATGAACATGGTCCATCACGATGCTCATGTGTTAGGAAATCATGAGTTTGATAATGGCATTGAAGGTATTGTTCCATACCTTCAACATCTCCAATCTCAAGTTGTTACCGCTAATATTATCGACGACGATGAACCAACGATACAAGGACTGTACAAGCCTAGCATTGTAGTCGAAAAGGGAGGTCGGAAGATTGGTATAATAGGTGTTATAATATCGAGTACTGACGAACTCGCAAGTACAGGAAATTTAAAGTTTACGGACGAAGTTGAAGCTGTTAGACGAGAAGCTGAAAAGTTAAACGAGCAAGGAATAAATATCATTGTGGTTCTATCACACTGTGGGATAGATATAGATCGTAAAATAGCCCTTAATGCTGGTCCCCATATAGACATAATTGTCGGAGGTCATAGTCATACACTTTTATCAAACAGTGACCCTCCAGAAGGTTCGACTTGGACTCCATTGGGACCTTATCCCGTTGTCGTAGAACAGACAGCAAGATCTGTATTGATCGTACAAGCTGGAGCTCATACAGCATTTTTAGGAGAAATTAAACTTAATTTTAATGATAACGGAGATCTTATCAGTTGGGTTGGTGATCCTCATTATATAGGCAACAATGTTCTTCCAGCTCCTGACGTCTTAGAAAAGATTAATGAGTATCTGCCAGTTATAACTGAACAAGCAACTGAATTGATTGGAGTCTCTAAAGTCCACATGTCCTCAAGATGTAATTGCGGAGAATGCAATCTTGGTAGTTTTATTTGCGATGCTTTTATGCACGAACAAGCGTGCTTCTTCTCTAACCTTCTCCATGCGAGAAGAGGCCTGTGCTCAACAGTGGGCTCTGATAGGCTGATGATGATGATGATGATTAAATATTCTATGATAATCAATTGTTTTGTGACAGGTATGCGAGTTATATTTGATGGAGCCCGTCCCGTTAACAATAGAGTTGTAAATGCTACAATAAGGTGTAATTATTGTGATATACCAACATACGCGCCTCTGGATCCGAACAAATATTACAAAGTCGTTTCACAATCCTTCATCGGAGGTGGTGGCGATGGATTTAGTATGATATCGAATAATCGCCAGAACGTAGAAGTGTTGGGTGTGGATTACGACATACTGTTACGTTACGTACGGCATCAGTCGCCGATCATGAAGGACTTGGACGGACGGATACTTATAAAAGATCCATGTTTAGAGAACTGA

Protein sequence:

>DPOGS209775-PA
MNMVHHDAHVLGNHEFDNGIEGIVPYLQHLQSQVVTANIIDDDEPTIQGLYKPSIVVEKGGRKIGIIGVIISSTDELASTGNLKFTDEVEAVRREAEKLNEQGINIIVVLSHCGIDIDRKIALNAGPHIDIIVGGHSHTLLSNSDPPEGSTWTPLGPYPVVVEQTARSVLIVQAGAHTAFLGEIKLNFNDNGDLISWVGDPHYIGNNVLPAPDVLEKINEYLPVITEQATELIGVSKVHMSSRCNCGECNLGSFICDAFMHEQACFFSNLLHARRGLCSTVGSDRLMMMMMIKYSMIINCFVTGMRVIFDGARPVNNRVVNATIRCNYCDIPTYAPLDPNKYYKVVSQSFIGGGGDGFSMISNNRQNVEVLGVDYDILLRYVRHQSPIMKDLDGRILIKDPCLEN-