Monarch geneset OGS2.0

DPOGS206914
TranscriptDPOGS206914-TA1623 bp
ProteinDPOGS206914-PA540 aa
Genomic positionDPSCF300001 - 1466650-1470548
RNAseq coverage24x (Rank: top 78%)
Annotation
HeliconiusHMEL0156270.071.14% 
BombyxBGIBMGA012871-TA0.060.44% 
DrosophilaCG42249-PC1e-10536.75% 
EBI UniRef50UniRef50_E3X3A64e-12241.06%Putative uncharacterized protein n=4 Tax=Nematocera RepID=E3X3A6_ANODA
NCBI RefSeqXP_001661610.11e-12741.74%apyrase, putative [Aedes aegypti]
NCBI nr blastpgi|1571291053e-12641.74%apyrase, putative [Aedes aegypti]
NCBI nr blastxgi|1571291053e-12441.74%apyrase, putative [Aedes aegypti]
Group
Gene OntologyGO:00167873.7e-188hydrolase activity
GO:00091663.7e-188nucleotide catabolic process
KEGG pathwayaag:AaeL_AAEL0005752e-117 
 K01081 (E3.1.3.5)maps-> Purine metabolism
    Nicotinate and nicotinamide metabolism
    Pyrimidine metabolism
InterPro domain[17-539] IPR0061793.7e-1885'-Nucleotidase/apyrase
[329-539] IPR0083341.2e-415'-Nucleotidase, C-terminal
[25-237] IPR0048432.6e-16Metallophosphoesterase domain
Orthology groupMCL10268 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206914-TA
ATGTCCACGATTGTTGTTATCCTTTTACTCCTTGGAGTTGCGTGTGCGCAAAACAATGGAACATATGAACTAAATATAATTCATTTCAACGATTTTCATGCACACTTCGACGAAATAGCTCTGAACGGCGCCAGTTGCAATCCCTCAGATGGTGATTGCATTGGTGGTTTTTCACGGCTTTACACAGCCATCAGACAGGCGCGACAACTGTCTCCTGACTCTTTGGTGCTTAATGGAGGAGACACATTTCAGGGAACTATTTGGTACAATTTCCTTCGGTGGAATGTCAGCCAGCATTTCATGAATCTGATACCTCATGATGCGCATGTGCTCGGTAACCACGAGTTTGATCATGGAGTCGATGGACTACTTCCTTATTTGGAGCGACTGAATGCTCCAATGTTGGGCGCCAACGTTAACACAACATTCGAGCCAGAACTTGCGAAATATGTAAAGAACCATATTGTTCTGGAAAGAAGAGGACGAAAAATCGGTATAATCGGCATACTGTTACGGACGTTTTCAGCGCCCATTGGCAATATTATAATGGAAGATGAATTGGAAGCGGTCAATCGGGAGGCGGCTATTTTGTCCAAACAAGGAGTTGACATTATTATTGTTCTGTCACATGTCGGATATGCTTCTGATATGAGGTTGGCGTCACGTATGTCATCGGAAGTCGACATTATAGTTGGCGCACACTCTCACAGTTTCCTGAACAATGGGGACGGACCGGATGGAAGCAGACCGGTTGGGGAGTATCCTTCTATAGTTACCCAAGAAAATGGCCATAGAATTCTTGTCGTCCAAGCTTCGGCGTATACGCGCTATTTGGGAGAAATTAAACTATTTTTCAATTCCGAAGGCAGAATAGAATCTTGGTCTGGACAACCCATTTATCTTGGCACTTCTGTTATAAAAGACCCCCAGATGGAGTTACTGCTGGAGCCTTGGCGACGCAAAGTGCGGGCGGTAGGTGATGAAGTGCTGGGAGAGGCTGTAACTCCTCTTCGAAGGGGTTGCTACCGACAGGAATGCAACCTGGGCAGCTGGCTTTGTGATGGACTGCTGGAACAGGTGATGTCTCGAGCGAGTGGTGGTGCCTGGGGTTATGCTCACGTTTGTATGATAAACGCTGGAGGAATAAGAAATCAAATAAACCCTGGAACAATAACAACGGAAGCATTGCTAATGGCCATGCCGTTTGAGAATATCGTGCAGGTCTATGATCTTCGCGGAGAGTATCTCTTGCAAGCGCTGGAGTTTTCAGTGGGAGTCGCGCAAACCGATCCCACCAACTTTTACAGCTCCAGAATGTTACAACTCGCAGGTTTCCGTGTGGTATATAACGCGACGGCACCTATAGGATCTCGTGTGGTATCAGCAGCTGTGCGTTGTGTGCGCTGTGATGTGCCGCGTTATGAGCCATTAAAAAAGGATTCAGTTTACCGCGTACTGACGCAGAACTACATAGGGGATGGCGGCGGCGGATACACGATGCTGTCAGAAAATAGGGAGAACTTAGTAAATCTCGAGATTGATTACGTCATGATACAGAGGTACCTAAGGAAACAGGGCAGCGTTGTGAAGGACCTCGATGGACGAATCCAAATAGTGTATTAA

Protein sequence:

>DPOGS206914-PA
MSTIVVILLLLGVACAQNNGTYELNIIHFNDFHAHFDEIALNGASCNPSDGDCIGGFSRLYTAIRQARQLSPDSLVLNGGDTFQGTIWYNFLRWNVSQHFMNLIPHDAHVLGNHEFDHGVDGLLPYLERLNAPMLGANVNTTFEPELAKYVKNHIVLERRGRKIGIIGILLRTFSAPIGNIIMEDELEAVNREAAILSKQGVDIIIVLSHVGYASDMRLASRMSSEVDIIVGAHSHSFLNNGDGPDGSRPVGEYPSIVTQENGHRILVVQASAYTRYLGEIKLFFNSEGRIESWSGQPIYLGTSVIKDPQMELLLEPWRRKVRAVGDEVLGEAVTPLRRGCYRQECNLGSWLCDGLLEQVMSRASGGAWGYAHVCMINAGGIRNQINPGTITTEALLMAMPFENIVQVYDLRGEYLLQALEFSVGVAQTDPTNFYSSRMLQLAGFRVVYNATAPIGSRVVSAAVRCVRCDVPRYEPLKKDSVYRVLTQNYIGDGGGGYTMLSENRENLVNLEIDYVMIQRYLRKQGSVVKDLDGRIQIVY-