Monarch geneset OGS2.0

DPOGS201814
TranscriptDPOGS201814-TA1563 bp
ProteinDPOGS201814-PA467 aa
Genomic positionDPSCF300145 + 220051-229631
RNAseq coverage782x (Rank: top 16%)
Annotation
HeliconiusHMEL0035580.078.07% 
BombyxBGIBMGA013245-TA5e-15966.67% 
DrosophilaNTPase-PB1e-8639.91% 
EBI UniRef50UniRef50_Q9VQI82e-8439.91%NTPase, isoform B n=17 Tax=Schizophora RepID=Q9VQI8_DROME
NCBI RefSeqXP_320057.31e-8641.13%AGAP009265-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1187919672e-8541.13%AGAP009265-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1948551852e-8238.59%GG24477 [Drosophila erecta]
Group
Gene OntologyGO:00167876.1e-122hydrolase activity
KEGG pathwayaga:AgaP_AGAP0092653e-86 
 K01511 (ENTPD5_6)maps-> Purine metabolism
    Pyrimidine metabolism
InterPro domain[25-460] IPR0004076.1e-122Nucleoside phosphatase GDA1/CD39
Orthology groupMCL15231 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201814-TA
ATGTCGGAAATAAGGCAAAGAAAGCCCGATATGGGCAAATTCAGTACGTACAGATATTTAGAGTTAAAAGATGAAACACCATCTCCACATATCAACGTAAGGAGCCGTTCTGTGCCACAGCCACGCCGGTTTAAGAAAATATTTATTGTGTTGCTTGTTTTAGTGTTATGTGTCACATATTTTTTGGGTCTTTTCGCGGGCCAGACACAGTGGTTCGATGGGATCGCGAAGAGTTTGGGATATGAGAGCGTCTACCACCACGCCGTCATCATTGACGCTGGCTCCTCCGGTACGAGGGTCCTAGCTTACAAATTCAGAGTACCCTTCACAGTTTTCGGTCAAATCAACTTGGATCTCGTGGACGAGTATTTCGAGGAGACTAAACCGGGGCTCTCGTCGTATGTCGACGAACCGGAAAAGGGTGCTGAGACAATAGTCCATCTTGTGAAACGAGCCGGTCATTTGATACCAGCGGACCGTCGCTCACAAACACCTCTAATAGTCCGAGCCACAGCGGGCCTGAGACTGTTGCCGCGGGATAAAGCTCAAAGACTCATCGACGAGGTCGCCAAGGCTATATCTGGCTCGGGTTACGATGTCGGTCCGAACAGCGTTGAGATCATGGATGGTTCAGACGAGGGTATCTTCATTTGGTATTCGGTTAATTTGTTGCACGATTTGATGGGTGGCGAGACTATGGCGGCGTTGGATCTCGGCGGCGGCTCCACACAGATCACATACCAGTTAAACGAGAAAGACACCAAGTTGTATCCCACAAACGACCAATATTTGGTACCGGCCGGCAAGAACAACATTTCATTATATACGCACAGCTACCTGAAGCTGGGTCTGTTAGCTGCCAGATACGGTGTCTTCAGAATGGAGGCGAACAACGACACGAACCACTTCACGTCGGTGTGTGTCGACCCTATAGTCCAGAAAGAGAAGTGGACATACGCTAACAAACAATACTCTATCGACGGCGCGTCACGTCCGTCCAACATGAAGCGGGAGGGTGTATACAGCCGATGTCAGTCGCTGGTGTCTCGGTACACTCGGGCGACTATCGACTGGGAGCCCTCTCAGCCGCCGCGGGGAGCCGTCGCCGCTATGAGCTACTTCTATGATGTGGCCGCTGATGCTGGGATTATAGATGTGATGCGCGGTGGCACGGTATCTGTGTCTCAGTACAGGGCGAGTGCTCTCCGCGCGTGCTCTGCATCTAACGTAGACCAGCCGTGGGCGTGCATCGACCTGGTGTACGTAGTGACGCTGCTCCAGGAGGTGTACAAGATAAAGGACACCGAGCCAATATCGTTGTTTAAGAAAGTGAACGGTCACGAGGTATCCTGGGCTTTGGGACTCGCATATACAACTGTCATGAACAGGCTAGCCCGAGCATAAGCTTATATACATATATATATATATATGTATATATATCTATATATGTATATATAATAGTTATTAAGATTTTCTGCCTGTTTTATATACAAGTTACAAATTATTATGATTTATGATGTGTTTCGAAGGACAGATATATTTTTTTGTTTGCAAAAATTTTAA

Protein sequence:

>DPOGS201814-PA
MSEIRQRKPDMGKFSTYRYLELKDETPSPHINVRSRSVPQPRRFKKIFIVLLVLVLCVTYFLGLFAGQTQWFDGIAKSLGYESVYHHAVIIDAGSSGTRVLAYKFRVPFTVFGQINLDLVDEYFEETKPGLSSYVDEPEKGAETIVHLVKRAGHLIPADRRSQTPLIVRATAGLRLLPRDKAQRLIDEVAKAISGSGYDVGPNSVEIMDGSDEGIFIWYSVNLLHDLMGGETMAALDLGGGSTQITYQLNEKDTKLYPTNDQYLVPAGKNNISLYTHSYLKLGLLAARYGVFRMEANNDTNHFTSVCVDPIVQKEKWTYANKQYSIDGASRPSNMKREGVYSRCQSLVSRYTRATIDWEPSQPPRGAVAAMSYFYDVAADAGIIDVMRGGTVSVSQYRASALRACSASNVDQPWACIDLVYVVTLLQEVYKIKDTEPISLFKKVNGHEVSWALGLAYTTVMNRLARA-