Monarch geneset OGS2.0

DPOGS209774
TranscriptDPOGS209774-TA1197 bp
ProteinDPOGS209774-PA398 aa
Genomic positionDPSCF300397 - 23956-26894
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0156275e-9440.27% 
BombyxBGIBMGA010291-TA7e-13660.88% 
DrosophilaCG42249-PC3e-7437.33% 
EBI UniRef50UniRef50_E3X3A62e-8439.08%Putative uncharacterized protein n=4 Tax=Nematocera RepID=E3X3A6_ANODA
NCBI RefSeqXP_001868694.15e-8641.65%5' nucleotidase [Culex quinquefasciatus]
NCBI nr blastpgi|1700679879e-8541.65%5' nucleotidase [Culex quinquefasciatus]
NCBI nr blastxgi|1700679873e-8540.88%5' nucleotidase [Culex quinquefasciatus]
Group
Gene OntologyGO:00167876.7e-122hydrolase activity
GO:00091666.7e-122nucleotide catabolic process
KEGG pathwayaag:AaeL_AAEL0005751e-76 
 K01081 (E3.1.3.5)maps-> Purine metabolism
    Nicotinate and nicotinamide metabolism
    Pyrimidine metabolism
InterPro domain[12-397] IPR0061796.7e-1225'-Nucleotidase/apyrase
[286-398] IPR0083342e-215'-Nucleotidase, C-terminal
Orthology groupMCL10268 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209774-TA
ATGTACGCTCTGTGTGTATTGTTGTTGTGCGGAATTTCTGTACGCAGTGAAAAGTTTTACGAGTTAAACATTATTCATTATAATGACTTTCATGCAAGATTCGTAGAAACTAGTCCGTTTGGATCTGTCTGCAATCCGACGGCAGCTCCTTGTATCGGAGGATTCGCTCGACTCGCAACCCTTATCAGAGATGGACTCGAGAGGGATCCAGAGTCCTTAGTATTAAACGGAGGCGACTCTTTTCAAGGAACTATCTGGTATAATTTATTGAGATGGAATGTCACTCAGGATTTTATGAACATGGTCCATCACGATGCTCATAAATTTGTAGTCGAAAAGGGAGGTCGGAAGATTGGTATAATAGGTGTTATAATATCGAGTACTGACGAACTTGCAAGTACAGGAAATTTAAAGTTTACGGACGAAGTTGAAGCTGTTAGACGAGAAGCTGAAAAGTTAAACGAGCAAGGAATAAATATCATTGTGGTTCTATCACACTGTGGGATAGATATAGATCGTAAAATAGCCCTTAATGCTGGTCCCCATATAGACATAATTGTCGGAGGCCATAGTCATACACTTTTATCAAACAGTGACCCTCCAGAAGGTTCGACTTGGACTCCATTGGGACCTTATCCCGTTGTCGTAGAACAGGCAGCAAGATCTGTATTGATCGTTCAAGCTGGAGCTCATACAGCATTTTTAGGAGAAATTAAACTTAATTTCAATGATAACGGAGATCTTATCAGTTGGGTTGGTGATCCTCATTATATAGGCAACAATGTCCTTCCAGCTCCTGACGTCTTAGAAAAGATTAATGAGTATCTGCCAGTTATAACTGAACAAGCAACTGAATTGATTGGAGTCTCTAAAGTCCACTTGTCCTCAAGATGTAATTGTGGAGAATGCAATCTTGGTAGTTTTATTTGCGATGCTTTTATGCACGAAACTGTATTGAAATCAGGGAAGAATACGTGGAATGAAGCGAATTTTTGCGTCATTAACACTGGTGACGTTCGTTCGGACATCGAAATAGGAAATGTAACATTCGAAAGCGTAATGCTTTCAATTCCGTTTGAGAACAACGTTGAGAAGTATGATTTAAGAGGAGATCATATTTTGGAAATGCTTGAGTATGCTGTGGCTAATCATTCACGGCCTGGCAGAAGGATGGTGCAAGTGTCAGGTTTAAATTAA

Protein sequence:

>DPOGS209774-PA
MYALCVLLLCGISVRSEKFYELNIIHYNDFHARFVETSPFGSVCNPTAAPCIGGFARLATLIRDGLERDPESLVLNGGDSFQGTIWYNLLRWNVTQDFMNMVHHDAHKFVVEKGGRKIGIIGVIISSTDELASTGNLKFTDEVEAVRREAEKLNEQGINIIVVLSHCGIDIDRKIALNAGPHIDIIVGGHSHTLLSNSDPPEGSTWTPLGPYPVVVEQAARSVLIVQAGAHTAFLGEIKLNFNDNGDLISWVGDPHYIGNNVLPAPDVLEKINEYLPVITEQATELIGVSKVHLSSRCNCGECNLGSFICDAFMHETVLKSGKNTWNEANFCVINTGDVRSDIEIGNVTFESVMLSIPFENNVEKYDLRGDHILEMLEYAVANHSRPGRRMVQVSGLN-