Monarch geneset OGS2.0

DPOGS212477
TranscriptDPOGS212477-TA1074 bp
ProteinDPOGS212477-PA357 aa
Genomic positionDPSCF300222 - 383135-389198
RNAseq coverage120x (Rank: top 58%)
Annotation
HeliconiusHMEL0093251e-11869.65% 
BombyxBGIBMGA009781-TA5e-14072.95% 
DrosophilaCG8083-PA8e-10754.67% 
EBI UniRef50UniRef50_E3XDV61e-11457.54%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XDV6_ANODA
NCBI RefSeqXP_001649552.12e-12060.06%sodium/nucleoside cotransporter [Aedes aegypti]
NCBI nr blastpgi|1571069423e-11960.06%sodium/nucleoside cotransporter [Aedes aegypti]
NCBI nr blastxgi|1700504671e-12062.07%sodium/nucleoside cotransporter 1 [Culex quinquefasciatus]
Group
Gene OntologyGO:00160209.2e-173membrane
GO:00068109.2e-173transport
GO:00054159.2e-173nucleoside:sodium symporter activity
KEGG pathway 
InterPro domain[4-353] IPR0082769.2e-173Concentrative nucleoside transporter
[140-351] IPR0116574.5e-73Na dependent nucleoside transporter, C-terminal
Orthology groupMCL10383 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212477-TA
ATGGCACTGCAATGTTTCTCCAACAAGGTGGCAACATTCTTATCCTACGGTGTAGAAGGTGCAGCCTTTGTCTTTGGAGACTTCCTTGTTAAACAGGAACAAGTATTTGCTTTTAACGCCCTCACTGTTATATTCTTCTTTAGTATGCTGGTCGAGGTTTTATTTTATTGGGGTGCTATGCAGTGGTTCTGTCTGAAATTGGGCGGAGTTCTGCAAGCCGCTACCGCAACCACAGTATGCGAGAGCACCATCGCCGTTGGAAACGTCTTCCTTGGCATGTCTGAGTCAGTTCTACTTATAAAACCCTACATTCCCGTACTGACGCCCTCAGAACTTCACGTTGTGATGTCGTCTGGATTCGCAACTGTTTCTGGTACGATACTGGCAGCGTATATTGGTTTTGGTGCTGAGCCGGCCCACCTGGTGACAGCGAGTGTGATGTCAGCGCCAGCCGCCCTCTGCTTCTCCAAGCTGATGTATCCTGAGACGAGACGCTCTCTCACCACCGTAGACAACATACCACCCGTGGAAAGACAGGATCAATCGGCGCTATCCGCAGCAACCCGTGGCGCGACCAATGGCATATCACTGATATTGAATATCATAGCGAATTTGGTGGCTTTCGTAGCCTTTATATCGTTCGTGAATGGCTTCCTCGGTTACTGCGGTGGATTGCTTGGCAACCCAGACATCAACTTGGAATGGATACTTGGCAAGATCTTCATACCATTGTGTTGGCTGATGGGTGTCCCCTGGGAAGAATGTGAGCTGGTGGGATCTCTGATAGGTCTCAAGACAGTCGTCAACGAGTTCGTGGCATACCAGCGGATGGGAGAAATAAAACGAGAGGGACTGCTGTCCCCGCGGTCCGAGCTGATCGCGACCTACTCCCTGTGTGGCTTCACGAATCCTTCATCAGCTGGTATCATGATCGGAGCGATCTCCGCCATGGCTCCCAACCAGAGAGAGACACTATCCAGTTTGGCTGTGAGAGCATTTTTCACGGGTTGCGGAATATGCTTTATGAACGCGTGTTTAGCTGGTATATTAATGCCAGACGGTTCCTTCGCTTAA

Protein sequence:

>DPOGS212477-PA
MALQCFSNKVATFLSYGVEGAAFVFGDFLVKQEQVFAFNALTVIFFFSMLVEVLFYWGAMQWFCLKLGGVLQAATATTVCESTIAVGNVFLGMSESVLLIKPYIPVLTPSELHVVMSSGFATVSGTILAAYIGFGAEPAHLVTASVMSAPAALCFSKLMYPETRRSLTTVDNIPPVERQDQSALSAATRGATNGISLILNIIANLVAFVAFISFVNGFLGYCGGLLGNPDINLEWILGKIFIPLCWLMGVPWEECELVGSLIGLKTVVNEFVAYQRMGEIKREGLLSPRSELIATYSLCGFTNPSSAGIMIGAISAMAPNQRETLSSLAVRAFFTGCGICFMNACLAGILMPDGSFA-