Monarch geneset OGS2.0

DPOGS203710
TranscriptDPOGS203710-TA1473 bp
ProteinDPOGS203710-PA490 aa
Genomic positionDPSCF300010 - 1440998-1446803
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0125160.080.04% 
BombyxBGIBMGA003497-TA0.074.04% 
Drosophila% 
EBI UniRef50UniRef50_Q174K59e-11242.86%Putative uncharacterized protein n=5 Tax=Culicidae RepID=Q174K5_AEDAE
NCBI RefSeqXP_001652284.12e-11242.86%hypothetical protein AaeL_AAEL006862 [Aedes aegypti]
NCBI nr blastpgi|3837921210.074.04%Bm-re [Bombyx mori]
NCBI nr blastxgi|3837921210.074.04%Bm-re [Bombyx mori]
Group
Gene OntologyGO:00550859.3e-11transmembrane transport
GO:00160219.3e-11integral to membrane
KEGG pathway 
InterPro domain[42-487] IPR0161963.9e-23Major facilitator superfamily domain, general substrate transporter
[146-428] IPR0117019.3e-11Major facilitator superfamily
Orthology groupMCL17288 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203710-TA
ATGAATACTAAATATGGAGCGACTAATTCCAACGATAGTACGGAAATTGAAGATCAGGAGACTTTAGATAATGAAATCGTTTTAAGTGTGCCATGCAAACGTCGGAGATGCCTCCCATGGCGGATGAACCAGAACCTTATATATGGTCTAGGTCATATATACAACGACTTGTGCGCTGCTATGTGGTTTTCATACATGATGCTGTTCTTCCAAGCTGTGATGGAAATGAGGGCAGTCATCAGCGGTGCCATGCTGCTCTTAGGCCAAGTTGTGGATGCTCTAGCTACACCTGTTGTGGGAATACTGGCCGACAAATATAGCACTAAGAAAATTTGGCATTTAACAGGCAGCGCACTGGTAACATTCACTTTTCCACTTCTCTTCATCCGATGTTGGGGATGTTCCTCTAACAGCACCGCAGAATATCTCACCTGGTGGATTCCATTCTATTATGCTTTTTTGATAATATTCTTTCAAATCGGTTGGGCTATTGTACAAATATCTCATCTAGCGATAATTCCATCTATTACCGAGAGTCTACAAGTGCGATCCGAGCTAACTTCGATAAGATATATGGCTTCGGTCATATCCAGCTTGGCCGTGTACTTTATAACTTGGATAGTATTAAGAGCAACAAACTACAGCACATTCATCGGACCGTCAGACGATTACAAATTTAGGGATGTTTCCCTTATCATAACTGTTATGGGAGTAATATCGTATATTGTATTTCACGTCTTCTTCAACTTGAATCCTTTGAAAGAGGAGAAACCTAAAGCGAATGGGCATGTAATCGAGAGTGGAGAAAATGAACCGCTAAAAATGACAGCGAAGTCCAAAATTATGCATTTCTTACAGATGCCATTGTTATATCAAACAAGCTTATTATATGTTTTCTCCCGTCTATATTGGGCTCTGAGCCTGGTGTACGTCCCATTGTTCCTGGAGGAGCGTCTATCAGTGAATCCGAGTGAGGGATCCGAACTGGTAGCAAGCGTGCCGCTCGTCCTCTATATATCCTCTCTCGTATTTTCCTTTCTTTTGAAAAGCAATATTAACAAAATTGGACACCAGGTGGCGTATTTCATAGGCAGTTCTCTGAGTTTGGTCAGCTGCTTTTGGATAGCACTCGCTATCTCACCGGATGCGCATGTTGCTCAAATATATTTAGTTGCAACATTAATAGGTGCAGGCAGCTCCATAACTCTGGTGTCTAGTCTCTGTGTGACGGCCGATTTAATAGGACCCCATTCTCATCAAGGCGCACTTATATATTCTATTGTGACGTTTGCTGATAAACTAGTAACAGGAATCGCCGTAGTAGCTATTGAAAACTACAAATGCGACGATACTTTGGATTGCCCGCAATATTATAGAGGGGTCCTAACCTACGCTTGTGGGGGCAGTGCGGTTCTTGGCATTCTATCTCTATCAATAACTACATTAGGTTCGAAAAAGAAAACTCCAACATAA

Protein sequence:

>DPOGS203710-PA
MNTKYGATNSNDSTEIEDQETLDNEIVLSVPCKRRRCLPWRMNQNLIYGLGHIYNDLCAAMWFSYMMLFFQAVMEMRAVISGAMLLLGQVVDALATPVVGILADKYSTKKIWHLTGSALVTFTFPLLFIRCWGCSSNSTAEYLTWWIPFYYAFLIIFFQIGWAIVQISHLAIIPSITESLQVRSELTSIRYMASVISSLAVYFITWIVLRATNYSTFIGPSDDYKFRDVSLIITVMGVISYIVFHVFFNLNPLKEEKPKANGHVIESGENEPLKMTAKSKIMHFLQMPLLYQTSLLYVFSRLYWALSLVYVPLFLEERLSVNPSEGSELVASVPLVLYISSLVFSFLLKSNINKIGHQVAYFIGSSLSLVSCFWIALAISPDAHVAQIYLVATLIGAGSSITLVSSLCVTADLIGPHSHQGALIYSIVTFADKLVTGIAVVAIENYKCDDTLDCPQYYRGVLTYACGGSAVLGILSLSITTLGSKKKTPT-