Monarch geneset OGS2.0

DPOGS203168
TranscriptDPOGS203168-TA1464 bp
ProteinDPOGS203168-PA487 aa
Genomic positionDPSCF300035 - 671611-673573
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0222840.067.16% 
BombyxBGIBMGA011017-TA1e-16356.49% 
DrosophilaCG8654-PB4e-8234.56% 
EBI UniRef50UniRef50_Q7QDB32e-8735.89%AGAP003039-PA n=4 Tax=Culicidae RepID=Q7QDB3_ANOGA
NCBI RefSeqXP_311836.44e-8835.89%AGAP003039-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479691207e-8735.89%AGAP003039-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479691201e-8536.10%AGAP003039-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550859.8e-31transmembrane transport
GO:00160219.8e-31integral to membrane
GO:00228579.8e-31transmembrane transporter activity
KEGG pathway 
InterPro domain[1-465] IPR0161963e-57Major facilitator superfamily domain, general substrate transporter
[66-467] IPR0058289.8e-31General substrate transporter
Orthology groupMCL21158 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203168-TA
ATGTTGGAAGAGGATAAGATTGAAAATATCATCGGTCGTTTTGGAAAATATCAAAGATGGATTTTTATTCTACTGACTATCAGTCGTCTTCCTACTGATTACCAATTAGTGAATGTTGTTTTTTTATTACCTAATGTGCAATACACATGCCTGGATGAAGAGGCCTATAACCAAACCAATTATTGTCCTTGTAAAAATCCACAATACGATCTAAGTGCTATTGGGAATTCTGTGACAAGCACATTTGGACTTATATGTGAAAAACGTCACCTGGCTAGTTTAGCACAATCAATCCTACAAGCTGGAATTTTAATTGGTGGTATCTTTTACGGATACCTGTCTGACAGATATGGTCGTCGTAAAATAGCGGTTTTTCTTGCTCTTATCTGTGAAGTCTTATTTGTATCACTATCCGCGGCTACAACACAATTTTGGATGTTCATTGCCATGCGATTCTTAATTGGCACTGCGGTTGGGGGAACAATGCTTTGTGCTTACATCATGCTAGTTGAACTTAGCGGGAAATCATTCAGGCCATATCTTGCGGGCATGATTGACATATCCCTTATTATTTCATACTTTACACTTCCCATATTGGCCTACTTTTTAAGAGATTGGAGGAAATTGCAACTAGTACTGTCATTACCTTGGTTGTATACTGTTGTCATCTATTTTGTTCTGCCAGAATCACCAAGATGGCTTATTACTACTGGACAGAAAGATAAAGCTGTGGAGGTTTTATCTTATATTGCCAAAAGGAATAACCGTCCAACAGAAAATATACGAGTTACTGTTGAAAATTTGATATATGAAGAAGAGACTAGCAACAGACAGCAAAAACACGGAACATACTTGGACTTATTTAGAACGCCAAAAGTTAGAGCGTACACTTTCATAACAGCTTTTATTTGGTTAAGTATATCACATACCTTTTTTGGTATCAACCAATACATCGGTCGGCTAGAGGGAAATATTTACATTAATGTAATTTTTTCATCAATTGGTCTTATACCTGGAATGACTTTGGTCGTAATAGCTTCTTTGTATTTGAATAGAAGATTAGCGGTTGTAATAAGTTGTGGTGTGGCTGCTATATCACTAATTGTTTTCATATTTATACCCAGTAATATGAATAATCTTATATTAGTTTTTGCTGTTATAGGCCAAACTGGAGCTTTTACAGCGTTTGCGCAAATATATCTGTATTCATCAGAAATTTTTCCAACTATTATTAGAAACTCAGCGATGGGTTTTGCTTCCATGTTTGCTAGATTTGGTGGTTTTATTGCGCCATTTGTAGTAAATATAGGTACAGAGTGGGTTTCCATAGTTATATTTAGTTTATTAGTCACGTTTGCAGGTATTTCCTGCTATTTTTTACCAGAAACAAAAGGTACTGTTCTCTTAAATACAATAGATGAAACTGAGAATTCTATAAAAAAATTGACTGATGATGTAAATTAA

Protein sequence:

>DPOGS203168-PA
MLEEDKIENIIGRFGKYQRWIFILLTISRLPTDYQLVNVVFLLPNVQYTCLDEEAYNQTNYCPCKNPQYDLSAIGNSVTSTFGLICEKRHLASLAQSILQAGILIGGIFYGYLSDRYGRRKIAVFLALICEVLFVSLSAATTQFWMFIAMRFLIGTAVGGTMLCAYIMLVELSGKSFRPYLAGMIDISLIISYFTLPILAYFLRDWRKLQLVLSLPWLYTVVIYFVLPESPRWLITTGQKDKAVEVLSYIAKRNNRPTENIRVTVENLIYEEETSNRQQKHGTYLDLFRTPKVRAYTFITAFIWLSISHTFFGINQYIGRLEGNIYINVIFSSIGLIPGMTLVVIASLYLNRRLAVVISCGVAAISLIVFIFIPSNMNNLILVFAVIGQTGAFTAFAQIYLYSSEIFPTIIRNSAMGFASMFARFGGFIAPFVVNIGTEWVSIVIFSLLVTFAGISCYFLPETKGTVLLNTIDETENSIKKLTDDVN-