Monarch geneset OGS2.0

DPOGS206281
TranscriptDPOGS206281-TA1371 bp
ProteinDPOGS206281-PA456 aa
Genomic positionDPSCF300290 + 79280-84412
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0131680.072.81% 
BombyxBGIBMGA010742-TA1e-7936.56% 
DrosophilaCG1213-PC7e-6935.50% 
EBI UniRef50UniRef50_UPI00015B44CF7e-7234.72%UPI00015B44CF related cluster n=1 Tax=unknown RepID=UPI00015B44CF
NCBI RefSeqXP_309669.11e-8036.92%AGAP003493-PB [Anopheles gambiae str. PEST]
NCBI nr blastpgi|312014393e-7936.92%AGAP003493-PC [Anopheles gambiae str. PEST]
NCBI nr blastxgi|312014398e-8236.92%AGAP003493-PC [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550853.2e-76transmembrane transport
GO:00160213.2e-76integral to membrane
GO:00228573.2e-76transmembrane transporter activity
GO:00160201.8e-74membrane
GO:00228911.8e-74substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[19-454] IPR0058283.2e-76General substrate transporter
[10-451] IPR0036631.8e-74Sugar/inositol transporter
[1-453] IPR0161961.6e-52Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL34653 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206281-TA
ATGGAGATAAAACCTAAAAATGGCAGCATTAAATATCAACTACTTATTGTTGCTTGCATCAACGTTGGACAGGTCATCGTCGGCTACAGTGTCGGATGGTCAGCTCCCATCCTCCCTAAACTGAAGAACATTGATGAGACACCGTTAACTGATGTCGTAACTGATTTGGAAGCATCATATATTGGGTCCTTACTTTACATCGGCTCTATGATTGGGCCGTATATAACAAGTATCTTTTCAAATGTGGTGGGACGGAAACCTTGTCTTCTGATTGGCGGATTGCTGAACATCCTTGCATATGTTCTGGTTATCACAACCAAGCATATAGCTATGGTGTACGCTGTCAGGATAATATCCGGTCTCGGAATGGGAATTACAATTGTTGGAAATATAGTTTACGTCGGCGAAATAGCGTCCACGAATATCCGGGGTATCCTGCTGACCTCAACGTCGATCATTGGTATATTCGGTACACTCCTTGTTTATGCTGTAGTGCCTTACGTTTCCTACTCTGAGTCCGGTTACATCGCTTTAGTTATAAGTGTAATACACGTGGTCGGTGTCTGCTTCATACCAGAATCTCCGGTCTATTACGCTATAAAAGACAGACCTGTCAGTGTCACAAAGACCCTAGACCTGCTCGGTAGATCGGCGGACGTGGACAAAGTTCTGGAAACATTTAGCAGGAAGAAGGGCGAAACAACCAGCAAGATTCGGGATTGGACGGAAATATTCACAGTTAAATCCAATAGAATGTCTCTTTTCCTGACTTTCACACTGGGGGCCTTCCAGCAAACCAGTGGCGTGGCTGTTGTGTTGTTTTTCGCGACAACCATCTTCGACACAGCTGGTTCATCGATCCGACCAGATTTAGCGACTATTATTATAGGTGTGACGAGACTCTTGTCCAGTTTAATTGCACCGTCCTTCGTTGAAAGATCTGGAAGAAAAATTCTGTTATTAATTTCCATGGCTGCGTGTGCTTTCAGCTTGTTGATTTTAGGTCTATATTTTTATTTGGACAGAACACATGTAGCTTTCATAAAGAACATTGGCTGGCTTCCATTGGTCGCGTTAATAGTTTATTTCTTTTGTTACGAAGCTGGATTCGGTACGATTCCAAACGCGATAGTCGGTGAAATGTTCAGAGCAAATGTTCGTTCAAACGGTTCTGCTTTGGCGATCACTTTAACTTGGCTGGTAGGTTTCGGCCTCACAACCAGCTTCACTACAATGGTAAAAGTTTTGGGCGGTGACGTCACGTTTTGGATATTCGGCGGCTCGTGTGTCCTAGCCTTTCTTTTCACTTTCTTCTTCTTGCCGGAAACAAAAGGAAAAACATTAAATGAAATACAAGATATGTTAAGTTGA

Protein sequence:

>DPOGS206281-PA
MEIKPKNGSIKYQLLIVACINVGQVIVGYSVGWSAPILPKLKNIDETPLTDVVTDLEASYIGSLLYIGSMIGPYITSIFSNVVGRKPCLLIGGLLNILAYVLVITTKHIAMVYAVRIISGLGMGITIVGNIVYVGEIASTNIRGILLTSTSIIGIFGTLLVYAVVPYVSYSESGYIALVISVIHVVGVCFIPESPVYYAIKDRPVSVTKTLDLLGRSADVDKVLETFSRKKGETTSKIRDWTEIFTVKSNRMSLFLTFTLGAFQQTSGVAVVLFFATTIFDTAGSSIRPDLATIIIGVTRLLSSLIAPSFVERSGRKILLLISMAACAFSLLILGLYFYLDRTHVAFIKNIGWLPLVALIVYFFCYEAGFGTIPNAIVGEMFRANVRSNGSALAITLTWLVGFGLTTSFTTMVKVLGGDVTFWIFGGSCVLAFLFTFFFLPETKGKTLNEIQDMLS-