Monarch geneset OGS2.0

DPOGS203210
TranscriptDPOGS203210-TA1629 bp
ProteinDPOGS203210-PA542 aa
Genomic positionDPSCF300035 + 745279-749155
RNAseq coverage286x (Rank: top 38%)
Annotation
HeliconiusHMEL0109770.067.30% 
BombyxBGIBMGA011093-TA0.071.71% 
DrosophilaCG6006-PC8e-11741.67% 
EBI UniRef50UniRef50_D6W9G41e-12846.40%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W9G4_TRICA
NCBI RefSeqXP_972002.13e-12946.40%PREDICTED: similar to CG6006 CG6006-PC [Tribolium castaneum]
NCBI nr blastpgi|910771685e-12846.40%PREDICTED: similar to CG6006 CG6006-PC [Tribolium castaneum]
NCBI nr blastxgi|910771682e-12946.40%PREDICTED: similar to CG6006 CG6006-PC [Tribolium castaneum]
Group
Gene OntologyGO:00550852.5e-28transmembrane transport
GO:00160212.5e-28integral to membrane
GO:00228572.5e-28transmembrane transporter activity
KEGG pathway 
InterPro domain[9-513] IPR0161965.8e-44Major facilitator superfamily domain, general substrate transporter
[129-484] IPR0058282.5e-28General substrate transporter
Orthology groupMCL17853 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203210-TA
GTTATTCCAAGTCAAGGGTCTAAAGAGAAACATCCAGAGGTAACAAACGAAGATGACTCGGATGTTGTGTCCACTATCATTGGAGATTATGGAAGATGGCAGCTCTTAATGACTTTCCTCTTATCTTTGTTCTCCTTTCCATGTACTTTTCACATATATTTACCAACGTTTACGGCGAAAGCAACCAAATTCTGGTGTCAAAGGCCAGAAAATTTGTCGTCTTTACCTGTAGACGATTGGATAAATTACAGTCAGCCTGTAGACGCTTGCTCGATACGCATTTTGCCAGCTGGAATTACAGTAGAAAGTATTTTAAACAACACAGCTCCATTATTGGACTCGTTTCAAAAGTGTACTCAATGGGAATATGATAAATCAGAGGTTGGAGAAACAATAATTTCAGAATGGAATCTCGTTTGCGATAACGCCAACCTGACAAGCCTAGGCGAGGTCGTTTTTTTAGTAGGCGTCGGAGTGGGAGGAGTGGTGGGGGGTTGGATATCCGATAAATTTGGCCGTAAAAAGATTCTTATGGGCATGGTAGTTGCACAAAGCGCACTCGCAATTATTTCGTTGCTTGTAAAATCATACTTACAGTATATGATGGTGAAGCTGGTGATGGGTCTGGTGTCAGTATCTGTCGTTTACGCAGCTTTTGTACTGTCTGTAGAATTAGTTGGAGGCAAATGGGTCACCATAGCAGGGGTGTGCAACTTTTTTCCTCTACCGTTGGCATACATTATTGTATCCCTTCTATCATTGGCCATGCCAAATTGGAGGCAACTGCAATTAGCGTTATCAGTGCCAGGATGTTTTCTACTTTTGATGTGGTTTGTGCTACCTGAATCTCCGAGATGGCTGTTAAATATGGGTAGAACTGAAGAAGCCCGAGAAATACTGGAAAGGGCAGCAAAATTTAATAAAAGAAACACGGTTGCCGATATAGACAAGCTTCTTCTTTTACATAAAGTAGAGGAAGACAGAGAGGAACCTAGTGTCCTCATGTTGTTCAAGGGATATTTATTAAAGAGGACTTTTTGCTTATTTATAGCGTGGTTTTCAATGACAATAGCATACTATGGACTCTTGTTAAATATCGGTAAATTCAATCTCGGCAATTTGCACCTTACATCAATCATACTTGCTGTAGTTGAAGTACCGGCAATTGCATTGAGTATTCCTATTCTATTGAAGGCTGGTAGGCGTATACCGATCTGTATAACCATGTTCGTTTGTGGAGTCGCCTGTGTAACAAGTGAACTACTTTCCATATTATATGATGATGTTTGGATAATCATATTTTGTCTGATGGTCGGGAAGTTTGCTATCGGCGCAACGAACATGATGCTGCCTATTTACACCGCTGAGCTATATCCGACTGTGATAAGGAATCTCGGTGTTGGAGCAAGTCAAATATCTTCTGGACTGGCTCTCATTTGTATCCCATATTTATGGGAACTGACAAAACTAAACGAACATTTGCCGCTGGTGACTATTGCGGCATTGGGTGCGGCAGGCGGCGCCGTCGTCCTTCTACTACCAGACACAGTCAACTCAAAAGATAACAAAAAACCTAAAAACGTGTCTTGTAATGGAACATTCACAATCAGCGACGACAGGAGGTATTGA

Protein sequence:

>DPOGS203210-PA
VIPSQGSKEKHPEVTNEDDSDVVSTIIGDYGRWQLLMTFLLSLFSFPCTFHIYLPTFTAKATKFWCQRPENLSSLPVDDWINYSQPVDACSIRILPAGITVESILNNTAPLLDSFQKCTQWEYDKSEVGETIISEWNLVCDNANLTSLGEVVFLVGVGVGGVVGGWISDKFGRKKILMGMVVAQSALAIISLLVKSYLQYMMVKLVMGLVSVSVVYAAFVLSVELVGGKWVTIAGVCNFFPLPLAYIIVSLLSLAMPNWRQLQLALSVPGCFLLLMWFVLPESPRWLLNMGRTEEAREILERAAKFNKRNTVADIDKLLLLHKVEEDREEPSVLMLFKGYLLKRTFCLFIAWFSMTIAYYGLLLNIGKFNLGNLHLTSIILAVVEVPAIALSIPILLKAGRRIPICITMFVCGVACVTSELLSILYDDVWIIIFCLMVGKFAIGATNMMLPIYTAELYPTVIRNLGVGASQISSGLALICIPYLWELTKLNEHLPLVTIAALGAAGGAVVLLLPDTVNSKDNKKPKNVSCNGTFTISDDRRY-