Monarch geneset OGS2.0

DPOGS206473
TranscriptDPOGS206473-TA1926 bp
ProteinDPOGS206473-PA641 aa
Genomic positionDPSCF300070 + 261299-286815
RNAseq coverage350x (Rank: top 33%)
Annotation
HeliconiusHMEL0119390.079.11% 
BombyxBGIBMGA005424-TA0.080.92% 
DrosophilaCG4797-PB2e-11141.55% 
EBI UniRef50UniRef50_D6X0166e-11847.44%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X016_TRICA
NCBI RefSeqXP_975260.11e-11847.44%PREDICTED: similar to AGAP012218-PA [Tribolium castaneum]
NCBI nr blastpgi|910910502e-11747.44%PREDICTED: similar to AGAP012218-PA [Tribolium castaneum]
NCBI nr blastxgi|3123853343e-11947.11%hypothetical protein AND_00902 [Anopheles darlingi]
Group
Gene OntologyGO:00550853.1e-56transmembrane transport
GO:00160213.1e-56integral to membrane
GO:00228573.1e-56transmembrane transporter activity
KEGG pathway 
InterPro domain[217-620] IPR0058283.1e-56General substrate transporter
[173-619] IPR0161962.1e-47Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL16943 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206473-TA
ATGAGAGACAACAGTACGCGCGATGGCGCTCACGAAAGCATGTTGGGAAAGGCCAAAAAACCCCAGGTCACCAGCAAGAAGGTGCAAAGTCAAGAAGAATACGAAGCACTCCAAAGTTTCCTGCGCAGCATCAGTTCAACAGCAACACTGCCTCCGTATGTTAAAGGAGAGACCTACTCCTCTGTTCGTGGCGTATTCAATCAGTGTTTAATAACATGCGCGGTGTTAATACTCGCTGCTGGTGCCGGTCACCCTATAGGCTTCTCGGCGGTTGCGCTCCCACAACTAAGGACAGAGAACTCGACTATGAGAATCAACGATGACATGGGCTCTTGGATAGGTAAATATTGCTTCTATTGCCGTTCAGATCTCAACGATGGCGCTCACGAAAGCATGTTGGGAAAGGCCAAAAAACCCCAGGTCACCAGCAAGAAGGTGCAAAGTCAAGAAGAATACGAAGCACTCCAAAGTTTCCTGCGCAGCATCAGTTCAACAGCAACACTGCCTCCGTATGTTAAAGGAGAGACCTACTCCTCTGTTCGTGGCGTATTCAATCAGTGTTTAATAACATGCGCGGTGTTAATACTCGCTGCTGGTGCCGGTCACCCTATAGGCTTCTCGGCGGTTGCGCTCCCACAACTAAGGACAGAGAACTCGACTATGAGAATCAACGATGACATGGGCTCTTGGATAGCGAGTATTCACTCGGCGGCGACTCCGCTTGGCTCCATGTTGTCGGGGCCTATTATGGAGGCGATAGGTCGGAAAAGAACCCTGCAAGCGTCGACTCTCCCACTAGTTATAGGTTGGATCCTCATTGGAACATCAACACACCATGCATTGCTTTTACTAGGAAGGATTGTTTGCGGTTTCGCCGTCGGTATTCTAGCAGCGCCTTCACAAGTGTACTTAGGAGAGATATCAGAACCCCGACTGCGAGGGTTGTTGATAGGCACCCCGTTCGTGGCTTATTCCTTAGGAGTGCTATATGTGTACGCATTGGGTGGAGCGCTGTCATGGCGGGCTGTAGCCCTACTCTCTATCGTACTACCCACACTAGCATTCATAGCCTTATGCTTCTCACCCGAGAGCCCCACCTGGCTAGCGCGACGAGGAAGATTCCACGACGCTATGGCCGCCATGGCCAGACTCCGAGGAGATCCTGATACGGCGCAACGTGAGCTCCACGAACTAATATCAGCACGGGAGAAAGAAAAGGCACGCGGTGAAGAAACCATTCGCTTCTTGGCAACAGTGTTGCGGGCTCCTGTACTGAAACCGTTGATCCTCATCAATGCCTTCAACATGCTGCAAATACTCTCCGGCAGCTACGTAGTTATCTTTTACGCCGTCGACATTGTCAAAGACGCTGGAGGATCCTTAAGCCCCACTATGGCAGCAAACGCCAGTGCTTTGGTCCGTTTGTTGGTAACAGTAGTGGCTTGTGTTGCACTGCTGAGGGTAACACGTCGCGCGCTAGTACTCGTCTCTGGTATAGGAACCGCGTTGTTCACACTCGCGCTCTCGGGCTTGCTTTACTATGGACCAGGGACCGGAGTCCTTCCACCAATCCTCATACTAGGATACGTTGCCTTCAACACCCTCGGCTTCTTTTTACTCCCAGGGCTTATGATCGGGGAACTTTTACCTACCAGAGTTCGAGGACTCTGCGGAGGTTATATATTCTGCCTCTTCAATAGCGTCCTGTTTGGTTTCACAAAATTATATCCCGTCATGAAAAATAACATCGGTATGTCCGGCGTATTCGGACTCTTTGGAGCTTCGGCCTCCCTGGCTACTGCAGTTCTATTCCTCCTTCTTCCCGAGACGAAGGGAAAATCTCTCATTCAAATAGAACAGTATTATCAAAAGCCGAACATATTATGGATGACGCGGAAAAAGGCAGCTGACTCTCAAAATGTTTGA

Protein sequence:

>DPOGS206473-PA
MRDNSTRDGAHESMLGKAKKPQVTSKKVQSQEEYEALQSFLRSISSTATLPPYVKGETYSSVRGVFNQCLITCAVLILAAGAGHPIGFSAVALPQLRTENSTMRINDDMGSWIGKYCFYCRSDLNDGAHESMLGKAKKPQVTSKKVQSQEEYEALQSFLRSISSTATLPPYVKGETYSSVRGVFNQCLITCAVLILAAGAGHPIGFSAVALPQLRTENSTMRINDDMGSWIASIHSAATPLGSMLSGPIMEAIGRKRTLQASTLPLVIGWILIGTSTHHALLLLGRIVCGFAVGILAAPSQVYLGEISEPRLRGLLIGTPFVAYSLGVLYVYALGGALSWRAVALLSIVLPTLAFIALCFSPESPTWLARRGRFHDAMAAMARLRGDPDTAQRELHELISAREKEKARGEETIRFLATVLRAPVLKPLILINAFNMLQILSGSYVVIFYAVDIVKDAGGSLSPTMAANASALVRLLVTVVACVALLRVTRRALVLVSGIGTALFTLALSGLLYYGPGTGVLPPILILGYVAFNTLGFFLLPGLMIGELLPTRVRGLCGGYIFCLFNSVLFGFTKLYPVMKNNIGMSGVFGLFGASASLATAVLFLLLPETKGKSLIQIEQYYQKPNILWMTRKKAADSQNV-