Monarch geneset OGS2.0

DPOGS212267
TranscriptDPOGS212267-TA1947 bp
ProteinDPOGS212267-PA648 aa
Genomic positionDPSCF300077 - 454815-472138
RNAseq coverage577x (Rank: top 22%)
Annotation
HeliconiusHMEL0149260.078.96% 
BombyxBGIBMGA011577-TA0.075.88% 
DrosophilaCG8468-PB7e-16445.01% 
EBI UniRef50UniRef50_Q7K1L41e-16145.01%CG8468, isoform A n=28 Tax=Diptera RepID=Q7K1L4_DROME
NCBI RefSeqXP_312909.45e-17646.63%AGAP003205-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582913949e-17546.63%AGAP003205-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582913943e-16946.84%AGAP003205-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550851.7e-20transmembrane transport
GO:00160211.7e-20integral to membrane
KEGG pathway 
InterPro domain[29-629] IPR0161961.2e-60Major facilitator superfamily domain, general substrate transporter
[61-244] IPR0117011.7e-20Major facilitator superfamily
Orthology groupMCL15971 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212267-TA
ATGAAGGGTTACACTATGGTACAAAGGGTAGCCACAGATGAGCGTCCATTTGAGACGCCCTCTTCGGATGAGAGCGGATTGGGGCGGAGCGACTCTCCCAGCGAGGACTTGGAACCAGAGGCAGCTTTGGTGGTTCCACCGGACGGCGGTTGGGGCTGGGTGGTGGTGGTAGCTTCATTTATGTGCAACTTCATCGTTGACGGCATAATTTTCTCTGGAGGAATGCTGTTGACTCCCATACAAAAAGAGTTTAAAGCGACTGACGGTCAAGTGGCACCTGTGAATTCATTATTAGCGGGATTTTATCTTTTAGCGGGGCCTTTCGTATCAGCTTTAGCAAACAAATATGGTTTCCGCGTTGTTACAATAGTGGGCACTCTCATAACAAGCACTGCATTCGCCTTATCATATTTCGCGGAGAGCGTCGAGTATCTTTATCTTGTGTACGGCGTCGTGGGAGGCGTGGGTTTCTGTATGATTTACATGCCGGCTGTACTCACTGTGGGTTTCTACTTCGAACGTTGGCGAGCGTTGGCCACGGGTTTGGCGTTATGCGGCTCGGGGGTTGGCACATTCGTGTTCGCGCCCCTAACTGACATGTTGAACGAACGAGTTGGATGGAAAATGACTATAGTTATACACTCTGGGTTAGTGCTCTTATGCATCATATTCGGAGCTATGTTCCGACCGATAAATCCCGTCCGCGTGACGTTAGCTGACAAACAAGACGAAGACGACGATTCGAGAAGACACGAAGAAGCTGTGGAAAAATTGAATTCTATGTTGAAACTGCAAAGCAAACTCGACTCCGGTATATCGATGCCGGCAGAAATGAGATTCACTAACAAAGTCAGTCCACACACCTGGATGGGCGTAGCAAACAACACTCGCTATCCGACTGCGGAAGAAGTCTTTAAAGGGAGTAATTCCCATATAAGCAGACGATCGTCAGCCACAGCCGGTACCATAAAGCATAACCTGGGAAACAAGCCCATGTTCATAGCTTTACCAGTCGCCGAGAAGGACGAACAGGAAGACTCTAACAGCAACATCGATAACGCTGAACCTCTCATCTCCAACACGATCAAAGTCATCACGAGGGACCAAAGACCGCGTAGGTCTCACATGGATCTCGTCGCGAGGCCGTTCTATCGTGACGACATTTTCTTCAGCGGTAGCTTAGCCAGACTACCTCAGTATACATCGAGGACTTCATTGGGCTACCACTTAGCTGTAACCCATGTACCGACACAGGAGGATGCTCAAGAGGAAGTATCCAAACAATGTCGCCTATGTCCAGAATCGGTCAAACGAGCTCTAGCAACAATGCTGGACGTAAGTCTCTTCAGATCGCCTACATTCATACTTCTAACTATGAGCGGATTTTTCACGATGTTGGGCTTCTTCGTACCATATATGTACGTGAAGCAGAGAGCCGAGAGCAATGGCATAGGTAAAGCAACGAGTGTTATGTTGGTATCTGCTATAGGAATAGCGAATATCGTGGGTCGTATCGCGTGTGGGCTGGTGTCCTCGATGCCTAAAGTGTCTCCGTTATGGCTGAACAACATAGCTTTATCTGCTGGTGGTGTCGCTACTATGATGAGCGGACTCTCCTATGAGGAGTATTTCCAGTTCGGTTACTGCACTATTTTTGGATTGGCTATCGCATGTTTCGCGTCTCTTCGTTCAATACTCGTAGTGGAGTACATCGGTCTAGAACAACTTACTAATTGTTTTGGACTATTCCTACTTTTCCAAGGCCTCGGAGCACTCCTCGGAGCACCAATCGCGGGCGCTCTCATGGACATGACTCACAGTTTCGACATAGCGTTCTACGTGTCCGGCAGCTTCCTATTATTTTCGGCTGTTATAATCTATCCCGTAGATTATGTGAGCAGATGGGAAAAATCTAGAAATAAACTCGCATCACAACCAAATGCGTAA

Protein sequence:

>DPOGS212267-PA
MKGYTMVQRVATDERPFETPSSDESGLGRSDSPSEDLEPEAALVVPPDGGWGWVVVVASFMCNFIVDGIIFSGGMLLTPIQKEFKATDGQVAPVNSLLAGFYLLAGPFVSALANKYGFRVVTIVGTLITSTAFALSYFAESVEYLYLVYGVVGGVGFCMIYMPAVLTVGFYFERWRALATGLALCGSGVGTFVFAPLTDMLNERVGWKMTIVIHSGLVLLCIIFGAMFRPINPVRVTLADKQDEDDDSRRHEEAVEKLNSMLKLQSKLDSGISMPAEMRFTNKVSPHTWMGVANNTRYPTAEEVFKGSNSHISRRSSATAGTIKHNLGNKPMFIALPVAEKDEQEDSNSNIDNAEPLISNTIKVITRDQRPRRSHMDLVARPFYRDDIFFSGSLARLPQYTSRTSLGYHLAVTHVPTQEDAQEEVSKQCRLCPESVKRALATMLDVSLFRSPTFILLTMSGFFTMLGFFVPYMYVKQRAESNGIGKATSVMLVSAIGIANIVGRIACGLVSSMPKVSPLWLNNIALSAGGVATMMSGLSYEEYFQFGYCTIFGLAIACFASLRSILVVEYIGLEQLTNCFGLFLLFQGLGALLGAPIAGALMDMTHSFDIAFYVSGSFLLFSAVIIYPVDYVSRWEKSRNKLASQPNA-