Monarch geneset OGS2.0

DPOGS205894
TranscriptDPOGS205894-TA1434 bp
ProteinDPOGS205894-PA477 aa
Genomic positionDPSCF300089 - 400604-405753
RNAseq coverage362x (Rank: top 33%)
Annotation
HeliconiusHMEL0055761e-17581.59% 
BombyxBGIBMGA007020-TA2e-17272.51% 
Drosophilartet-PA4e-10946.71% 
EBI UniRef50UniRef50_Q7PXG31e-12052.14%AGAP001391-PA n=4 Tax=Anopheles RepID=Q7PXG3_ANOGA
NCBI RefSeqXP_321745.33e-12152.14%AGAP001391-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479658044e-12052.14%AGAP001391-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1954529385e-11949.08%GK14183 [Drosophila willistoni]
Group
Gene OntologyGO:00550855.1e-39transmembrane transport
GO:00160215.1e-39integral to membrane
KEGG pathway 
InterPro domain[40-477] IPR0161967.9e-51Major facilitator superfamily domain, general substrate transporter
[45-442] IPR0117015.1e-39Major facilitator superfamily
Orthology groupMCL15060 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205894-TA
ATGGAATCTTCACACAATGGGGAATTAAGAAATCGAAAAGAGAAATCAAGCCTCTTAACACAAAATGGAGCTGACATAGTTCAAAATAATATAAATAAATGCAAAACTACTAATGCAAGGACAATAGGATTAGTGTTTATATCATTACTCCTAGATTTACTTGCTTTCACTATGATTCTACCACTTTTACCATCATTGCTTGACTACTACAATAGAGAAGAAGGAAATTCCAACACTCTATATACATCTTTGCTTCATGCAGTTCATAGTTTTCAAAAGTTAACGGGAGTTCCGGAGAGATTTTCATCAGTATTATTTGGGGGTGCTCTAGGATCAATGTATAGTTTTCTGCAATTTCTAACAAGCCCCATAGTGGGGAGTCTGTCGGATGCTTATGGCAGAAAACCTATGCTCTTAATTTGTCTAATTGGAATAGCATTATCTCATGCTTTATGGAGTTGTGCCAGCACATTCAGCCTGTTTGTACTGGCGAGATTCATTGGAGGTTTGAGTAAAGCCAATGTCAGCCTCAGTATGGCTGTGGTCACTGATGCCACAGATGAGAAAACTAGAGCCAGGGGAATGGCATTGGTGGGTTTAGCATTTTCAATAGGTTTCATAGTTGGTCCTTTAGCTGGTGCATGGTTTGCTAAGAGCAATATGACATCCGGTCTATGGGGTGAGAAACCAGCGTTGTATGCACTATCACTTTCATTAGCAAACATTACTCTTGTAACAATGTTTATACCTGAGACACTTTCAAAAGGGAAAAGATCACCTTTGTCTCTCTCACTATCAAAAGCCTATGACTTTGTCTCACCATATCATTTGTTAAAGTTCACGGCTGTCAAGAATCTAAACGTTCAACAGAACAAAGTATTATCGAAACTGGGACTGATCTATTTTATTTATCTGTTTATTTATTCTGGTCTAGAATTTACGATAACGTTTCTAACACATCATACTTTTGAGTACACCGCAATGCAACAAGGCAAGATGTTTTTAGTAATAGGTTTGGTAATGGCATTGCTACAAGGAGGCGCGGCTCGTAGGTTGAACGCGCGTGGCGCCGTGAAGGCCGCTCGTGTCGCTCTGTTATTGACGCCGCTTTCTTTCCTGTGCGTGGCGTCAGCGGCTGTGACGTCACCGCCGTTTTTCGCGCCCATCGTCTGGTTGTGGGCTGGTCTCGTGCTGTTTGCGATTTCGACCGCGTTCGCAGTGTCTTGTATGACAGCAATGGCGTCAGCTCAGGCGCCGTCTGAAGCTCGAGGCGCTTTGCTTGGTACGTTACGATCACTTGGAGCGTTAGCGAGAGCTGCCGGGCCACTTCTTGCATCTACAATGTATTGGTACTCCGGAGCCGCTACAACATATACCATTGGATCTATCATTCTGGTGTTGCCAGCTGTGATGTTAATTAGACTGAAGACATGA

Protein sequence:

>DPOGS205894-PA
MESSHNGELRNRKEKSSLLTQNGADIVQNNINKCKTTNARTIGLVFISLLLDLLAFTMILPLLPSLLDYYNREEGNSNTLYTSLLHAVHSFQKLTGVPERFSSVLFGGALGSMYSFLQFLTSPIVGSLSDAYGRKPMLLICLIGIALSHALWSCASTFSLFVLARFIGGLSKANVSLSMAVVTDATDEKTRARGMALVGLAFSIGFIVGPLAGAWFAKSNMTSGLWGEKPALYALSLSLANITLVTMFIPETLSKGKRSPLSLSLSKAYDFVSPYHLLKFTAVKNLNVQQNKVLSKLGLIYFIYLFIYSGLEFTITFLTHHTFEYTAMQQGKMFLVIGLVMALLQGGAARRLNARGAVKAARVALLLTPLSFLCVASAAVTSPPFFAPIVWLWAGLVLFAISTAFAVSCMTAMASAQAPSEARGALLGTLRSLGALARAAGPLLASTMYWYSGAATTYTIGSIILVLPAVMLIRLKT-