Monarch geneset OGS2.0

DPOGS208343
TranscriptDPOGS208343-TA1434 bp
ProteinDPOGS208343-PA477 aa
Genomic positionDPSCF300383 + 88462-95448
RNAseq coverage358x (Rank: top 33%)
Annotation
HeliconiusHMEL0220590.071.93% 
BombyxBGIBMGA014434-TA5e-13548.41% 
DrosophilaCG30344-PA4e-7031.15% 
EBI UniRef50UniRef50_UPI0000D569FE2e-6931.53%UPI0000D569FE related cluster n=1 Tax=unknown RepID=UPI0000D569FE
NCBI RefSeqXP_002004246.12e-7733.26%GI19716 [Drosophila mojavensis]
NCBI nr blastpgi|1951194543e-7633.26%GI19716 [Drosophila mojavensis]
NCBI nr blastxgi|1951194543e-7733.41%GI19716 [Drosophila mojavensis]
Group
Gene OntologyGO:00550856.6e-27transmembrane transport
GO:00160216.6e-27integral to membrane
KEGG pathway 
InterPro domain[5-458] IPR0161963.2e-34Major facilitator superfamily domain, general substrate transporter
[89-422] IPR0117016.6e-27Major facilitator superfamily
Orthology groupMCL14536 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208343-TA
ATGACACAAGAAGATGAAAACCTGTCGGAGAACGAACCGAATATTAATAGTAATGATGAAAATCGGAGTAAATCATACAGACTCACAATGGAATTTCCTCTATTTTTCACCATGTTATCTATGTCTTTATCTGGAGCTGCAATCAGTAATTTAATCCTCTACCGTACTTGCGTGCACTCTTTAAACCATACACAACAGGAGTGCCAGGTTTTTCTTTCGCCAGTGAAAAATAATGGAAGCCAGGCTTTGGAAGAGGAAGTTCAGAAGTACGCAACCTTTGTGTCTATGGTTAGAACGATAATAGAGTCCGTGGCACCTGCAATACTATCTTTATTCCTTGGTGTGTGGTCTGATAGACATGGTCGTAAACCATTGGTTGTTTGGCCGCTGTTGGGTATGACTGTAAGTGGCGCTCTCATAGTTGTATACAGTATGTTAGAGAGTCTGGGTCCATGGTGGTTCATACTGACTGCCATCCCGGTATCTTTGTCCGGAGGGTTCACCGCCATGTTTACTGGATCATTCTGTTACGTAAGCGATATCTCCGCTAGAGAGAAGAGATCACTTAGGATGACGATTGTTGAAGCCTCTGTATCAGCTGGTAGTGTCACAGGAGCTATTCTCAGCTCTTATTTGTTGAGGGCCGTTGGAAGTGTTTATTTGCTAGTCATATGCACAGGATTATCCGCGATCGCGTATCTTTTTACGAATGTTTTTCTCAAAGAATCTCTAGTTGGTGCTGTTCAGGGAGGAATATCGTCCGTTCTGGATTTCAAACTCATAAAAGAGATGATGGGAACTTGTTTCAAACCTCGTCCAAATCACGGACGTGCTCAGATTCTGTTACTCACGATAGCGAACAGCCTCTCTGTGTTCATATTGTTCGGCAATATGAGCTTGCAATATATGTACACGAGGCAGAAACTTCATTGGGCAATAAAGCAGTACACTTTGTATTCGGCTGTTCACACAACAGTGTCGTTCTTCGGATCCTTTTTCGGAGTCATGATAGTACAGAAGTTGTTCAAAGTGACAGATCTGACGTTTTCTACTATAGCTTATGTATCAGATACCATTGAATACGTCATAATAACATTTGCGACCGTTTCCTGGCAAATGTACGCTGCCGCCGGAATATCCCTGTTCCGTGGTCTGTCGTCTCCTCTTATACGTTCTCTCCTGACGAAGATCCTCCCACCTGAGGATATAGCGAAGGTGTTTGCTCTGATGTGCGCCATAGAAGGCGTCAGTCCATTGATATCACCAGCTTTATACAACTCGCTGTACGCTTACACCATATCCACCTTTCCCGGGGCTATATACATGTTGAGTACCGGGATATCTTGTGTATGCGTCATATTCTTAGGGTTTGTGCAATATTATAGATGGAAGGAGTGTTCTATCACATACAATACGCTGACAAGCGAATCTTAA

Protein sequence:

>DPOGS208343-PA
MTQEDENLSENEPNINSNDENRSKSYRLTMEFPLFFTMLSMSLSGAAISNLILYRTCVHSLNHTQQECQVFLSPVKNNGSQALEEEVQKYATFVSMVRTIIESVAPAILSLFLGVWSDRHGRKPLVVWPLLGMTVSGALIVVYSMLESLGPWWFILTAIPVSLSGGFTAMFTGSFCYVSDISAREKRSLRMTIVEASVSAGSVTGAILSSYLLRAVGSVYLLVICTGLSAIAYLFTNVFLKESLVGAVQGGISSVLDFKLIKEMMGTCFKPRPNHGRAQILLLTIANSLSVFILFGNMSLQYMYTRQKLHWAIKQYTLYSAVHTTVSFFGSFFGVMIVQKLFKVTDLTFSTIAYVSDTIEYVIITFATVSWQMYAAAGISLFRGLSSPLIRSLLTKILPPEDIAKVFALMCAIEGVSPLISPALYNSLYAYTISTFPGAIYMLSTGISCVCVIFLGFVQYYRWKECSITYNTLTSES-