Monarch geneset OGS2.0

DPOGS211401
TranscriptDPOGS211401-TA1467 bp
ProteinDPOGS211401-PA488 aa
Genomic positionDPSCF300115 - 169884-177542
RNAseq coverage308x (Rank: top 37%)
Annotation
HeliconiusHMEL0126014e-7937.08% 
BombyxBGIBMGA010861-TA0.074.05% 
DrosophilaCG4288-PB4e-14253.23% 
EBI UniRef50UniRef50_Q7K2N36e-14053.23%CG4288, isoform A n=19 Tax=Pancrustacea RepID=Q7K2N3_DROME
NCBI RefSeqXP_970624.14e-16460.00%PREDICTED: similar to AGAP006594-PA [Tribolium castaneum]
NCBI nr blastpgi|910926007e-16360.00%PREDICTED: similar to AGAP006594-PA [Tribolium castaneum]
NCBI nr blastxgi|910926002e-16460.00%PREDICTED: similar to AGAP006594-PA [Tribolium castaneum]
Group
Gene OntologyGO:00550851.4e-52transmembrane transport
GO:00160211.4e-52integral to membrane
KEGG pathway 
InterPro domain[1-449] IPR0161965.8e-78Major facilitator superfamily domain, general substrate transporter
[36-413] IPR0117011.4e-52Major facilitator superfamily
Orthology groupMCL12648 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211401-TA
ATGCAACAAGATCCCGCGCCGGAAAGGATCGACCCAGCGCCAGAGAGCGATGGTTTCTCGTGGCGGTTTTGGGAGAAGCGTCGGCTGGTTGTGGCGCTGCTGGCTTTCTTCGGCTTCTTCAACGTTTATGCCCTCAGAGTGAACCTGTCAGTAGCTGTGATCGCTATGACCGAACCGATCGAGCTCGAACTTGATAATGGCACCACTGTTTACGTTCCAGAGTTCGACTGGAGTCCGCAGACGAAGGGCTTGGTACTGAGCTCGTTCTTCTACGGCTACCTCCTGACACAGCTGCCCGGCGGTTGGCTTGCCTCTAAAATAGGAGGTCACAGGATGTTTGCTATCGGTGTGGGCGCCACGTCCCTCCTCACATTGTTCACGCCGCCTCTGGCACACATCAGCACCGGTCTACTGGTGACTGTAAGAGTCATCGAAGGACTTTTTGAAGGTGTCACGTACCCATGTATCCACGCCGTGTGGAGCGTGTGGGCGCCGGCCGGAGAGCGCGCCCGGCTCGCTTCCTTCGCTTTCAGCGGCAGCTACGTCGGAACGGTGGTGTCCCTGCCCGTGTCCGCGTACCTCGCCCGCTACACCGGCTGGCCCGGCATCTTCTACGTGTCCGGTATATTCGGTCTTCTGTGGACCACCATATGGTGGCTGGTGGTGAAGGAGTCCCCGGAGAGAGACCCTCACATCACGGCCGCCGAGCTGAAGTACATTCAGGAGTCCCGAGGTTGTACGCGCGGTGTGCGGAGTCACCCGTGGCGCGCGATGCTGTCTTCGGGTCCGGTGTGGGCGATCGTGGCCGCTCACTTCAGTGAGAACTGGGGCTTCTACACGCTACTCACCTTCCTGCCCACCTTCATGCAGGACGCCTTCGGGTTCTCGACGTCGTCGTCGGGCTGGTCGTCGGCCGTCCCCTACCTGGCCATGTCTCTCACCCTGCAGGTATCGGGCGTGTTGGCGGACTGGCTTCTCTCCCTCCGTCGCCTGAGCGTCACGGCTGTGCGGAGGCTCTTTACGTGCGGAGCTTTCGTGGCCCAGACGGTGTTCATGCTGGGCGCGGCCTACTCGTCGTCCCCCTCAGCTTGTATCGCGTGCCTCACCCTGGCCGTGGGTCTCGGAGGGTTCGCCTGGTCGGGTTTCAGTGTGAACCACCTGGACATCGCTCCGCCTCACGCCAGCGTGCTGATGGGCGTGTCCAACACCGTGGCCACGCTGCCCGGGATCGTGTCGCCCGCCCTGGCCGGGGCCATCGTCACCGACAAGTCGCCGGAGCAATGGCGGATCGTGTTTTTTATATCCAGCGCCATCTACCTGTCGGGCGCGGCGCTGTACGGCGCGTTGTGCTCCGGGAACAGACAGGCCTGGGTGGTGGAGGTCGAGGGAGACGCCACCTTCGACACGGACGGCGCCTCCACCACCACCTACGACAACAAAGCCTTGGACAGGAGTTCGGAGATGTGA

Protein sequence:

>DPOGS211401-PA
MQQDPAPERIDPAPESDGFSWRFWEKRRLVVALLAFFGFFNVYALRVNLSVAVIAMTEPIELELDNGTTVYVPEFDWSPQTKGLVLSSFFYGYLLTQLPGGWLASKIGGHRMFAIGVGATSLLTLFTPPLAHISTGLLVTVRVIEGLFEGVTYPCIHAVWSVWAPAGERARLASFAFSGSYVGTVVSLPVSAYLARYTGWPGIFYVSGIFGLLWTTIWWLVVKESPERDPHITAAELKYIQESRGCTRGVRSHPWRAMLSSGPVWAIVAAHFSENWGFYTLLTFLPTFMQDAFGFSTSSSGWSSAVPYLAMSLTLQVSGVLADWLLSLRRLSVTAVRRLFTCGAFVAQTVFMLGAAYSSSPSACIACLTLAVGLGGFAWSGFSVNHLDIAPPHASVLMGVSNTVATLPGIVSPALAGAIVTDKSPEQWRIVFFISSAIYLSGAALYGALCSGNRQAWVVEVEGDATFDTDGASTTTYDNKALDRSSEM-