Monarch geneset OGS2.0

DPOGS209145
TranscriptDPOGS209145-TA1503 bp
ProteinDPOGS209145-PA500 aa
Genomic positionDPSCF300061 - 621504-625067
RNAseq coverage115x (Rank: top 58%)
Annotation
HeliconiusHMEL0147950.076.99% 
BombyxBGIBMGA009202-TA0.076.66% 
DrosophilaCG7458-PA1e-9034.19% 
EBI UniRef50UniRef50_D2CG571e-10139.39%Putative uncharacterized protein GLEAN_10997 n=1 Tax=Tribolium castaneum RepID=D2CG57_TRICA
NCBI RefSeqXP_970562.12e-10239.39%PREDICTED: similar to AGAP012383-PA [Tribolium castaneum]
NCBI nr blastpgi|910947214e-10139.39%PREDICTED: similar to AGAP012383-PA [Tribolium castaneum]
NCBI nr blastxgi|910947212e-10039.47%PREDICTED: similar to AGAP012383-PA [Tribolium castaneum]
Group
Gene OntologyGO:00550858.5e-30transmembrane transport
GO:00160218.5e-30integral to membrane
KEGG pathway 
InterPro domain[98-486] IPR0161962.5e-50Major facilitator superfamily domain, general substrate transporter
[108-445] IPR0117018.5e-30Major facilitator superfamily
Orthology groupMCL18023 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209145-TA
ATGTCAGCGTTTATGAGTGAATTCATCTTTTCGGCTGCCGCTATACCGCACAGATGTCGTATTCCGGAATGTGGTGAAGATGGTAAACTTCACGAGATTGAACCCGAGTGGATAGGAAAAGCAATTCCAGAAGCTAATTCTGGATATTCACGCTGTGAAAGATATGCACCAACCAATATCGGCATCAACGGCTCATTAGACTACTGCCCAGCGGAACTTTTCAATACTGCAGAAACGATAAGCTGCGATGGCTTTGTTTACGCAAGAGATAATTCAGTAGTGTATGAGTTTGATCTCGGGTGTCAAGACTTTTTACGAGCATTTGCTGGTACATTGAACAGTATAGGAACACTTTTGGTTCTGCCAATAACCGGTTTCTTCTCTGATCGTTTTGGACGAAGATGGGCCCTCGTTATTAGCGTATTTAACCTTGCTCTTATTGGTCTTATACGAGCGTTTTCTGTAAATTATCCAATGTACCTGGCCCTGCAAATTCTACAAACAACTCTCGGCGCTGGTACTTTCAGTTCCGCGTACATTTTTGCTGCTGAGCTCGTTGGTCCTAAATATCGGGTGATGACTAGTGCTTTCTCGTCTTCAATGTTTTCTGTAGGCCAAGTAATATTGGGTGGTGTAGCTTGGCTAGTACAGCCTTGGAGGTACATGATCATGGCCCTTCACATCCCTTGCTTCTTAATCATCTCTTACTACTGGTTACTGCCTGAGAGCATCCGCTGGTTACTTTCGAAAAATAAGAATGAAGAAGCACGAAAAGTATTGGAAAATGTGGCGAGAGTAAATAAAAAGTCTATAAGCGAAAAATCAATACAAGCCTTAATGTTCACTCCAGAGGGGACTGACAATAACATAAACGCTGATAAACCTGGTCTAATCAAACAAATCATAAGTTCTCCTGTTCTACTGAGACGAGTCTGCACAACTCCTATTTGGTGGATCACAACGACATTCGTCTACTACGGACTGTCTATCAATTCCACCGGATTGTCCGACTCCATATATTTGCAATATATACTTACATGTGCCATCGAAATCCCTGGGTACTTCACCGCAGTGTTGGTGTTAGACAGGATTGGAAGAAAAATAACTCTCTCCAGCGGTTTCTTCTTCAGTGCAGCCTGTAACATCGCCTTCGTTTTTATACCAAGTGATTTATCCGTTCTCCGGCTAATTATTTTCCTTTTGGGTAAATTTGGAATCTCGGTGGTGATGACGTCACTATATCTGTTCACATCTGAGCTCTACCCCACGGAATACCGCCACAGTCTCCTGGCCTTTTCATCCATGGTCGGCCGTCTTGGATCTATAACGGCTCCACTGACACCCGTTTTGGTGAACTACTGGCACGGTATACCGTCCATGATGTTTGGGGCTATGGGTATACTGGCTGGACTCCTGGTACTCACACAACCTGAAACACTTGGCACCACAATGCCCGATACGTTGGCGGAAGCTGAAGCCATCGGCAGGAAACCAACTGTATAA

Protein sequence:

>DPOGS209145-PA
MSAFMSEFIFSAAAIPHRCRIPECGEDGKLHEIEPEWIGKAIPEANSGYSRCERYAPTNIGINGSLDYCPAELFNTAETISCDGFVYARDNSVVYEFDLGCQDFLRAFAGTLNSIGTLLVLPITGFFSDRFGRRWALVISVFNLALIGLIRAFSVNYPMYLALQILQTTLGAGTFSSAYIFAAELVGPKYRVMTSAFSSSMFSVGQVILGGVAWLVQPWRYMIMALHIPCFLIISYYWLLPESIRWLLSKNKNEEARKVLENVARVNKKSISEKSIQALMFTPEGTDNNINADKPGLIKQIISSPVLLRRVCTTPIWWITTTFVYYGLSINSTGLSDSIYLQYILTCAIEIPGYFTAVLVLDRIGRKITLSSGFFFSAACNIAFVFIPSDLSVLRLIIFLLGKFGISVVMTSLYLFTSELYPTEYRHSLLAFSSMVGRLGSITAPLTPVLVNYWHGIPSMMFGAMGILAGLLVLTQPETLGTTMPDTLAEAEAIGRKPTV-