Monarch geneset OGS2.0

DPOGS212187
TranscriptDPOGS212187-TA1419 bp
ProteinDPOGS212187-PA472 aa
Genomic positionDPSCF300344 - 12969-20312
RNAseq coverage54x (Rank: top 69%)
Annotation
HeliconiusHMEL0131502e-7455.25% 
BombyxBGIBMGA010728-TA4e-3638.36% 
DrosophilaCG10960-PB3e-2427.84% 
EBI UniRef50UniRef50_D6WYD73e-3034.53%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WYD7_TRICA
NCBI RefSeqXP_972187.16e-3134.53%PREDICTED: similar to CG1213 CG1213-PA [Tribolium castaneum]
NCBI nr blastpgi|910893211e-2934.53%PREDICTED: similar to CG1213 CG1213-PA [Tribolium castaneum]
NCBI nr blastxgi|910897653e-3133.86%PREDICTED: similar to AGAP003493-PC [Tribolium castaneum]
Group
Gene OntologyGO:00550859.4e-32transmembrane transport
GO:00160219.4e-32integral to membrane
GO:00228579.4e-32transmembrane transporter activity
KEGG pathway 
InterPro domain[211-458] IPR0058289.4e-32General substrate transporter
[222-457] IPR0161963e-20Major facilitator superfamily domain, general substrate transporter
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212187-TA
ATGTTACACTTCCAGATGTTACACTTTTATATGTGTTGGATACAGATATTTGATAATATCACACGAAGTTTGTCAGAAGTGACTCGTCCGGGTGGTATCGGCGGGTCCTGCACATTCCCGGAACCAGCCGTGTGTCCTCCGTGTGAGTCTAAGCCCCAGCCCGCTTGTCGTCCTGCCCCTCCTACCAGCGCCACTCGCGGAGTCAAGCAAGTGTGTTGCGCACCACCTATTGGTCAGAGACCCCCGTGCTCCCCTTCACCTCCAAACTGCTCCAAGACTTGCCGACATGCGAGTGAAAAAAAGACCAGTGTTTCCCACTGTGAAACGGGTGAATGTAAACATAAACAACATGAAAAATGTGGTGATAACTTAGATCGCGTAGCTAGGCGCCTTGCAGCAGCCGAATTCTCTCGACGTTTAGACTACGAACGTGAAGTAGCAGAAAAAAAACCAGCACCTAAGTATGAGACCGCTGTTTATCAACTTGTTGGAGAGCGGAAGTGCATGCGCGCCACAGATGGTTGTGATTACGCTGTTTCTAAACAGAAAAGCTCACAACTTAAATGCGGACCATGTCGAATACCACCTCCGCCCTGTCCTGTAGATCTAGAGCCAGCTGACAATTGTCGTACGGATGATGCCATAAATGTCCTGTCTAAAGTCCGTGAAACTGATGTACAGGTCAAACAAGAGATTGAAGACTATAAATCTAGCAAGAAAGAAAAAATAGCTAAGTTGGCCTTAATGAAGGATAAAACATTTCTGAAGTCACTTTCATTGGGCATTCTGGTCTGTGGCGGCTGCAATGCCGTAGGATACAATGCAGTGGAATTTTATTTGCAAACGATTTTGGAAGCAACGCACTCCAGCTTAATGCCTGAGGTAGCATCAGTTATAGTTGGCTGCATCCAACTGAGCGCTGCTGTTTGCACATCATTTTTTACAAAAATGTTTGGGAGACGACCTATTTTAATTTATTCCCTCATTGGCATGTTGTGTGGTATGCTCGGATTGGGAGTTTTCTTCACATATAGTACCAGGGAAGGTTATGTTATCTCTGGCTTCCTTAACTACCTTCCTATCATATCTTTAATACTTGCTACATACAGTTTCAATATTGGAATTGGCTGTCTTCTTTTCATAGTTACTGTTGAATTATTTGAAGGTAGAACGCGAGCATTTGGATATACAATATGTTTTACATTCTTCTTACTAAGCGCGTTCATAACAACTAAATATTTAGAAGCTATGTTCCACGCCTTCGGTTATTCTGGGACGTACTGGTTCTTCAGTGCAATGTGTTTAATTATTTGTACTTTAATTGTCTTATTCGTGCCCGAAACTAAAGGGAAAACGATAACTGAGATTCAAATTGCATTAGGCAACAAGAAAATTGAGGCTGAGGTTGGGAATAAGTAA

Protein sequence:

>DPOGS212187-PA
MLHFQMLHFYMCWIQIFDNITRSLSEVTRPGGIGGSCTFPEPAVCPPCESKPQPACRPAPPTSATRGVKQVCCAPPIGQRPPCSPSPPNCSKTCRHASEKKTSVSHCETGECKHKQHEKCGDNLDRVARRLAAAEFSRRLDYEREVAEKKPAPKYETAVYQLVGERKCMRATDGCDYAVSKQKSSQLKCGPCRIPPPPCPVDLEPADNCRTDDAINVLSKVRETDVQVKQEIEDYKSSKKEKIAKLALMKDKTFLKSLSLGILVCGGCNAVGYNAVEFYLQTILEATHSSLMPEVASVIVGCIQLSAAVCTSFFTKMFGRRPILIYSLIGMLCGMLGLGVFFTYSTREGYVISGFLNYLPIISLILATYSFNIGIGCLLFIVTVELFEGRTRAFGYTICFTFFLLSAFITTKYLEAMFHAFGYSGTYWFFSAMCLIICTLIVLFVPETKGKTITEIQIALGNKKIEAEVGNK-