Monarch geneset OGS2.0

DPOGS201264
TranscriptDPOGS201264-TA1488 bp
ProteinDPOGS201264-PA495 aa
Genomic positionDPSCF300037 + 630848-632586
RNAseq coverage8x (Rank: top 85%)
Annotation
HeliconiusHMEL0121503e-10652.66% 
BombyxBGIBMGA011016-TA6e-7933.54% 
DrosophilaCG8654-PB2e-5929.06% 
EBI UniRef50UniRef50_D6WKJ13e-7031.24%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WKJ1_TRICA
NCBI RefSeqXP_973659.11e-7634.16%PREDICTED: similar to organic cation transporter [Tribolium castaneum]
NCBI nr blastpgi|910825352e-7534.16%PREDICTED: similar to organic cation transporter [Tribolium castaneum]
NCBI nr blastxgi|910825351e-7634.16%PREDICTED: similar to organic cation transporter [Tribolium castaneum]
Group
Gene OntologyGO:00550854.2e-32transmembrane transport
GO:00160214.2e-32integral to membrane
GO:00228574.2e-32transmembrane transporter activity
KEGG pathway 
InterPro domain[7-492] IPR0161968.9e-54Major facilitator superfamily domain, general substrate transporter
[119-491] IPR0058284.2e-32General substrate transporter
Orthology groupMCL34392 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201264-TA
ATGGGTAAAAAGAACCGTGACAGTAATAATCATAATAACAAGGAAAATGGAAAAAACGACAATGAAAGTGATTACTTGGAAGATTCTATCGGTGTAATGGGTATCTGGCAAATATACGTTTGCGTTGTAGCAGCGTCAACTAGATTTATAGGTATGGGAAACATGTCAATAATAGTGTTTCTAACACCAATAACAGATTTTATTTGCATTGAATTCGAAGAAAATACAAATATAACAGCTGAGAAAATGGTCTGCTATAATAATTGTGTGAAATATGAATATAAATCCGTCTCCATGGACCAAACTATTGTAACCGAATTCGATTTGATTTGTGAAAGAGAATGGATGGCGAGTTTTGCTCAATCTGTTATAATGTTTGGGCTTGTTATCGGAGTATCACTTTTTGGTTGGATATCTGACAGGTTCGGACGACGCGTCGCTCTTTTGTCATCCACAATTCTCAACATATTTTCTATGGTGTCTTCGGCATTTTCTCCAGATTATTGGATCTATAACGGTCTAAGGTTGATTATGGGAATTGCGTCTGGTGGATCTATTATAGTATGTGTGCCATACGTAATAGAAATTTGCAGTAAAAAATATAGAGAAATAGCTGGAGCTCTCATAACTTTACCCGACGGTCTGTCAGAATTATCTTTAGCCTTATTTGCTTATTATTTTCCTACATGGAAATCTCTTACACTCGGATTCAGTTCTGTGTCCGTGATAATTCTGTTGCTAGTTCTCACTCTGCCAGAATCTCCAAGATGGCTGGTTTGTAACGGGAGAACAGAAGAAGCAATTTCTACCATTATGAAGGCTGCTGAATGGAATAAACTAGAAACAGATCATGTCAGAGACAGAGTAAATAAGAGTGTTGGAGCCATAGTTCAAAAAATTAACCCTGATGTATCCGTATCATATTTCGATTTATTTAAGAAACGTCTAGGAACAGTTACCTGGTCCACAATATTTATTTGGGCCGTTGTTGGTACTAATTATTTCGGTTTATATCAATACATGACATTTTTGGGAACTACTGTTCACACGACCGTGGTAATGTTAGCATTGCTTCAGTTTCCACTCTGTGTGTTTGAGGTATTATTGACCAAACACTTCAGTAGAAAGGCAACCCTGATCGGTGTACTGGTAGCATGTGGAGTTCCAATGCTTATTCTCATTTTTACTCCTAAAAACCATTGGGTTACTAGTACCTTAGGGGTCATAGGATTTTCTGCATGTTATATCGCCTTTGGTGTAATATACGTTTATCATGGAGAATTATATCCCACTTGCCTAAGAAGTATGGCTTATGGAATAACGTCGGGTACAAATAAAGTGGGTGCAATGGCGTCCCCTTTTATAGCTCGAATTTCACCATCTTGGATTCCGTCATCAATATTTGCATCACTATCATTTCTAGCAGCCGCCTTTTGTTTTCTGCTGCCTGAAACCAAGGGAAGTAATTTAAAGGACACTGTGGATTAA

Protein sequence:

>DPOGS201264-PA
MGKKNRDSNNHNNKENGKNDNESDYLEDSIGVMGIWQIYVCVVAASTRFIGMGNMSIIVFLTPITDFICIEFEENTNITAEKMVCYNNCVKYEYKSVSMDQTIVTEFDLICEREWMASFAQSVIMFGLVIGVSLFGWISDRFGRRVALLSSTILNIFSMVSSAFSPDYWIYNGLRLIMGIASGGSIIVCVPYVIEICSKKYREIAGALITLPDGLSELSLALFAYYFPTWKSLTLGFSSVSVIILLLVLTLPESPRWLVCNGRTEEAISTIMKAAEWNKLETDHVRDRVNKSVGAIVQKINPDVSVSYFDLFKKRLGTVTWSTIFIWAVVGTNYFGLYQYMTFLGTTVHTTVVMLALLQFPLCVFEVLLTKHFSRKATLIGVLVACGVPMLILIFTPKNHWVTSTLGVIGFSACYIAFGVIYVYHGELYPTCLRSMAYGITSGTNKVGAMASPFIARISPSWIPSSIFASLSFLAAAFCFLLPETKGSNLKDTVD-