Monarch geneset OGS2.0

DPOGS203163
TranscriptDPOGS203163-TA3147 bp
ProteinDPOGS203163-PA1048 aa
Genomic positionDPSCF300035 - 769578-781745
RNAseq coverage313x (Rank: top 36%)
Annotation
HeliconiusHMEL0109730.078.48% 
BombyxBGIBMGA011012-TA0.075.78% 
DrosophilaCG6126-PA0.063.44% 
EBI UniRef50UniRef50_Q961R90.063.44%CG6126 n=26 Tax=Neoptera RepID=Q961R9_DROME
NCBI RefSeqXP_002097637.10.063.83%GE26331 [Drosophila yakuba]
NCBI nr blastpgi|1955010530.063.83%GE26331 [Drosophila yakuba]
NCBI nr blastxgi|1949012300.056.73%GG20124 [Drosophila erecta]
Group
Gene OntologyGO:00550852.4e-32transmembrane transport
GO:00160212.4e-32integral to membrane
GO:00228572.4e-32transmembrane transporter activity
KEGG pathway 
InterPro domain[128-512] IPR0161961.2e-48Major facilitator superfamily domain, general substrate transporter
[132-503] IPR0058282.4e-32General substrate transporter
Orthology groupMCL15175 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203163-TA
ATGGACAAGGATCGTGCTTTGGAGGATATGATGGGGAAGTTGGGGGACTTCGGGCGATATCAAGGCTTACAATTCTTCCTTCACATCCTCTCAGCTTTGACGGCGGGACTTCACATGCTGTCATTAGTTACAGTCGCTGCCGTTCCTGAACACAGGTGTACAATAGATGGAGTAGACAGCTCTAATTACACGGCATCCTGGAATTCTTCTTTCGTACTAAATGCCATCCCTCTAAATTCACACGGAAAATTGGAGTCCTGTAAGATTTATGGCGAAAATAATACATTACAAACATGTGAATCTTGGGTTTACAATACCCAATACTTTACTTCGTCGCGAGGAATCGAATGGGATTTCGTATGCAGTCGAAGATGGATGGGAGCGGCAGCTCAAACGGCCTACATGTTTGGAGTCGTGATGGGGTCGTTTGTTCTAGGACGTTTCTGCGACAAGTTTGGCAGGAAGACTGTTTTTGTTTGGGCAGGCGTTTTTCAACTCATATTCGGCTCGTTAGTTGCCTTAGCGACGCAATATTATACATTTATTGTCCTAAGATTTGTTTACGGTATATTTGGATCTGGTTCATATATAGCTGGTTTTGTACTTACAATGGAATTAGTGGGACCAAGTCGGCGAACAATATGTGGTGTGGCTTTCCAAATTATGTTTGCAGTCGGAATCATGCTCGTGGCTGGATGGGGATATATCTTAGACAATCGATTTTACTTACAGATCCTGTATGGTCTACATGCCTTGATATTACTACCCCACTGGTTTCTTATGGACGAATCAATAAGATGGCTGTGGTCGCAAGGGCGTGCCCGCGAATCAGTCGCCTTAATAGAGAAGGCTTTGAAAATGAATGGTTCCAATGAAATAATTGAAACATCCGCTTTAGTATCTCAATGCAAAGCTACTTGTGCTAAATACTCTGATGACGAAGCAGCCGGTACAGGTGACCTTTTTAAAAGCCCGAATATGTTAAAAAAGACTCTCATAATCTGCGGTTGTTGGTTCGCCAACTCAGTAGTATATTATGGCCTGTCCCTTAATACTGGAAAACTGAATGGGAATCCTTATTTTTTAACTTTCCTATTCGGAATTGTTGAATTACCCAGTTACATTATAATAGTTTACTGCTTGGACAGAGTCGGACATAGAGCACTTATTAGCACGATGATGTTGTTTGGAGGAATTGCGTGTCTGGTTGTAGTGGCCCTACCGCATGGTTCAAATTCAGTAACGGGAGTTGTGATGATTGGCAAACTTTTTATTTCTGGTTCATATTCAATAATATACAAATATTCTGCAGAATTATTTCCCACAGTAGTGCGCAGTTCCGGAGTTGGATTGGGAAGCATGTGCGCCAGTGTATCGGGTGCTTTGACACCTTTAATAAGTTTACTGGATACGTTGAATCCCAAAATACCAACAATTATATTTGGATTATTAGCCCTTCTATCTGGATTTTCTACTTTTTTCTTACCCGAGACAATAGGTAAAGAGCTGCCTCAATCTGTAGAAGATGGAGAAAAGTTTGGAGTTAATGACACTTGCTTTACAAATTGTGTTGGAAGGCGAATGAGTACTGCTTCCGAAGATCTCCCAGAAGCTATGGAACCATTAGACAATACTGTGAAAAAGTGCTGGATTGATGGTGTCGACACAAACGAATCAGTGGCCTTATGGAATTCCTCAGAAATATTGAAATCTATACCTTTGACATCTACCGGTAGTCTCAGCAGTTGTTTGATGTACAATGAAGAGAATATAACAGTTACTTGCAACAAATGGGTATACGATTCAAAATACAGAACATCTTCTCGAGGCATCGATTGGGACTTAGTATGCGATCAAAGGTGGAGAGGCGCATTAGCCCAAACCATGTACATGCTAGGGGTTTTCACGGGAGCTGTTTATTTAGGAGGACTAGCGGACAAAATAGGTCGCAAAAAAGTATTTTGCATGTCCGCTGTGTTACAATTAATTTTGGGTGTAGTAGTTGCCTTTATACCAGAATATTGGACATTTGTAGTTATAACTTATTTTTACGCAATTTTTGGATCTGCCGGGGCATATATTCCTGCATTTGTGCTCACTATGGAATTAGTAGGCCCAAGTAAAAGAACTATTTGTGGTGTCGCTTTTCAAGCTACTTTTGCTTTAGGAATAATGTTGGTTGCAGGTTGGGGCGCGCTTATTGACAATAGGGTTGTACTTCAAGTTATTTATGGATTGCATGGTTTAATTCTTATTCCACATATTTGGATAATGGATGAGTCGCCACGTTGGCTCTGGGCCCAAGGTAGACCCAAAGAAGCTGTCGACATTGTCCAAAAAGCTCTAAAATATAATAAATCCGATAAAGTTTTAGACCGGGCAGTTTTAGTTTCTAAAGGAAAAGTAGAAAAGTCTAAAAATACGGAGTCGTCTGCAAGCGTTTTTGATTTGTTTAAAACACCAAATTTACGGATTAAAACTTTGAATGTTTGTCTTTGTTGGTTTGCTAATTCCTTAGTCTACTATGGACTAACTCTTAGTGCTGGAAAATTGGAAGGAAATCCTTATCTTATAACAGCTGTTTTTGGATTAGTAGAGTTACCTAGCTATGCGGCCGTTGTATATTTCCTTGATATTTGGGGGCGCAGGCCACTTATGACTTCCATGATGCTGGTAGGTGGCAGTGCATGTATCATTGCTGCCTTTATTGATCCAGATTATATCGTGTCAACTGTCGTTGTGATAGCGGGAAAGCTGTTCATAGCTGGTTCTTTTGCCATTATTTATAATTATTCCGCTGAATTATTTCCTACAGTTGTCCGAAATTCAGCTATAGGATTAGGATCAATGTGTGCTCGATTTTCAGGGGCTCTGACGCCTTTGATAACTTTGCTCGATTCTTTCGATCCGAAAATTCCGGCAGCCACCTTTGGTTTAGTGGCTATAGTATCAGGATTTCTCTGCTTCTTCTTACCTGAAACTATGAATCATCCAATGCCACAGTCATTGGAAGACGGAGAAAACTTTGGCAAGGGTGAAACTTGTTTCACAAGTTGTTTAGGTAAAAGGGATAGCATTGATTCATATAATGCCGATGATAAAGCAGAAGAAATGGTGGCGTTAGATGATATGAGTAAAAAAGTTTAA

Protein sequence:

>DPOGS203163-PA
MDKDRALEDMMGKLGDFGRYQGLQFFLHILSALTAGLHMLSLVTVAAVPEHRCTIDGVDSSNYTASWNSSFVLNAIPLNSHGKLESCKIYGENNTLQTCESWVYNTQYFTSSRGIEWDFVCSRRWMGAAAQTAYMFGVVMGSFVLGRFCDKFGRKTVFVWAGVFQLIFGSLVALATQYYTFIVLRFVYGIFGSGSYIAGFVLTMELVGPSRRTICGVAFQIMFAVGIMLVAGWGYILDNRFYLQILYGLHALILLPHWFLMDESIRWLWSQGRARESVALIEKALKMNGSNEIIETSALVSQCKATCAKYSDDEAAGTGDLFKSPNMLKKTLIICGCWFANSVVYYGLSLNTGKLNGNPYFLTFLFGIVELPSYIIIVYCLDRVGHRALISTMMLFGGIACLVVVALPHGSNSVTGVVMIGKLFISGSYSIIYKYSAELFPTVVRSSGVGLGSMCASVSGALTPLISLLDTLNPKIPTIIFGLLALLSGFSTFFLPETIGKELPQSVEDGEKFGVNDTCFTNCVGRRMSTASEDLPEAMEPLDNTVKKCWIDGVDTNESVALWNSSEILKSIPLTSTGSLSSCLMYNEENITVTCNKWVYDSKYRTSSRGIDWDLVCDQRWRGALAQTMYMLGVFTGAVYLGGLADKIGRKKVFCMSAVLQLILGVVVAFIPEYWTFVVITYFYAIFGSAGAYIPAFVLTMELVGPSKRTICGVAFQATFALGIMLVAGWGALIDNRVVLQVIYGLHGLILIPHIWIMDESPRWLWAQGRPKEAVDIVQKALKYNKSDKVLDRAVLVSKGKVEKSKNTESSASVFDLFKTPNLRIKTLNVCLCWFANSLVYYGLTLSAGKLEGNPYLITAVFGLVELPSYAAVVYFLDIWGRRPLMTSMMLVGGSACIIAAFIDPDYIVSTVVVIAGKLFIAGSFAIIYNYSAELFPTVVRNSAIGLGSMCARFSGALTPLITLLDSFDPKIPAATFGLVAIVSGFLCFFLPETMNHPMPQSLEDGENFGKGETCFTSCLGKRDSIDSYNADDKAEEMVALDDMSKKV-