Monarch geneset OGS2.0

DPOGS210546
TranscriptDPOGS210546-TA1824 bp
ProteinDPOGS210546-PA607 aa
Genomic positionDPSCF300304 - 40758-42581
RNAseq coverage146x (Rank: top 54%)
Annotation
HeliconiusHMEL0034780.091.43% 
BombyxBGIBMGA013466-TA0.088.01% 
DrosophilaVAChT-PA0.072.51% 
EBI UniRef50UniRef50_UPI00022C88AD0.070.30%UPI00022C88AD related cluster n=3 Tax=unknown RepID=UPI00022C88AD
NCBI RefSeqXP_975499.10.073.19%PREDICTED: similar to vesamicol binding protein, putative [Tribolium castaneum]
NCBI nr blastpgi|3479677250.073.34%AGAP002369-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479677250.074.13%AGAP002369-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00158939.1e-41drug transport
GO:00160219.1e-41integral to membrane
GO:00152389.1e-41drug transmembrane transporter activity
GO:00550854.9e-34transmembrane transport
KEGG pathwaydmo:Dmoj_GI213301e-95 
 K08155 (SLC18A)maps-> Parkinson's disease
InterPro domain[37-485] IPR0161967.4e-60Major facilitator superfamily domain, general substrate transporter
[116-255] IPR0047349.1e-41Multidrug resistance protein
[50-405] IPR0117014.9e-34Major facilitator superfamily
Orthology groupMCL14786 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210546-TA
ATGTCGGAGGAACCGCATACCGTGTGGCAAAAAATTGACAACGCCGTTATTCCAGTGATTAATCTGGAGATTCGAGAGGTCCGCGAGATACTATGGGAGAAGATAAAGGAACCCACGTCTCAAAGAAAGATAATTTTAGTAATAGTGTCTATAGCCCTCCTACTAGACAACATGCTCTATATGGTAATCGTACCAATCATACCAGATTATTTAAGATATATTGGAGCTTGGGGAGAGGCGGGCTACGATCACGTAGTCACGTTGCCGCCAATCAAGGAAGGAAATAGAACTATAATACCGACTAAAATTATACCAGCATCTCATGAAGGCCAGGATTCAGCGACTGGAGTTCTGTTCGCATCTAAGGCGATAGTACAATTAATGATTAATCCCTTTTCCGGTGCCTTAATTGACCGCATAGGTTATGATATTCCTATGATGATCGGACTTATAATAATGTTTCTTTCCACTTCGATATTTGCCTGTGGGCGGAGTTATAGTATGTTGTTCTTTGCGAGAAGTCTTCAGGGTGTTGGATCTGCCTTCGCAGATACTTCCGGTCTCGCAATGATTGCTGACAGATTTACCGAAGAATCTGAACGCTCAAAGGCACTTGGCATTGCTCTTGCCTTTATAAGTTTTGGATGTCTTGTTGCTCCCCCATTTGGTGGAGCTTTATATCAATTCGCCGGGAAAGAAGTACCATTTCTAATACTTGCACTGATATCTCTTTTGGACGGTTTTATGTTACTTCTTGTTATGAAACCGCTAAAGACACAAATGAAAGAGGCTTCCCAACCAAAACCAGCTGGCACTCCTATTTGGAAACTATTGATGGACCCCTATATCGCGGTATGCGCTGGAGCCTTAATGATGTCAAATGTGGCATTGGCCTTTCTGGAACCAACAATATCTCTATGGATGGAAGATAATCTGACAAAGGATAATTGGAAAATTGGTATGATTTGGCTACCGGCATTCTTTCCACACGTTTTAGGGGTCATTATAACAGTAAAAATGGCAAAACAGTATCCACAGCATCAATGGTTGATGGCAGCTGGTGGTTTGGCTTTAGAGGGCCTATGTTGTTTTATAATACCATTTGCAAGCTCATACAAAATGTTAATGATTCCAATTTGTGGTATCTGCTTTGGTATAGCTCTCATTGACACAGCTCTTTTACCAACTCTAGGATACCTCGTGGATGTACGATACGTATCTGTTTACGGAAGCATCTACGCTATTGCTGATATATCCTACTCTTTCGCTTATGCTGTTGGACCCATCATAGCTGGCGAGGTAGTGGAGGCTATTGGTTTCACCGCACTTAATTTGTTTATTGCCTTCAGTAATCTACTATACGCTCCTGTTTTAATATATCTGCGCCATATTTATGACTTCAAACCTTTTGAGAATGAGGCAAATATTTTGATGGCAGATCCTCCGGACAAAGAATATCAAACGTATAGCATGCAAGATCAGAGACCAGTGAATGGTGAATTTAAAAATCATTTGGAGTACAGCTCCATGGGTGGCCAAGGGGATTCCGTGCAAGAGTCCAACGTGGATGGGACACAATATGATTACCAAGACGGATATCAAAACTATTCACAAGGCTACGAACAGCAAGGTTTTCAGCAGCAAAGCTATCAGCAAGAAGGGTACAGTCAGCCTCGACAGTTGCCAGCGCAGCCCCAACCTCCAGCCAGCAACCCCTTCAGAGCCGGGTCAGCAGCTGCGGCGCCCACCCCAGCCCCCGCCGCGGCACCGACTCCCGCCCCGGCCACCGGTGCCATCAGGAATCCATTCCGACAGGGCTTCTAG

Protein sequence:

>DPOGS210546-PA
MSEEPHTVWQKIDNAVIPVINLEIREVREILWEKIKEPTSQRKIILVIVSIALLLDNMLYMVIVPIIPDYLRYIGAWGEAGYDHVVTLPPIKEGNRTIIPTKIIPASHEGQDSATGVLFASKAIVQLMINPFSGALIDRIGYDIPMMIGLIIMFLSTSIFACGRSYSMLFFARSLQGVGSAFADTSGLAMIADRFTEESERSKALGIALAFISFGCLVAPPFGGALYQFAGKEVPFLILALISLLDGFMLLLVMKPLKTQMKEASQPKPAGTPIWKLLMDPYIAVCAGALMMSNVALAFLEPTISLWMEDNLTKDNWKIGMIWLPAFFPHVLGVIITVKMAKQYPQHQWLMAAGGLALEGLCCFIIPFASSYKMLMIPICGICFGIALIDTALLPTLGYLVDVRYVSVYGSIYAIADISYSFAYAVGPIIAGEVVEAIGFTALNLFIAFSNLLYAPVLIYLRHIYDFKPFENEANILMADPPDKEYQTYSMQDQRPVNGEFKNHLEYSSMGGQGDSVQESNVDGTQYDYQDGYQNYSQGYEQQGFQQQSYQQEGYSQPRQLPAQPQPPASNPFRAGSAAAAPTPAPAAAPTPAPATGAIRNPFRQGF-