Monarch geneset OGS2.0

DPOGS215970
TranscriptDPOGS215970-TA3936 bp
ProteinDPOGS215970-PA1311 aa
Genomic positionDPSCF300078 - 662474-669533
RNAseq coverage45x (Rank: top 71%)
Annotation
HeliconiusHMEL0180430.056.33% 
BombyxBGIBMGA000927-TA0.090.47% 
DrosophilaCG5549-PB0.065.81% 
EBI UniRef50UniRef50_B4JVT70.077.80%Transporter n=14 Tax=Coelomata RepID=B4JVT7_DROGR
NCBI RefSeqXP_002066140.10.070.06%GK22198 [Drosophila willistoni]
NCBI nr blastpgi|1954363680.070.06%GK22198 [Drosophila willistoni]
NCBI nr blastxgi|1950562070.045.30%GH22876 [Drosophila grimshawi]
Group
Gene OntologyGO:00160216.9e-203integral to membrane
GO:00053286.9e-203neurotransmitter:sodium symporter activity
GO:00068366.9e-203neurotransmitter transport
KEGG pathway 
InterPro domain[39-632] IPR0001750Sodium:neurotransmitter symporter
Orthology groupMCL16155 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215970-TA
ATGAAAACGGAAAAGCAATCACCGACGCCGATTCCTGCGGTGACCAAGAACATACCAGCAATGGATCTAAATTATTCTGATTATAAGGACGACGTAAGTCGAGAACCGATTGTAGAGGAAGTGGAACCGGAAAGAGGGAATTGGACAGGACGTTTCGATTTCCTCCTATCTCTTCTAGGTTATAGCGTGGGGTTGGGCAACGTCTGGAGATTTCCCTACCTCTGCTATAATAACGGAGGAGGTGCCTTTCTGATTCCTTTCACCGTAATGCTGATAATAGCTGGTCTTCCTCTTATGTTCATGGAGCTGTCTTTCGGACAATACGCGGCCTTGGGTCCGGTAGCTGTTTACAATAGATTCTGCCCACTTTTCAGAGGATTGGGCTATGGGATGGTCATAGTATCAAGTATAGTTATGTTATACTATAACCTGATCATTGCTTGGACTATCTATTACATGGTGGTGTCGTTTACAAGCATTTTTTACCAATTGCCGTGGCAGAACTGCGACGCTGATTGGAGCACTAAATATTGTTATTCATACGAAGAAGCAGACATATGCGAGGCGAGCAATGGCACGTACTATCTGAGACAATGTGTGAATCAAAGCTATGCCATTGTTAATAATATCATTGCTCTAGCTGATACAGCAGTAAGAAAGCCCCCTGCTGAGGAATATTTTACTAATCAAGTGCTTGGACTTTCATCTGGCATTGAAGAAACAGGTCAAATCCGCTGGGGTATGGCTGCTTGTCTATTCGCTGCGTGGCTTATTGTATTTCTTTGCTTATGCAAAGGTGTACAGTCTTCGGGAAAGGTCGTCTATTTTACAGCGTTGTTTCCTTACGTGGTTTTGGTGATTTTATTTTTTCGTGGTGTTACTCTGCCTGGAGCGTCGACGGGTATACTGTTTTATCTTACACCAGACTTTAGCCAACTTGCAAATGCTCAGGTGTGGGGCGATGCAGCAGTGCAGATATTTTTTGCATTGAGTCCAGCTTGGGGAGGACTAATTACTTTATCCTCCTACAATAAATTCTCAAACAATTGCTACATTGATTCTCTCATAGTTGCTGTCTCTAACATTGCAACGTCGTTCTTCGCTGGGTTGGTGATTTTCTCCGTTATTGGATTTTTGGCTCATGAGCTTGAGGTAGAAGTTGACCGTGTGGTAGACCAGGGCGCTGGACTGGCATTTATTGTATACCCGGAGGTGGTTACAAGATTACCAGTATCACCTTTATGGTCCATACTTTTCTTCGTTATGCTTCTGACATTGGGACTTGATTCTCAGTTTGCTCTCATGGAGACTGTAACAACAGCTATTTTGGATCGATTTCCAAAATTTCGACAGAAGAAGACATGGGTAGTGCTAGCAGTGGCTGTGTTCGGTTATTTGGGTGGCTTGATTTTCACGACAAATAGTGGTATGTATTGGTTACAATTGATGGACAAATACGCAGCGAACTGGTCGGTTCTGATCATAGCTATTGGAGAGTGCATTTTGATTGCTTGGATATATGGAGCTGAAAAGTTCTGCATCGATATACAAACGATGATTGGTAGGCAGTCGAGGCTTTGGGTGGTGTTCTGGAGTGCCATGTGGAGGATTATTACGCCAGCAGCCTTAGTTTTTATTTTGGTGTTTAATTGGATTGAGTACAAGCCAGCTTCTTATGGTCATTACGTATACCCAATGTGGGCGGACGCGGTGGGTTGGACACTGGGAGTGTTGCCTATAGTCGTCGTAGTACTGATGGCCGTCAATCAAATATGCAGTGGACCCGACGATCTTACTATTATGGAGAAAGCAAGAGTCCTCGCTCGTCCTACTGAGGAATGGGGTCCAGCGTCTGCATCCTGTGCTAGCACTTTCCGCGAGACGGAGCTGTCTCCACCTGTACTGCTGCTTCACGGCAGGCCGACATCACAGGATGGTGCAGACTCGACATCCATTGGTAGTGGATTAGATAACGAACTAAAGAAACCCAGTACAAGCACTAACAAAGTAAAGAAAAATTCTATTAAAAGGAAAGTGCCAAACTTCAGCCATGATTCAGATGACGCTATCATAGAACCGTTGTTGCAAACTGAGAAAGATAATTCTGTGATTGCTAACAAAAGTGCCAAATATTTATTAAACAAATCTAAGACTGAGAGCATGTGCGAAAAAGATGCAAAACCATATGAAAAAGCCAAAATATCATCACCTTCGAGTATTTCAAATAATATTGCTTCTACGGTTGAAAAACGAACATCTCCCTTAACTTCTGAGTCACCACCTATCTATAATCACATGAACACGAATGTCTTTAGTGGATATGACTTAACAAAAGAAAAGACAACAACAGCAAGTAGTTCCGTTCTTGTAGAAATTGATGGATGTAAAATTGTATCCCAGGGAACAAACCCAGTTATCTTTACAACGGCAAAAATTCATGCAGATAATAAAACGAAGGCAATCGAACCAGTTCCAGTTACAACTGTGCCAATATTATCAACAATCGAAGGTAATATTGTCAAATCTGGAGTTATCTATGAAGATAATAAAAGATATAATGTATCACAAACCAATAAAGGCGATATAACGAAAAATGTGAATATAAAATCTGATTTAACGTTGGAATCATCAAATTTCAATAACCAGCCTAATGCCAAGGTAACACCTACTATACCTTTAACATCTATGCACATTAAAAATGAACAAAACAAAAATAAAGAACTGAAAGAACAATCTGCTGGTTCCTCTGAACAATCAGAAAAGTTGTTTAGCAAAGATCAAAAGGTGAGTCTTGGAAAGGTTAATACAAATACTTCTGATGAAAGATCTTTAATGCCTAAAGTAACTTGTGATGACAAAATAACAACAAAAGCAGAAAATACTATGTATAAAAGCAATGGTGTATTGCCGGATACTTCTAAAGAATTATTGAAATCTTCAAATAAACCAAGTACATTAAATAAAAGTAACATAGTCGAGAAAATAAAGCCATCAAGTGACGTTGAAAAGATAAGTACAAACCAGTCAACAACAAGCTCAACTAAACAAAGTAGCAAATCAAATGTATCACCTCAAAGCGATCAAACTGATTCTAGCAAGAACTTTTCTTCAAGTATTAAAAAACTGCAACGACAAAAGAATGCTATGATGGAGGAAGCTGTTGAAACTGGTAAAAAAGATTTGGTTAATTCAGCTAAGGATAAACCTTGTACAGTACAGAATATTATGTTAAAGGATGTAACTACTGCCGTTTCAGTGAATACAAAAACGATCCAAAAACCAGTCACGTCCTTGACAAAGGTGGCAATAACTTGTGCAGGTCCAAAAGCTGCATCTGAATTACCTTCGAAATCAATAAATACTACTGTACTTACCCCAGCTATAAAATCTCCTTTAATTACTCCGGGAACATCAACGACATTAACAACTAAACCGAGTACTAGTGCTGACACAGCGAAAGCAAAGTCAACATCAACTAAAATTCCAGAGACGTCTGGCAAATTAGCTACGAAGCCATCTTTAAATATTACAACTGTGACCACAGCTCCGTCTACAAAAACATCAACGGCTCCTTCTGTGAAAAGTATGAAGGTAACGGATGACCCAACCCCGTCAAAACAGGACAGTCCTTTAGTTTCCACTTCATCATCGAAAATACACAAAGCAGAAACTACTCCAACGTCATCTGTCGCTACTACTCTCCCACTGTATATAGTTTCTACGTCTAAAACTAAAATAGGTTCATCCTCAACTCCTAGTATATCTAACGTGACTAAGAAAACGCTTGATCAAAAACCTATTACGGAAGTTGTAAGTACTGCAAATGGGAAATCGGGAAAAGCGTTGATGTCAACATCATCGTCACAGAATAATCAGTCTGGTTCAAAACCAAAAACTTCTACAGTTAATTTAGTCAAAAAAGATGATAGAGCTTAA

Protein sequence:

>DPOGS215970-PA
MKTEKQSPTPIPAVTKNIPAMDLNYSDYKDDVSREPIVEEVEPERGNWTGRFDFLLSLLGYSVGLGNVWRFPYLCYNNGGGAFLIPFTVMLIIAGLPLMFMELSFGQYAALGPVAVYNRFCPLFRGLGYGMVIVSSIVMLYYNLIIAWTIYYMVVSFTSIFYQLPWQNCDADWSTKYCYSYEEADICEASNGTYYLRQCVNQSYAIVNNIIALADTAVRKPPAEEYFTNQVLGLSSGIEETGQIRWGMAACLFAAWLIVFLCLCKGVQSSGKVVYFTALFPYVVLVILFFRGVTLPGASTGILFYLTPDFSQLANAQVWGDAAVQIFFALSPAWGGLITLSSYNKFSNNCYIDSLIVAVSNIATSFFAGLVIFSVIGFLAHELEVEVDRVVDQGAGLAFIVYPEVVTRLPVSPLWSILFFVMLLTLGLDSQFALMETVTTAILDRFPKFRQKKTWVVLAVAVFGYLGGLIFTTNSGMYWLQLMDKYAANWSVLIIAIGECILIAWIYGAEKFCIDIQTMIGRQSRLWVVFWSAMWRIITPAALVFILVFNWIEYKPASYGHYVYPMWADAVGWTLGVLPIVVVVLMAVNQICSGPDDLTIMEKARVLARPTEEWGPASASCASTFRETELSPPVLLLHGRPTSQDGADSTSIGSGLDNELKKPSTSTNKVKKNSIKRKVPNFSHDSDDAIIEPLLQTEKDNSVIANKSAKYLLNKSKTESMCEKDAKPYEKAKISSPSSISNNIASTVEKRTSPLTSESPPIYNHMNTNVFSGYDLTKEKTTTASSSVLVEIDGCKIVSQGTNPVIFTTAKIHADNKTKAIEPVPVTTVPILSTIEGNIVKSGVIYEDNKRYNVSQTNKGDITKNVNIKSDLTLESSNFNNQPNAKVTPTIPLTSMHIKNEQNKNKELKEQSAGSSEQSEKLFSKDQKVSLGKVNTNTSDERSLMPKVTCDDKITTKAENTMYKSNGVLPDTSKELLKSSNKPSTLNKSNIVEKIKPSSDVEKISTNQSTTSSTKQSSKSNVSPQSDQTDSSKNFSSSIKKLQRQKNAMMEEAVETGKKDLVNSAKDKPCTVQNIMLKDVTTAVSVNTKTIQKPVTSLTKVAITCAGPKAASELPSKSINTTVLTPAIKSPLITPGTSTTLTTKPSTSADTAKAKSTSTKIPETSGKLATKPSLNITTVTTAPSTKTSTAPSVKSMKVTDDPTPSKQDSPLVSTSSSKIHKAETTPTSSVATTLPLYIVSTSKTKIGSSSTPSISNVTKKTLDQKPITEVVSTANGKSGKALMSTSSSQNNQSGSKPKTSTVNLVKKDDRA-