Monarch geneset OGS2.0

DPOGS207520
TranscriptDPOGS207520-TA3141 bp
ProteinDPOGS207520-PA1046 aa
Genomic positionDPSCF300177 + 4965-31178
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0087430.092.06% 
BombyxBGIBMGA004312-TA0.087.58% 
DrosophilaDAT-PA0.050.41% 
EBI UniRef50UniRef50_Q4F9L70.090.14%Transporter n=9 Tax=Coelomata RepID=Q4F9L7_OSTNU
NCBI RefSeqXP_975356.10.068.92%PREDICTED: similar to high-affinity octopamine transporter protein [Tribolium castaneum]
NCBI nr blastpgi|1242455510.090.14%high-affinity octopamine transporter protein [Ostrinia nubilalis]
NCBI nr blastxgi|874482080.089.98%high-affinity octopamine transporter [Trichoplusia ni]
Group
Gene OntologyGO:00160211.2e-215integral to membrane
GO:00053281.2e-215neurotransmitter:sodium symporter activity
GO:00068361.2e-215neurotransmitter transport
GO:00082702.2e-10zinc ion binding
KEGG pathway 
InterPro domain[81-649] IPR0001750Sodium:neurotransmitter symporter
[669-794] IPR0151531.2e-36EF-hand domain, type 1
[798-886] IPR0151542.9e-27EF-hand domain, type 2
[891-936] IPR0004332.2e-10Zinc finger, ZZ-type
Orthology groupMCL10151 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207520-TA
GCCAGTCTCAGCGGTAGTGGGGTTGTAACACACACGGCCCCGTCTTACGAAGAACAACGAGCGAGTCCCGCACTGTTGAGCAGGAACACGGGCACGCCTGGTGGACGGAGTACCAGGGATGACGGTTACTGTTCAGCTGGGAGTACTCCGAGAGCATTTGAGGAACCTAAAATACAATTAGACGAGGAAGTTTATTACAGTGACTCGTCGAAGACATCAAAAACTAACAACGTCCTTAACAAATCAGAAAATGACGATGGCCGCGAGACTTGGGGTACTGGAGCCGATTTTCTCCTGTCCATCATCGGCTTCGCTGTGGACCTCGCTAATGTCTGGCGTTTCCCCTACCTTTGTTATAGAAACGGAGGCGGAGCTTTCCTTATCCCGTACACGTTGATGTTGGTTTTCGGAGCTGTTCCTCTATTCTATATGGAGCTGATCCTCGGGCAATACAATCGACAAGGTCCGATAACGCTCTGGAAGATATGTCCGTTGTTCAAAGGTGTTGGCTTCTGCGCAGTGATGGTAGCCTTTTACGTGTCTTTCTATTACAATGTCATAATCGGTTGGGCCTTCTACTTTCTGGTGTCGTCGGCTCGCTCTGAGTTACCCTGGGTCCACTGTGATAACTCCTGGAACACGGAGCAGTGTTGGGACGCAGCTCGTCTGAATGCCACCAACCGTACTGATATACCATACCAGGGACCACTCTCACACTTCACACCGGCCAGCGAGTTCTTCCATCGGGCTGTTCTCGAGATGCAGCACTCCGAGGGTCTTAATGACTTGGGTCTGCCAAAATGGCAGCTGGCTGTCTGCCTGGGGGTGGTTTACTTCACACTGTATTTGTCACTTTTCAAGGGCGTCAAAAGTTCAGGTAAAGTGGTGTGGATGACTGCTACGATGCCGTACGTGGTGCTATCCATCCTGCTCGCACGAGGCTTGCTCCTCCCAGGTGCGACACGTGGCATCGCTTACTACCTGCACCCAGAGCTGACACGCCTCAAGGACACACAGGTGTGGGTCGATGCTGCTGTACAGATTTTTTATTCCGTCGGAGCCGGCTTCGGCGTCCACCTCTCGTACGCCAGTTACAACACATTCCACAACAATTGCTATCGGGACTGTCTTGTCACGACCCTGGTGAACTGCTTCACCTCTTTCTTTTCCGGATTCGTGATCTTCACGTATTTGGGCTTCATGTCACACAAACAAGGAGTCCCTATATCATCGGTAGCTACAGAAGGTCCAGGTCTAGTGTTCCAGGTGTATCCGGAGGCGGTCGCCACCTTGCCCGGAGCTAGTCTGTGGGCCATGCTGTTCTTCTTCATGCTTATAATGCTAGGACTTGATTCTGGAATGGGCGGCCTGGAGTGCGTGATCACCGGTCTACTGGACCAGGCGCGTGCTTGTGGCGCCTCCCGACTCCGCCGCGAACACTTCACACTGGCAGTCGTCTGTCTGTCGTTCTGCGTCGCCTGCATCAATGTTACTCCGGGCGGCATATACATGTTCCATTTACTCGACACCTACGCCGCTGGTATTTCTTTGCTGTGCTCGGCGTTGTTTGAAGCTGTCGCCGTTTCTTGGTTTTATGGTTTGAAGAGATTCTCTGATGATGTAGAGGAGATGCTGGGTTTCAGACCTGGACTGTACTGGAGGATCTGCTGGAAATTTGTCAGCCCCGTGTTTATTATTGGTGTGGTCGTGTTTGGGCTGCTGTATCAACAGCCGTTGCAATACCAACACTACACGTACCCTCCCTGGGCGGTCGTCCTCGGCTGGGGGCTCGCTTGCTCCTCCATACTCATGATACCAATCGTGGCGATCTACAAGCTCGTCACTACCCCGGGAACATGTCGACAGCGTCTGGCCTGTTGTATTTCGCCGGAATCCGAACACGAAGCGATCAGAGGTGGTGCCCCAGCCAAAATGTCATTAGAAAGAGAAACAATGGGTGTTATGCCACCCATTTCCGTTGGCAGCGGTCCACTGGATCCTCGCTCACAGCTGATACAAGAAATGAGAGACCAGAATTTCGATCTCATCAGATTCGCTTCATACAGGACGGCATGTAAACTCCGATTTGTTCAGAAGAAATGCAACTTACATGCAATCGATATATGGAATGTCATAGAGGCGTTCAGAGAGAATGCTCTAAACACTCTCGAGCCGACAGCCTGTGTGAATGTCACGAGACTTGAGACACTGGTGTCCTCACTGTATCATAACTTAAACAAGAGGCTACCTCCCGCTAACCAAGTCTCGGTGGAAGCATGCTCGGCATTGCTATTGAACTGGCTATTATCAGCATTTAGTACCGGCGAGAATGTGGGAAAGATAAGAGTTTTTTCTATCAAAGTGGCATTAGCAACGATGTGTGCCGGCAAGCTGATGGATAAATTGAGATACATATTCTCTCAACTCTCGGATGGTAATGGTCATTTACTTATGAAGAGGCTCTCAGACTATCTGCGGGAAGTGTTGGCTTTACCGGCAGCTGTTTACGAGTCACCATCCTTCAGTTACAATGACAATCTAGCTATGGCTATATTTAACCAGAATGTCAAAATAACGGTCAATGACTTCTTAGACACACTGATGTCGGATCCGGGACCGCCGTGCCTTGTGTGGCTACCGTTACTCCACAGACTGGCTAGCGTGGAGAATGTGGTGCACCCCCTCGCCTGCAGTGTGTGTCGTCGTGGTTCTCTGACTGGGTTCCGTTACCGCTGTACACGCTGCGCCGCCTACACCCTCTGTCAGGACTGCTTCTGGCGGGGTCACGTGAGCCCCCCGCACAGCAATGACCACGAGGTCAAAGAATACGCCACCTACAAATCTCCATCTAAGCAGATCGGTGCGACTCTTCGTAAGTCATTCCGCTGTGTTCCCGAGCGAGCACGACCAGCCCTTCCGAGGTATCCCGACCAGCCGGAGCGGACACTCAATTTAAGCCATATTGTTCCACCGTCACCAGTGCCGGCTCACAACGGCTTCCCGGAGTACACTGGCTACAACACCGGCTCATTGGACAGTCGTTCATCAAGATCGACACATAGAATGAAGCCTCAGCAGAGTGAATGCAATTTCACTCGTAGAAAATACCCTCGATTGAGACAGATAGAGGCAAGCAATTGA

Protein sequence:

>DPOGS207520-PA
ASLSGSGVVTHTAPSYEEQRASPALLSRNTGTPGGRSTRDDGYCSAGSTPRAFEEPKIQLDEEVYYSDSSKTSKTNNVLNKSENDDGRETWGTGADFLLSIIGFAVDLANVWRFPYLCYRNGGGAFLIPYTLMLVFGAVPLFYMELILGQYNRQGPITLWKICPLFKGVGFCAVMVAFYVSFYYNVIIGWAFYFLVSSARSELPWVHCDNSWNTEQCWDAARLNATNRTDIPYQGPLSHFTPASEFFHRAVLEMQHSEGLNDLGLPKWQLAVCLGVVYFTLYLSLFKGVKSSGKVVWMTATMPYVVLSILLARGLLLPGATRGIAYYLHPELTRLKDTQVWVDAAVQIFYSVGAGFGVHLSYASYNTFHNNCYRDCLVTTLVNCFTSFFSGFVIFTYLGFMSHKQGVPISSVATEGPGLVFQVYPEAVATLPGASLWAMLFFFMLIMLGLDSGMGGLECVITGLLDQARACGASRLRREHFTLAVVCLSFCVACINVTPGGIYMFHLLDTYAAGISLLCSALFEAVAVSWFYGLKRFSDDVEEMLGFRPGLYWRICWKFVSPVFIIGVVVFGLLYQQPLQYQHYTYPPWAVVLGWGLACSSILMIPIVAIYKLVTTPGTCRQRLACCISPESEHEAIRGGAPAKMSLERETMGVMPPISVGSGPLDPRSQLIQEMRDQNFDLIRFASYRTACKLRFVQKKCNLHAIDIWNVIEAFRENALNTLEPTACVNVTRLETLVSSLYHNLNKRLPPANQVSVEACSALLLNWLLSAFSTGENVGKIRVFSIKVALATMCAGKLMDKLRYIFSQLSDGNGHLLMKRLSDYLREVLALPAAVYESPSFSYNDNLAMAIFNQNVKITVNDFLDTLMSDPGPPCLVWLPLLHRLASVENVVHPLACSVCRRGSLTGFRYRCTRCAAYTLCQDCFWRGHVSPPHSNDHEVKEYATYKSPSKQIGATLRKSFRCVPERARPALPRYPDQPERTLNLSHIVPPSPVPAHNGFPEYTGYNTGSLDSRSSRSTHRMKPQQSECNFTRRKYPRLRQIEASN-