Monarch geneset OGS2.0

DPOGS210320
TranscriptDPOGS210320-TA2175 bp
ProteinDPOGS210320-PA724 aa
Genomic positionDPSCF300025 - 897071-913183
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0051290.093.32% 
BombyxBGIBMGA004216-TA1e-14142.90% 
DrosophilaCG10804-PC0.061.94% 
EBI UniRef50UniRef50_D6WF780.074.93%Transporter n=4 Tax=Coelomata RepID=D6WF78_TRICA
NCBI RefSeqXP_969026.10.074.17%PREDICTED: similar to sodium- and chloride-dependent neurotransmitter transporter [Tribolium castaneum]
NCBI nr blastpgi|3407167840.075.34%PREDICTED: sodium-dependent neutral amino acid transporter B(0)AT2-like [Bombus terrestris]
NCBI nr blastxgi|3838529950.075.45%PREDICTED: sodium-dependent neutral amino acid transporter B(0)AT2-like [Megachile rotundata]
Group
Gene OntologyGO:00160215.3e-191integral to membrane
GO:00053285.3e-191neurotransmitter:sodium symporter activity
GO:00068365.3e-191neurotransmitter transport
GO:00160203.3e-05membrane
GO:00058873.3e-05integral to plasma membrane
KEGG pathway 
InterPro domain[52-663] IPR0001750Sodium:neurotransmitter symporter
Orthology groupMCL11341 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210320-TA
ATGGCGAACACGGCACATTTGATGAGGCGACAGAGCTCGCGAGATTTGTACGCCCAACGATCTTTGGATAGAATGGAGATGAAGGAACTGAGAGGTCGGCTGGTGACCATGGAAAATGGACATCCCATACACCGCACCCACGTCATATACGGCGCCACCAATCAAGCCTTCGAGGACACGAGCCCCAATAGAAGATCCTCTGAGACACCGACTAGCAAGGCCAGCTCTACAGAAGGTCCCACGGGCTTAAAGCCCGAAGGAGGCGAGTGCGAACGGGAATCTTGGGACAGCAAGTTGACGTTCTTACTTGCGACCGTCGGATATGCTGTGGGTCTTGGTAACGTTTGGAGATTCCCCTATTTAGCACAGAAAAATGGCGGTGGAGCGTTCTTAATCCCGTATTTTGTGATGTTAGCTATTGAGGGTATACCGATTTTTTATTTGGAGCTGGCCATCGGGCAAAGATTGAGGAAAGGCGCGATTGGTGTTTGGCACCAGGTCTCACCGTACTTGGGTGGGATTGGTATCAGCTCAGCTGTTGTATCTTTCAACGTAGCGTTGTACTATAATTCCATCATCGCCTGGTGTCTGTTCTACTTTGCGCAGAGTTTCCAAGCTGAGCTGCCTTGGTCGGAGTGCCCGAAAAAATACTTCACCAATGGTTCCTACATCAGCGAACCAGAATGTGCTGTAAGCAGTCCCACTCAATATTTCTGGTACCGAACCACTCTCCAGATATCGGAAGACATCAATCATCCCGAAACATTTAATTACAAGATCGCGCTCGCTCTCATTATCGCCTGGATCTTAGTCTACATGTGTATGATCAAAGGCATAGCGTCTTCTGGGAAAGTTGTCTACGTCACAGCCACCTTCCCATATATAGTTCTGGTTATATTTTTCTTCAGAGGGATAACTTTGAAAGGAATGGGAGATGGGCTCGTTCATCTCTTTACGCCAAAATGGCATAAAATCCTAGATCCAGTTGTCTGGTTGGAAGCGGGCACCCAAATTTTCTTCTCCTTGGGCCTCGCTTTCGGAGGCCTTATCGCATTCAGCTCATACAACCCGGTGGACAATAACTGCTACAGAGATGCGATCATGGTCTCAATGACGAACTGTCTAACGTCCATGTTCGCTGGCATTGTTGTTTTCTCCATCATCGGTTTTAAAGCAACTATGGTTTATGAGAAATGTTTGGCCATACGAAATGAGACCCTCCTTGAAGTGTTTGGTCCTGAGTATACTAGAATGTCGTTTCCCGAAGCCGGTCAAAAATTGGCCGTCATGCTGAACGATACTGTAACCAATTTTATTATGCCACATCTTCCGGAATGCGATCTCCAGAAAGAGCTGGACAACACTGCATCGGGCACTGGTCTAGCCTTCATAATATTTACGGAGGCCATCAACCAGTTCCCTGGAGCTCAGTTCTGGTCCGTGCTGTTCTTCTTGATGCTGTTCACTCTCGGCATCGACTCGCAGTTCGGGACGCTGGAAGGCGTGGTTACCAGTATTGTTGATATGAAACTGTTTCCAAATCTTCGCAAGGAGTACCTCACTGGAGGTCTATGTCTGATCTGCTGTTTATTGTCTATGGGTTTCGCTCATGGCTCTGGGAGCTATATATTCCTGCTCTTTGACATGTACAGTGGAAACTTCCCGCTACTGATCATCGCTTTCTGTGAATGTATTGGGATAGCATACGTCTATGGCGTGAAGAGATTCGCGGATGACATAGAGCTGATGACGGGTCAACGTCCCGGGATATATTGGTTGATATGCTGGAAGTATCTCTCGCCTCTGGCGATGCTGTCCATACTGATATCCTCGTTCGCGGAGCTGATCATGGAAGGTTCGAGCTACGAAGCTTGGATCAGCTCCGAAGGCGATACTGTGAAGAAGCCCTGGCCTCTGTGGGCTGTTTTGTTGGTGTTGGTGATGGTGCTGGCTTCGGTACTTTGGATACCAGGACTGGCGATTTGCAGATATTTGGGTATCCAGGTTATAGATGATGAGGAGCGCGCGTGGTTCCCGGCCGAGGAGTTACGTGATTTCCACGGCTTGGAGCCGAGGCCTGTCTCCGCGTTGGAGACGCTGCTGTTCTGTACACGACCGGACGGCAGTGAGAAGTGCTGTTGGCCGGGGTGCTGCGATACCGACGACGAGGAATAG

Protein sequence:

>DPOGS210320-PA
MANTAHLMRRQSSRDLYAQRSLDRMEMKELRGRLVTMENGHPIHRTHVIYGATNQAFEDTSPNRRSSETPTSKASSTEGPTGLKPEGGECERESWDSKLTFLLATVGYAVGLGNVWRFPYLAQKNGGGAFLIPYFVMLAIEGIPIFYLELAIGQRLRKGAIGVWHQVSPYLGGIGISSAVVSFNVALYYNSIIAWCLFYFAQSFQAELPWSECPKKYFTNGSYISEPECAVSSPTQYFWYRTTLQISEDINHPETFNYKIALALIIAWILVYMCMIKGIASSGKVVYVTATFPYIVLVIFFFRGITLKGMGDGLVHLFTPKWHKILDPVVWLEAGTQIFFSLGLAFGGLIAFSSYNPVDNNCYRDAIMVSMTNCLTSMFAGIVVFSIIGFKATMVYEKCLAIRNETLLEVFGPEYTRMSFPEAGQKLAVMLNDTVTNFIMPHLPECDLQKELDNTASGTGLAFIIFTEAINQFPGAQFWSVLFFLMLFTLGIDSQFGTLEGVVTSIVDMKLFPNLRKEYLTGGLCLICCLLSMGFAHGSGSYIFLLFDMYSGNFPLLIIAFCECIGIAYVYGVKRFADDIELMTGQRPGIYWLICWKYLSPLAMLSILISSFAELIMEGSSYEAWISSEGDTVKKPWPLWAVLLVLVMVLASVLWIPGLAICRYLGIQVIDDEERAWFPAEELRDFHGLEPRPVSALETLLFCTRPDGSEKCCWPGCCDTDDEE-