Monarch geneset OGS2.0

DPOGS205278
TranscriptDPOGS205278-TA3411 bp
ProteinDPOGS205278-PA1136 aa
Genomic positionDPSCF300021 - 428602-435795
RNAseq coverage323x (Rank: top 35%)
Annotation
HeliconiusHMEL0174850.072.71% 
BombyxBGIBMGA011081-TA0.067.12% 
Drosophilabdg-PC2e-12836.90% 
EBI UniRef50UniRef50_D6WAN60.051.97%Transporter n=4 Tax=Neoptera RepID=D6WAN6_TRICA
NCBI RefSeqXP_001815298.10.052.19%PREDICTED: similar to sodium/chloride dependent transporter [Tribolium castaneum]
NCBI nr blastpgi|2700015240.051.97%hypothetical protein TcasGA2_TC000366 [Tribolium castaneum]
NCBI nr blastxgi|3504190660.044.77%PREDICTED: hypothetical protein LOC100741389 [Bombus impatiens]
Group
Gene OntologyGO:00160211.9e-45integral to membrane
GO:00053281.9e-45neurotransmitter:sodium symporter activity
GO:00068361.9e-45neurotransmitter transport
KEGG pathway 
InterPro domain[346-1119] IPR0001751.9e-45Sodium:neurotransmitter symporter
Orthology groupMCL15960 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205278-TA
ATGGAAACAATTGCAATGGAAGAGGGTAGGTATGAACAAAAGGAAAGTTTTCCAGGAAGTAGTGGTATTGACAGTGATACATCAAGTGTATTTGAGAGTAATGAAAGTAGTGAACAAATAAGCGAAGATGAAGAAGATTTGAAAGATGACCTCACAAAACATTTACCTATATTTTATAAGTCTGATGCAAACCATGAATCACATAACAACATCACAAACGATGAGGAATTTCAAGCAGCCGCTCTTGGTCAGTTTATGGACATATTAAATGATTTAGATGAAGTTCTAGATAAATCCCTGCTTGCATGTCTTGACGATGGTACTAAATCTTTAGACACTGACGAAGGGGATATTATCTGTAAAATAAAGGAATGTATAGTGGATACTGAAAATATATCTGAACATTTAGGATCCCAGAACTCTTTAGATGATGATACGCAAGTCGTGTGCTCCGTTCTTCCGTCTTCGACAGCATCAATAGATGATTTAGAACTTTCACAATGTGCTCCCATTTCACGCAATAATCTCATCCGTTCGAAAAGTTTCTCTGAAATTCCACGAAATCATACACAGTTTACCACCGCTCAGGCTTTAGAAAGGGCTAATACGGTAAACAACACGCTAAGAAATTCAATGAGAAGATTAGATCCTATAGTTCTACCAGCTATAACAAACCAGGAATCTGAACCTCTTACACTGCCCGTCATATTGTTTTTGGAGCATCATGTTAATGCGCGACCAACAAGTTCTCCTATACAATTGCAAGTGACCGCAGCAAATTTAAGTAGCGATGGGAGTTCCGGTCCACTCATAGTTGGAAGACGGACACTGTTAATGAACAGAGCGTTATCATTACCGTCGCCTGTTGATAGTGATATCACGACTAATTGGAACGAACGAATATCGAGAAGTCTAGCGAGAAGCGCAAGTGCATCTAGTTCGTCAGAAGATTCACTGCCAGGGTTAGCAGCTGTAAATCACAGTCCTGCTTCTGACGACCGACATGAGGATGCTGACATGCCCCCGTTTGGAGTTTGGCCGCATAGAATGAGCGCGATGCTAGCGTGTTTCAGTTGCACGATAGGAATATTCAACATCTCAAGATTTGCCATATTTAGCGTTAACTTTGGAGCCAGTTTTATAGTGCAATTTATTATACTATCATTAATAGTAGGTATACCATTGTTTACATTGCACTTGTGTTTGGGTCAAGTTTTGGAATCCGGGCCAGTTGACATGTGGAAGATATCTCCAATATTCCAAGGTGTTGGTATATCATTATTGTTAACACAAGCTGTTATTGGGATGTATAGTATAATAGGATTGTCCTGGATATTCGTTTACTTCAGAGATTCCTTTATAACATCAGATGATAGATACAAATGGGCATTACCAAATGAATACAACTTTGACAGTCACAGAAATAACACTAAAATATATGAGACACTGCCAAAGTATTTCCATACTGAAGTGCTTCAAAGAAATGGGAATTCAAACAGTTTTGGTACTATAAAGTTTCAAGTGGCATTCAACTTGGCTGTTGTATGGATGATAGTTTTTGTTTCTCTCAGCAAAGGATTGAGGTCGTACGGCAAGGCTGTGTATATGTTGATATTTTTACCTATCTGTGGTACTTTAGTGCTTTCTATCAAACTATTGACTCTAATACCTTATGATACTGTGACCAATATATTCCCTGAGACTGAATGGAGCGAATTTTTCATAAATAGTAGTAGTTGGGCTGCCGCTGCCCAAGAGACTTATCTGACATGGGGTTTGTTGTCAGCTTGTGTAATGCAGTTGACTACACACAAACATCCGAAACACAAAACACATCTTATACTACAACGGGAGAGCGCATGTATAGTTGTGTTCACCATGAGCGTTCTATTTTTAGGAGCTTTCCTTGCTAATACATGTGTCGTTATATTGAAGAGTTACGGTTTCACCTACGTGCCTAGTAGTTTCGAGACAGTTAAATCATCACAATTCTTATGGCCAGTCTCGGAACCGCTACCTGGTAACACAGTATCAACTCCCTTGCGGTATATGGGGCATTATGGGAGTCTGGTAGGAGTTACAGTATGGAAGACTGGTAACATTGCAAGAACTTTGAGTGGTTGGCAACCTTTACAACTGGCAACACAGATAGTTCCTGCAACACTGGCTGTGCTGCCGACAAATTTTCTGTCACCAGCGTGGGCTGTGATATTCTACTTCATCTTAATAATGTTTGGTATAGCCCAACAGCTTGCTATATGGCATTGCGTCATAACAGGAATCATGGCTATTAACGCTAAGGCCCTCAAAGTATGGGAGACGACTATAACTTTCCTAAGCTGTGTTTTTGGTCTTGCTGTGGGATTGCTTTTATCTACTGATGCGGGGATACGTATAGTACATTTCATCGACTACGTGTGGGTGGGATGTTGGTGGCAGTGCATAGTACACGTGTCGCTAGTCGTAGGTGTGTTCGTGGTACGAGGGCGGCCGTACTCGCCGGACGCGGTGGTGGGGGCGCTGTACACCGCGGGCTCTCGTCTGTCCGCCACACTCGCCGCTCTATTAAGTTTCACGTGGACCGTGGTGCTGCCTGTGTTGCTCTGTGCAATATGCATAATGGATTTTCGGACTGGACAGCAACGACAATTGTACAGTTGGCGGAAACCTATCAGTTACTGGCCAATATGGACACGCCAAGTGGCAGTTTTCTTACAGCTGACCGCACTTCTGATTGTACCTGTAACAGCTTTCGTACAAACTTGGATATACATATATAAAGGACCTACCGATATATTAGAGGATGACGAGTCCATCATTTGCGCCGATGACCCGAGAATTCAGAATCTGTATCGTCCTCGTATTGGTTCGTCGGGCTCCACGCCGATTGTTATCGGTGCGGTTGAAGATAGGCCTTCGCCCCCCGACCCTCCCCCGAAATATACGCCGCCGCCATCATACTCGACCGCCACAGGAGCCCGACTTATGCATACCTTGAGGCGGAGCTTTAGAACACTAAGGAGAATAACATCAACACGCGAGGAGACGGTCGACGAGACATCACTACCCATCACACTGAGCGAGAGTGTCGACAGACAGCCGCAGACTAGCGACCAGCGGACCTTCACACCGGTGACACTCACCGACGAGCCGGACACGACACGACAAATCCGTCCGACGTCCTCACGACAATCGCTCACACTCACAAGGGATTACCTCAGGCGATCCTTCGTCAGGAAGAACGACTCCAAAGGCATACGGTCCAGCTTACGAAGAAGCTTCAAGTACGGTGGTAATCTGACGACCAGCCACGAACATTTGGTGAGGGATATTGAACCAATATCAAACACTGTAGCCATGTCCGCCATGACCGAACCCTCTCATACAAGGAGCTTGGGATCAGTCTTATGA

Protein sequence:

>DPOGS205278-PA
METIAMEEGRYEQKESFPGSSGIDSDTSSVFESNESSEQISEDEEDLKDDLTKHLPIFYKSDANHESHNNITNDEEFQAAALGQFMDILNDLDEVLDKSLLACLDDGTKSLDTDEGDIICKIKECIVDTENISEHLGSQNSLDDDTQVVCSVLPSSTASIDDLELSQCAPISRNNLIRSKSFSEIPRNHTQFTTAQALERANTVNNTLRNSMRRLDPIVLPAITNQESEPLTLPVILFLEHHVNARPTSSPIQLQVTAANLSSDGSSGPLIVGRRTLLMNRALSLPSPVDSDITTNWNERISRSLARSASASSSSEDSLPGLAAVNHSPASDDRHEDADMPPFGVWPHRMSAMLACFSCTIGIFNISRFAIFSVNFGASFIVQFIILSLIVGIPLFTLHLCLGQVLESGPVDMWKISPIFQGVGISLLLTQAVIGMYSIIGLSWIFVYFRDSFITSDDRYKWALPNEYNFDSHRNNTKIYETLPKYFHTEVLQRNGNSNSFGTIKFQVAFNLAVVWMIVFVSLSKGLRSYGKAVYMLIFLPICGTLVLSIKLLTLIPYDTVTNIFPETEWSEFFINSSSWAAAAQETYLTWGLLSACVMQLTTHKHPKHKTHLILQRESACIVVFTMSVLFLGAFLANTCVVILKSYGFTYVPSSFETVKSSQFLWPVSEPLPGNTVSTPLRYMGHYGSLVGVTVWKTGNIARTLSGWQPLQLATQIVPATLAVLPTNFLSPAWAVIFYFILIMFGIAQQLAIWHCVITGIMAINAKALKVWETTITFLSCVFGLAVGLLLSTDAGIRIVHFIDYVWVGCWWQCIVHVSLVVGVFVVRGRPYSPDAVVGALYTAGSRLSATLAALLSFTWTVVLPVLLCAICIMDFRTGQQRQLYSWRKPISYWPIWTRQVAVFLQLTALLIVPVTAFVQTWIYIYKGPTDILEDDESIICADDPRIQNLYRPRIGSSGSTPIVIGAVEDRPSPPDPPPKYTPPPSYSTATGARLMHTLRRSFRTLRRITSTREETVDETSLPITLSESVDRQPQTSDQRTFTPVTLTDEPDTTRQIRPTSSRQSLTLTRDYLRRSFVRKNDSKGIRSSLRRSFKYGGNLTTSHEHLVRDIEPISNTVAMSAMTEPSHTRSLGSVL-