Monarch geneset OGS2.0

DPOGS210778
TranscriptDPOGS210778-TA1974 bp
ProteinDPOGS210778-PA657 aa
Genomic positionDPSCF300386 - 24317-34475
RNAseq coverage264x (Rank: top 40%)
Annotation
HeliconiusHMEL0165421e-15158.43% 
BombyxBGIBMGA004216-TA0.067.16% 
DrosophilaCG43066-PC0.054.98% 
EBI UniRef50UniRef50_Q8MZ190.054.53%Transporter n=43 Tax=Eumetazoa RepID=Q8MZ19_DROME
NCBI RefSeqXP_002006688.10.054.33%GI18448 [Drosophila mojavensis]
NCBI nr blastpgi|1951244150.054.33%GI18448 [Drosophila mojavensis]
NCBI nr blastxgi|1953812030.056.31%GJ21533 [Drosophila virilis]
Group
Gene OntologyGO:00160213.4e-239integral to membrane
GO:00053283.4e-239neurotransmitter:sodium symporter activity
GO:00068363.4e-239neurotransmitter transport
KEGG pathway 
InterPro domain[37-589] IPR0001753.4e-239Sodium:neurotransmitter symporter
Orthology groupMCL11358 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210778-TA
ATGGCGGCTAAAGCGGAGCCGATAGGACCTAGGAATGGACATGAATTGGCACCGTTGAATACAAGAGCTGATGGCAGCGAGCGACCCCATGGGGTGACCATCGTCCTCCAAGGATCCAGGAACTCCTTGCAGAGAGAGGCTCCAGACGAAGACAGAGCAGCGTGGTCAGGGAAGCTGCAGTTCTTTCTATCCATTATCGGCTATTCCGTCGGCCTCGGGAACATTTGGAGATTCCCGTATCTGTGTCAACAAAATGGCGGCGGTGCTTTTCTGATTCCCTTCCTCATCATGTTGGTTTTGGAAGGCATACCGCTGTTCCTTATCGAAATGGCCATTGGACAGAAGATGAGATTAGGATCTCTAGGCGTGTGGAACACTATCCATCCATGGCTTGGCGGGATCGGTATAGCCAGCTGTGTCGTCACTCTGTTCGTAGCCTTATACTACAACGTCATCATCACATGGGTCTTCTTCTACCTCTTCAATAGTATTAGGCTTAGCGCCGACCAGCTACCTTGGGCGCACTGCCCTCAAGACAACGGAACTGCTGAGGCGGAATGCACCAAGGCTTCAGCGACAGTATACTACTGGTACAGAGAAGCACTCGACGCAGCACCCAGCATCGAGGAGCCGGGCGTCCCTCGATGGTGGATCGTCCTGTACTTACTTCTGGCATGGGTCATCGTGTTTTTCATAGTGATGAAGGGAATTCAGAGTAGTGGGAAGGTGGTATATTTCACGTCATTATTTCCTTACGTTGTGCTGACAATATTTTTTGTACGCGGAATCACTTTACCGGGATCAGCTGATGGCGTCATCCATATGTATAAACCAAAGGTAATCTATAAAATCCAAAAGGAGATCAAAGTTCTAGCACTGCATCATATCGGTGGGTTCACACTGAACTCCAGCCAGGAGTTCTATGACGAACACTACCCGCAGCTCAGCGCTAACATCACAACCGCCCTCAATCTGACCGGCTGCACGATGAGCAAGCAGTTGGATGAGGCTGCTGAAGGGACTGGTTTAGCGTTTATAGTTTTCACCCAAGCTATTCTCAAGTTGACACCGGCTCCGTTCTGGTCCATAATTTTCTTCTTGATGCTGCTGTCTCTGGGTCTAGGAAGTCAGATTGGTATCATGGAAGGCATGTTGTGCACTATCTTCGATATTGATTTCTTCAAAAGATTCAGCAAGCCTGTTATCACTGGTGCTGTATGCACGCTATGTTTTTTCGTGGGTCTCATCTTCACTACGGGAGCTGGGGAGTACTGGCTCAAGATGTTCGATTCATTCGCTGGTACCATCGGTCTGGTGGTGGTGGCACTGCTTGAAATGGTATCCGTCATATACATATACGGACATGAGAGGTTCACCAACGACATCTACGAGATGACGGGCTACCGTCCAGGATGGTACTGGCAGGTCACCTGGCGCTACATAGGACCGGTCATAGTCTCATGCATCCTAGTGTCTTCTTTAATATTCATGTTGCTGAACCCGCCCATGTATGGAGCCTGGAACGCTGAGGAGGGTCGTGTGGAGAAGACGCCTTACCCCCCGTGGGTGCTGTTTATCGCGGTGGCGATGATACTAGCTGGGATCCTCCCCATCCCGGTGGTGCTCCTCCTCAGACGGTTCCAATGCTTGGCTCTGGATGTAGACATCCACCAGGGCTCCATCAGGAGAATCGAAACCACCGTCTCCACCAAGGAGATGATGAGCGATCAAGATGTGTTGTCTCCAGAGAGTGTGCCGCCGTCGCAGCTGCTGCCTGATGCCGCCAGATTCACCATTGGAGACTTTGATTCTTCCGATTCGGAGAGTCAGGTCCGAGCCACCGACCGTTACGAAGATTCTAGTAGTGACGATGAAATTTTGAACGCGAAGTATTCGACCGGAACAGGAATACGTCTCTCCATGAAACCCATCGGTCACAATCCCAACGCATTCAAAATGGACGTCGTCGACTGA

Protein sequence:

>DPOGS210778-PA
MAAKAEPIGPRNGHELAPLNTRADGSERPHGVTIVLQGSRNSLQREAPDEDRAAWSGKLQFFLSIIGYSVGLGNIWRFPYLCQQNGGGAFLIPFLIMLVLEGIPLFLIEMAIGQKMRLGSLGVWNTIHPWLGGIGIASCVVTLFVALYYNVIITWVFFYLFNSIRLSADQLPWAHCPQDNGTAEAECTKASATVYYWYREALDAAPSIEEPGVPRWWIVLYLLLAWVIVFFIVMKGIQSSGKVVYFTSLFPYVVLTIFFVRGITLPGSADGVIHMYKPKVIYKIQKEIKVLALHHIGGFTLNSSQEFYDEHYPQLSANITTALNLTGCTMSKQLDEAAEGTGLAFIVFTQAILKLTPAPFWSIIFFLMLLSLGLGSQIGIMEGMLCTIFDIDFFKRFSKPVITGAVCTLCFFVGLIFTTGAGEYWLKMFDSFAGTIGLVVVALLEMVSVIYIYGHERFTNDIYEMTGYRPGWYWQVTWRYIGPVIVSCILVSSLIFMLLNPPMYGAWNAEEGRVEKTPYPPWVLFIAVAMILAGILPIPVVLLLRRFQCLALDVDIHQGSIRRIETTVSTKEMMSDQDVLSPESVPPSQLLPDAARFTIGDFDSSDSESQVRATDRYEDSSSDDEILNAKYSTGTGIRLSMKPIGHNPNAFKMDVVD-