Monarch geneset OGS2.0

DPOGS203406
TranscriptDPOGS203406-TA1992 bp
ProteinDPOGS203406-PA663 aa
Genomic positionDPSCF300003 + 1261466-1267046
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0063752e-17349.85% 
BombyxBGIBMGA010528-TA3e-9336.31% 
Drosophilablot-PB3e-3524.87% 
EBI UniRef50UniRef50_B6CY694e-4427.09%Transporter n=4 Tax=Tribolium castaneum RepID=B6CY69_TRICA
NCBI RefSeqNP_001137203.17e-4527.09%bloated tubules [Tribolium castaneum]
NCBI nr blastpgi|2195220421e-4327.09%bloated tubules [Tribolium castaneum]
NCBI nr blastxgi|2195220425e-5327.14%bloated tubules [Tribolium castaneum]
Group
Gene OntologyGO:00160211.3e-18integral to membrane
GO:00053281.3e-18neurotransmitter:sodium symporter activity
GO:00068361.3e-18neurotransmitter transport
KEGG pathway 
InterPro domain[40-496] IPR0001751.3e-18Sodium:neurotransmitter symporter
Orthology groupMCL31895 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203406-TA
ATGTGCGCTCAACGTGTGGTAAGATCTCGACTGGTTAATGGCAGCCGGCCGGCTGGTCAAGCGCAGAACTGGCTTAGCTGGCGTTTGCTTCGAACTCACTTGTACACTTTGGCTGTGAGTGCAGGCTTTAGCAGTAGTTGGCGGACTCCAAGGGAGGCGTTTCTGTATGGAGGTCTTACTTTTTCGTTAGCGCTAACAGTAGCAGCGGTGCTGATTGCACTGCCGACGGTGTTGCTGCAACTTGCCGTGGGCCAGCTTAGTCAGCAAGATGCTGCGGGAGTGTGGAGAGCGGTTCCATTTTTTAAAGGCGTTGGTTTTTTACGTTTGTTGATATCTTTTGTGGGCTCCATTTACACTGTGATTTATGTTGGCATGTGTGCTATTAATTTTTTCTACACCTTAAACAGTACATTTGCCGACTGCATGGAGTTAATAGCGATGGATGATGTTTTACAGGTTGAGGGCGATATTTATGGAACTGCGTGTTCAAACGGAACCTTTTTGGCTCCTGTTAATGAGCAACCGGAGTATTACTTAGCTCTGGCATTAATTATTATTGTGCTTTGGACAGTGTTTCCTTTTATTTTTTACAATCCAGTAAAATTTATGAAGAGAATATTCTACATTACTGGACCTATAGTTTATTTACTTGGAGTTATTATCGTATCTTTTATCGGCACGACCGAGGGTTTATCGTCCTTTACCGAAAGCGATGATTGGATTAACTTTTTGAGGCCGTACATTTGGTATAGTGCGTTAACTCAGGCTTTGCTTTCCACACAGACAGCTGGAGGCTATCTTATATCGTCCGGCGATTCTCTATACTCCGATACCGACGTTCAGTATACCTCATTTGTTTTTGTCGGCTTCAATATCGTTTCATGTTGGACTGGAACATTGTTCTGGTTCGCTGTAGGCGGAAGTGATGCCAAAGTCACCAGCAGCGCGGCCGTGCTCGTACAAATACACAAGGTTACCATAGAACGAAATCTAAGCAACGCGTGGCCGATCTTGTTATATTTACTATTATTTATATCGGGCATTATAACAATGATTACATTCCTGTATCCTTTGTATGATCGTTTCCGCCGCGTGGGCGGGTACAAATGGCGATATTTATCTGTGGGCAGTTCGATTGTGGGCGTTGGAGCCGCTATCGCCGTGCTAGCTTTCGGTCACTCACCGCTCACCCTGCTTGAGGACGTCGTTATGCCCCTTATGATCAGCTTCGCCACCTTGGTTGAGATATCCGCTTTTATATTTATTTATGGATGGAAAGTGTTGGTTGAAGATATCGAATTTCTCATTGGATACAATCTTCGCAAGTACTGGGTGTTAGGGTGGTTCGTTGTCATTGGAGTACTTGCACCGTTCACGGCATGGTGGACGGCGGACTGGTTTCTTGACACACCCAACTGGATAGAGCCTCCTTGGGAAGCGACTACCATAGTTATAACTGCGGGCTTCTCTGTTGTATTATTTTTAATTTGTGCTGCTATATCCGTTTCAAAACAAGTCCAATATGATTTCATGGGGAAGTTAAAATCGTCATTTACACCTTCGAGGCACTGGGGCCCGCGGGACCCTATAACTCACTACTATTGGCTGGCGCGTCGTGAGGGGAGCCGCGACGTCCCGCAACCGGTGCTTCGCCGTCCGGTCTTAGGAGAGTTTTCAGGAACATCAAGCTTGTCGATCATTATTAGCCCACACGAAGATTTTGCTCACGTAAATCAACGGCGATCGAAATCTGACGATCAGGTCGTTGTGAACAGAAAACAATATACCGCGGAATTGAATCAAATTCAAAGCGTCTTCCAAAGACGTTCTAAATCCTTAGATTGGAATTTCTCGTCTAAAAATAATATTCAAAATAACGTTTCCATAAAGAAGAGTCATGATTTAGAGAGAAAGACACATCCAAATGAACCGATAGTTATTATAAGGGAGATGAATAATAATACTCTTTCATTAGAATCTTTAGACATTTAA

Protein sequence:

>DPOGS203406-PA
MCAQRVVRSRLVNGSRPAGQAQNWLSWRLLRTHLYTLAVSAGFSSSWRTPREAFLYGGLTFSLALTVAAVLIALPTVLLQLAVGQLSQQDAAGVWRAVPFFKGVGFLRLLISFVGSIYTVIYVGMCAINFFYTLNSTFADCMELIAMDDVLQVEGDIYGTACSNGTFLAPVNEQPEYYLALALIIIVLWTVFPFIFYNPVKFMKRIFYITGPIVYLLGVIIVSFIGTTEGLSSFTESDDWINFLRPYIWYSALTQALLSTQTAGGYLISSGDSLYSDTDVQYTSFVFVGFNIVSCWTGTLFWFAVGGSDAKVTSSAAVLVQIHKVTIERNLSNAWPILLYLLLFISGIITMITFLYPLYDRFRRVGGYKWRYLSVGSSIVGVGAAIAVLAFGHSPLTLLEDVVMPLMISFATLVEISAFIFIYGWKVLVEDIEFLIGYNLRKYWVLGWFVVIGVLAPFTAWWTADWFLDTPNWIEPPWEATTIVITAGFSVVLFLICAAISVSKQVQYDFMGKLKSSFTPSRHWGPRDPITHYYWLARREGSRDVPQPVLRRPVLGEFSGTSSLSIIISPHEDFAHVNQRRSKSDDQVVVNRKQYTAELNQIQSVFQRRSKSLDWNFSSKNNIQNNVSIKKSHDLERKTHPNEPIVIIREMNNNTLSLESLDI-