Monarch geneset OGS2.0

DPOGS207668
TranscriptDPOGS207668-TA2022 bp
ProteinDPOGS207668-PA673 aa
Genomic positionDPSCF300133 + 404001-413368
RNAseq coverage400x (Rank: top 30%)
Annotation
HeliconiusHMEL0064620.066.67% 
BombyxBGIBMGA010528-TA0.062.38% 
Drosophilablot-PB2e-4125.65% 
EBI UniRef50UniRef50_B6CY692e-5026.40%Transporter n=4 Tax=Tribolium castaneum RepID=B6CY69_TRICA
NCBI RefSeqNP_001137203.14e-5126.40%bloated tubules [Tribolium castaneum]
NCBI nr blastpgi|2195220428e-5026.40%bloated tubules [Tribolium castaneum]
NCBI nr blastxgi|2195220422e-6125.93%bloated tubules [Tribolium castaneum]
Group
Gene OntologyGO:00160212.6e-21integral to membrane
GO:00053282.6e-21neurotransmitter:sodium symporter activity
GO:00068362.6e-21neurotransmitter transport
KEGG pathway 
InterPro domain[43-506] IPR0001752.6e-21Sodium:neurotransmitter symporter
Orthology groupMCL25745 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207668-TA
ATGGCAGTGGTGAGCCGACGGTTTGGTGTTGTACCTTGTGCGAGAGTCGACAGTCAACAGGGCAGATCTCTACAAAATGGCTGGACAGTCAGGCGGATTCTTCACTCAATCAAAACTCAAATATGCACAATAGCTATATCATTCACATTATTCAACACGGTCCGTCTACCTCGCGAGGCGTTCAAGTATGGCGATATTCCATACTTGGTTACATACTCCTTTTGGCTGCTCGCGGTCGCGCTGCCAACAACGCTGCTGCAGCTTGCGATGGGGCAGCTCAGCCAGCAAGATCCTGTTGGTGTCTGGAGAGCTGTTCCTATACTACGAGGTGTAGGTCACTTGAAGATTCTAACCTCATATATTTGTTGTATATACAGTATGATATATATTGCTTTATCAGCCGCCTACATGATTTGGATAGCAAAGGGCACGTTTCCACTGAAAGATTGTACAAAACTACATTTAACTCCGAACGGATACGAAAATAAAATGAACGCGTCTGAATGTTTTAACTCCACGTTTTTGGCGCCATTTGCCGAACAGCCACAATATCTCGGCGTAATGGCTATTTTAATATTCGTTTTATGGTTCTTTGTGCCCATTATGTTGTATCGACTCAGAAAAACTTTACGCATAGCACTTATTATCTGCACACCTATAATAATAGTAATTGCCATACTACTGTTTACATTTTTATCGGACTCCTTTACATTGACCACTATGTTGGAATCTTGCGATCATTGGTCAATACTAACTCATCCATATATTTGGCAGAGCGCGTTAGTTCAGGCGCTCTTGTCTTCAAATATAGTAAGTGGCTATTTAATTAGTGCTGGAGGAACGGTTTATAGACAATCAGACGTTAGATGGACAGCAATATTTGTAACAATCACTAATTTAGTATCGGCTTGGTTGTCCGTATTTATGTGGCAGTCCATAAGCGGTGAAAGCAATAAGGATACCAGCTTTGTATCAATATTAGTTTTAATATATCAGTCTTCGGTGTCCCAAAAGAGATCCAAAGAATGGCCTCTGTTAGGTTTCGGGTTAGTACTAGCTTCGGGGATTGTCACCGTGCTAGCTCTTTTATATCCTATATACGACAAACTGCACCGCCTAGCAGGCGATCATTGGCGACTATTTGCGTGTGCATCTTCTGCATTAGGCACAGCACTAACTGTTACTGTGCTAGCACAAGGGCTAGACGTTACATCCCTGCTTGACGAACTGGTCGTACCAGTGCTAGCTGTGTTCACTACACTTGTCGAAATCGTGGGGTTCGTTTTTATTTACGGTTGGTACTACTTAACAGTGGACATACATTTTGTTACCGGTTCAAGATTGCCGATATTTTGGTTGATATCTTGGTGGGTAACACCGGTACTGCTTATGGGTGTGATGGGCTGGTGGCTACGATCACTTTTAAGATTCACTTGGGGAAGTCAGACTTTGTGGCCCTTGGGAGGAACTTTAATTGGAATCTTGGTGGTGATGATAATCATGGCAGCAGTAGCAGTCGCAAAGGAAGAACAATTCAACTTGTTAACGAAAATAGCATCTGCGTTCCGACCTTCACGACTTTGGGGCCCAGAAGAACCGATGGCCCGGTACGTCTGGATGTCCCAAAGATATATTAATGAAAATGGAAATTCCGAGCAAGAATTTAATATAGACAAATCTTTTAATCCTACTATAATTAGTAAATATCACAGTAAATACAACGAACCTTACAAAGAAAATTGGTTGCATACATCGAATATTTATCAAACTACCACAAAAAATAATATTAAGCATGATTTTATGAAGAGCCAAGAGCCTTCATTTGATGATATAGGAGCAATATATTCTATACCTATAGTACCTATAAATAAAAGGAAAAAAAACGAAGGCGAGCAATTTCGCTCGCCTAACATTTGCGTTGCCAAAGGTGATATGGGTGGTCTAATAAATTGCAATTGTAATAGACATTTCCAATTAAATGTGCCTGACTTAAGAACCAACGAAGTATCGACTAGTCTTTAA

Protein sequence:

>DPOGS207668-PA
MAVVSRRFGVVPCARVDSQQGRSLQNGWTVRRILHSIKTQICTIAISFTLFNTVRLPREAFKYGDIPYLVTYSFWLLAVALPTTLLQLAMGQLSQQDPVGVWRAVPILRGVGHLKILTSYICCIYSMIYIALSAAYMIWIAKGTFPLKDCTKLHLTPNGYENKMNASECFNSTFLAPFAEQPQYLGVMAILIFVLWFFVPIMLYRLRKTLRIALIICTPIIIVIAILLFTFLSDSFTLTTMLESCDHWSILTHPYIWQSALVQALLSSNIVSGYLISAGGTVYRQSDVRWTAIFVTITNLVSAWLSVFMWQSISGESNKDTSFVSILVLIYQSSVSQKRSKEWPLLGFGLVLASGIVTVLALLYPIYDKLHRLAGDHWRLFACASSALGTALTVTVLAQGLDVTSLLDELVVPVLAVFTTLVEIVGFVFIYGWYYLTVDIHFVTGSRLPIFWLISWWVTPVLLMGVMGWWLRSLLRFTWGSQTLWPLGGTLIGILVVMIIMAAVAVAKEEQFNLLTKIASAFRPSRLWGPEEPMARYVWMSQRYINENGNSEQEFNIDKSFNPTIISKYHSKYNEPYKENWLHTSNIYQTTTKNNIKHDFMKSQEPSFDDIGAIYSIPIVPINKRKKNEGEQFRSPNICVAKGDMGGLINCNCNRHFQLNVPDLRTNEVSTSL-