Monarch geneset OGS2.0

DPOGS216041
TranscriptDPOGS216041-TA3105 bp
ProteinDPOGS216041-PA1034 aa
Genomic positionDPSCF300067 - 298137-318504
RNAseq coverage7x (Rank: top 87%)
Annotation
HeliconiusHMEL0089250.072.18% 
BombyxBGIBMGA009019-TA0.062.82% 
DrosophilaCG8546-PB3e-12744.18% 
EBI UniRef50UniRef50_D6WXM62e-13246.45%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WXM6_TRICA
NCBI RefSeqXP_972346.13e-13647.04%PREDICTED: similar to pickpocket [Tribolium castaneum]
NCBI nr blastpgi|910891177e-13547.04%PREDICTED: similar to pickpocket [Tribolium castaneum]
NCBI nr blastxgi|910891171e-13346.96%PREDICTED: similar to pickpocket [Tribolium castaneum]
Group
Gene OntologyGO:00160206.1e-154membrane
GO:00052726.1e-154sodium channel activity
GO:00068146.1e-154sodium ion transport
KEGG pathway 
InterPro domain[40-519] IPR0018736.1e-154Na+ channel, amiloride-sensitive
Orthology groupMCL18346 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216041-TA
ATGACATCCGGAAGTGTTGGAAGTATTATAGATATAACCCTAGAGGCAAATGAAAATCTTTCCAAGGACGAAGAAGTGAAAACAAAAAAACATAAATTAGGTTTACTCAAAAAGCATTTAATTGACTACTCAGCGAACTCAAATCTCCACGGACTTAAGTACATTGGTGAAAAGGACAGAACTCTGTTTGAAAAATGGAACGAAAGTCCCGTCATTGTAAGCTTCGCGGAAAAATCAACACCAGTTTGGCAGATTCCTTTTCCGGCTGTAACAATCTGTTCAGAAACAAAGGCGCGTCAAACGATTTTTAATTTAACAAAATATTATCATCTTTATGACGACGACATCACACGTTTAAATTTAACCGAGAAAGAGCGCCGATTATTCGAAGATGTTTCAATGGTTTGCGATGTGAACGTGGCATCTTACTTTGGAACTAAATTTTCTGATGCAAAAGAAACTGTTCAAAATATTAAAGAGCTATCTCCGAAAATAAATGATACATTTTATGCCTGTGTTTGGAAGAACTCGCTAAGTATCTGTTTAGACGAATTTTTGCCGATCATCACTGAAGAAGGTGTTTGCTACACCTTCAACACCTTGGGTGCTGAGGAATTATTTAGAGTTGAAAACCTCAATAAGGACTACGATTACTTAGAATATTCCAAACGAAATTCCAGTCTTTGGACGTTAGAAGATGGATATCCAACTGATAGTCCGGTAGAGACATACCCTCATAGAGGAATTGGTTTTGGCATTAAATCAGGATTGAATATATTTTTACAATCTAAAGAAATTGATCAGGACTTCCTTTGTAGAGGTCCTGTTAAGGGATTTAAGATATTACTGCATAATCCGGCCGAACTGCCTCGTCTTTCCAAGCAATACTTCAGGGCACCTTTATCTCATGAGGTGGTTGTTGCAGTTAAACCTAACATGATGACGACCTCTAAAGGCTTGAAATCTCTTGATTCTTCGAGACGTCAATGCTATTTCCCAACGGAGCGTTTCCTTCAGTATTTTAAAATTTACACACAGGCTAATTGTGAAATAGAGTGTCTATCAAACTTCACGTACGCCAGATGTGGCTGTGTTCATTTCGGCATGCCTCATGGTCCTAAAATTCCGGTCTGCAACGCCCGCAAAATCATCTGTATGAGTACAGCACAAATGGAACTAGCCACAGCAGAAATACAAAGTCATCTGGGAAAAGATACAACTGATAACGGCACTCTGGGTAACGCTCTGTTAGTAGCTACAAAATGCAAATGTCTTCAATCCTGTACATCTATAGAATACGATGCTGAAACATCACAAGGCGATTACAATTGGCAACCCCTATTTAAAGCCCTCAAGATAGATATTAGCAAAGAAGACACGGATGTTTCTATTAGTCGGGTTTCGATTTTCTTCAAAGAAGACCAATTCATTACTTCACGAAGATCTGAATTGTATGGTCAGACAGAGTTTTTAGCCAATGTCGGTGGTCTGCTAGGACTCTTTTTGGGCTTCTCCATACTAAGTCTAGCTGAAATATTTTATTTCCTTACCTTGAGATCAGGAAGTATAGGAGGTATCCAGGACGTAGACCCTGAAGTAAACAAAGGTCATCTCAATACCAAAAATGAGAAAATTAAAAAAGGAAAGCTGAGTGCCATCAAGCGGTATTTGATTGACTACACCGCAAACTCAAATCTTCACGGTCTGAAGTATATTGGAGAAAAGGAGAGAACTTTGATTGAAAAAATTTTCTGGCTGTTAATGTTTTCCTGTTCTTTAATATTCTGTATTGGAAAGATTCACTTAATATGGATTAGATGGAACGAAAGTCCCGTCATTGTAAGCTTCGCGGAAAAATCAACACCAGTTTGGCAGATCCCATATCCAGCCGTGACTATTTGTTTTGAAACAAAGGCTCGACAAACCATATTTAATTTTACCGAATACTATCATCTGTACAAGAATGAGACTACACGTGCGAATTTAACCGAAGAAGAACGCCACCTTTTTGAGGACGTATCTATGGTTTGCGATGATCACTTGGCCCCATCAAGTGGAAGAAGATTTTCTAATGGAAACGTAACAGTTGAGAATCTTAAAGAGCTATCACCAAACATAACTGAGATGCTCTTCGCTTGTAAATGGAAAGATGTTTCCCGCGTGAATTGTTCGGATTTATTTTTGCCGATCATCACTGAAGAAGGTGTTTGCTATACCTTCAATACCTTGGGTGCTGAGGAATTATTTAGAGTTGAAAACCTCAATAAGGACTATGGTTATTTAGAATATTCAAAACGAAATTCTAGTCAAATTTGGACGTTAGAAGATGGATATCCTCCTGATAGTCCGGTAGAGACGTACCCTCATAGAGGCACTGGATTCGGCGCAAAATCAGGATTAACGTTTTTGTTGAAAGCTAAGCAAATGGATCTTGACTACCTTTGTAAGGGTCCGGTTCAGGGGTTTAAGATATTACTTCATAATCCGGCAGAATTGCCTCGTCTGTCAAAACAATATTTCAGATCACCTTTATCCCAAGAGGTAGTAGTTGCAGTAAAACCTAATATGATGACGACTTCTGAAGGATTGAAACCTTACGACCCTACAAGACGTCAATGCTATTTCCCAACGGAGCGTTACCTACAGTATTTTAAAATTTACACACAAGCTAATTGTGAAATAGAGTGTCTATCAAACTTCACATACACTAGGTGTGGCTGTGTTCATTTCGGCATGCCTCATGGTCCTACAATACCCGTATGCAATGCCGGCATGGAATTAGTCACAGCAGAAATTCAAACCAATTTGGAAAAAGATGCAGCTGATAACGGTACCCTTGGTGAGGCTCTACTAGTAGCCGCAAAATGCAAATGTCTTCAAGCTTGCACGTCTATAGAATACGATGCAGAAACATCACAAGCTGACTACGATTGGCAATCCATATTCAGAGCTCATCGTCAAGAAATTGAAGAACAGGATAAGGAACTTTACTGCGTTCTGTCAAAGCGCCGACAATACTGGGACCTTGCTTCTTTTTTGGAGCGTAACCGATTCTATTATAAACTACCGAAGGAATTAAAGGGACAAAACAGTTCCCCATATTAA

Protein sequence:

>DPOGS216041-PA
MTSGSVGSIIDITLEANENLSKDEEVKTKKHKLGLLKKHLIDYSANSNLHGLKYIGEKDRTLFEKWNESPVIVSFAEKSTPVWQIPFPAVTICSETKARQTIFNLTKYYHLYDDDITRLNLTEKERRLFEDVSMVCDVNVASYFGTKFSDAKETVQNIKELSPKINDTFYACVWKNSLSICLDEFLPIITEEGVCYTFNTLGAEELFRVENLNKDYDYLEYSKRNSSLWTLEDGYPTDSPVETYPHRGIGFGIKSGLNIFLQSKEIDQDFLCRGPVKGFKILLHNPAELPRLSKQYFRAPLSHEVVVAVKPNMMTTSKGLKSLDSSRRQCYFPTERFLQYFKIYTQANCEIECLSNFTYARCGCVHFGMPHGPKIPVCNARKIICMSTAQMELATAEIQSHLGKDTTDNGTLGNALLVATKCKCLQSCTSIEYDAETSQGDYNWQPLFKALKIDISKEDTDVSISRVSIFFKEDQFITSRRSELYGQTEFLANVGGLLGLFLGFSILSLAEIFYFLTLRSGSIGGIQDVDPEVNKGHLNTKNEKIKKGKLSAIKRYLIDYTANSNLHGLKYIGEKERTLIEKIFWLLMFSCSLIFCIGKIHLIWIRWNESPVIVSFAEKSTPVWQIPYPAVTICFETKARQTIFNFTEYYHLYKNETTRANLTEEERHLFEDVSMVCDDHLAPSSGRRFSNGNVTVENLKELSPNITEMLFACKWKDVSRVNCSDLFLPIITEEGVCYTFNTLGAEELFRVENLNKDYGYLEYSKRNSSQIWTLEDGYPPDSPVETYPHRGTGFGAKSGLTFLLKAKQMDLDYLCKGPVQGFKILLHNPAELPRLSKQYFRSPLSQEVVVAVKPNMMTTSEGLKPYDPTRRQCYFPTERYLQYFKIYTQANCEIECLSNFTYTRCGCVHFGMPHGPTIPVCNAGMELVTAEIQTNLEKDAADNGTLGEALLVAAKCKCLQACTSIEYDAETSQADYDWQSIFRAHRQEIEEQDKELYCVLSKRRQYWDLASFLERNRFYYKLPKELKGQNSSPY-