Monarch geneset OGS2.0

DPOGS210747
TranscriptDPOGS210747-TA1917 bp
ProteinDPOGS210747-PA638 aa
Genomic positionDPSCF300013 + 549109-556515
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0161857e-3428.53% 
BombyxBGIBMGA006279-TA8e-3833.06% 
DrosophilaCG31065-PB2e-3930.00% 
EBI UniRef50UniRef50_UPI0000D569CD8e-4230.39%UPI0000D569CD related cluster n=1 Tax=unknown RepID=UPI0000D569CD
NCBI RefSeqXP_975313.11e-4230.39%PREDICTED: similar to pickpocket [Tribolium castaneum]
NCBI nr blastpgi|910871493e-4130.39%PREDICTED: similar to pickpocket [Tribolium castaneum]
NCBI nr blastxgi|1610786841e-3730.03%CG31065 [Drosophila melanogaster]
Group
Gene OntologyGO:00160204.7e-43membrane
GO:00052724.7e-43sodium channel activity
GO:00068144.7e-43sodium ion transport
KEGG pathway 
InterPro domain[238-633] IPR0018734.7e-43Na+ channel, amiloride-sensitive
Orthology groupMCL30412 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210747-TA
ATGGAAGCTGTTAAACAGTATCTTAAGAAGTACAATATATCGACGAATGCTACAAAATTTTTCCACGAGGTATCGTTTTGGGATCTCAAATACTGTACGTCTTGCACGATTTGCAAATTAAATGACAGTTGCGTTGAAGATTTTACGAGCGCCATCCCTGAAATAAGACAGGGCTGTTCACAATTGTTTACCGAATGCAAATTCGGCGGGAGCGATTTCAATTGTTGTGATAAGTTTCAACCTATAGAAACTGAGTTTGGCAGCTGTTATGTCTTTAACTCCGCACTTTTAAGTAACGCTAGCTTGTTGACGGTGAATAGAACAATCGGCCTGCCTGATTTAGTATTTCACGTCAGGAAAGTCGTAGCGGTAAGGATTCACGCTCCTAGAGATATTGTTTCAGGTGGAATGCTAAATATATTACAAGTTCAGTCAGTACCATTAGTTACTGAGATGGATGTGATGCTGAGGGCTGAACCAACAATTAATGACGAATCAGTTACGACTCTGTCCGAGGCGTCACGTGACTGTCTGTTAGATGATGAGCGACCTCCTTACCCCGACTGGCCGTTCGGATACTATACAAGGAGTGCTTGCATTTTGTATTGCAGGGCGCTCGCTCAGATGAGTCGTTGTAATTGTACGCATCACTTTTTAGCTAAAATAGTTTCGTCTTCAATCATCACGACAATAAAATATAAGGTCCGTCGATTCGGTGATATGAGCAACCTGCACGGCGTGGGATATGTTTTCTCAATGTCAAACATTCCGTATTTCAAAAGATTCTTCTGGCTGATAATCCTATGTATATGCTGCTTTGGTGCTTGGGAGATACTAAAGTCATCTCTGTATATATTATCCACTGGAGCTGGTTCTTATGTGGTGGAGACGAATAACTTGGAGTGGAACACACCATTTCCAGGTGTTACTGTTTGCAAGCATACCGATATGGAAGCTGTTAAACAGTATCTTAAGAAGTACAATATATCGACGAATGCTACAAAATTTTTCCACGAGGTATCGTTTTGGGATCTCAAATACTGTACGTCTTGCACGATTTGCAAATTAAATGACAGTTGCGTTGAAGATTTTACGAGCGCCATCCCTGAAATAAGACAGGGCTGTTCACAATTGTTTACCGAATGCAAATTCGGCGGGAGCGATTTCAATTGTTGTGATAAGTTTCAACCTATAGAAACTGAGTTTGGCAGCTGTTATGTCTTTAACTCCGCACTTTTAAGTAACGCTAGCTTGTTGACGGTGAATAGAACAATCGGCCTGCCTGATTTAGTATTTCACGTCAGGAAAGTCGTAGCGGTAAGGATTCACGCTCCTAGAGATATTGTTTCAGGTGGAATGCTAAATATATTACAAGTTCAGTCAGTACCATTAGTTACTGAGATGGATGTGATGCTGAGGGCTGAACCAACAATTAATGACGAATCAGTTACGACTCTGTCCGAGGCGTCACGTGACTGTCTGTTAGATGATGAGCGACCTCCTTACCCCGACTGGCCGTTCGGATACTATACAAGGAGTGCTTGCATTTTGTATTGCAGGGCGCTCGCTCAGATGAGTCGTTGTAATTGTACGCATCACTTTTTAGCAAAAATAGATTCTATCGTACATTTTGCAAAAGAAAAATGTGCTTGTCCGATGGCTTGCGAGGAAACTGTTTACGACGCTATTCATGTTTTTTCAAGACGAGTTAGCACTACAATGACGGAACAACAAAGGCTAGGTACTATGATTAAAGTACGCTTTACAAATTTGCCTTCGTTGCGGGTGAGGAGACTAGCGATCACGACTCCTCTTAAACTTGTTGTTGACATGGGCGGTATTGGCGGAGTGTTTTTCGGCGCTTCCCTCCTCAGTGTCATAGAGCTGATTTATCTCCTTTGCATCCGACGTAGCTAA

Protein sequence:

>DPOGS210747-PA
MEAVKQYLKKYNISTNATKFFHEVSFWDLKYCTSCTICKLNDSCVEDFTSAIPEIRQGCSQLFTECKFGGSDFNCCDKFQPIETEFGSCYVFNSALLSNASLLTVNRTIGLPDLVFHVRKVVAVRIHAPRDIVSGGMLNILQVQSVPLVTEMDVMLRAEPTINDESVTTLSEASRDCLLDDERPPYPDWPFGYYTRSACILYCRALAQMSRCNCTHHFLAKIVSSSIITTIKYKVRRFGDMSNLHGVGYVFSMSNIPYFKRFFWLIILCICCFGAWEILKSSLYILSTGAGSYVVETNNLEWNTPFPGVTVCKHTDMEAVKQYLKKYNISTNATKFFHEVSFWDLKYCTSCTICKLNDSCVEDFTSAIPEIRQGCSQLFTECKFGGSDFNCCDKFQPIETEFGSCYVFNSALLSNASLLTVNRTIGLPDLVFHVRKVVAVRIHAPRDIVSGGMLNILQVQSVPLVTEMDVMLRAEPTINDESVTTLSEASRDCLLDDERPPYPDWPFGYYTRSACILYCRALAQMSRCNCTHHFLAKIDSIVHFAKEKCACPMACEETVYDAIHVFSRRVSTTMTEQQRLGTMIKVRFTNLPSLRVRRLAITTPLKLVVDMGGIGGVFFGASLLSVIELIYLLCIRRS-