Monarch geneset OGS2.0

DPOGS207910
TranscriptDPOGS207910-TA1131 bp
ProteinDPOGS207910-PA376 aa
Genomic positionDPSCF300478 - 2913-5705
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0126861e-2724.37% 
BombyxBGIBMGA003843-TA1e-4454.05% 
DrosophilaCG30181-PA1e-2130.49% 
EBI UniRef50UniRef50_D6WCY11e-5335.61%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WCY1_TRICA
NCBI RefSeqXP_971659.29e-3928.64%PREDICTED: similar to CG34369 CG34369-PA [Tribolium castaneum]
NCBI nr blastpgi|2700023784e-5335.61%hypothetical protein TcasGA2_TC004432 [Tribolium castaneum]
NCBI nr blastxgi|2700023785e-5535.82%hypothetical protein TcasGA2_TC004432 [Tribolium castaneum]
Group
Gene OntologyGO:00160201.4e-38membrane
GO:00052721.4e-38sodium channel activity
GO:00068141.4e-38sodium ion transport
KEGG pathway 
InterPro domain[6-351] IPR0018731.4e-38Na+ channel, amiloride-sensitive
Orthology groupMCL30109 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207910-TA
ATGATGGAGAGCTGGAAGGAGTTCTGCTCGGAGACCAGCTTCCACGGATTCAACCACATCACCGCTGAGAGGAGGCACTGGAGCGAGAGGTTGTTGTGGGTCTGCTTCATAGTGGCGTCAGTGTGGTTGGTGTTGGACATATCTATAGGCCAGTGGCAGAGGTACGACGAGAACCCCACCATCGTGACTCTGGAGAAGGACTTCAGGACCTGGAGGATTCATATGCCGGCCGTCACTGCTTGCTTGAAGAACAAAGTGGATGTAAAGAAACTTCCGAATGCAATCAAATCTCGTTGGAACGTCGAAGCTGGACATCCCAAGTACGTCTATTACAGTCGTTTCGTGAGCGCCGTCGCCAGCTCTACTCTGGAAAACCTCAAGGTGTTTGAACAGTTCGGCGATGACCCGACACTGGACGTAGACCTATTCAGACTCGCCGTGGACTACTATCTTCACTCTCCATACGATGTGGTCGACTACACGGAAGCACATACCACCACCTCCCCACCACTCATCGCACACGTGGACGTCACCACCACGGAGATAGGAGTGGGCCCAGGAGTGAGAGCTCTCCTGCCGAGGAGACGTCAGTGCCTGTTCACTGATGAACCCACCACAGCCTCCAGACAGGTGTACAGCACACACTCCTGCAGACAAGACTGCAGGAGACGTCTGGCGATGGAGCTGTGCGGGTGTCAACCATTCTACTACTTCTATGCTGCTGGCCCGACATGTACAGTGCGCGGCATGCGCTGCCTGTCTCTACATCAGCACCGTCTGTTCACTCTGGAGGGCCAGCGCTGCTCCTGCAGCCAGCAGTGTGTGGACGCCGTATTCAAGGAAGCCCTGGAGAAGATTGAGAACATGACAGGAGGTCCGTTCGGGCTCCAAGGGTCCGTTCACTACACCCTGGAACAGCCTCGCGAGAGATACGTCAGATATATCGTGTTCTACTTCCAGGACCTGGTGGTGTCATTCGGCGGCGCGGCTGGTCTCTTCCTGGGAGCCAGCTTCATCAGCTTCGTGGAGGTCGGATACTTCCTCATCGAGAGACTCTCGAGGTCGTCTCCGACACGGACTGCGACGGATGTGAGTGTTTCAGATATTCGAAGGAGGCTGGCACAGAAGTAG

Protein sequence:

>DPOGS207910-PA
MMESWKEFCSETSFHGFNHITAERRHWSERLLWVCFIVASVWLVLDISIGQWQRYDENPTIVTLEKDFRTWRIHMPAVTACLKNKVDVKKLPNAIKSRWNVEAGHPKYVYYSRFVSAVASSTLENLKVFEQFGDDPTLDVDLFRLAVDYYLHSPYDVVDYTEAHTTTSPPLIAHVDVTTTEIGVGPGVRALLPRRRQCLFTDEPTTASRQVYSTHSCRQDCRRRLAMELCGCQPFYYFYAAGPTCTVRGMRCLSLHQHRLFTLEGQRCSCSQQCVDAVFKEALEKIENMTGGPFGLQGSVHYTLEQPRERYVRYIVFYFQDLVVSFGGAAGLFLGASFISFVEVGYFLIERLSRSSPTRTATDVSVSDIRRRLAQK-