Monarch geneset OGS2.0

DPOGS200906
TranscriptDPOGS200906-TA1842 bp
ProteinDPOGS200906-PA613 aa
Genomic positionDPSCF300066 + 133826-155197
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0134032e-11456.61% 
BombyxBGIBMGA000674-TA6e-9055.59% 
Drosophilappk16-PA3e-6230.37% 
EBI UniRef50UniRef50_B0WX521e-7434.50%Pickpocket 16 n=4 Tax=Culicidae RepID=B0WX52_CULQU
NCBI RefSeqXP_318613.42e-7835.36%AGAP009590-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582984434e-7735.36%AGAP009590-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582984431e-7635.36%AGAP009590-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160203.8e-88membrane
GO:00052723.8e-88sodium channel activity
GO:00068143.8e-88sodium ion transport
KEGG pathway 
InterPro domain[145-552] IPR0018733.8e-88Na+ channel, amiloride-sensitive
Orthology groupMCL15693 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200906-TA
ATGAGATCCTCAATAGCCGCTTACGGGCCGGAGCGGGACTGTCAAGCTCGGAACACTCTCCATAGATTCCGGGACGTTTCCGAGCTTCCCTGGCCACTCGTCGAGCGAGCACTGCGTGTCACGCTCCTCCCACTCACTTCGTTTCAGGACCTTATCGCTCAGGACTGCACCAGTGCACCTCGTCTCGGGAAGTCCTTGGTGGAAGGACCCGCCTGGCGTTGGTTGCCCTCACCCCCCAGAATATGTGGGGCAGAGCAAGGGATTCCCCAGCCTCATCAAGGCTGGGACCCTTCTGGAGTAGTTTTAGTAAAAGATAGAGAAAGACTGGTGATCAGCCCTCTGAGCACCTTACCCTGTGTAAAGGGCGGAGCGGCACACCCAGACGTTGCCCAGGGCCACCAACGAGGAGGAACCGTTTGTCTTTTCTGCCAGGTTATAGAAAGTACTCAAGGAGCAATATGGGATGTTCCATTTCCGGCTGTGACGATCTGCGACCTCAATATGATTTCACGGAGAGCTGCGAGAGATTTGTCAAACGTCTTGACCCTACCCGAAAATGTAACAGCTGATTTTGTTTTCAAAACATTAAAAGTAGCCCCGCTTTTGCATTCCTCAAACATGGCCGATCCAATACAAAAAAGGGATCTCCACATTCTGCAAGACGTTCTGGATCTTAATAACATCACGGTCGAAACGCTTTTTAAAAAATTGTCTCCAGCTTCATCCTGCGGAGACCTCTTAGAGCGGTGCATGTGGAAGAATACAATATATCATTGCGATCAGATCTTTCAGCATGTATTCAATACAGTCTTACTTTGTTGTACCTTCAATTACTATGCTTTGGACCAGCCGAGTAACGAACTGATGATATTTCGTCATTCGACTCCAAGACGCGTGGCTTCCTGCGGCTATCAGACAGCGCTGACGGCTATTTTGAAAACTAACCCAAGTGACTACTACAGCACTAGCGTAGCATCTTTAGGTATACTGGTCTTTATAGACAATCCATATAATGTTCCTGACTTCGAATCACCAGTTCGTATGGTAAATCCGTCAACTGAAATGATGATTGCTGTGTCACCCGAACAGACGTACGCTACGCCTGGAACTAAATCATTTACACCTGATGAGAGACAATGTTACTACAGCGACGAGGTAAAAATACCTCACATTCACAAGTACTCCTACCATAATTGTATGATGTTAAGGAAAATACAGATCTTAATAAACGTCTGTAATTGTGTACCATTCTACTTTCCTCACGACGGTGACAGCAGAATTTGTAATTTCCACGATGTAGAATGCTTGGAAAATGTGAAAAGTCCTTACAGGGTGAATAATGAGTCTGAGGAAGAAGATCCAAGAACCTTGAATTTGATCAAATGTTTACCAGAATGTGAACATTTTGATTATCCTCTGGAGGTGGCCCTTGGCAAGCTATTTGTGAACGTGCCTCTTGGAGTTACATCCTTTTATGAAGGTATCAATTTGGAAAATCGATCAGTTCTCAACGTGTTCTTCAATGATTTAGTCTCTACTAAATACAGACGAGAAGTTTACTTGAACTGGCAGAATATTTTAGCCGCGTTTGGTGGTCTCTTGAGTTTGATGCTGGGTTTCACATTAATAGCGGGATTCGATTTCATATTGTTTTTTATATTGAAAGTGGCTTACGACTTTTTAATTAAATGTTTTAAAAATGATTCCAAACCGACTAACAGTCATATAATCAACGTGGAAGAACATAAAAAAGAAAGATGGATAAATAATACTAGAAGAAAGAAAAACTATTCGGAACACGGAAAAATGTTTGTAAAAGCAAATGAAAGGAATAGATATTGA

Protein sequence:

>DPOGS200906-PA
MRSSIAAYGPERDCQARNTLHRFRDVSELPWPLVERALRVTLLPLTSFQDLIAQDCTSAPRLGKSLVEGPAWRWLPSPPRICGAEQGIPQPHQGWDPSGVVLVKDRERLVISPLSTLPCVKGGAAHPDVAQGHQRGGTVCLFCQVIESTQGAIWDVPFPAVTICDLNMISRRAARDLSNVLTLPENVTADFVFKTLKVAPLLHSSNMADPIQKRDLHILQDVLDLNNITVETLFKKLSPASSCGDLLERCMWKNTIYHCDQIFQHVFNTVLLCCTFNYYALDQPSNELMIFRHSTPRRVASCGYQTALTAILKTNPSDYYSTSVASLGILVFIDNPYNVPDFESPVRMVNPSTEMMIAVSPEQTYATPGTKSFTPDERQCYYSDEVKIPHIHKYSYHNCMMLRKIQILINVCNCVPFYFPHDGDSRICNFHDVECLENVKSPYRVNNESEEEDPRTLNLIKCLPECEHFDYPLEVALGKLFVNVPLGVTSFYEGINLENRSVLNVFFNDLVSTKYRREVYLNWQNILAAFGGLLSLMLGFTLIAGFDFILFFILKVAYDFLIKCFKNDSKPTNSHIINVEEHKKERWINNTRRKKNYSEHGKMFVKANERNRY-