Monarch geneset OGS2.0

DPOGS205460
TranscriptDPOGS205460-TA5763 bp
ProteinDPOGS205460-PA1920 aa
Genomic positionDPSCF300166 - 192011-202956
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0021930.075.10% 
BombyxBGIBMGA008399-TA0.065.31% 
DrosophilaSym-PA0.037.11% 
EBI UniRef50UniRef50_E0VQ070.043.56%Symplekin, putative n=11 Tax=Coelomata RepID=E0VQ07_PEDHC
NCBI RefSeqXP_002428201.10.043.56%Symplekin, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3320308790.042.39%Symplekin [Acromyrmex echinatior]
NCBI nr blastxgi|3320308790.041.32%Symplekin [Acromyrmex echinatior]
Group
Gene OntologyGO:00054887.3e-15binding
KEGG pathwaymdo:1000108857e-168 
 K06100 (SYMPK)maps-> Tight junction
InterPro domain[104-320] IPR0218503.9e-53Protein of unknown function DUF3453
[1717-1888] IPR0220752.4e-47Symplekin tight junction protein C-terminal
[33-771] IPR0160247.3e-15Armadillo-type fold
Orthology groupMCL12185 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205460-TA
ATGTCGAGTAAATATCACCCAAGCAGTGGCGGTACCGCTCAAACACAGCTAGTACAATGGATAAACGAGGCAGGAATGTCGGAGGGCAACAAAAAGGTGAATTTGTTGCGAAAAATTATAGAGGTGTTATTACATCAAGCACCTCAAATGGTGCCAGTTTATATGGATAACATTCTAAGTTGCGTCAATGATAAATCTGCTGATGTTAAAAGGCAGGTTGTCGCATTTATCGAAGAATTAAGCAAATCATATCCTGAATATCTTCCTAAAATAATTGGTCAGCTGCAATTACTGGTGATAGATACGGTTATTGCGGTACAGAAGAGGGCCATTCAGGCGGCGAGCCTGGTTTACAGAAATGTCCTCCTTTGGATATGTAAAGGAACCTCGGAAATGAAAGACATGCAATATGTTTGGGAACATTTATCCGAATTAAAGCTTCTTATTCTCAATATGATAGATAGTGACAATGAAGGTATAAGGACACATTCAATCAAGTTCTTGGAGGAAGTTGTTGTACTTCAAAGTCCTCATGCAAATGATGACGATGACTTTAGTTTGGACTGTTTACCGTCACACCTGCCATTCCTCAACAGAAAGGCCATGGAAGAAGAGTCCGACCATATTTTCCAATTACTGTTAAAATTCCACAACTCGCAACATATATCAAGCGTTAACCTCATGGCTTGTATGACAACGTTATGCATAGTGGCGAAACTTAGACCAAAATATATGTCGAGTGTGGTGAAAGCTTTAAACGACCTGCACACGACACTACCACCTACATTATCACAGTCACAGGTGAATTCAGTGCGGAAACACCTCAAGATGCAGCTACTGATATTGGTCAAACATCCATCCTCATATGACATGATGCCGCAATTGACGCAGTTGCTCATGGACATTGGAATGACCCCACAAGAAATAAATAAAGCTCTACCTAGAGACAGAAGGAACAAACGATTGGGAGAGTTGAGGGCGTCAGAAAATCCAGCAAAAAGATTCAGGGTTGACTCGCCTCAGAGCACCTTGGGCAGCGATAGCAACAGTAACAGCAGAAGTGAATTCAATTTGTTTGATGATGACAACAGCCAGCAGGGCAGCCAACAGAGTATAACCAAAGCATCCTGTACAGAAGAATCAATTTTAGATGGTCTGAATAGTTTAGAAAATGTCGTCAATCTGGTTGTGACAACCCTGATTAATAACCTGCCGACTGAAATGCCCACTAGTTTCATCTTGGCTTATAAACCTATTCCAAATTCGGGAAGCAAGGTCCAGAAGCAAAGTCTTGCTAAAATGATGATGGCATTGATCAAAGACGAACCATTGCCAATGCCTACCACCATGAAGTCTTTGGATACAACCACAAAAATCCCATTACTGAGAGACGACGACGATAAAATCAATCTAAAGAACGCGGTCGCAAAGCTACAGGAGTCCACCAAAGTAGACAAGCAAATAGAAAATGCTGTATCGAAACTGATGGAAGAAACTAGACAGGAGCATCTCAAAGAAGAGGAGAGAAAGAATAAGGATAAAGAAAAACCAGTCGCTCCACCAACGCCATCTATACCGAAATTAAAACAGAAAGTGAAACTATTAAAACTCCAAGAACTAACTCGACCCATACCAAAGGAAATTAAAGAGAAACTGATGATTCAAGCCGTGGAAAGAATATTGCGAGCCGAGAAAGAGAGCGTTATCGGGGGAGCGGCTCAAATAAGAACGAAATTCATCACGATATTCGCATCAAGTTACACTCCGGAGATACGAGAACTGGTCCTCAACTACATACTGGAAGATCCGTTAAACAGAATCGACCTGGCATTATCCTGGCTGTACGAAGAATACGCGTACATGCAAGGCTTCAACCGGCATCCGGTGACGCTGCAGCCCAAACTGCACGAAAAACACGGCGAAAACTACAACCAGCTGCTATGCGCTCTCATCACGCAGATATCAGAGAGAGGGGATCCGGTGATGGAAGGGAGTAAGGACGTTCTGCTGAGGAAGGTTTACTCCGAAGCGCCCGTAGTCACCGACGAGGCGGTGGACTACTTGAAGCATCTGGTCACTGAGGAAAAGTCAGCGACGGTAGCCCTGGAACTGCTCGAGGAGTTGTGTCTGCTAAGACCACCTAGGGCGCACAAATTTGTTGCCGCCCTGGTATGTCACGTGTTGAGTGAAAACGAGGAAATTCGCAATATAGCCTTGAAATCGTCAACCAAAATCTACAAACACAGTACGGACGCCGCTAAGAAGGTTATAGAGAAACACGCTATGTTGTACCTCGGCTTTATCTCGCTGTCAACGCCGCCCCAAGAGTTGTATGGCAACAGACACGCGAGCAGACCCTGGTCCGACGACTTGTATAAAATGTGCCTCAATCTGGTCATGGCGTTGTTCCCAGAGAAGGAAGACGTGATCATTGAGATCGCCCGCGTCTACGGAACCACAGGCGCTGAGGCGAAGCGCTGTGTGCTGCGACAGCTGGAAGTGCCTGTGCGTGCTCTGGCCGCCTCAGAGCCTCCTGGACACCTGTCACCTGCGCTCGCAGCACTGCTGGATGCGTGCCCGCGCGGCGCCGAGACGCTACTGACGCGGATCGTGCACGTGCTTACTGATAAATACCCGCCGAGCCCCGAACTGGTGTCTCGTGTCCGCGAGCTGTACGCGACCCGAGTCTCAGATGTGCGGTTCCTTATACCGGTGCTGAATGGACTTACTAAGAAGGAGCTGCAATTACTGGTGATAGATACGGTTATTGCGGTACAGAAGAGGGCCATTCAGGCGGCGAGCCTGGTTTACAGAAATGTCCTCCTTTGGATATGTAAAGGAACCTCGGAAATGAAAGACATGCAATATGTTTGGGAACATTTATCCGAATTAAAGCTTCTTATTCTCAATATGATAGATAGTGACAATGAAGGTATAAGGACACATTCAATCAAGTTCTTGGAGGAAGTTGTTGTACTTCAAAGTCCTCATGCAAATGATGACGATGACTTTAGTTTGGACTGTTTACCGTCACACCTGCCATTCCTCAACAGAAAGGCCATGGAAGAAGAGTCCGACCATATTTTCCAATTACTGTTAAAATTCCACAACTCGCAACATATATCAAGCGTTAACCTCATGGCTTGTATGACAACGTTATGCATAGTGGCGAAACTTAGACCAAAATATATGTCGAGTGTGGTGAAAGCTTTAAACGACCTGCACACGACACTACCACCTACATTATCACAGTCACAGGTGAATTCAGTGCGGAAACACCTCAAGATGCAGCTACTGATATTGGTCAAACATCCATCCTCATATGACATGATGCCGCAATTGACGCAGTTGCTCATGGACATTGGAATGACCCCACAAGAAATAAATAAAGCTCTACCTAGAGACAGAAGGAACAAACGATTGGGAGAGTTGAGGGCGTCAGAAAATCCAGCAAAAAGATTCAGGGTTGACTCGCCTCAGAGCACCTTGGGCAGCGATAGCAACAGTAACAGCAGAAGTGAATTCAATTTGTTTGATGATGACAACAGCCAGCAGGGCAGCCAACAGAGTATAACCAAAGCATCCTGTACAGAAGAATCAATTTTAGATGGTCTGAATAGTTTAGAAAATGTCGTCAATCTGGTTGTGACAACCCTGATTAATAACCTGCCGACTGAAATGCCCACTAGTTTCATCTTGGCTTATAAACCTATTCCAAATTCGGGAAGCAAGGTCCAGAAGCAAAGTCTTGCTAAAATGATGATGGCATTGATCAAAGACGAACCATTGCCAATGCCTACCACCATGAAGTCTTTGGATACCACCACAAAAATCCCATTACTGAGAGACGACGACGATAAAATCAATCTAAAGAACGCGGTCGCAAAACTACAGGAGTCCACCAAAGTAGACAAGCAAATAGAAAATGCTGTATCGAAACTGATGGAAGAAACTAGACAGGAGCATCTCAAAGAAGAGGAGAGAAAGAATAAGGATAAAGAAAAACCAGTCGCTCCACCAACGCCATCTATACCGAAATTAAAACAGAAAGTGAAACTATTAAAACTCCAAGAACTAACTCGACCCATACCAAAGGAAATTAAAGAGAAACTGATGATTCAAGCCGTGGAAAGAATATTGCGAGCCGAGAAAGAGAGCGTTATCGGGGGAGCGGCTCAAATAAGGACGAAATTCATCACGATATTCGCGTCAAGTTACACTCCGGAGATACGAGAACTGGTCCTCAACTACATACTGGAAGATCCGTTAAACAGAATCGACCTGGCATTATCCTGGCTGTACGAAGAATACGCGTACATGCAAGGCTTCAACCGGCATCCGGTGACGCTGCAGCCCAAACTGCACGAAAAACACGGCGAAAACTACAACCAGCTGCTATGCGCTCTGATCACGCAGATATCAGAGAGGGGGGATCCGGTGATGGAAGGGAGTAAGGACGTCCTGCTGAGGAAGGTTTACTCCGAAGCGCCCGTAGTCACCGACGAGGCGGTGGACTACTTGAAGCATCTGGTCACTGAGGAAAAGTCAGCGACGGTAGCCCTGGAACTGCTCGAGGAGTTGTGTCTGCTAAGACCACCTAGGGCGCACAAATTTGTTGCCGCCCTAGTATGTCACGTGTTGAGTGAAAACGAGGAAATTCGCAATATAGCCTTGAAATCGTCAACCAAAATCTACAAACACAGTACGGACGCCGCTAAGAAGGTTATAGAGAAACACGCTATGTTGTACCTCGGCTTTATCTCGCTGTCAACGCCGCCCCAAGAGTTGTATGGCAACAGACACGCGAGCAGACCCTGGTCCGACGACTTGTATAAAATGTGCCTCAATCTGGTCATGGCGTTGTTCCCAGAGAAGGAAGACGTGATCATTGAGATCGCCCGCGTCTACGGAACCACAGGCGCTGAGGCGAAGCGCTGTGTGCTGCGACAGCTGGAAGTGCCTGTCCGTGCTCTGGCCGCCTCAGAGCCTCCTGGACACCTGTCTCCTGCGCTCGCAGCACTGCTGGATGCGTGCCCGCGCGGCGCCGAGACGCTACTGACGCGGATCGTGCACGTGCTCACTGATAAATATCCGCCGAGCCCCGAACTGGTGTCTCGTGTCCGCGAGCTGTACGCGACCCGAGTCTCAGATGTGCGGTTCCTTATACCGGTGCTGAATGGACTTACCAAGAAGGAGATTCTGGCTGCCCTGCCGAAGTTGATCAAATTAAATCCAATAGTAGTGAAGGAAGTTTTCAACAAATTACTCGGCCTGCAGAATCCCAACGAAGAACAATTACCGGTCTCTCCCGAAGAACTACTGGTAGCTTTGCATCTTATAGACCCGAGCAAAGCAGATCTCAAGTACATCATCAAAGCGACCGCTTTATGTTTCGCTGAAAAGAACACTTACACACAGGAGGTGTTGTCTTCAGTTCTCCAGCGCCTGGCTGAGGAGCAGCAGACGCCAGTACTGATGATGCGCTCTGTTCTGCAAGCGTTGACCCTTCACCCATCACTAGCGCCGCTCGCCCTCAACATACTATGCCTCCTGTGCGAGAGAGAGGTTTGGAACAACAAAGTGGCTTGGGAGGGTTGGGTGAAGTGCGCTGAACGACTTGGACCTCGAGCGGGTCCCGCGCTAAGGTCACTACCACCGAGGGCGAGAGACATGCTACCATCGCACCTTACAGCCTCGTGTCCGTCGGATGCTCCTTATTCTGGCCCAAACCCGATAGAGCCGTTACCCCCCGGAATGGAATGA

Protein sequence:

>DPOGS205460-PA
MSSKYHPSSGGTAQTQLVQWINEAGMSEGNKKVNLLRKIIEVLLHQAPQMVPVYMDNILSCVNDKSADVKRQVVAFIEELSKSYPEYLPKIIGQLQLLVIDTVIAVQKRAIQAASLVYRNVLLWICKGTSEMKDMQYVWEHLSELKLLILNMIDSDNEGIRTHSIKFLEEVVVLQSPHANDDDDFSLDCLPSHLPFLNRKAMEEESDHIFQLLLKFHNSQHISSVNLMACMTTLCIVAKLRPKYMSSVVKALNDLHTTLPPTLSQSQVNSVRKHLKMQLLILVKHPSSYDMMPQLTQLLMDIGMTPQEINKALPRDRRNKRLGELRASENPAKRFRVDSPQSTLGSDSNSNSRSEFNLFDDDNSQQGSQQSITKASCTEESILDGLNSLENVVNLVVTTLINNLPTEMPTSFILAYKPIPNSGSKVQKQSLAKMMMALIKDEPLPMPTTMKSLDTTTKIPLLRDDDDKINLKNAVAKLQESTKVDKQIENAVSKLMEETRQEHLKEEERKNKDKEKPVAPPTPSIPKLKQKVKLLKLQELTRPIPKEIKEKLMIQAVERILRAEKESVIGGAAQIRTKFITIFASSYTPEIRELVLNYILEDPLNRIDLALSWLYEEYAYMQGFNRHPVTLQPKLHEKHGENYNQLLCALITQISERGDPVMEGSKDVLLRKVYSEAPVVTDEAVDYLKHLVTEEKSATVALELLEELCLLRPPRAHKFVAALVCHVLSENEEIRNIALKSSTKIYKHSTDAAKKVIEKHAMLYLGFISLSTPPQELYGNRHASRPWSDDLYKMCLNLVMALFPEKEDVIIEIARVYGTTGAEAKRCVLRQLEVPVRALAASEPPGHLSPALAALLDACPRGAETLLTRIVHVLTDKYPPSPELVSRVRELYATRVSDVRFLIPVLNGLTKKELQLLVIDTVIAVQKRAIQAASLVYRNVLLWICKGTSEMKDMQYVWEHLSELKLLILNMIDSDNEGIRTHSIKFLEEVVVLQSPHANDDDDFSLDCLPSHLPFLNRKAMEEESDHIFQLLLKFHNSQHISSVNLMACMTTLCIVAKLRPKYMSSVVKALNDLHTTLPPTLSQSQVNSVRKHLKMQLLILVKHPSSYDMMPQLTQLLMDIGMTPQEINKALPRDRRNKRLGELRASENPAKRFRVDSPQSTLGSDSNSNSRSEFNLFDDDNSQQGSQQSITKASCTEESILDGLNSLENVVNLVVTTLINNLPTEMPTSFILAYKPIPNSGSKVQKQSLAKMMMALIKDEPLPMPTTMKSLDTTTKIPLLRDDDDKINLKNAVAKLQESTKVDKQIENAVSKLMEETRQEHLKEEERKNKDKEKPVAPPTPSIPKLKQKVKLLKLQELTRPIPKEIKEKLMIQAVERILRAEKESVIGGAAQIRTKFITIFASSYTPEIRELVLNYILEDPLNRIDLALSWLYEEYAYMQGFNRHPVTLQPKLHEKHGENYNQLLCALITQISERGDPVMEGSKDVLLRKVYSEAPVVTDEAVDYLKHLVTEEKSATVALELLEELCLLRPPRAHKFVAALVCHVLSENEEIRNIALKSSTKIYKHSTDAAKKVIEKHAMLYLGFISLSTPPQELYGNRHASRPWSDDLYKMCLNLVMALFPEKEDVIIEIARVYGTTGAEAKRCVLRQLEVPVRALAASEPPGHLSPALAALLDACPRGAETLLTRIVHVLTDKYPPSPELVSRVRELYATRVSDVRFLIPVLNGLTKKEILAALPKLIKLNPIVVKEVFNKLLGLQNPNEEQLPVSPEELLVALHLIDPSKADLKYIIKATALCFAEKNTYTQEVLSSVLQRLAEEQQTPVLMMRSVLQALTLHPSLAPLALNILCLLCEREVWNNKVAWEGWVKCAERLGPRAGPALRSLPPRARDMLPSHLTASCPSDAPYSGPNPIEPLPPGME-