New model in OGS2.0 | DPOGS205460  |
---|---|
Genomic Position | scaffold2266:+ 122-6156 |
See gene structure | |
CDS Length | 3087 |
Paired RNAseq reads   | 1378 |
Single RNAseq reads   | 3164 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008399 (0.0) |
Best Drosophila hit   | symplekin (4e-151) |
Best Human hit | symplekin (2e-133) |
Best NR hit (blastp)   | Symplekin, putative [Pediculus humanus corporis] (0.0) |
Best NR hit (blastx)   | Symplekin, putative [Pediculus humanus corporis] (0.0) |
GeneOntology terms    | GO:0005923 tight junction GO:0006379 mRNA cleavage GO:0005848 mRNA cleavage stimulating factor complex GO:0005488 binding GO:0006398 histone mRNA 3'-end processing GO:0005634 nucleus |
InterPro families    | IPR016024 Armadillo-type fold IPR021850 Protein of unknown function DUF3453 IPR022075 Symplekin tight junction protein C-terminal |
Orthology group | MCL12507 |
Nucleotide sequence:
ATGTATATCTTAATATTTTCAAGCAAATCATATCCTGAATATCTTCCTAAAATAATTGGT
CAGCTGCAATTACTGGTGATAGATACGGTTATTGCGGTACAGAAGAGGGCCATTCAGGCG
GCGAGCCTGGTTTACAGAAATGTCCTCCTTTGGATATGTAAAGGAACCTCGGAAATGAAA
GACATGCAATATGTTTGGGAACATTTATCCGAATTAAAGCTTCTTATTCTCAATATGATA
GATAGTGACAATGAAGGTATAAGGACACATTCAATCAAGTTCTTGGAGGAAGTTGTTGTA
CTTCAAAGTCCTCATGCAAATGATGACGATGACTTTAGTTTGGACTGTTTACCGTCACAC
CTGCCATTCCTCAACAGAAAGGCCATGGAAGAAGAGTCCGACCATATTTTCCAATTACTG
TTAAAATTCCACAACTCGCAACATATATCAAGCGTTAACCTCATGGCTTGTATGACAACG
TTATGCATAGTGGCGAAACTTAGACCAAAATATATGTCGAGTGTGGTGAAAGCTTTAAAC
GACCTGCACACGACACTACCACCTACATTATCACAGTCACAGGTGAATTCAGTGCGGAAA
CACCTCAAGATGCAGCTACTGATATTGGTCAAACATCCATCCTCATATGACATGATGCCG
CAATTGACGCAGTTGCTCATGGACATTGGAATGACCCCACAAGAAATAAATAAAGCTCTA
CCTAGAGACAGAAGGAACAAACGATTGGGAGAGTTGAGGGCGTCAGAAAATCCAGCAAAA
AGATTCAGGGTTGACTCGCCTCAGAGCACCTTGGGCAGCGATAGCAACAGTAACAGCAGA
AGTGAATTCAATTTGTTTGATGATGACAACAGCCAGCAGGGCAGCCAACAGAGTATAACC
AAAGCATCCTGTACAGAAGAATCAATTTTAGATGGTCTGAATAGTTTAGAAAATGTCGTC
AATCTGGTTGTGACAACCCTGATTAATAACCTGCCGACTGAAATGCCCACTAGTTTCATC
TTGGCTTATAAACCTATTCCAAATTCGGGAAGCAAGGTCCAGAAGCAAAGTCTTGCTAAA
ATGATGATGGCATTGATCAAAGACGAACCATTGCCAATGCCTACCACCATGAAGTCTTTG
GATACCACCACAAAAATCCCATTACTGAGAGACGACGACGATAAAATCAATCTAAAGAAC
GCGGTCGCAAAACTACAGGAGTCCACCAAAGTAGACAAGCAAATAGAAAATGCTGTATCG
AAACTGATGGAAGAAACTAGACAGGAGCATCTCAAAGAAGAGGAGAGAAAGAATAAGGAT
AAAGAAAAACCAGTCGCTCCACCAACGCCATCTATACCGAAATTAAAACAGAAAGTGAAA
CTATTAAAACTCCAAGAACTAACTCGACCCATACCAAAGGAAATTAAAGAGAAACTGATG
ATTCAAGCCGTGGAAAGAATATTGCGAGCCGAGAAAGAGAGCGTTATCGGGGGAGCGGCT
CAAATAAGGACGAAATTCATCACGATATTCGCGTCAAGTTACACTCCGGAGATACGAGAA
CTGGTCCTCAACTACATACTGGAAGATCCGTTAAACAGAATCGACCTGGCATTATCCTGG
CTGTACGAAGAATACGCGTACATGCAAGGCTTCAACCGGCATCCGGTGACGCTGCAGCCC
AAACTGCACGAAAAACACGGCGAAAACTACAACCAGCTGCTATGCGCTCTGATCACGCAG
ATATCAGAGAGGGGGGATCCGGTGATGGAAGGGAGTAAGGACGTCCTGCTGAGGAAGGTT
TACTCCGAAGCGCCCGTAGTCACCGACGAGGCGGTGGACTACTTGAAGCATCTGGTCACT
GAGGAAAAGTCAGCGACGGTAGCCCTGGAACTGCTCGAGGAGTTGTGTCTGCTAAGACCA
CCTAGGGCGCACAAATTTGTTGCCGCCCTAGTATGTCACGTGTTGAGTGAAAACGAGGAA
ATTCGCAATATAGCCTTGAAATCGTCAACCAAAATCTACAAACACAGTACGGACGCCGCT
AAGAAGGTTATAGAGAAACACGCTATGTTGTACCTCGGCTTTATCTCGCTGTCAACGCCG
CCCCAAGAGTTGTATGGCAACAGACACGCGAGCAGACCCTGGTCCGACGACTTGTATAAA
ATGTGCCTCAATCTGGTCATGGCGTTGTTCCCAGAGAAGGAAGACGTGATCATTGAGATC
GCCCGCGTCTACGGAACCACAGGCGCTGAGGCGAAGCGCTGTGTGCTGCGACAGCTGGAA
GTGCCTGTCCGTGCTCTGGCCGCCTCAGAGCCTCCTGGACACCTGTCTCCTGCGCTCGCA
GCACTGCTGGATGCGTGCCCGCGCGGCGCCGAGACGCTACTGACGCGGATCGTGCACGTG
CTCACTGATAAATATCCGCCGAGCCCCGAACTGGTGTCTCGTGTCCGCGAGCTGTACGCG
ACCCGAGTCTCAGATGTGCGGTTCCTTATACCGGTGCTGAATGGACTTACCAAGAAGGAG
ATTCTGGCTGCCCTGCCGAAGTTGATCAAATTAAATCCAATAGTAGTGAAGGAAGTTTTC
AACAAATTACTCGGCCTGCAGAATCCCAACGAAGAACAATTACCGGTCTCTCCCGAAGAA
CTACTGGTAGCTTTGCATCTTATAGACCCGAGCAAAGCAGATCTCAAGTACATCATCAAA
GCGACCGCTTTATGTTTCGCTGAAAAGAACACTTACACACAGGAGGTGTTGTCTTCAGTT
CTCCAGCGCCTGGCTGAGGAGCAGCAGACGCCAGTACTGATGATGCGCTCTGTTCTGCAA
GCGTTGACCCTTCACCCATCACTAGCGCCGCTCGCCCTCAACATACTATGCCTCCTGTGC
GAGAGAGAGGTTTGGAACAACAAAGTGGCTTGGGAGGGTTGGGTGAAGTGCGCTGAACGA
CTTGGACCTCGAGCGGGTCCCGCGCTAAGGTCACTACCACCGAGGGCGAGAGACATGCTA
CCATCGCACCTTACAGCCTCGTGTCCGTCGGATGCTCCTTATTCTGGCCCAAACCCGATA
GAGCCGTTACCCCCCGGAATGGAATGA
Protein sequence:
MYILIFSSKSYPEYLPKIIGQLQLLVIDTVIAVQKRAIQAASLVYRNVLLWICKGTSEMK
DMQYVWEHLSELKLLILNMIDSDNEGIRTHSIKFLEEVVVLQSPHANDDDDFSLDCLPSH
LPFLNRKAMEEESDHIFQLLLKFHNSQHISSVNLMACMTTLCIVAKLRPKYMSSVVKALN
DLHTTLPPTLSQSQVNSVRKHLKMQLLILVKHPSSYDMMPQLTQLLMDIGMTPQEINKAL
PRDRRNKRLGELRASENPAKRFRVDSPQSTLGSDSNSNSRSEFNLFDDDNSQQGSQQSIT
KASCTEESILDGLNSLENVVNLVVTTLINNLPTEMPTSFILAYKPIPNSGSKVQKQSLAK
MMMALIKDEPLPMPTTMKSLDTTTKIPLLRDDDDKINLKNAVAKLQESTKVDKQIENAVS
KLMEETRQEHLKEEERKNKDKEKPVAPPTPSIPKLKQKVKLLKLQELTRPIPKEIKEKLM
IQAVERILRAEKESVIGGAAQIRTKFITIFASSYTPEIRELVLNYILEDPLNRIDLALSW
LYEEYAYMQGFNRHPVTLQPKLHEKHGENYNQLLCALITQISERGDPVMEGSKDVLLRKV
YSEAPVVTDEAVDYLKHLVTEEKSATVALELLEELCLLRPPRAHKFVAALVCHVLSENEE
IRNIALKSSTKIYKHSTDAAKKVIEKHAMLYLGFISLSTPPQELYGNRHASRPWSDDLYK
MCLNLVMALFPEKEDVIIEIARVYGTTGAEAKRCVLRQLEVPVRALAASEPPGHLSPALA
ALLDACPRGAETLLTRIVHVLTDKYPPSPELVSRVRELYATRVSDVRFLIPVLNGLTKKE
ILAALPKLIKLNPIVVKEVFNKLLGLQNPNEEQLPVSPEELLVALHLIDPSKADLKYIIK
ATALCFAEKNTYTQEVLSSVLQRLAEEQQTPVLMMRSVLQALTLHPSLAPLALNILCLLC
EREVWNNKVAWEGWVKCAERLGPRAGPALRSLPPRARDMLPSHLTASCPSDAPYSGPNPI
EPLPPGME