DPGLEAN10977 in OGS1.0

New model in OGS2.0DPOGS205460 
Genomic Positionscaffold2266:+ 122-6156
See gene structure
CDS Length3087
Paired RNAseq reads  1378
Single RNAseq reads  3164
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008399 (0.0)
Best Drosophila hit  symplekin (4e-151)
Best Human hitsymplekin (2e-133)
Best NR hit (blastp)  Symplekin, putative [Pediculus humanus corporis] (0.0)
Best NR hit (blastx)  Symplekin, putative [Pediculus humanus corporis] (0.0)
GeneOntology terms




  
GO:0005923 tight junction
GO:0006379 mRNA cleavage
GO:0005848 mRNA cleavage stimulating factor complex
GO:0005488 binding
GO:0006398 histone mRNA 3'-end processing
GO:0005634 nucleus
InterPro families

  
IPR016024 Armadillo-type fold
IPR021850 Protein of unknown function DUF3453
IPR022075 Symplekin tight junction protein C-terminal
Orthology groupMCL12507

Nucleotide sequence:

ATGTATATCTTAATATTTTCAAGCAAATCATATCCTGAATATCTTCCTAAAATAATTGGT
CAGCTGCAATTACTGGTGATAGATACGGTTATTGCGGTACAGAAGAGGGCCATTCAGGCG
GCGAGCCTGGTTTACAGAAATGTCCTCCTTTGGATATGTAAAGGAACCTCGGAAATGAAA
GACATGCAATATGTTTGGGAACATTTATCCGAATTAAAGCTTCTTATTCTCAATATGATA
GATAGTGACAATGAAGGTATAAGGACACATTCAATCAAGTTCTTGGAGGAAGTTGTTGTA
CTTCAAAGTCCTCATGCAAATGATGACGATGACTTTAGTTTGGACTGTTTACCGTCACAC
CTGCCATTCCTCAACAGAAAGGCCATGGAAGAAGAGTCCGACCATATTTTCCAATTACTG
TTAAAATTCCACAACTCGCAACATATATCAAGCGTTAACCTCATGGCTTGTATGACAACG
TTATGCATAGTGGCGAAACTTAGACCAAAATATATGTCGAGTGTGGTGAAAGCTTTAAAC
GACCTGCACACGACACTACCACCTACATTATCACAGTCACAGGTGAATTCAGTGCGGAAA
CACCTCAAGATGCAGCTACTGATATTGGTCAAACATCCATCCTCATATGACATGATGCCG
CAATTGACGCAGTTGCTCATGGACATTGGAATGACCCCACAAGAAATAAATAAAGCTCTA
CCTAGAGACAGAAGGAACAAACGATTGGGAGAGTTGAGGGCGTCAGAAAATCCAGCAAAA
AGATTCAGGGTTGACTCGCCTCAGAGCACCTTGGGCAGCGATAGCAACAGTAACAGCAGA
AGTGAATTCAATTTGTTTGATGATGACAACAGCCAGCAGGGCAGCCAACAGAGTATAACC
AAAGCATCCTGTACAGAAGAATCAATTTTAGATGGTCTGAATAGTTTAGAAAATGTCGTC
AATCTGGTTGTGACAACCCTGATTAATAACCTGCCGACTGAAATGCCCACTAGTTTCATC
TTGGCTTATAAACCTATTCCAAATTCGGGAAGCAAGGTCCAGAAGCAAAGTCTTGCTAAA
ATGATGATGGCATTGATCAAAGACGAACCATTGCCAATGCCTACCACCATGAAGTCTTTG
GATACCACCACAAAAATCCCATTACTGAGAGACGACGACGATAAAATCAATCTAAAGAAC
GCGGTCGCAAAACTACAGGAGTCCACCAAAGTAGACAAGCAAATAGAAAATGCTGTATCG
AAACTGATGGAAGAAACTAGACAGGAGCATCTCAAAGAAGAGGAGAGAAAGAATAAGGAT
AAAGAAAAACCAGTCGCTCCACCAACGCCATCTATACCGAAATTAAAACAGAAAGTGAAA
CTATTAAAACTCCAAGAACTAACTCGACCCATACCAAAGGAAATTAAAGAGAAACTGATG
ATTCAAGCCGTGGAAAGAATATTGCGAGCCGAGAAAGAGAGCGTTATCGGGGGAGCGGCT
CAAATAAGGACGAAATTCATCACGATATTCGCGTCAAGTTACACTCCGGAGATACGAGAA
CTGGTCCTCAACTACATACTGGAAGATCCGTTAAACAGAATCGACCTGGCATTATCCTGG
CTGTACGAAGAATACGCGTACATGCAAGGCTTCAACCGGCATCCGGTGACGCTGCAGCCC
AAACTGCACGAAAAACACGGCGAAAACTACAACCAGCTGCTATGCGCTCTGATCACGCAG
ATATCAGAGAGGGGGGATCCGGTGATGGAAGGGAGTAAGGACGTCCTGCTGAGGAAGGTT
TACTCCGAAGCGCCCGTAGTCACCGACGAGGCGGTGGACTACTTGAAGCATCTGGTCACT
GAGGAAAAGTCAGCGACGGTAGCCCTGGAACTGCTCGAGGAGTTGTGTCTGCTAAGACCA
CCTAGGGCGCACAAATTTGTTGCCGCCCTAGTATGTCACGTGTTGAGTGAAAACGAGGAA
ATTCGCAATATAGCCTTGAAATCGTCAACCAAAATCTACAAACACAGTACGGACGCCGCT
AAGAAGGTTATAGAGAAACACGCTATGTTGTACCTCGGCTTTATCTCGCTGTCAACGCCG
CCCCAAGAGTTGTATGGCAACAGACACGCGAGCAGACCCTGGTCCGACGACTTGTATAAA
ATGTGCCTCAATCTGGTCATGGCGTTGTTCCCAGAGAAGGAAGACGTGATCATTGAGATC
GCCCGCGTCTACGGAACCACAGGCGCTGAGGCGAAGCGCTGTGTGCTGCGACAGCTGGAA
GTGCCTGTCCGTGCTCTGGCCGCCTCAGAGCCTCCTGGACACCTGTCTCCTGCGCTCGCA
GCACTGCTGGATGCGTGCCCGCGCGGCGCCGAGACGCTACTGACGCGGATCGTGCACGTG
CTCACTGATAAATATCCGCCGAGCCCCGAACTGGTGTCTCGTGTCCGCGAGCTGTACGCG
ACCCGAGTCTCAGATGTGCGGTTCCTTATACCGGTGCTGAATGGACTTACCAAGAAGGAG
ATTCTGGCTGCCCTGCCGAAGTTGATCAAATTAAATCCAATAGTAGTGAAGGAAGTTTTC
AACAAATTACTCGGCCTGCAGAATCCCAACGAAGAACAATTACCGGTCTCTCCCGAAGAA
CTACTGGTAGCTTTGCATCTTATAGACCCGAGCAAAGCAGATCTCAAGTACATCATCAAA
GCGACCGCTTTATGTTTCGCTGAAAAGAACACTTACACACAGGAGGTGTTGTCTTCAGTT
CTCCAGCGCCTGGCTGAGGAGCAGCAGACGCCAGTACTGATGATGCGCTCTGTTCTGCAA
GCGTTGACCCTTCACCCATCACTAGCGCCGCTCGCCCTCAACATACTATGCCTCCTGTGC
GAGAGAGAGGTTTGGAACAACAAAGTGGCTTGGGAGGGTTGGGTGAAGTGCGCTGAACGA
CTTGGACCTCGAGCGGGTCCCGCGCTAAGGTCACTACCACCGAGGGCGAGAGACATGCTA
CCATCGCACCTTACAGCCTCGTGTCCGTCGGATGCTCCTTATTCTGGCCCAAACCCGATA
GAGCCGTTACCCCCCGGAATGGAATGA

Protein sequence:

MYILIFSSKSYPEYLPKIIGQLQLLVIDTVIAVQKRAIQAASLVYRNVLLWICKGTSEMK
DMQYVWEHLSELKLLILNMIDSDNEGIRTHSIKFLEEVVVLQSPHANDDDDFSLDCLPSH
LPFLNRKAMEEESDHIFQLLLKFHNSQHISSVNLMACMTTLCIVAKLRPKYMSSVVKALN
DLHTTLPPTLSQSQVNSVRKHLKMQLLILVKHPSSYDMMPQLTQLLMDIGMTPQEINKAL
PRDRRNKRLGELRASENPAKRFRVDSPQSTLGSDSNSNSRSEFNLFDDDNSQQGSQQSIT
KASCTEESILDGLNSLENVVNLVVTTLINNLPTEMPTSFILAYKPIPNSGSKVQKQSLAK
MMMALIKDEPLPMPTTMKSLDTTTKIPLLRDDDDKINLKNAVAKLQESTKVDKQIENAVS
KLMEETRQEHLKEEERKNKDKEKPVAPPTPSIPKLKQKVKLLKLQELTRPIPKEIKEKLM
IQAVERILRAEKESVIGGAAQIRTKFITIFASSYTPEIRELVLNYILEDPLNRIDLALSW
LYEEYAYMQGFNRHPVTLQPKLHEKHGENYNQLLCALITQISERGDPVMEGSKDVLLRKV
YSEAPVVTDEAVDYLKHLVTEEKSATVALELLEELCLLRPPRAHKFVAALVCHVLSENEE
IRNIALKSSTKIYKHSTDAAKKVIEKHAMLYLGFISLSTPPQELYGNRHASRPWSDDLYK
MCLNLVMALFPEKEDVIIEIARVYGTTGAEAKRCVLRQLEVPVRALAASEPPGHLSPALA
ALLDACPRGAETLLTRIVHVLTDKYPPSPELVSRVRELYATRVSDVRFLIPVLNGLTKKE
ILAALPKLIKLNPIVVKEVFNKLLGLQNPNEEQLPVSPEELLVALHLIDPSKADLKYIIK
ATALCFAEKNTYTQEVLSSVLQRLAEEQQTPVLMMRSVLQALTLHPSLAPLALNILCLLC
EREVWNNKVAWEGWVKCAERLGPRAGPALRSLPPRARDMLPSHLTASCPSDAPYSGPNPI
EPLPPGME