DPGLEAN10840 in OGS1.0

New model in OGS2.0DPOGS204547 
Genomic Positionscaffold424:+ 65972-80723
See gene structure
CDS Length2268
Paired RNAseq reads  222
Single RNAseq reads  639
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005337 (0.0)
Best Drosophila hit  CG11206, isoform D (1e-128)
Best Human hitkazrin isoform E (4e-69)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC010503 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC010503 [Tribolium castaneum] (1e-175)
GeneOntology terms




  
GO:0001533 cornified envelope
GO:0005634 nucleus
GO:0005737 cytoplasm
GO:0030054 cell junction
GO:0030057 desmosome
GO:0031424 keratinization
InterPro families



  
IPR010993 Sterile alpha motif homology
IPR001660 Sterile alpha motif domain
IPR021129 Sterile alpha motif, type 1
IPR011510 Sterile alpha motif, type 2
IPR013761 Sterile alpha motif-type
Orthology groupMCL12137

Nucleotide sequence:

ATGTTTGGGATACTAACAGTGTCTTTTATTGTAGCTCTAAAAGCCGACAAGAAACGGCTC
AAAGCGGAGAAATTTGATTTGCTGAATCAAATGAAGCAACTCTACGCCACTCTTGAGGAC
AAAGAGAAGGAGCTTAGGGATTTCATCAGGAATTATGAACAGATGCGGTCTCGAAGCGGT
GCGTCCTCAGCGCTGGGTGCAGAACGAGCTGAACGCGAACGCGAGCGTGCGGCATTGTTG
CGGCATGCCCGCGACGAGGCCGAGCGCTCTTTACAACTGGCGGCCGCACTCAGCGCCCGT
GATACGCAGTTGCGGCATGCTAGGGAACAGCTTTTTGAGGCTCGAAGACAACTACAAGCA
GCAGGGTGTTTGTCCGAAGGTGAGAGTGTAGCGTCTTTGGGAATTGGTCCTCCAATGATG
CTTGGAGGTCCCACGGGTTTGATGGGTGATAGAGGTAGCTGCAGCGCAGATTCTGGAGTT
AGAGGTAGTAGCGATGGTGGCGCCACGTCGGTTTGCGGCGGAAACCTATCAGACTCCACC
GCAGAGGGCGCGCCTCCCACCCTCGACCCATACGATACAGATGCTGTATCGCTGGTGTCA
TCCGCGCACCCTATATACCAATTAAGCACGCCCCGTGACTGTAGCCCGACTCTCTCACCA
CATAACAGCGGTTCATCATTCACAAGATCTATTGATGCTGGATCACTATCTAGGTCAGTT
GAGCAGTTATCGAGTCCGGGGGAATGTGACTCTGGTATGGTTGGGATGCGGACCCGTCCT
GGGGGTTCCAAGGCCGGCCGCGGTCGGGGATCCGCTTGGGGATCCATATCTCGCGTTTTT
GCTAGAAGCAGACACCGCACCAAGTCCGGAAGCGCAGCCAGTAGTGGTCACGAGAGCGAG
CCAATATACGCTGGCACAGGCAGCACAAGTCGCGCTTGGTCTCCTTTAGGGAGTGAGGCG
GCATTACGCGAAGCTGCCTCTCTACCTCTATCAAGATGGCGGGCACCAGCCATCATCGCC
TGGCTTGAACTTGCTCTTGGCATGCCGCAATATGCAGCTGCTGTTGCTGATAATGTTAAA
AGTGGAAAGATCCGTGCTTTGAATGGGCAGGTTCTGCTCGAGTTGACGGACACGGATCTT
GAGGTCGGGTTGGGGGTGACTCAGCCAATGCACAGGAAGAAGCTGCGGCTGGCCATCGAA
GAGAGACGGCGGCCGGACCTCGTACGGAACCCTAGCATCGGACAGCTGAGTCATGCATGG
GTTGCGGCGGAGTGGTTGCCAGATCTAGGGCTATCCCAGTACGCAGAATCATTTTTAGCC
AATTTGGTGGATGCTAGAATGCTGGATACTATCAGCAAGAAGGAGCTGGAGAAATATCTT
GGTGTTACGAGAAAGTTCCATCAGGCATCCATTGTCCACGGCATTCATTTGCTACGAATC
ATGAAATATGATAGACAAGCACTGGCAGTACGGCGGCATCAGTGCGAAAATGTCGATGCG
GACCCTCTGGTTTGGACCAATCAAAGGTTTATGCGTTGGTCTCACAATATCGACTTGGGT
GAATTTGCTGAGAATCTTAAAGACAGCGGTGTCCATGGTGGTTTGGTGGTACTGGAACCA
TCATTCACTGGTGAGACCATGGCCACGGCGCTTGGTATACCACCGTCGAAGAGTATAATT
CGAAGGCATTTGGTAGCTGAATTTGATGCCCTTGTCATCCCAGCGAGGAATATGTTTGGT
CACCAAATAAGGATGTTGGGAAGACCGTTTTCAAGATCGGTTGCGACAGGCTTGCCTGGA
ATTGACTTTAGCGCTGATTCTAGACGACATAGTCTAAGGGGCTCTATAACACGAGCGTTG
GGTGTTCTCAAGCCGAAGCACGATAGACCATCACCATCTAGTTCAAGCGAGAGTTCTAGC
GTGATGAGTCTGACACAACCGTACATATCTTATTCACCTCCTATAGCAGTGCGGACGCTG
TCTCAATTGAGCATGACATACGCTCCACCACCGACACTGGCAGAGTATGAACCGATATAT
ACGCCTTTGAGTTTATATTCCCAGTCTAGCGTATCCACAAAGGATAGCCTTCAGCGCCTT
AATGATGGCAAAGATTATAATATCACCCACAGGTACGGACAAAAAGTAGATCAATCTCAT
CGAGTCAGTTCACCGTTACCTGAAACATCTGACGGAAATAAGCAAAGACGTCACAGACGA
GTGAAAAGTATAGGAGATATTAATGCTTCGAGCAAAACGACGGTTTAA

Protein sequence:

MFGILTVSFIVALKADKKRLKAEKFDLLNQMKQLYATLEDKEKELRDFIRNYEQMRSRSG
ASSALGAERAERERERAALLRHARDEAERSLQLAAALSARDTQLRHAREQLFEARRQLQA
AGCLSEGESVASLGIGPPMMLGGPTGLMGDRGSCSADSGVRGSSDGGATSVCGGNLSDST
AEGAPPTLDPYDTDAVSLVSSAHPIYQLSTPRDCSPTLSPHNSGSSFTRSIDAGSLSRSV
EQLSSPGECDSGMVGMRTRPGGSKAGRGRGSAWGSISRVFARSRHRTKSGSAASSGHESE
PIYAGTGSTSRAWSPLGSEAALREAASLPLSRWRAPAIIAWLELALGMPQYAAAVADNVK
SGKIRALNGQVLLELTDTDLEVGLGVTQPMHRKKLRLAIEERRRPDLVRNPSIGQLSHAW
VAAEWLPDLGLSQYAESFLANLVDARMLDTISKKELEKYLGVTRKFHQASIVHGIHLLRI
MKYDRQALAVRRHQCENVDADPLVWTNQRFMRWSHNIDLGEFAENLKDSGVHGGLVVLEP
SFTGETMATALGIPPSKSIIRRHLVAEFDALVIPARNMFGHQIRMLGRPFSRSVATGLPG
IDFSADSRRHSLRGSITRALGVLKPKHDRPSPSSSSESSSVMSLTQPYISYSPPIAVRTL
SQLSMTYAPPPTLAEYEPIYTPLSLYSQSSVSTKDSLQRLNDGKDYNITHRYGQKVDQSH
RVSSPLPETSDGNKQRRHRRVKSIGDINASSKTTV