DPGLEAN19976 in OGS1.0

New model in OGS2.0DPOGS202638 
Genomic Positionscaffold4292:- 6197-16957
See gene structure
CDS Length2802
Paired RNAseq reads  1902
Single RNAseq reads  5845
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000839 (2e-25)
Best Drosophila hit  CG12325 (0.0)
Best Human hitperiodic tryptophan protein 2 homolog (5e-168)
Best NR hit (blastp)  PREDICTED: similar to CG12325-PA [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG12325-PA [Apis mellifera] (0.0)
GeneOntology terms




  
GO:0004252 serine-type endopeptidase activity
GO:0007165 signal transduction
GO:0006508 proteolysis
GO:0005730 nucleolus
GO:0005634 nucleus
GO:0004871 signal transducer activity
InterPro families






  
IPR019781 WD40 repeat, subgroup
IPR007148 Small-subunit processome, Utp12
IPR011046 WD40 repeat-like-containing domain
IPR001680 WD40 repeat
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019775 WD40 repeat, conserved site
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
Orthology groupMCL12340

Nucleotide sequence:

ATGACTGCACCCAGTGCATTCACGGGAGAGTTCCGTTCATTCATAATGAGACGTGTTTTT
AAAAAATCACATGACGAAGTCACTTGCCTGGACTGGTCTAGTTGTGGAAAGTTACTTGCG
GTGGGATCCAAAGACACCACAACCAAATTATACACAGCCGAGTACTTGGACAACCTAAAT
ATGTACTCTTTAAGTGGTCACACAGATAAGATTGTTGGTGTATTCTTTGAGCAGAAGAGT
TTAGACCTTATAACTGTGAGCCGTAATGGTCAGGTTTGCTTGTGGGATGCCAGTCTGGAT
TCAGATAGCCTGGTTACTTCAGAGGTACAAATATCACATAAGAAGAGACGGAAATTGCAA
AAGGAAGCCGAATTAGTTGAGGATGAAGTTGATGAAGAGAATATAGTTGAAAAGGACAAA
GAATATGAGAGCGATAAAGATGTTGAGATAGAAGAGGAACAAAAGACAAATGATGGTAAA
AAACTCCAATACAACAAGCTGGGGAGGCATTATATTGGGGATTCCATAAGGAACGGTAAC
CATAAGGTGAAACTAACGGCTGCGGCATACCACAGGGGGACCAAGATATTAGTGACAGGT
TTCTCCACTGGTATATTCTTCCTTCACGAGATGCCAGATGTGAATCTCATCCACTCCTTG
AGTATATCAGAACACAGGATTGGCAGCATCTCGGTATCTCACCAGGGGGACTGGATAGCG
TTCGGTTGTCCCAACATTGGACAACTGTTGGTTTGGGAGTGGCAGAGCGAGCAATATGTT
ATGAAGCAGCAGGGCCACTCGCTAGACATGACCTGCCTCGCGTATTCGCCTGACGGGCTC
TACATAGTCACAGGCGGCTATGACGGGAAGGTCAAAGTATGGAATACCAGCTCGGGCTTC
TGCTTCGTTACATTCAGTGAACATAAGTCGACGGTGACCGGGATAACGTTCAGTGCCAAT
AAGAAATTCTTCGTGTCTTCATCTCTGGACGGCACCGTGAGATGTTACGATCTGACGAGG
TATCGTAACTTCCGTACTTTCTCGTCTCCGACCCTGGTTCAGTTCGGCTGCGTGTCCTTG
GACAGCAGCAGCGAACTGTGTGCTGCTGGAGGACAGGACGTCTTCGAGATATACCTGTGG
TCCGTCAAATTTGGGCGACTTTTGGAGGTGCTCGCAGGTCATGCAGCTCCAGTGGCTAGT
TTAGCTTTCAGTCCACTTCTGTCTAGTTCCAAACTGGCCTCCGCCTCCTGGGACAAGACG
GTAAAGATATGGAACTGTATAGAAACAAGCTCGGACTGTGAAACTATACAACTGGGTTCG
GACGCACTGCAAGTGAGCTTCAGACCTGATGGAGAAGAGATAGCAGTATCGACGTTAGAC
GGTAACATATCATTCTTCAACGCCACCACTTGCGACCAGACTGCCAGTTTGGAGGGGAGA
AACGATCTGGGAGCCGGCAGGGCCGACACCGATCTGGTTACACCGGAGAAGCTGTTGAAG
ACCAAAGCTTTCACTACGATATGCTACTCAGCGGATGGCACGTGTATCCTGGGGGCAGGA
AACTCCAAACACATATGTCTGTACAGTATTAAGGAGGGTGTACTCATCAAGAAGTTTGTG
ATCACGCAGAACAAGTCCTTGGACGCTATTAATGACTTTATAAATCGTCGGAACATCACC
GAATTTGGTAATATGGCGCTGGTTGAAGAGAGGGAGGAGTTGGAAGGAGGGGAGGTTAGG
GTGAGGCTGCCGGGGGTCGGCGGGGGAGATATGGCTGATAGGAGGCTGAAACCTGAGGTT
CGTGTGTGGTGTGTTCGTTTCTCTGGTGCTGATGAAAGCTTTGCAGCAGCATGTACCGAG
GGACTGCTGTTATATGGAACAAGAACGGGGAGTGGGTTCAGGCCATATCGTCTAGAAACA
GGTTCCACGCCGGCTGCAGTGAAAAATCTATTGTCGGAGAGATCATGGGGCTTCGCCCTC
ATTGGAGCCCTACAGCTCAACGACAACACTCTCATACAGCAATGCGTAGAAGCTGTCCCC
CCGAATGACATCGAACTAACAGCAAAGAGTTTGGAAGAAGATTACATGATACGTCTTCTT
AACTCGATCGCAAGTCTTCTAGAAGATAGTCCCCATCTAGAACATTTGCTCATCTGGGTT
AGGAGTCTCGTCACAGACAAGAAGAAATTCCCGCCCAGCGTGTTGCTAGCCTTAGAGAAG
GCGCTCACGGTGAAATATTCGCAGATTAATAAAATGCCATATCGTCTAGAAACAGGTTCC
ACTCCGGCTGCAGTGAAAAATCTATTGTCGGAGAGATCATGGGGCTTCGCCCTCATTGGA
GCCCTACAGCTCAACGACAACACTCTCATACAGCAATGCGTAGAAGCTGTCCCCCCGAAT
GACATCGAACTAACAGCAAAGAGTTTGGAAGAAGATTACATGATACGTCTTCTTAACTCG
ATCGCAAGTCTTCTAGAAGATAGTCCCCATCTAGAACATTTGCTCATCTGGGTTAGGAGT
CTCGTCACAGACAAGAAGAAATTCCCGCCCAGCGTGTTGCTAGCCTTAGAGAAGGCGCTC
ACGGTGAAATATTCGCAGATTAATAAAATTTCTTGTACGGACGCCCTTTTCCCTAATGTG
GAAGAAGTAGAAGAAGTTATAGAAGGTTCGAGGCGCCAGATCCAAGCACCTGGTATGCTA
TGGTTGAATGCTGTGCACAGTTGGCAATCCACGGCTGCACACATCATCACGTCGTTAATG
CATTTCACCAACACATTGACTTCGATATTGACCAGACCTTAG

Protein sequence:

MTAPSAFTGEFRSFIMRRVFKKSHDEVTCLDWSSCGKLLAVGSKDTTTKLYTAEYLDNLN
MYSLSGHTDKIVGVFFEQKSLDLITVSRNGQVCLWDASLDSDSLVTSEVQISHKKRRKLQ
KEAELVEDEVDEENIVEKDKEYESDKDVEIEEEQKTNDGKKLQYNKLGRHYIGDSIRNGN
HKVKLTAAAYHRGTKILVTGFSTGIFFLHEMPDVNLIHSLSISEHRIGSISVSHQGDWIA
FGCPNIGQLLVWEWQSEQYVMKQQGHSLDMTCLAYSPDGLYIVTGGYDGKVKVWNTSSGF
CFVTFSEHKSTVTGITFSANKKFFVSSSLDGTVRCYDLTRYRNFRTFSSPTLVQFGCVSL
DSSSELCAAGGQDVFEIYLWSVKFGRLLEVLAGHAAPVASLAFSPLLSSSKLASASWDKT
VKIWNCIETSSDCETIQLGSDALQVSFRPDGEEIAVSTLDGNISFFNATTCDQTASLEGR
NDLGAGRADTDLVTPEKLLKTKAFTTICYSADGTCILGAGNSKHICLYSIKEGVLIKKFV
ITQNKSLDAINDFINRRNITEFGNMALVEEREELEGGEVRVRLPGVGGGDMADRRLKPEV
RVWCVRFSGADESFAAACTEGLLLYGTRTGSGFRPYRLETGSTPAAVKNLLSERSWGFAL
IGALQLNDNTLIQQCVEAVPPNDIELTAKSLEEDYMIRLLNSIASLLEDSPHLEHLLIWV
RSLVTDKKKFPPSVLLALEKALTVKYSQINKMPYRLETGSTPAAVKNLLSERSWGFALIG
ALQLNDNTLIQQCVEAVPPNDIELTAKSLEEDYMIRLLNSIASLLEDSPHLEHLLIWVRS
LVTDKKKFPPSVLLALEKALTVKYSQINKISCTDALFPNVEEVEEVIEGSRRQIQAPGML
WLNAVHSWQSTAAHIITSLMHFTNTLTSILTRP