New model in OGS2.0 | DPOGS202638  |
---|---|
Genomic Position | scaffold4292:- 6197-16957 |
See gene structure | |
CDS Length | 2802 |
Paired RNAseq reads   | 1902 |
Single RNAseq reads   | 5845 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000839 (2e-25) |
Best Drosophila hit   | CG12325 (0.0) |
Best Human hit | periodic tryptophan protein 2 homolog (5e-168) |
Best NR hit (blastp)   | PREDICTED: similar to CG12325-PA [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG12325-PA [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0007165 signal transduction GO:0006508 proteolysis GO:0005730 nucleolus GO:0005634 nucleus GO:0004871 signal transducer activity |
InterPro families    | IPR019781 WD40 repeat, subgroup IPR007148 Small-subunit processome, Utp12 IPR011046 WD40 repeat-like-containing domain IPR001680 WD40 repeat IPR015943 WD40/YVTN repeat-like-containing domain IPR019775 WD40 repeat, conserved site IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain |
Orthology group | MCL12340 |
Nucleotide sequence:
ATGACTGCACCCAGTGCATTCACGGGAGAGTTCCGTTCATTCATAATGAGACGTGTTTTT
AAAAAATCACATGACGAAGTCACTTGCCTGGACTGGTCTAGTTGTGGAAAGTTACTTGCG
GTGGGATCCAAAGACACCACAACCAAATTATACACAGCCGAGTACTTGGACAACCTAAAT
ATGTACTCTTTAAGTGGTCACACAGATAAGATTGTTGGTGTATTCTTTGAGCAGAAGAGT
TTAGACCTTATAACTGTGAGCCGTAATGGTCAGGTTTGCTTGTGGGATGCCAGTCTGGAT
TCAGATAGCCTGGTTACTTCAGAGGTACAAATATCACATAAGAAGAGACGGAAATTGCAA
AAGGAAGCCGAATTAGTTGAGGATGAAGTTGATGAAGAGAATATAGTTGAAAAGGACAAA
GAATATGAGAGCGATAAAGATGTTGAGATAGAAGAGGAACAAAAGACAAATGATGGTAAA
AAACTCCAATACAACAAGCTGGGGAGGCATTATATTGGGGATTCCATAAGGAACGGTAAC
CATAAGGTGAAACTAACGGCTGCGGCATACCACAGGGGGACCAAGATATTAGTGACAGGT
TTCTCCACTGGTATATTCTTCCTTCACGAGATGCCAGATGTGAATCTCATCCACTCCTTG
AGTATATCAGAACACAGGATTGGCAGCATCTCGGTATCTCACCAGGGGGACTGGATAGCG
TTCGGTTGTCCCAACATTGGACAACTGTTGGTTTGGGAGTGGCAGAGCGAGCAATATGTT
ATGAAGCAGCAGGGCCACTCGCTAGACATGACCTGCCTCGCGTATTCGCCTGACGGGCTC
TACATAGTCACAGGCGGCTATGACGGGAAGGTCAAAGTATGGAATACCAGCTCGGGCTTC
TGCTTCGTTACATTCAGTGAACATAAGTCGACGGTGACCGGGATAACGTTCAGTGCCAAT
AAGAAATTCTTCGTGTCTTCATCTCTGGACGGCACCGTGAGATGTTACGATCTGACGAGG
TATCGTAACTTCCGTACTTTCTCGTCTCCGACCCTGGTTCAGTTCGGCTGCGTGTCCTTG
GACAGCAGCAGCGAACTGTGTGCTGCTGGAGGACAGGACGTCTTCGAGATATACCTGTGG
TCCGTCAAATTTGGGCGACTTTTGGAGGTGCTCGCAGGTCATGCAGCTCCAGTGGCTAGT
TTAGCTTTCAGTCCACTTCTGTCTAGTTCCAAACTGGCCTCCGCCTCCTGGGACAAGACG
GTAAAGATATGGAACTGTATAGAAACAAGCTCGGACTGTGAAACTATACAACTGGGTTCG
GACGCACTGCAAGTGAGCTTCAGACCTGATGGAGAAGAGATAGCAGTATCGACGTTAGAC
GGTAACATATCATTCTTCAACGCCACCACTTGCGACCAGACTGCCAGTTTGGAGGGGAGA
AACGATCTGGGAGCCGGCAGGGCCGACACCGATCTGGTTACACCGGAGAAGCTGTTGAAG
ACCAAAGCTTTCACTACGATATGCTACTCAGCGGATGGCACGTGTATCCTGGGGGCAGGA
AACTCCAAACACATATGTCTGTACAGTATTAAGGAGGGTGTACTCATCAAGAAGTTTGTG
ATCACGCAGAACAAGTCCTTGGACGCTATTAATGACTTTATAAATCGTCGGAACATCACC
GAATTTGGTAATATGGCGCTGGTTGAAGAGAGGGAGGAGTTGGAAGGAGGGGAGGTTAGG
GTGAGGCTGCCGGGGGTCGGCGGGGGAGATATGGCTGATAGGAGGCTGAAACCTGAGGTT
CGTGTGTGGTGTGTTCGTTTCTCTGGTGCTGATGAAAGCTTTGCAGCAGCATGTACCGAG
GGACTGCTGTTATATGGAACAAGAACGGGGAGTGGGTTCAGGCCATATCGTCTAGAAACA
GGTTCCACGCCGGCTGCAGTGAAAAATCTATTGTCGGAGAGATCATGGGGCTTCGCCCTC
ATTGGAGCCCTACAGCTCAACGACAACACTCTCATACAGCAATGCGTAGAAGCTGTCCCC
CCGAATGACATCGAACTAACAGCAAAGAGTTTGGAAGAAGATTACATGATACGTCTTCTT
AACTCGATCGCAAGTCTTCTAGAAGATAGTCCCCATCTAGAACATTTGCTCATCTGGGTT
AGGAGTCTCGTCACAGACAAGAAGAAATTCCCGCCCAGCGTGTTGCTAGCCTTAGAGAAG
GCGCTCACGGTGAAATATTCGCAGATTAATAAAATGCCATATCGTCTAGAAACAGGTTCC
ACTCCGGCTGCAGTGAAAAATCTATTGTCGGAGAGATCATGGGGCTTCGCCCTCATTGGA
GCCCTACAGCTCAACGACAACACTCTCATACAGCAATGCGTAGAAGCTGTCCCCCCGAAT
GACATCGAACTAACAGCAAAGAGTTTGGAAGAAGATTACATGATACGTCTTCTTAACTCG
ATCGCAAGTCTTCTAGAAGATAGTCCCCATCTAGAACATTTGCTCATCTGGGTTAGGAGT
CTCGTCACAGACAAGAAGAAATTCCCGCCCAGCGTGTTGCTAGCCTTAGAGAAGGCGCTC
ACGGTGAAATATTCGCAGATTAATAAAATTTCTTGTACGGACGCCCTTTTCCCTAATGTG
GAAGAAGTAGAAGAAGTTATAGAAGGTTCGAGGCGCCAGATCCAAGCACCTGGTATGCTA
TGGTTGAATGCTGTGCACAGTTGGCAATCCACGGCTGCACACATCATCACGTCGTTAATG
CATTTCACCAACACATTGACTTCGATATTGACCAGACCTTAG
Protein sequence:
MTAPSAFTGEFRSFIMRRVFKKSHDEVTCLDWSSCGKLLAVGSKDTTTKLYTAEYLDNLN
MYSLSGHTDKIVGVFFEQKSLDLITVSRNGQVCLWDASLDSDSLVTSEVQISHKKRRKLQ
KEAELVEDEVDEENIVEKDKEYESDKDVEIEEEQKTNDGKKLQYNKLGRHYIGDSIRNGN
HKVKLTAAAYHRGTKILVTGFSTGIFFLHEMPDVNLIHSLSISEHRIGSISVSHQGDWIA
FGCPNIGQLLVWEWQSEQYVMKQQGHSLDMTCLAYSPDGLYIVTGGYDGKVKVWNTSSGF
CFVTFSEHKSTVTGITFSANKKFFVSSSLDGTVRCYDLTRYRNFRTFSSPTLVQFGCVSL
DSSSELCAAGGQDVFEIYLWSVKFGRLLEVLAGHAAPVASLAFSPLLSSSKLASASWDKT
VKIWNCIETSSDCETIQLGSDALQVSFRPDGEEIAVSTLDGNISFFNATTCDQTASLEGR
NDLGAGRADTDLVTPEKLLKTKAFTTICYSADGTCILGAGNSKHICLYSIKEGVLIKKFV
ITQNKSLDAINDFINRRNITEFGNMALVEEREELEGGEVRVRLPGVGGGDMADRRLKPEV
RVWCVRFSGADESFAAACTEGLLLYGTRTGSGFRPYRLETGSTPAAVKNLLSERSWGFAL
IGALQLNDNTLIQQCVEAVPPNDIELTAKSLEEDYMIRLLNSIASLLEDSPHLEHLLIWV
RSLVTDKKKFPPSVLLALEKALTVKYSQINKMPYRLETGSTPAAVKNLLSERSWGFALIG
ALQLNDNTLIQQCVEAVPPNDIELTAKSLEEDYMIRLLNSIASLLEDSPHLEHLLIWVRS
LVTDKKKFPPSVLLALEKALTVKYSQINKISCTDALFPNVEEVEEVIEGSRRQIQAPGML
WLNAVHSWQSTAAHIITSLMHFTNTLTSILTRP