DPGLEAN22298 in OGS1.0

New model in OGS2.0DPOGS203773 
Genomic Positionscaffold8079:- 151-1851
See gene structure
CDS Length1701
Paired RNAseq reads  600
Single RNAseq reads  1437
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013351 (2e-141)
Best Drosophila hit  CG13097 (4e-57)
Best Human hitU3 small nucleolar ribonucleoprotein protein MPP10 (1e-58)
Best NR hit (blastp)  PREDICTED: similar to U3 small nucleolar ribonucleoprotein protein mpp10 [Nasonia vitripennis] (4e-105)
Best NR hit (blastx)  PREDICTED: similar to M-phase phosphoprotein 10 [Apis mellifera] (3e-105)
GeneOntology terms


  
GO:0003676 nucleic acid binding
GO:0005634 nucleus
GO:0030529 ribonucleoprotein complex
GO:0006364 rRNA processing
InterPro families
  
IPR007151 Mpp10 protein
IPR012173 U3 small nucleolar ribonucleoprotein complex, subunit Mpp10p
Orthology groupMCL12364

Nucleotide sequence:

ATGACGAGTGAAAAAATTGATGATATAATAGATAAATTCAGTGTGCTCACTGAGAAACCT
GTGAAGTTTTTAACAGTTCAAGATGATATACAGGGAGATATCAGGAACTTGGTGAAGTCC
ATCTATGATCTTACTAAGTCACAAGAAAGCAGCAATAAGAAGAAAAAGGCGTTATCGAGT
ATGATAGTGAGTGATTGCGACGAGGAACAGATTTGGCAACAGATAGAGTTGCAGAATTCA
GAGCGATGGGACGAACTGGTCTGGGATGTAGCTAATAGCGTGTCCAGTAAAAACGATCTC
ACGTTTCCTTTAGAGTTTCCAGAAGAGAAAGACGAAGAAGATATTAAAAATGAAGATGAT
GTTATGTCAGAAGAAGAACATCAAGAAGTAGAACAGTCTAATGTTAAAGTGGCCAAAACG
AAACCTAGTAAAAAACAGTCAATTGTTGATGACGATTTCTTTAAATTACAAGATATGGAA
AACTTTTTATTAAAAGAAGAAAAAATGGAGGGAAAAAACAAAAAGAGTGAAGACGACGAG
GACTCGATAGACATGTTTGAAGATATAGATAGTGAGGGATCTGACGAAGAAGGCGGCAAA
GATGTTAAATACTATGACTTTTTCAATGAAGACAATGAAAGTGGTCAGGAAGATAATGAC
GAGGAAGATGATGATGAACATGAAGAAAATAATGATTATAAGACCGAACCAAAAGAACAT
AAGAAAAGTGACAAAAAAGTCAGGTTTCTTGAGCCAGATTCTGAATCAGGTGATTCCGAG
GACAGTGAAGAACAAAAATCTAAGCACATCAATGGCAGTGATAAGGAAAATAAATCAGAA
TTTGAACTTCGGCAAGAGCGGTTGCAGAAGCATATTTCGAGGCTGGAAGAGAAATCCATA
AATGAAGCCCCGTGGCAACTGAAAGGTGAAGTCGATGCCATGAAAAGACCACAGAACTCG
TTACTTGAAGAAGTTCTCGATTTCGATCTCACAACCCGACCACCTCCCGTCATAACAGAA
CAAACAACAGTCACATTAGAAGGCCTCATCAGACAACGCATCAAAGACAAGGCTTGGGAC
GACGTAACCAGGAAAGAAAAGCCAGTAGATGATCAGTTTCTGTTCAAAAAGCCCGAAGTT
CTAGATCAGTCTAAGAGTAAAATGAGTTTAGCACAAGTCTATGAAGCAGAGTATCTTAAA
CAGAAACAAGCGTCGTCTGGTGAAGTTGATGATGAAAAGGAACCTGAAAGTCATACTGAA
ATTCGGGAAGCTATGAAAAATTTATTTTCCAAGCTCGATGCTCTGTGTCATTATCACTAC
ACACCAAAACCACCTCAAGCTGAAGTTAAAATTGTTAGTAACACTCCGGCCATATCCATG
GAAGAAGTGGCTCCAGTGGCAGTGAGTGATGCTACCCTATTAGCACCCGAAGAAGTTAAA
AGGAAAACAAAAGGAGATCTTATGAGCAAGGAAGAGAGAACGCAGACAGATAAAAACAGA
GAGAGGAGGAAAAAGAAGAAACTGCAAAGGAAAAAGGGTTCTGTTACGAAAGTCACAGAT
AATAGAAACACTAAGGCAGCCGTTGAAAGCAACGACAAGTCCCTCAAGACGTCCAAAGCT
TTCTTCCAACAGCTAAATGATAATTCAACTAGTCTCATTAAATCTAAAACTAAGAAGCTT
ATTAAAAATAAGGGACAATAA

Protein sequence:

MTSEKIDDIIDKFSVLTEKPVKFLTVQDDIQGDIRNLVKSIYDLTKSQESSNKKKKALSS
MIVSDCDEEQIWQQIELQNSERWDELVWDVANSVSSKNDLTFPLEFPEEKDEEDIKNEDD
VMSEEEHQEVEQSNVKVAKTKPSKKQSIVDDDFFKLQDMENFLLKEEKMEGKNKKSEDDE
DSIDMFEDIDSEGSDEEGGKDVKYYDFFNEDNESGQEDNDEEDDDEHEENNDYKTEPKEH
KKSDKKVRFLEPDSESGDSEDSEEQKSKHINGSDKENKSEFELRQERLQKHISRLEEKSI
NEAPWQLKGEVDAMKRPQNSLLEEVLDFDLTTRPPPVITEQTTVTLEGLIRQRIKDKAWD
DVTRKEKPVDDQFLFKKPEVLDQSKSKMSLAQVYEAEYLKQKQASSGEVDDEKEPESHTE
IREAMKNLFSKLDALCHYHYTPKPPQAEVKIVSNTPAISMEEVAPVAVSDATLLAPEEVK
RKTKGDLMSKEERTQTDKNRERRKKKKLQRKKGSVTKVTDNRNTKAAVESNDKSLKTSKA
FFQQLNDNSTSLIKSKTKKLIKNKGQ