DPGLEAN04297 in OGS1.0

New model in OGS2.0DPOGS207618 
Genomic Positionscaffold240:+ 222853-226565
See gene structure
CDS Length2100
Paired RNAseq reads  446
Single RNAseq reads  1076
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006359 (0.0)
Best Drosophila hit  CG9799 (6e-175)
Best Human hitWD repeat-containing protein 36 (2e-167)
Best NR hit (blastp)  PREDICTED: similar to wd-repeat protein [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to wd-repeat protein [Tribolium castaneum] (0.0)
GeneOntology terms
  
GO:0006364 rRNA processing
GO:0032040 small-subunit processome
InterPro families






  
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019775 WD40 repeat, conserved site
IPR011047 Quinonprotein alcohol dehydrogenase-like
IPR001680 WD40 repeat
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR007319 Small-subunit processome, Utp21
IPR019781 WD40 repeat, subgroup
Orthology groupMCL14213

Nucleotide sequence:

ATGACTGGTGATAGTTTTCACGTTTATACAGCCAGCGAAAACGATATATACGCATGGAGA
CGAGGCTGCGAGCTAAAGCACGTTTACAAGGGACACCAGGCACCGATACACCAGCTATTA
CCATTTGGAGTTCATCTCATATCAATAGATAAAGATAATGTCCTTAAAATATTTGACATT
AAAGAGGGATCAGAGTTTCTCGATCTCAAGTTCGATGAAACTCATTTCAAAATTACAACT
TTATGTCATCCACCCACTTATCTTAATAAAATATTACTTGGCAGTAAACAGGGCCAACTC
CAGATATGGAATATTAGAACTTCAAAATTGGTGTATACATTTAAAGGTTGGGACTCACCT
GTGACAGTTACAGAAGCTGCTCCAGCAATTGATGTTGTAGCTATTGCTTTGGGTAATGGA
AAAATTATTCTTCATAATCTCCGTTATGATCAAGAGGTAATGGAGTTTATTCATGATTGG
GGCAGAGTTAGTTGTTTGTCATTTAGAATGGATGGAGTGCCCATAATGGTAACAGGAAGT
ACACAAGGACATTTAGTTATGTGGGATTTAGAAGAGAAAAGAGTGAAGTCACAGATACAG
TCAGCTCATTTTGCTAAAATAGCTGGTTTACAATGTTTAAATTCTGAACCACTAATGGTT
ACCAATTCCCAAGATAATTCATTAAAAATGTGGATTTTTGATATGCCAGATGGAGGGGCT
AGACTTTTGAAGAAAAGGGAAGGTCATTCTTTACCTCCAACGATAGTGCGCTACTGTGAG
CCAACTGGTGGAAACATTCTTGCAGCAGGCAGTGATAGCAGTCTTCATATTATGAATACA
GTAACAGAAACTTTTAACAAAAGCATGGGTAAAGCCTCATACAACAGGAAAGCATCCAAA
AAGAAAAAAAGATATCAGATAGATACAAAAATTCTTCCACAAATAACTAATATAAGCTCC
TGTATGCAAAGGGATAAGCAATGGGACAGTATTGCAACATTGCATGAAGGAAAGTACTTG
GCTACTACTTGGTCATATAATAGAATGTGTATGGGAACACACAAATTAAAGCCACCTGAT
ATGGAAAAAAGTACTCTGTCAACCTGCTTGACGGTAACACATTGTGGCAATTTTGTTATT
ATTGGTTATAGTAATGGACAAGTGCATAAGTTTAATATGCAGTCAGGCCTTTACCGAGGC
CATTACGGCAAAGAAAACAAACAGGCCCACAAAGGAGCACTGAGAGGCGTAGAAACAGAT
ATCTGTAATCAAAGGCTCATTACTGTTGGTGCTGACGATAAACTTAAATTCTGGCATTTT
AAAACTGCTACCACCCCATATCATGTACTGAGATTGGATGAATCTGTGAGTATGACAAAA
TGCCACAGGGAAAGTGGTTTGCTGGCGTTAGCAAATGAAGATTTTACAATTACACTGGTC
GATATAGACACCATGAGAGTTGTTAGAAACTTCGAAGGTCATGTTGGTAAAATAAACGAC
ATTGATTTTGATTGTCAAAGCAGATGGTTAGTGTCATCATCTATGGATTGTACAATTTGT
ACTTGGGATATACCAACTTCACAACTGGTTGATATATTTTCTGTTGAACAGCCATGTACA
TCTCTAACTATGTCACCAACCGGTGATTATCTGGCGACGTCCCATGTGGGTGAGCTTGGG
ATCTGTCTTTGGGCCAACAGATTGTTGTATAGCAAAGTCTTCCTCAAGCCCGTTGATAGA
AATGATGTGCCGCGATTGAAACTACCAACTACTGCAGCCGAGAAACCTGATATAGATGAT
ATAGGAACAATTGATTTGGGCGATGACGAATATAAATCACCGGAACAAATCAGCGAGGAA
CTTTTAACACTATCTGGCCAGCCTACATCAAGATGGCTGAATTTGCTCAATTTGGACGTA
ATAAAACGTAGGAATAAACCCAAAACGCCTTTGACGGTTCCCAAATCGGCGCCATTCTTT
CTCCCAACAATCCCAAGTCTTGACCTTGAATTCGATTTAGAAAAGGAAAAGGCGGGAAAC
ACGAAAAAGTTGCTCATACCGGATACATTGTCAACTTTAACGCCATTTGCAAAAAATTGA

Protein sequence:

MTGDSFHVYTASENDIYAWRRGCELKHVYKGHQAPIHQLLPFGVHLISIDKDNVLKIFDI
KEGSEFLDLKFDETHFKITTLCHPPTYLNKILLGSKQGQLQIWNIRTSKLVYTFKGWDSP
VTVTEAAPAIDVVAIALGNGKIILHNLRYDQEVMEFIHDWGRVSCLSFRMDGVPIMVTGS
TQGHLVMWDLEEKRVKSQIQSAHFAKIAGLQCLNSEPLMVTNSQDNSLKMWIFDMPDGGA
RLLKKREGHSLPPTIVRYCEPTGGNILAAGSDSSLHIMNTVTETFNKSMGKASYNRKASK
KKKRYQIDTKILPQITNISSCMQRDKQWDSIATLHEGKYLATTWSYNRMCMGTHKLKPPD
MEKSTLSTCLTVTHCGNFVIIGYSNGQVHKFNMQSGLYRGHYGKENKQAHKGALRGVETD
ICNQRLITVGADDKLKFWHFKTATTPYHVLRLDESVSMTKCHRESGLLALANEDFTITLV
DIDTMRVVRNFEGHVGKINDIDFDCQSRWLVSSSMDCTICTWDIPTSQLVDIFSVEQPCT
SLTMSPTGDYLATSHVGELGICLWANRLLYSKVFLKPVDRNDVPRLKLPTTAAEKPDIDD
IGTIDLGDDEYKSPEQISEELLTLSGQPTSRWLNLLNLDVIKRRNKPKTPLTVPKSAPFF
LPTIPSLDLEFDLEKEKAGNTKKLLIPDTLSTLTPFAKN