DPGLEAN04451 in OGS1.0

New model in OGS2.0DPOGS200680 
Genomic Positionscaffold364:- 60220-62326
See gene structure
CDS Length1785
Paired RNAseq reads  683
Single RNAseq reads  1498
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005552 (2e-110)
Best Drosophila hit  CG12301 (1e-12)
Best Human hitU3 small nucleolar RNA-associated protein 14 homolog A isoform 1 (1e-14)
Best NR hit (blastp)  PREDICTED: similar to smooth muscle caldesmon, putative [Tribolium castaneum] (8e-106)
Best NR hit (blastx)  PREDICTED: similar to smooth muscle caldesmon, putative [Tribolium castaneum] (1e-69)
GeneOntology terms


  
GO:0005730 nucleolus
GO:0005634 nucleus
GO:0006364 rRNA processing
GO:0032040 small-subunit processome
InterPro families  IPR006709 Small-subunit processome, Utp14
Orthology groupMCL11683

Nucleotide sequence:

ATGAGTTATGAAGAAATGGTTGAACACAGACAGCATCTTGCTAAGTTTAGAGCTCAACAG
TCTTATAGGGCTGCCAAGGCTAAGAGACAAAGCAAAATTAAAAGTAAAAAATATCATCGT
ATACTTAAGAAAGAAAAATTAAAACAACAGTTAAAGGAATTTGAGGAATTACAAGCAACT
AATCCAGAAGAGGCATTGAAAAAACTTGAGGAGTTAGAAAAAGCACGAGCTTTAGAAAGA
CATACCTTGAGACATAAAAACACTGGAAAATGGGCTAAGAACAAATTAGTTAGAGCAAAA
TATGATAAAGAGGTGAGGCAGCAATTAGCTGAACAACTGTCCGTAAGCAGAGGTCTTACA
CAGAAAACACAAAATGTTGAAAGTTCAGATGATGAGCCAGATGAAAGTGAAAACATCTGT
GACATTCAAATATCACAGGATCCCATGAATCCATGGATGTTAAAGAAATCCGACAAGAGT
AATATAGATGCCGAATTCAATTTCGGCTATAAAAAATATTTAAAAGACAAAATGTATAAA
TGCAAAGAGCAAAGTGACTCGGAAGAAGATGAAGCTGACCAAAATAGAGACAACGACATG
AGTTCTCTGAAAATGCTGGCAGAGAGCTTGAAAAAATTAAACAATGGTGAAAGTACAAAT
GTGCTTGAAAACAAAACTCAAGACGATGTCACTTTAATGGATGTTACCAGTGAAGAAAAT
AAATCACTAACAGTTCTTAATAAAAATATTACACAGAAACAGAAAAGTAATAATACTGGC
AGAAAAAACAAAAGAAAGATAGTCTCAACCTCAGACTGGCTTGTGGAAGAAATAAATCCC
AAAAATGCAACAACTGAAGAAGATATAAATACTGCATTTGATGATTATGAAGACAAAGTA
GCATTAAAGGTTGCTAAAAAACTTAAGGGTTTGAAACATGAGTTAAAAAATTTAGAATCA
TCATCTATCAAGCCAAATAAAAAGACAAATGAAACTAGTAAAGAAATTGACAACCTTGAA
TATTTAAAAATTAAAAAGCAGAAACAGATGCCTATAATTGATGAACCTCTTATAGAATCA
AATAAAAATATTGATGACATACCCGAGCAGACAAAACACTTATTAGACACATTAAAAGAT
ACAATAACAAGCACAAGCCAAAACGTAAACACAGATATTGATCCAAGTAGGTTTATTGAA
GTTAAACCAAAGTATTTGAATACAGCTGTGACAAATTCCGAGAATAATTTCGACGACTTA
GATGATGAAGAACAAGTGGTACCCAAAGTGGATATTGAAGAAGTTTTTGAAGAAGACGAT
GTGGTGACTAGTTTCAGACAAGAGAAGGAGGACGAAATTAATAAGAATAATCCAGAAGAA
CTAAGTTTAACACTCCCTGGATGGGGAGGCTGGGCCGGTAAAGGTGTGAAAGCACCTAAA
CGAAAGAAAAATAGATTTATTACAAAGAAACCACCAAAAACACTCAGAAGAGATGAAAAC
AAAGGTGATGTAATCATTAACGAGTCTAAAAATCCCAAGCTTGCTATACATAAAGTTTCA
GATTTACCACATCCATTCAACAGTGTGAAAGAATATGAGGAAAGTATAAGAACGCCTCTA
GGTAACACATTTGTGCCTGAAACAGCTCATAAGAAACTTATAAAACCTAATGTTATCACA
AGATCTGGAACAATCATTGAACCGATGGATGAAGAAGAACTGCTTGTGCCAAGAAATCGT
AACTTTAAAAATAAGTCTGTTATTAAGATTCTAGGCAAGCAATAA

Protein sequence:

MSYEEMVEHRQHLAKFRAQQSYRAAKAKRQSKIKSKKYHRILKKEKLKQQLKEFEELQAT
NPEEALKKLEELEKARALERHTLRHKNTGKWAKNKLVRAKYDKEVRQQLAEQLSVSRGLT
QKTQNVESSDDEPDESENICDIQISQDPMNPWMLKKSDKSNIDAEFNFGYKKYLKDKMYK
CKEQSDSEEDEADQNRDNDMSSLKMLAESLKKLNNGESTNVLENKTQDDVTLMDVTSEEN
KSLTVLNKNITQKQKSNNTGRKNKRKIVSTSDWLVEEINPKNATTEEDINTAFDDYEDKV
ALKVAKKLKGLKHELKNLESSSIKPNKKTNETSKEIDNLEYLKIKKQKQMPIIDEPLIES
NKNIDDIPEQTKHLLDTLKDTITSTSQNVNTDIDPSRFIEVKPKYLNTAVTNSENNFDDL
DDEEQVVPKVDIEEVFEEDDVVTSFRQEKEDEINKNNPEELSLTLPGWGGWAGKGVKAPK
RKKNRFITKKPPKTLRRDENKGDVIINESKNPKLAIHKVSDLPHPFNSVKEYEESIRTPL
GNTFVPETAHKKLIKPNVITRSGTIIEPMDEEELLVPRNRNFKNKSVIKILGKQ