DPGLEAN20657 in OGS1.0

New model in OGS2.0DPOGS207243 
Genomic Positionscaffold55:- 120927-124280
See gene structure
CDS Length1704
Paired RNAseq reads  385
Single RNAseq reads  1010
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012059 (3e-51)
Best Drosophila hit  CG7246 (2e-39)
Best Human hitU3 small nucleolar RNA-associated protein 6 homolog (1e-40)
Best NR hit (blastp)  PREDICTED: similar to RIKEN cDNA 4732497O03 [Apis mellifera] (3e-73)
Best NR hit (blastx)  PREDICTED: similar to RIKEN cDNA 4732497O03 [Apis mellifera] (5e-77)
GeneOntology terms



  
GO:0005634 nucleus
GO:0006364 rRNA processing
GO:0006396 RNA processing
GO:0005622 intracellular
GO:0005515 protein binding
InterPro families
  
IPR013949 U3 small nucleolar RNA-associated protein 6
IPR003107 RNA-processing protein, HAT helix
Orthology groupMCL12556

Nucleotide sequence:

ATGGCTGAACAAGTAAATCAACGTATAGAAGGTATGATAAATGAGTTGGAGCAAATGCGT
AGAACTAATCTTTACGAGGACGATGAAATAAGAGAAATTTCCCGTAAACGAAAGGAGTTT
GAATATAAAATACAACGAAGAATAAAAGAGAAAAGCGACTTTGTTCAATACATTGCATTT
GAATTGGCTCTTCTAGAAGATATTTCTCTTAGAAGGAAACAAGCAAAGTTGGGGGAAAAG
AAGAAAGATATTGAATATGCCATTGCTAAGAGACTAAATAAAGTTTTCAAACAATTTATA
TTTCGTTTTCAAAATGACATAGCTATTTACTTTGAGTACATAAAATTCTGTCAAGCTGTT
GGATTTGATTATGCGGTATCTGCTATTATTGACCAAATGTTGAGAGTACATGGTGATAAG
CCCAAAACATGGCAGTTGGCAAGCAAATGGGAAAGCAAGGAACAGAACAATCTAGAAAAT
GCTAGAAATTTTTTACTTAAAGGCATTCACAGACATCCCAATTCTGATATATTATACTTA
GATCTTTTTGATATCGAACTTATGATTGCTTTTAAAACTGAAGATGAAACAGAAAAAGCA
AAAAATTTCAAAAGGGCCGATGTTGTATGGAGAAATGGAATGAAGAACATTCCGGATGTG
AATTATTTATTTAAATTATGTGATATATCATTGAGGTATGGTGTTAACGAGGACATATCC
AATTCTATAAAGCAAGAAATATGGAACAGAAGATCCGAAAAGCGAGTTTGGTCATACATT
GCTTCTAAAGAATTGGAGGGATATCACTGGAAAGATATTGAGGAGTATGTGAGTGAAGAG
TTCAGTTATTCAAAAGAATTGAACTATTATATAGCTGTGTATGAGGAAGCTTTAATGCAG
TTTCCAGATGAAAATCTATCAACTATGTACATTCATGGGTTACTCGGTTTAAAGGATAAT
TTGTGTACAGATTTACAAAAAATTTGTGCCGTCAAACAAGCATGGTTCTTCAGTCACGAG
AACGGGTTGCTGAGTAATGATATGTATGTTTTTGGTATAAAAATGTTAAAGTTAGAAGGC
GAGATTACTGAAACTCAATTAACTGAGGTTCTGGATACAGCATTGGCAAAAAATCCGCTT
TATAGATATTTATGGGAAGAGAAAATTTTATTACACAAAAATGATGAAGATGTAATTCTA
AAAACACTAAAAGATGCCACAAAAATTTTAAAAACTGATGATGTTAGATGTCTTTGGAAT
TTTGTTTTTGATAATATTGAGTCACATATGGTTTTTAAGAATTGTTATTCTAAGCTACAG
TCATGTGAAAGTGTTGTATTTATGACTCTCAAACCCACACTCTTAAAGAAGATGTATGAA
CACAATGGATTGAAGGCGGCACGGCAAGTTTACGAAGAGTGTATAAGAACCCCTCCCACA
CAAGAAGAAGTACATAGCATAATGATTGACATTGAAATGAATCAAGAAAAACCATCACTC
AAGAATGTAAGAAAATGCTATGAGGCCCTAGTTCAACATCATGGCAAGAGTAATATTAAA
GTTTGGATGGACTACATAAACTTTGAACAAAAGTATGGAAATGCCCAAGCTGTGCCTTCC
ATTCATAGGAGGGCTATAGGAATGTTGGATAAAAATTCTGTCGATGATTTTATCAAAGCT
CAAACGCTAGCAAAATTAAATTAA

Protein sequence:

MAEQVNQRIEGMINELEQMRRTNLYEDDEIREISRKRKEFEYKIQRRIKEKSDFVQYIAF
ELALLEDISLRRKQAKLGEKKKDIEYAIAKRLNKVFKQFIFRFQNDIAIYFEYIKFCQAV
GFDYAVSAIIDQMLRVHGDKPKTWQLASKWESKEQNNLENARNFLLKGIHRHPNSDILYL
DLFDIELMIAFKTEDETEKAKNFKRADVVWRNGMKNIPDVNYLFKLCDISLRYGVNEDIS
NSIKQEIWNRRSEKRVWSYIASKELEGYHWKDIEEYVSEEFSYSKELNYYIAVYEEALMQ
FPDENLSTMYIHGLLGLKDNLCTDLQKICAVKQAWFFSHENGLLSNDMYVFGIKMLKLEG
EITETQLTEVLDTALAKNPLYRYLWEEKILLHKNDEDVILKTLKDATKILKTDDVRCLWN
FVFDNIESHMVFKNCYSKLQSCESVVFMTLKPTLLKKMYEHNGLKAARQVYEECIRTPPT
QEEVHSIMIDIEMNQEKPSLKNVRKCYEALVQHHGKSNIKVWMDYINFEQKYGNAQAVPS
IHRRAIGMLDKNSVDDFIKAQTLAKLN