DPGLEAN11749 in OGS1.0

New model in OGS2.0DPOGS206701 
Genomic Positionscaffold2021:+ 32144-34911
See gene structure
CDS Length1278
Paired RNAseq reads  477
Single RNAseq reads  1143
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008533 (2e-154)
Best Drosophila hit  CG9226 (6e-75)
Best Human hittelomerase Cajal body protein 1 isoform 4 (3e-56)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL008174 [Aedes aegypti] (3e-99)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL008174 [Aedes aegypti] (3e-99)
GeneOntology terms
  
GO:0019001 guanyl nucleotide binding
GO:0034512 box C/D snoRNA binding
InterPro families





  
IPR019775 WD40 repeat, conserved site
IPR019781 WD40 repeat, subgroup
IPR001680 WD40 repeat
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR015943 WD40/YVTN repeat-like-containing domain
IPR011046 WD40 repeat-like-containing domain
Orthology groupMCL14716

Nucleotide sequence:

ATGGAAGAAAACTTTATATTAGAGAATAATGACGAAAACTGTGAAACTTCTCACATGGAT
ACAGAAGAAACAGATATTTTTACGTATCCTACTCTGTTTAGTAGCAAATCCTTAGTAGAA
TTGTGCAACTCTTCTTGGTCGAGATCTCAAAACGCGAAACAGAATGTACAGCCCTATTTA
AGAGGATGTAAATGGTCTCCAGACGGTACTTGCTGTCTCACAGTGGTTAATAATGACGGG
GTTCACGTAACAGAGCTACCAAGAGATCTTTATTCTGGATCCATATCCCCTGATCGAACA
ATTAATATATTGGATTCTGTTATTCACGTCAAAGAGGCAGGTCTTGTCTATGATTTCTGT
TGGTATCCTGGTATGAACAGTAGCATACCTGAGACTTGCTGTTGGTTGACCACTCGTCAG
AATGCACCACTGCAATTTTGGGATGCTTTTGACGGTTCTTTGAGATGTTCATACAGAGGC
TTCAATGCAGTGGATGAAATGGAGCCAGCACTCACGGTCACATTTAATAGTGAGGGAGAT
AGAATTGTAGCTGGATATAAGAAATATTTGAGAACTTTTGACGTCGAAAGACCAGGAAGA
GATTTTGCTGAGCATAAGATCAATTCACCGGCTTCTTGTTTTGCTACACATGATAATCTA
TTAGCTATGGGCTCATGGAATACAACTATAACTTTATACAATACCAGTGAATTTGGAACA
TATAAGAGTATTGGGAAAATGCATGGCCACTCAGGGGGCGTCACTCACTTGAAGTTTACT
CAAGATGGTCAAAAATTAGTGTCGGGAGCGAGAAAGGATCACAGGCTACTCATTTGGGAT
ATTCGTTATTATCAAAGGCCGCTGAATGTATTAAGTAGAGTCGTTGACACAAACCAAAGG
ATATATTTTGATATATCACCATGCGGTAAATATTTGGTTACCGGAGGTACGGATGGTGTG
ATAAAAGTATGGGATGCGGATAACATTGATTGGATTAATAGATTAGATGCTACCGATGAC
AAAGATAATGCTACATACAGGTTTCCATTGCATAAAGACTGCTGCAATAGCCTGTCAATA
CATCCGTTAAGACCAATATTAGCTACCGGCTCCGGTCAATATCATTTCGAGGATCCTGCC
CAGGATTTGGAAGAGAATTTCGGAGTACAGGAAGACATTACTGAGACTGATAAAGGGTTA
AGAAATGATAAATACTCAAAAATAGCTGAAAATAGCTTAGGGTTTTGGTGGATCGGGGAT
ATTCCTCAAGTAACTTAG

Protein sequence:

MEENFILENNDENCETSHMDTEETDIFTYPTLFSSKSLVELCNSSWSRSQNAKQNVQPYL
RGCKWSPDGTCCLTVVNNDGVHVTELPRDLYSGSISPDRTINILDSVIHVKEAGLVYDFC
WYPGMNSSIPETCCWLTTRQNAPLQFWDAFDGSLRCSYRGFNAVDEMEPALTVTFNSEGD
RIVAGYKKYLRTFDVERPGRDFAEHKINSPASCFATHDNLLAMGSWNTTITLYNTSEFGT
YKSIGKMHGHSGGVTHLKFTQDGQKLVSGARKDHRLLIWDIRYYQRPLNVLSRVVDTNQR
IYFDISPCGKYLVTGGTDGVIKVWDADNIDWINRLDATDDKDNATYRFPLHKDCCNSLSI
HPLRPILATGSGQYHFEDPAQDLEENFGVQEDITETDKGLRNDKYSKIAENSLGFWWIGD
IPQVT