DPGLEAN12844 in OGS1.0

New model in OGS2.0DPOGS215948 
Genomic Positionscaffold2568:+ 6209-10953
See gene structure
CDS Length2736
Paired RNAseq reads  677
Single RNAseq reads  1860
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001854 (7e-78)
Best Drosophila hit  CG8414 (5e-38)
Best Human hitnucleolar protein 9 (2e-23)
Best NR hit (blastp)  GI20408 [Drosophila mojavensis] (2e-44)
Best NR hit (blastx)  PREDICTED: similar to GA21059-PA [Nasonia vitripennis] (2e-43)
GeneOntology terms


  
GO:0005634 nucleus
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families  IPR010655 Pre-mRNA cleavage complex II Clp1
Orthology groupMCL14630

Nucleotide sequence:

ATGGAATTTTTCGAAAAAGCACACGTTGCCTGCAGCAATGCAACATCTTTAAATAATAAG
TCAAAAAAAAATGTTAAGAAACAACTGAAACAAATGTTACATGGCTATAAACAAAGTGAT
ATTCATCTTATAAAGTCATCGGCTCTCGACAAAACAGAGTATGATAGTAATTCTACATCA
GTTAGTGGATATTCAGACTTAAATCTGACCGATTCTGGCGACGATGAACATTCAACTCGG
AAGGAGTTGGTAGAGAGTGTAACAGTAGACGAAAATGATTCCTTGGATAGAAGTGCCAGT
AATACTATACATAAAGTTACTTCATATAAGTCTTCTGAAATTAGTGCTAGCAAGGATAAA
AATTCTATTACTGAAGAATCCGATGATGTCTTTTCACTAGTAGACAGTGAGGCATCTAAT
GCAAATAGTCTGATTAAGTCAGAAAGTGAATCAAACAGTTCAGAACCATTTGACCCAGAT
GGTTTACTGGCTGCAAAAATTATTCAAAGACTTCAAATCAACAATGAAAAGAAAAAAAAC
CTCAAAAGAAAACATGATATTGAATCTAAGAAGCATAAAGTCTCAAAAGTCATTGAAAGT
AAAAGTAAATCTGTAAAAGGTCCCGTCATTCCATCAGTATCTGAAGAAGTTCAACAATTC
TCCGATGACGACTCATCTGTCTGTGCCATAGCTGTAGAAAAAGATTCCTCTGAATTTTCT
CCTCCATATGTTTCTATTTCAGCCGTTGATGTACCACTTCTAAGTTTCACAGATGTTTTA
GGAGATTACAAACATACAACTATAAAAAACATACCATCCCAACAAAAATCTGTAACAGTG
GATGAGTATTCAAATTTTATTGTAGATAAATCTTTGAGCGTTGACGCTAAAGTTGAGACA
GATATTGGAAGTTTGACTCCTGAACCTTCATCGGATAACGAGATTGGTGAAATAGAGGAA
ATAATCAATCTTGACTCAACAACAGAAACGATAGAAACGGATACTCCTATACCAACTAAT
GTACCTTTTGATGATACCATGCAATCGGAACCGTTGAGCGATAAAGAATCTTCAACGGAC
GAGGATTACTACAAGGTTATAGATACATTCAAAATATATTATGGCAACAAATGCTGCATC
ATCATATTGAAACATCCAACAGAATTGTTTGTTCAAGGAAAAGTTGCAATAAAAGTTTTA
GACGGCTCAATTGATATATTCGGTTACACCCCAAAGGACGATACTTGTAAATTATACGCT
CCCTTTTACAGCTACGCTCACAGTATAAAAACTACCGAAGAACCAAATGATTATTATGGG
TTGTTCGGGAAATTAACTGAAGCCGGTCTCTCGGTCGCAGAGGCTGAAGAAATAGTGATT
ACTATTGGGGAACACGATGGAGTCGTACTTTTAAAGCCGCTTGTGAACCAGTGTTTAGAT
TTTGTAGAAAATAATTTTAAAATAACAAACTTGTTTGTAAGATCTGTGAAAAACATTGAA
CCCTTCTTTACAAAAGCAACTGATATTTTAAATTGTTCTCTATTCTCAATAAGGCCAGCT
AGATGCTTCAAAGTCCATCCCAGTTGGAAAGAGGCACTTAAATATTCTCAGGAAAAACAT
AGCCGTGGGATTATTTGTGGCGGCAAAGGTGCTGGAAAATCTACATATTTGAGATACCAA
GTAAATAAATTAATCTCTCAAGGACCAGTTTTAGTTGTGGATCTTGACCCTGGGCAGTCC
GAGTTCACAGTGGCCGGTGGTATATCAGCTACTACCGTGTCCGAACCACTATTAGGGCCA
AGTTTCACACATCTAAAGAAGCCAGACATAATGTTTAATATCGGCATGATAAACACAATG
GACAATGCGAGGCGTTATGTTGCAGCACTGCAGCAATTGTTATCACACTGCCGCAATCAC
AAACCGTACTCCGAAATGCCGTGGATAGTCAACACAATGGGGATGACAAACTTCCTAGGG
CTCAAGTTCATAACACTTATCGTAATATTAACGCAGCCAACGTATCTTCTGCAATACGAA
TCCAAGAATTCTAAACGAAGATTTGAAAGCTTCTTAAGACCTTCCAACGTAAAACTTGTA
TTCCAAGATAATGAAAGCGATCCCTTGTTCAGCAACATCACCTTCCCGGAGCAGTTGAAT
TATAAGTTTGTAGTGGCAGACGAGGCGGATAGCTTCTTGAAAAATGGCTACTCTTTATCA
CCTAGAGACGAAAGATACCTAAATTTCTTGGCCTATTTTGGTCAATTACTAACTGTCCAC
AAACTAAAGAGTTTGCTTGAAATAACGCCTTATCAGGTGAACTTGAAAGATATCAACGTC
GCTACCAACGTGATCGTAATGAAGGAACGTATTACAAAAGTTATCAATGGACAAATCGTG
GCACTGTGTCAGCTGTTGAGACAATGCGACAATAGAGTTTTTACATTAGACGATAAACCA
TTCGTATGTTATGGATACGGTATTGTCCGAGGAGTTGATTGGGATAAGGAGGTTCTATAT
ATAATTACACCATTGGAAGGCGATTTCTTAGCATGTGTGGATACTTTGGTGTATGCAGAC
TGGAGTCCCGAGTTAGTGGGGCTAGAGACATGTCTACCGAATGGCACAAGCATCCCCTAC
CGCACCTACACAAGAAACAAGCATATACAGCTCATGTCCACACCAAAGAGGAGATTCAAT
CCACTTCAGCTTATAAAGATGACAAGGAATGCCTAA

Protein sequence:

MEFFEKAHVACSNATSLNNKSKKNVKKQLKQMLHGYKQSDIHLIKSSALDKTEYDSNSTS
VSGYSDLNLTDSGDDEHSTRKELVESVTVDENDSLDRSASNTIHKVTSYKSSEISASKDK
NSITEESDDVFSLVDSEASNANSLIKSESESNSSEPFDPDGLLAAKIIQRLQINNEKKKN
LKRKHDIESKKHKVSKVIESKSKSVKGPVIPSVSEEVQQFSDDDSSVCAIAVEKDSSEFS
PPYVSISAVDVPLLSFTDVLGDYKHTTIKNIPSQQKSVTVDEYSNFIVDKSLSVDAKVET
DIGSLTPEPSSDNEIGEIEEIINLDSTTETIETDTPIPTNVPFDDTMQSEPLSDKESSTD
EDYYKVIDTFKIYYGNKCCIIILKHPTELFVQGKVAIKVLDGSIDIFGYTPKDDTCKLYA
PFYSYAHSIKTTEEPNDYYGLFGKLTEAGLSVAEAEEIVITIGEHDGVVLLKPLVNQCLD
FVENNFKITNLFVRSVKNIEPFFTKATDILNCSLFSIRPARCFKVHPSWKEALKYSQEKH
SRGIICGGKGAGKSTYLRYQVNKLISQGPVLVVDLDPGQSEFTVAGGISATTVSEPLLGP
SFTHLKKPDIMFNIGMINTMDNARRYVAALQQLLSHCRNHKPYSEMPWIVNTMGMTNFLG
LKFITLIVILTQPTYLLQYESKNSKRRFESFLRPSNVKLVFQDNESDPLFSNITFPEQLN
YKFVVADEADSFLKNGYSLSPRDERYLNFLAYFGQLLTVHKLKSLLEITPYQVNLKDINV
ATNVIVMKERITKVINGQIVALCQLLRQCDNRVFTLDDKPFVCYGYGIVRGVDWDKEVLY
IITPLEGDFLACVDTLVYADWSPELVGLETCLPNGTSIPYRTYTRNKHIQLMSTPKRRFN
PLQLIKMTRNA