DPGLEAN10898 in OGS1.0

New model in OGS2.0DPOGS206136 
Genomic Positionscaffold4:+ 349760-358325
See gene structure
CDS Length2211
Paired RNAseq reads  624
Single RNAseq reads  1935
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000727 (2e-96)
Best Drosophila hit  CG32369, isoform A (4e-51)
Best Human hitLON peptidase N-terminal domain and RING finger protein 2 (3e-48)
Best NR hit (blastp)  GI12194 [Drosophila mojavensis] (1e-58)
Best NR hit (blastx)  GI12194 [Drosophila mojavensis] (2e-56)
GeneOntology terms


  
GO:0006508 proteolysis
GO:0005515 protein binding
GO:0008270 zinc ion binding
GO:0004176 ATP-dependent peptidase activity
InterPro families




  
IPR017907 Zinc finger, RING-type, conserved site
IPR015947 Pseudouridine synthase/archaeosine transglycosylase-like
IPR013083 Zinc finger, RING/FYVE/PHD-type
IPR001841 Zinc finger, RING-type
IPR003111 Peptidase S16, lon N-terminal
IPR018957 Zinc finger, C3HC4 RING-type
Orthology groupMCL10790

Nucleotide sequence:

ATGGATTCTAATGTGCATTCTGCTAGAACTAGAGGTCAACGTAGAAGAGTGTCATCAAGT
CAAAGATTACGACCATATGGAATGACAGATCTTAATGTGTATATTGATACGAATGTTCAC
TATAATCTCGATATTGATGAAGGAAGTGTTGCCAATAGGACCGAGGAGCCTTGTAGGGTA
TTGATTAAAAGAAACAACCAAGATTTCAACCATGGATTGGACGAAAGTCAGCGTGGTGAT
GATAGGATGGGTTCACCGACTCCAAACGCCAGCATAACTGCTGGTCATCTCCAGATAATA
GCCGAGGATGAGAATATCGTTCTGGTTGCGTCCCTGGCAAATACTGACACCGATTCTATC
GGTACTTTATCACCGACCCTATTACAATTACATCCCACGCACATCCAATCGAACAGTCCA
ACAGAAAATGGCGAGCAGTCTCTTAATTCTGATCAAATTTGCACTGCACCAGAACAGGAC
GCTGCTATGAATCCGTTATTCGACGAGGCTATGTACCAGGAGCCATCGGAGACGGAAGTT
AAGAAGGAAACCGTCGCTCAAACGGCACAAGTTAAAAGCAACGGTCAAACTCTCACAACG
AACTTATTCCTGAATCCTCATTCGCTGGTGAGGGAATTGCCGATGCCTAGCGTGAACGCC
CACAAGTTAGCCAACAACCTGATCAAACTGTCCAGATATCTGAAGTCGCCAGCTCCCAGC
GATTTATGCTGTCTAAACTGTTGTCGCATCCCGGTTCTGCCCGTCACCGGCCAATGCGGC
CACACTAGATGCATGAGGTGCATCGTCGTCAATGGGACATGCCCGTGCGGCGTGAACGCT
CCTAAGACGCTATTCGTAAACACGGTCATTAGAGAGATAATTGAAAAAATGATAAAATAC
ATAAGAAGCCCGAGAATACTAGATCCTGGGTCGCCTAGGAAAACCTGCGAGAAGAAATTC
CCGCTTATTAGAGCGAGACGTCGTTATTATCGACGCGGACAGAGCTTCTCTAGAGGCGCT
CCTCTCAGTAACTCTCCGACCTGGTGTTTCTCAAGACCCCGAGTGCCTTTGACCGTCCAG
GCGCGGTTCAAACGTGCTCGGGCGCTACTGGCGGCTGGAGAGTATTTGCAGGCTGCACCT
CACCTGGCCAGGGTCGCAGCCTCAACGGAGCCCTGCGCTAGGATGGCGAGATTGATGCTG
GCACAGACTATAAGCGCTTTAAGCGAAGGTCACAAGCGGAAGAGCGTCTCCCGGGAGCTG
TTCCAGTCTGTGAGGCAGCAGTCTACGATCAGCTGGTTAGCACCCTCGGATCTGGAGTGC
GTGTTGTGCACGAACAGCTACACGAACCCGGTTTCGACTCCCTGCGGCCATACATACTGC
AGGACCTGCATAGAAAGATCCTTGTACTATAAGAAAAAATGCGCGCTCTGTTTGGGACCA
TTGGAAAACTTTATGCTACCTGAGACTCAAGACACGTTGTTCATTAGTTCAATACTATCG
TCTATCGGAGTGTCGCAGTCTGTTCGTGATGAGGACGTGATACCCGTCGTAACATGCTAC
GTTGCGTTCCCTGGAATGCCCTGCCCGCTGTTTATGTTCAACCCTCGCTACTGGCAGATG
GTGAGACGAGTGTTGGAGTCAGGCACACGAAGATTCGGCATGCTGGCACACGAAGGTGGA
AATAACTTTGCTGATTACGGCACAGTGCTCGAGATCTGCGACTGCGTAGTGCTGGAAGAC
AACCGCTGTATAGTATCGACGGTCGGCGTCTCCAGGTTTAGAGTCATCGAGAGACACATT
AGAGACGGGTGTGACGTAGCCCGAATCCAGCCACTGACAGATGTGACACCAACTGAGGAC
GAGCTCCAAGACCTGCATACTCTGTCCTCGCAGATATCATCCAAAACTCAAACCTGGCTA
AAGAATATGGACGAGGGTGTTAGGAAAGAAATCGAAACTGCCTTCGGAGCTATGCCTTGT
AAGGACATTCCCGAAAACTGGTGGAACACATCCGATGGACCTAATTGGCTGTGGTGGCTG
ATAGCCATACTGCCCCTGAAGTCAGAGATCAAGATATTAATACTATCAACACGAAGTCTT
CTCAAACGGATGTTGGCTGTATCAAGGACTTTGGACGTCATGGACGCAGAGTTTGTATCA
AACGACTCAAAACTGAACATCACTAGCAGAAAGGAATGGCTGAGGAGATGA

Protein sequence:

MDSNVHSARTRGQRRRVSSSQRLRPYGMTDLNVYIDTNVHYNLDIDEGSVANRTEEPCRV
LIKRNNQDFNHGLDESQRGDDRMGSPTPNASITAGHLQIIAEDENIVLVASLANTDTDSI
GTLSPTLLQLHPTHIQSNSPTENGEQSLNSDQICTAPEQDAAMNPLFDEAMYQEPSETEV
KKETVAQTAQVKSNGQTLTTNLFLNPHSLVRELPMPSVNAHKLANNLIKLSRYLKSPAPS
DLCCLNCCRIPVLPVTGQCGHTRCMRCIVVNGTCPCGVNAPKTLFVNTVIREIIEKMIKY
IRSPRILDPGSPRKTCEKKFPLIRARRRYYRRGQSFSRGAPLSNSPTWCFSRPRVPLTVQ
ARFKRARALLAAGEYLQAAPHLARVAASTEPCARMARLMLAQTISALSEGHKRKSVSREL
FQSVRQQSTISWLAPSDLECVLCTNSYTNPVSTPCGHTYCRTCIERSLYYKKKCALCLGP
LENFMLPETQDTLFISSILSSIGVSQSVRDEDVIPVVTCYVAFPGMPCPLFMFNPRYWQM
VRRVLESGTRRFGMLAHEGGNNFADYGTVLEICDCVVLEDNRCIVSTVGVSRFRVIERHI
RDGCDVARIQPLTDVTPTEDELQDLHTLSSQISSKTQTWLKNMDEGVRKEIETAFGAMPC
KDIPENWWNTSDGPNWLWWLIAILPLKSEIKILILSTRSLLKRMLAVSRTLDVMDAEFVS
NDSKLNITSRKEWLRR