DPGLEAN09040 in OGS1.0

New model in OGS2.0DPOGS202770 
Genomic Positionscaffold30:- 21777-23961
See gene structure
CDS Length1917
Paired RNAseq reads  932
Single RNAseq reads  2681
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010493 (1e-60)
Best Drosophila hit  CG15141 (3e-52)
Best Human hitputative E3 ubiquitin-protein ligase UBR7 isoform 2 (3e-53)
Best NR hit (blastp)  PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] (4e-62)
Best NR hit (blastx)  AGAP009512-PA [Anopheles gambiae str. PEST] (3e-66)
GeneOntology terms

  
GO:0004842 ubiquitin-protein ligase activity
GO:0005515 protein binding
GO:0008270 zinc ion binding
InterPro families



  
IPR011011 Zinc finger, FYVE/PHD-type
IPR013032 EGF-like region, conserved site
IPR013993 Zinc finger, N-recognin, metazoa
IPR013083 Zinc finger, RING/FYVE/PHD-type
IPR003126 Zinc finger, N-recognin
Orthology groupMCL14557

Nucleotide sequence:

ATGATGGACGTCTTGCAGGAACAAGAAAATTTTGAAGAAGATGCTAACGCTGTGTTGGGA
GGTTCCGATGATAAAAACTGTACATATTCTAAGGGCTACATAAAGAGACAAGCATTGTAC
GCCTGCATGACCTGCTGCTCTGAAGCGAAATCTGACCCAGCTAAGAGAGCAGGTCTTTGT
CTAGCCTGCAGCCTCACTTGTCATGAAAATCATGAACTTATAGAACTGTATACTAAACGC
AATTTTAGATGTGACTGCGGTAACTCAAAATTCAACTCTAATCCTTGTCAGTTAGCACCC
AAAAAAGCAAATTTTAATGAGGAGAATAGTTACAACCAGAATTTTAGTGGAGTATACTGT
GTGTGTCGGAGGCCATACCCCGATCCAGATTGTGAAACTGAAGATGTAATGATCCAATGT
ACCATATGCGAGGACTGGTACCACGGCACACATTTAGAAACAACTGTCCCTAACAGTGAA
CTCTACACAGAGATGATTTGCAAAGGATGTATGGAAAAATATGACTTTTTACATTCCTAC
AGTTACATGGTTGTAAATGTTGAAAGCTCCGATGTTGATGTCATTAATGTTCCTGAGAAT
GGAATTAAAACTCGCAATGGAGACTTCAAAACAGATGCCACAGCTGTTGAAGATAGTGAA
AGGTCTCAGGAAAATGAAGATATTAGCCTCACTCCTAAAAAAGAAATTTCTTCTATTGAT
GAAAACATTGAAGAAAATAAAAAGAAAGTAGAAAATGATGGAACAAATAATTCTAAGATG
GAAGGTATTTCTGATGTTGATGTTAGTGTTGAGAATCCTTCAAGTGAAAGTCTCATAAGT
TGTAACAATAAAGACAACACTGATATTAAAGAGGAGGGAAGCAATGCAGACAATACCAAT
GACCGAGATACTAGTAGTGACCAGAGCCAAGATATTATCAATAGTGAGATCCAACGAGAC
ATGGAACTAAACAAAAATAATACTGCAAAAGAAATCGAAGACAAAAATGCTATGAAAACA
ACTAGCGAGGTGAAAAATAGACAAGATGAAAATACAGATGAAGAGAAGCCTCTGGTAGAT
TATGAAAGTGAAAGCTGCAAGTCAAAAATGAATTTAGATATCACAGAAACAAGTCAGGAA
GATATTAAGACTACACATGAGAACGGGAAAATGTATAAAAAAAATGATGCTGACAATACT
ATCAACAGAACAAGTGAGTTAAATAACTTAGAAAAAGGAGAAGGGACAGAGGAAAAAACG
GAAAACTTGGCATCTCATGATGAAAAAGATGTGGTACAAAACACATCAAAAGGAAAGGAA
ACAAAAAATGAAGGTGGTGGTAACTGTAGCAATCCTGTTGATGGTAATTATACAGACGCT
GCTACTGATGAAGTAACAGGGGACAGCAATCACAAAGGATCAGAAAAAAGAAAACTTTCC
ACAGAAGAAACAACAGATAGTTCAGTGAGTAAGAAAAGTAAATTAGGAGAGGTGACTGAC
AAACCATGCACTTGTCCTAAAAATGACAAAAAAGTGTACAGAGGAGCAACATTCTGGCCC
TCAACCTTCCGCCAGAGACTCTGCACATGCAATGAATGTCTGAGCATGTATAAGGACCTG
TCTGTTATGTTCCTTATGGACACTGAAGACACAGTCGTCGCCTACGAGAGCTTGGGCAAG
GAGAAAACCAACGGTAAGCCATCACAGTATGAAAAGGGGCTCCAAGCACTTTCATCGCTG
GATAGAATCCAACAGATCAATGCCTTGACAGAGTACAACAAAATGAGAGACAAGCTATTA
GACTTCCTTAAAAGCTTCAAGGACAGGAAAGAAATTGTCAAGGAGGAAGACATCAAAGCA
TTCTTTGCCGGAATGAAGCCCAAGAGGGAACCAGAGGGTGTGTACTTTTGTCGGTGA

Protein sequence:

MMDVLQEQENFEEDANAVLGGSDDKNCTYSKGYIKRQALYACMTCCSEAKSDPAKRAGLC
LACSLTCHENHELIELYTKRNFRCDCGNSKFNSNPCQLAPKKANFNEENSYNQNFSGVYC
VCRRPYPDPDCETEDVMIQCTICEDWYHGTHLETTVPNSELYTEMICKGCMEKYDFLHSY
SYMVVNVESSDVDVINVPENGIKTRNGDFKTDATAVEDSERSQENEDISLTPKKEISSID
ENIEENKKKVENDGTNNSKMEGISDVDVSVENPSSESLISCNNKDNTDIKEEGSNADNTN
DRDTSSDQSQDIINSEIQRDMELNKNNTAKEIEDKNAMKTTSEVKNRQDENTDEEKPLVD
YESESCKSKMNLDITETSQEDIKTTHENGKMYKKNDADNTINRTSELNNLEKGEGTEEKT
ENLASHDEKDVVQNTSKGKETKNEGGGNCSNPVDGNYTDAATDEVTGDSNHKGSEKRKLS
TEETTDSSVSKKSKLGEVTDKPCTCPKNDKKVYRGATFWPSTFRQRLCTCNECLSMYKDL
SVMFLMDTEDTVVAYESLGKEKTNGKPSQYEKGLQALSSLDRIQQINALTEYNKMRDKLL
DFLKSFKDRKEIVKEEDIKAFFAGMKPKREPEGVYFCR