DPGLEAN17838 in OGS1.0

New model in OGS2.0DPOGS207533 
Genomic Positionscaffold440:- 9449-14411
See gene structure
CDS Length1608
Paired RNAseq reads  981
Single RNAseq reads  2498
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001897 (0.0)
Best Drosophila hit  CG7741 (2e-100)
Best Human hitCWF19-like protein 1 (3e-109)
Best NR hit (blastp)  PREDICTED: similar to CG7741-PA isoform 1 [Apis mellifera] (1e-136)
Best NR hit (blastx)  PREDICTED: similar to CG7741-PA isoform 1 [Apis mellifera] (4e-134)
GeneOntology terms


  
GO:0003674 molecular_function
GO:0003824 catalytic activity
GO:0005575 cellular_component
GO:0008150 biological_process
InterPro families


  
IPR011146 Histidine triad-like motif
IPR006768 Cwf19-like, C-terminal domain-1
IPR006767 Cwf19-like protein, C-terminal domain-2
IPR011151 Histidine triad motif
Orthology groupMCL15488

Nucleotide sequence:

ATGGCAGATAAACAGAAAACTCTTATTTGTGGGGACGTAGATGGAAACTTCAACATATTG
TTTTCCCGCGTTGAATCAATTGTGAAAAAATCCGGAGCTTTCGAGGTACTATTGTGCGTT
GGAAACTTTTTTGGTGAAGATAATTCGCAATTGGATGCATATCGCATGAGGACAAGAAAA
GTGCCAGTAACAACATATGTTTTCGGACCATCTAATAGTGACCATGTGGAATATTACTGT
GAGGAAGGTGCTGAAATTGTTCCAAATGTCATTTACATGGGGAAAAGGGGGATATTCACC
ACAAGCGCTGATGTTAAAATTGCCTATCTGACGGGTATGTCCCGTCGGGAATTAGGCAAG
GAGATACCTTTGTGTACATTTGAACCGAGTGATTGTAGTGCAGTGAGAGATGCATGCTTC
AGAGGTACATCTGAATATAGGGGGGTTGACGTCTTAATAACAACCCTATGGCCATCTGGC
ATACAACAAGATGATTGCCAAAAGGCAGATATTGAGCCAGATAAATTATCCGATCTGATA
TCGTGGCTCGCAATCCACATAAAGCCGAGGTATCATTTCGTGCCGTCAAAAGAAAAATAT
TATGAAAGGCAACCTTATAGAAATCAAAGTGTACACCAGGATTACAAAGAGGGCGCCACA
CGTTTCATCGCATTAGCCCCCGTGGGTAATAAGGTTAAAGAAAAATGGATATACGCGTGT
TCATTACAGCCAATAAACAAAATGCGAATGACTGATATATTACAAAGCACTACCGATGAG
ACCTCTTGCCCCTTTGACCCTGAATTGCTGAAGCAGCATCAGCCAGGGAAGGTTGTGAAG
GTTTCTGGTAATGGACAATTCTTCTATAATATGGACGCTCAAGATGATGATAACGGAAAA
AGGAAACGGAAAAGTGGTGACAACCCTGAACGGAAAAGGAAAGAATTTGATCCAGATACC
TGCTGGTTTTGTCTGTCATCACCATCAGTGGAGAAACATTTAGTGATAAGTGTTGGTAGT
CATTGCTACCTCGCTCTACCAAAGGGTCCGTTGACCTCACACCATGTCCTCATACTGCCT
ATAGCGCATCATCAGTCCGTTACCAAGGCACCGGATGAGGTGATAAAAGAAATTAAGAGA
TTCAAAGATGCATTGAAGAAGCTGTATTCCTCGATGGACCAGCTGGGAGTGTTCTTTGAG
CGAAATTTCAGAACGTCACACATGCAGATACAGTGTGTACCGGTCGGGAAACAGTGTGGA
GATCAGTTACTGGAGGTGTTTCAGGACGAGGCCGGCATTAATAGCATTCAGTTAGAGGTG
CTGCCGCCTTATACCGACATCGCTCAAGTGTCTCTGCCGGGAGCGCCGTACTTCCACGCG
GAACTTCCCTCCGGGGAACAGATATACGCTAAGACACGACAGCATTTCCCATTACAGTTT
GGAAGAGATGTACTGTCAAGCCCGCCGATACTTAACTGCGAGGACAAAGCAGACTGGCGA
CAGTGTCTCCTCAGTAGAGAGGAGGAAGACCAGCTCGTGGCAGACTTCAGACAACAGTTC
AGACCATACGACTTCACCGCTGACGATAGTGGTAGCGATAGTGAATGA

Protein sequence:

MADKQKTLICGDVDGNFNILFSRVESIVKKSGAFEVLLCVGNFFGEDNSQLDAYRMRTRK
VPVTTYVFGPSNSDHVEYYCEEGAEIVPNVIYMGKRGIFTTSADVKIAYLTGMSRRELGK
EIPLCTFEPSDCSAVRDACFRGTSEYRGVDVLITTLWPSGIQQDDCQKADIEPDKLSDLI
SWLAIHIKPRYHFVPSKEKYYERQPYRNQSVHQDYKEGATRFIALAPVGNKVKEKWIYAC
SLQPINKMRMTDILQSTTDETSCPFDPELLKQHQPGKVVKVSGNGQFFYNMDAQDDDNGK
RKRKSGDNPERKRKEFDPDTCWFCLSSPSVEKHLVISVGSHCYLALPKGPLTSHHVLILP
IAHHQSVTKAPDEVIKEIKRFKDALKKLYSSMDQLGVFFERNFRTSHMQIQCVPVGKQCG
DQLLEVFQDEAGINSIQLEVLPPYTDIAQVSLPGAPYFHAELPSGEQIYAKTRQHFPLQF
GRDVLSSPPILNCEDKADWRQCLLSREEEDQLVADFRQQFRPYDFTADDSGSDSE