DPGLEAN00892 in OGS1.0

New model in OGS2.0DPOGS215400 
Genomic Positionscaffold486:+ 744-7064
See gene structure
CDS Length1281
Paired RNAseq reads  485
Single RNAseq reads  1208
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005736 (2e-06)
Best Drosophila hit  CG7526, isoform E (3e-09)
Best Human hitcollagen and calcium-binding EGF domain-containing protein 1 precursor (6e-21)
Best NR hit (blastp)  PREDICTED: similar to collagen and calcium binding EGF domains 1 [Tribolium castaneum] (6e-63)
Best NR hit (blastx)  PREDICTED: similar to collagen and calcium binding EGF domains 1 [Tribolium castaneum] (9e-46)
GeneOntology terms  GO:0005509 calcium ion binding
InterPro families






  
IPR006210 Epidermal growth factor-like
IPR001881 EGF-like calcium-binding
IPR000152 EGF-type aspartate/asparagine hydroxylation site
IPR013032 EGF-like region, conserved site
IPR018097 EGF-like calcium-binding, conserved site
IPR008160 Collagen triple helix repeat
IPR013091 EGF calcium-binding
IPR000742 Epidermal growth factor-like, type 3
Orthology groupMCL24101

Nucleotide sequence:

ATGCGCGCGCCGCTCGCACCGGCCTCGCTCCTCCTGACACATGCGCTCCTGGTACTCGGG
CGACTACATGAAGGATACTACGCCGAGGACGGTTACCATGACGACGCCCTGGATGTGGTG
GACCTAGTCTCGGCCTGTCCTACAGACAGACTGCTTCGTACCAGAGAGACTTGTCACGTG
GAAGGGGCTGACGTTCAGTGTATCACATTACACTGCTGTGAAGACTACAGTTACATCGCT
GGACGCTGTATCCGTAACTCTGTGGACGCGTGTAGTCTCCACCTGTGTGAGCAGGCTTGT
GAGGTTCAGGAGCAGCGTGTGTGGTGTTCCTGTCACCCTGGGTACAGGTTCGACGCTGAT
AGTTACAATCGGAAAAGGCAGCCTTACTGTGTAGATATAGATGAATGCACCATCAATAAT
GGCGGCTGTGAGCACCGTTGTGTGAACGACCCCGGCGGTTTTCACTGTGAGTGTAACGCG
CCGTATAGTGTCGGCATCGATGGAAGAAAGTGTGTACCGTCTGTGGCTGTCGGGATGCCG
GAACCTTTGCCCCTCGTCCGAACATCTTCTCGGTGCTACGCTCCGTGTGACACCGTGTCC
TGGCTCTCGCGGAAGGTGAAGCAGCTCAACGACCAGCTCCACAGCACGCAAGCTGCCTTG
AAGAAGTTGTTAGAGAACCCCGTGCTGACAGAAGACAGGAGTTTTGCTTATCGAGTACTG
GATTCCACGGCTCCCTTAGAGGGCGGCTACTGCCGGTGTGAGAGAGGTCCTCGTGGTCCC
GCCGGCCCACCGGGGATGGAAGGCCCGAAAGGCGACCCGGGACAACGCGGACCCCGAGGA
GCTCGAGGTCCCAAGGGATCTTTGGACCTTATGCTGCTTTTACTAGCAGACATAAGACAC
GACATCCATAATCTTGAGGAAAGGGTTTATAAAGAAGGGGAACGACCCGAACGCTTCAAC
CTTCAGAAGGCATGGCGTCGACAACGGAAGCAAGAAAACTTAGAGAAGGAAAATAGGACG
GAAGAAGAACTAGAAGCTTACACCTCGCCACCCGTCATTGAGGGGGCTGGAGACGTGACA
TCACGAGGTCCCGATGGTGACAAGCCCGAGTCCGGCACCACCAGGGATAATGTCCATAAT
GAGGATAATCAGAAGAGCACGGAATCCTTGGACCTTGCGGACATGGACGAGAAGCTGCGG
CAGATCAGACTCCTGGCGCAGTCCACCAGCACCGACGACGACGACGAGCCCGACGGAGAC
TACGACTACAGCTTCTACTAG

Protein sequence:

MRAPLAPASLLLTHALLVLGRLHEGYYAEDGYHDDALDVVDLVSACPTDRLLRTRETCHV
EGADVQCITLHCCEDYSYIAGRCIRNSVDACSLHLCEQACEVQEQRVWCSCHPGYRFDAD
SYNRKRQPYCVDIDECTINNGGCEHRCVNDPGGFHCECNAPYSVGIDGRKCVPSVAVGMP
EPLPLVRTSSRCYAPCDTVSWLSRKVKQLNDQLHSTQAALKKLLENPVLTEDRSFAYRVL
DSTAPLEGGYCRCERGPRGPAGPPGMEGPKGDPGQRGPRGARGPKGSLDLMLLLLADIRH
DIHNLEERVYKEGERPERFNLQKAWRRQRKQENLEKENRTEEELEAYTSPPVIEGAGDVT
SRGPDGDKPESGTTRDNVHNEDNQKSTESLDLADMDEKLRQIRLLAQSTSTDDDDEPDGD
YDYSFY