New model in OGS2.0 | DPOGS215400  |
---|---|
Genomic Position | scaffold486:+ 744-7064 |
See gene structure | |
CDS Length | 1281 |
Paired RNAseq reads   | 485 |
Single RNAseq reads   | 1208 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005736 (2e-06) |
Best Drosophila hit   | CG7526, isoform E (3e-09) |
Best Human hit | collagen and calcium-binding EGF domain-containing protein 1 precursor (6e-21) |
Best NR hit (blastp)   | PREDICTED: similar to collagen and calcium binding EGF domains 1 [Tribolium castaneum] (6e-63) |
Best NR hit (blastx)   | PREDICTED: similar to collagen and calcium binding EGF domains 1 [Tribolium castaneum] (9e-46) |
GeneOntology terms   | GO:0005509 calcium ion binding |
InterPro families    | IPR006210 Epidermal growth factor-like IPR001881 EGF-like calcium-binding IPR000152 EGF-type aspartate/asparagine hydroxylation site IPR013032 EGF-like region, conserved site IPR018097 EGF-like calcium-binding, conserved site IPR008160 Collagen triple helix repeat IPR013091 EGF calcium-binding IPR000742 Epidermal growth factor-like, type 3 |
Orthology group | MCL24101 |
Nucleotide sequence:
ATGCGCGCGCCGCTCGCACCGGCCTCGCTCCTCCTGACACATGCGCTCCTGGTACTCGGG
CGACTACATGAAGGATACTACGCCGAGGACGGTTACCATGACGACGCCCTGGATGTGGTG
GACCTAGTCTCGGCCTGTCCTACAGACAGACTGCTTCGTACCAGAGAGACTTGTCACGTG
GAAGGGGCTGACGTTCAGTGTATCACATTACACTGCTGTGAAGACTACAGTTACATCGCT
GGACGCTGTATCCGTAACTCTGTGGACGCGTGTAGTCTCCACCTGTGTGAGCAGGCTTGT
GAGGTTCAGGAGCAGCGTGTGTGGTGTTCCTGTCACCCTGGGTACAGGTTCGACGCTGAT
AGTTACAATCGGAAAAGGCAGCCTTACTGTGTAGATATAGATGAATGCACCATCAATAAT
GGCGGCTGTGAGCACCGTTGTGTGAACGACCCCGGCGGTTTTCACTGTGAGTGTAACGCG
CCGTATAGTGTCGGCATCGATGGAAGAAAGTGTGTACCGTCTGTGGCTGTCGGGATGCCG
GAACCTTTGCCCCTCGTCCGAACATCTTCTCGGTGCTACGCTCCGTGTGACACCGTGTCC
TGGCTCTCGCGGAAGGTGAAGCAGCTCAACGACCAGCTCCACAGCACGCAAGCTGCCTTG
AAGAAGTTGTTAGAGAACCCCGTGCTGACAGAAGACAGGAGTTTTGCTTATCGAGTACTG
GATTCCACGGCTCCCTTAGAGGGCGGCTACTGCCGGTGTGAGAGAGGTCCTCGTGGTCCC
GCCGGCCCACCGGGGATGGAAGGCCCGAAAGGCGACCCGGGACAACGCGGACCCCGAGGA
GCTCGAGGTCCCAAGGGATCTTTGGACCTTATGCTGCTTTTACTAGCAGACATAAGACAC
GACATCCATAATCTTGAGGAAAGGGTTTATAAAGAAGGGGAACGACCCGAACGCTTCAAC
CTTCAGAAGGCATGGCGTCGACAACGGAAGCAAGAAAACTTAGAGAAGGAAAATAGGACG
GAAGAAGAACTAGAAGCTTACACCTCGCCACCCGTCATTGAGGGGGCTGGAGACGTGACA
TCACGAGGTCCCGATGGTGACAAGCCCGAGTCCGGCACCACCAGGGATAATGTCCATAAT
GAGGATAATCAGAAGAGCACGGAATCCTTGGACCTTGCGGACATGGACGAGAAGCTGCGG
CAGATCAGACTCCTGGCGCAGTCCACCAGCACCGACGACGACGACGAGCCCGACGGAGAC
TACGACTACAGCTTCTACTAG
Protein sequence:
MRAPLAPASLLLTHALLVLGRLHEGYYAEDGYHDDALDVVDLVSACPTDRLLRTRETCHV
EGADVQCITLHCCEDYSYIAGRCIRNSVDACSLHLCEQACEVQEQRVWCSCHPGYRFDAD
SYNRKRQPYCVDIDECTINNGGCEHRCVNDPGGFHCECNAPYSVGIDGRKCVPSVAVGMP
EPLPLVRTSSRCYAPCDTVSWLSRKVKQLNDQLHSTQAALKKLLENPVLTEDRSFAYRVL
DSTAPLEGGYCRCERGPRGPAGPPGMEGPKGDPGQRGPRGARGPKGSLDLMLLLLADIRH
DIHNLEERVYKEGERPERFNLQKAWRRQRKQENLEKENRTEEELEAYTSPPVIEGAGDVT
SRGPDGDKPESGTTRDNVHNEDNQKSTESLDLADMDEKLRQIRLLAQSTSTDDDDEPDGD
YDYSFY