Genomic Position | scaffold1740:+ 86-7498 |
---|---|
See gene structure | |
CDS Length | 1914 |
Paired RNAseq reads   | 203 |
Single RNAseq reads   | 722 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009343 (6e-06) |
Best Drosophila hit   | down syndrome cell adhesion molecule, isoform H (6e-12) |
Best Human hit | hemicentin-1 precursor (3e-17) |
Best NR hit (blastp)   | hemicentin-1 [Culex quinquefasciatus] (2e-23) |
Best NR hit (blastx)   | polyprotein [Tetraodon nigroviridis] (3e-23) |
GeneOntology terms    | GO:0008218 bioluminescence GO:0005576 extracellular region GO:0005509 calcium ion binding GO:0050896 response to stimulus GO:0005604 basement membrane GO:0018298 protein-chromophore linkage GO:0007601 visual perception |
InterPro families    | IPR000152 EGF-type aspartate/asparagine hydroxylation site IPR013032 EGF-like region, conserved site IPR018097 EGF-like calcium-binding, conserved site IPR013098 Immunoglobulin I-set IPR013091 EGF calcium-binding IPR000884 Thrombospondin, type 1 repeat IPR000742 Epidermal growth factor-like, type 3 IPR007110 Immunoglobulin-like IPR003599 Immunoglobulin subtype IPR003598 Immunoglobulin subtype 2 IPR001881 EGF-like calcium-binding IPR006210 Epidermal growth factor-like IPR013783 Immunoglobulin-like fold |
Orthology group | MCL10002 |
Nucleotide sequence:
ATGTTTATAAGTAATTTACTTAGATTTTTCTTGTGGTTATTTTTAGTTCCCCCTGCTCCT
CAGAAGGGAGAAATCAAACGAGTGAAAACCTTGTCAGGGTTGTCACTCAATATAAGTTGT
CCGGTCGAAGGACATCCTCTACCGTTCATAAGGTGGTTTAAACAACCCTACTCGGAAATT
GTGGATAGTACGAGGACACTGTTGCTAGATTATAATTCCACTTTACATTTCCCAAACATC
GACACATCAGACTCCGGGTTATACTCCTGTATAGCCACAAACAGCGTTGGTACCACCGAG
CTGCTTTACGAGGTCACAGTACAGACAGCTCCCACCATCGCCGGGAATGATTCACAACTA
GTGGTAGCACTTGGCAGAAGCATCATCTTGAAATGTGAAGTTGCTGGCGTCCCTGAACCC
AAGATCACATGGTTTAAGGCAGTTGGAACTGCTAGGCACTTGTTAGTGGGATGTAAGGTA
CTCCTCGATAGCGGTCAATACTCGCGTCGTCACGATAGGGTTCTGGAAATCATACGTGAA
GCGGTTAGTCTTTCGGTAGCCAGAGCGCAAAAAGGAATAACCACAAACGAGCGATCAGTA
GGTTTTGTGAGAGAGGGCATTAGGACTATAAAAACAAATGTCAAGCCTTACTCCATCCTT
AAAGCGGCTACGGATTGGACTATAATGATGGATACGTGTGAAAAACAATACAAAATCCCC
GAGGATATTTGTGCGTCGGCCTCCAGACCGGACATATTCATGTATTCGCGAATCTTAAAG
CGCGTTGTGATGATAGAGCTTACGGTTCCTTGGGAAACCAACATCCCCAAAGACCATACC
ATCAAGGTCAACAAATATTACGAGCTCACAAACGAACTCACTCGAAATAGGTTCGTCGTG
GATTTATACGCGGTAGAAGTGGGAGCGAGAGGTATAACGGCTAAATCTCTCTACAACCTA
CTAAAAGACTTAGGCCTGTCCAGAACTCACATCAATTCGTTCTTGGAACGTACTTCGAAG
GCAGCCCTAGTAGGTTCTTTTCAAATATGGTTAGGTAGGGAGAGGAGCTTGGACAGTGGA
GATAGAAACAATAAGCCGGAAAATGTTCTCATAGACGAGTCCTCGTGCAGGGGTTCCAGC
GTCGAGAGGAGGAAGTGCCGCATGCCGCCGTGCGAGTCTCCCAGTCCGATGTGGTCTCGC
TGGTCGTTGTGGTCGCCCTGTTCCTCGTCCTGCGAGCCGGCCGTCCAGCTCCGGACCAGG
ACGTGTCTGCATGACGACTGTGACGGAGACCACGTGCAGGTCAGAAAATGTCCGGGTGTT
CCAAAGTGTGAGACCAATCGCTTGCATTACACTATAGGAGACGACCTCGAGGACGAAAAT
ATGCCAGATTACATTCCTGAAGCTACGTTTGAACTACAACCGGAGGCGAAAGATGTCACA
AAACCAAAATTATCAAAACCATTCAAGAAGAAACCCATTCCAGCATCTCCCGTGTATGAG
GTGACGGTTACAGAGAATCTTGACGGTAGTCAGAGAGGCCCGTGCAAGACAGGGGACTGG
TTCGATGCTGATAACAACAAATGTGAAGACGTGGACGAGTGTGTGACCTCAAGAAGCGTG
TGTCACTCGACCCAGGTGTGTGTGAACACTCGCGGAGGATACTCATGCACCTGTGAGAAG
GGATACACCTCGCTGGGGGCCGGACAACGCTGTATAGACGTGAACGAGTGCACGCTGGAT
GTACACGAGTGTGAGTACGCGTGTGTGAACACGGCCGGCGGGTATGTGTGTGCGTGTCCC
GCGCCGCTCAGGCTACATCGCGACAGGCGGCGGTGTGTGATGCCGTCGTTACAACACGAG
CCCGTTCAATACGAGGATTATGAAGACACGAAAACTTTCTACAAATATTTATAA
Protein sequence:
MFISNLLRFFLWLFLVPPAPQKGEIKRVKTLSGLSLNISCPVEGHPLPFIRWFKQPYSEI
VDSTRTLLLDYNSTLHFPNIDTSDSGLYSCIATNSVGTTELLYEVTVQTAPTIAGNDSQL
VVALGRSIILKCEVAGVPEPKITWFKAVGTARHLLVGCKVLLDSGQYSRRHDRVLEIIRE
AVSLSVARAQKGITTNERSVGFVREGIRTIKTNVKPYSILKAATDWTIMMDTCEKQYKIP
EDICASASRPDIFMYSRILKRVVMIELTVPWETNIPKDHTIKVNKYYELTNELTRNRFVV
DLYAVEVGARGITAKSLYNLLKDLGLSRTHINSFLERTSKAALVGSFQIWLGRERSLDSG
DRNNKPENVLIDESSCRGSSVERRKCRMPPCESPSPMWSRWSLWSPCSSSCEPAVQLRTR
TCLHDDCDGDHVQVRKCPGVPKCETNRLHYTIGDDLEDENMPDYIPEATFELQPEAKDVT
KPKLSKPFKKKPIPASPVYEVTVTENLDGSQRGPCKTGDWFDADNNKCEDVDECVTSRSV
CHSTQVCVNTRGGYSCTCEKGYTSLGAGQRCIDVNECTLDVHECEYACVNTAGGYVCACP
APLRLHRDRRRCVMPSLQHEPVQYEDYEDTKTFYKYL