DPGLEAN13235 in OGS1.0

Genomic Positionscaffold1740:+ 86-7498
See gene structure
CDS Length1914
Paired RNAseq reads  203
Single RNAseq reads  722
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009343 (6e-06)
Best Drosophila hit  down syndrome cell adhesion molecule, isoform H (6e-12)
Best Human hithemicentin-1 precursor (3e-17)
Best NR hit (blastp)  hemicentin-1 [Culex quinquefasciatus] (2e-23)
Best NR hit (blastx)  polyprotein [Tetraodon nigroviridis] (3e-23)
GeneOntology terms





  
GO:0008218 bioluminescence
GO:0005576 extracellular region
GO:0005509 calcium ion binding
GO:0050896 response to stimulus
GO:0005604 basement membrane
GO:0018298 protein-chromophore linkage
GO:0007601 visual perception
InterPro families











  
IPR000152 EGF-type aspartate/asparagine hydroxylation site
IPR013032 EGF-like region, conserved site
IPR018097 EGF-like calcium-binding, conserved site
IPR013098 Immunoglobulin I-set
IPR013091 EGF calcium-binding
IPR000884 Thrombospondin, type 1 repeat
IPR000742 Epidermal growth factor-like, type 3
IPR007110 Immunoglobulin-like
IPR003599 Immunoglobulin subtype
IPR003598 Immunoglobulin subtype 2
IPR001881 EGF-like calcium-binding
IPR006210 Epidermal growth factor-like
IPR013783 Immunoglobulin-like fold
Orthology groupMCL10002

Nucleotide sequence:

ATGTTTATAAGTAATTTACTTAGATTTTTCTTGTGGTTATTTTTAGTTCCCCCTGCTCCT
CAGAAGGGAGAAATCAAACGAGTGAAAACCTTGTCAGGGTTGTCACTCAATATAAGTTGT
CCGGTCGAAGGACATCCTCTACCGTTCATAAGGTGGTTTAAACAACCCTACTCGGAAATT
GTGGATAGTACGAGGACACTGTTGCTAGATTATAATTCCACTTTACATTTCCCAAACATC
GACACATCAGACTCCGGGTTATACTCCTGTATAGCCACAAACAGCGTTGGTACCACCGAG
CTGCTTTACGAGGTCACAGTACAGACAGCTCCCACCATCGCCGGGAATGATTCACAACTA
GTGGTAGCACTTGGCAGAAGCATCATCTTGAAATGTGAAGTTGCTGGCGTCCCTGAACCC
AAGATCACATGGTTTAAGGCAGTTGGAACTGCTAGGCACTTGTTAGTGGGATGTAAGGTA
CTCCTCGATAGCGGTCAATACTCGCGTCGTCACGATAGGGTTCTGGAAATCATACGTGAA
GCGGTTAGTCTTTCGGTAGCCAGAGCGCAAAAAGGAATAACCACAAACGAGCGATCAGTA
GGTTTTGTGAGAGAGGGCATTAGGACTATAAAAACAAATGTCAAGCCTTACTCCATCCTT
AAAGCGGCTACGGATTGGACTATAATGATGGATACGTGTGAAAAACAATACAAAATCCCC
GAGGATATTTGTGCGTCGGCCTCCAGACCGGACATATTCATGTATTCGCGAATCTTAAAG
CGCGTTGTGATGATAGAGCTTACGGTTCCTTGGGAAACCAACATCCCCAAAGACCATACC
ATCAAGGTCAACAAATATTACGAGCTCACAAACGAACTCACTCGAAATAGGTTCGTCGTG
GATTTATACGCGGTAGAAGTGGGAGCGAGAGGTATAACGGCTAAATCTCTCTACAACCTA
CTAAAAGACTTAGGCCTGTCCAGAACTCACATCAATTCGTTCTTGGAACGTACTTCGAAG
GCAGCCCTAGTAGGTTCTTTTCAAATATGGTTAGGTAGGGAGAGGAGCTTGGACAGTGGA
GATAGAAACAATAAGCCGGAAAATGTTCTCATAGACGAGTCCTCGTGCAGGGGTTCCAGC
GTCGAGAGGAGGAAGTGCCGCATGCCGCCGTGCGAGTCTCCCAGTCCGATGTGGTCTCGC
TGGTCGTTGTGGTCGCCCTGTTCCTCGTCCTGCGAGCCGGCCGTCCAGCTCCGGACCAGG
ACGTGTCTGCATGACGACTGTGACGGAGACCACGTGCAGGTCAGAAAATGTCCGGGTGTT
CCAAAGTGTGAGACCAATCGCTTGCATTACACTATAGGAGACGACCTCGAGGACGAAAAT
ATGCCAGATTACATTCCTGAAGCTACGTTTGAACTACAACCGGAGGCGAAAGATGTCACA
AAACCAAAATTATCAAAACCATTCAAGAAGAAACCCATTCCAGCATCTCCCGTGTATGAG
GTGACGGTTACAGAGAATCTTGACGGTAGTCAGAGAGGCCCGTGCAAGACAGGGGACTGG
TTCGATGCTGATAACAACAAATGTGAAGACGTGGACGAGTGTGTGACCTCAAGAAGCGTG
TGTCACTCGACCCAGGTGTGTGTGAACACTCGCGGAGGATACTCATGCACCTGTGAGAAG
GGATACACCTCGCTGGGGGCCGGACAACGCTGTATAGACGTGAACGAGTGCACGCTGGAT
GTACACGAGTGTGAGTACGCGTGTGTGAACACGGCCGGCGGGTATGTGTGTGCGTGTCCC
GCGCCGCTCAGGCTACATCGCGACAGGCGGCGGTGTGTGATGCCGTCGTTACAACACGAG
CCCGTTCAATACGAGGATTATGAAGACACGAAAACTTTCTACAAATATTTATAA

Protein sequence:

MFISNLLRFFLWLFLVPPAPQKGEIKRVKTLSGLSLNISCPVEGHPLPFIRWFKQPYSEI
VDSTRTLLLDYNSTLHFPNIDTSDSGLYSCIATNSVGTTELLYEVTVQTAPTIAGNDSQL
VVALGRSIILKCEVAGVPEPKITWFKAVGTARHLLVGCKVLLDSGQYSRRHDRVLEIIRE
AVSLSVARAQKGITTNERSVGFVREGIRTIKTNVKPYSILKAATDWTIMMDTCEKQYKIP
EDICASASRPDIFMYSRILKRVVMIELTVPWETNIPKDHTIKVNKYYELTNELTRNRFVV
DLYAVEVGARGITAKSLYNLLKDLGLSRTHINSFLERTSKAALVGSFQIWLGRERSLDSG
DRNNKPENVLIDESSCRGSSVERRKCRMPPCESPSPMWSRWSLWSPCSSSCEPAVQLRTR
TCLHDDCDGDHVQVRKCPGVPKCETNRLHYTIGDDLEDENMPDYIPEATFELQPEAKDVT
KPKLSKPFKKKPIPASPVYEVTVTENLDGSQRGPCKTGDWFDADNNKCEDVDECVTSRSV
CHSTQVCVNTRGGYSCTCEKGYTSLGAGQRCIDVNECTLDVHECEYACVNTAGGYVCACP
APLRLHRDRRRCVMPSLQHEPVQYEDYEDTKTFYKYL