DPGLEAN12071 in OGS1.0

New model in OGS2.0DPOGS207247 
Genomic Positionscaffold584:+ 22089-41709
See gene structure
CDS Length2718
Paired RNAseq reads  29
Single RNAseq reads  76
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012062 (3e-147)
Best Drosophila hit  multiplexin, isoform E (9e-71)
Best Human hitcollagen alpha-1(XV) chain precursor (6e-45)
Best NR hit (blastp)  PREDICTED: similar to CG33171-PC, isoform C [Apis mellifera] (2e-116)
Best NR hit (blastx)  PREDICTED: similar to collagen alpha 1(xviii) chain [Tribolium castaneum] (4e-109)
GeneOntology terms




  
GO:0005581 collagen
GO:0007155 cell adhesion
GO:0005198 structural molecule activity
GO:0031012 extracellular matrix
GO:0005488 binding
GO:0008045 motor axon guidance
InterPro families


  
IPR016186 C-type lectin-like
IPR010515 Collagenase NC10/endostatin
IPR008160 Collagen triple helix repeat
IPR016187 C-type lectin fold
Orthology groupMCL12955

Nucleotide sequence:

ATGGCGATGGTGATATCACAGACGACTAAAGCAGCTATTGCTGGATGTTTAGCTTTAATG
ATATTAGGAGCTGTTCTGGTCGTTGGTGTGGCGTTCGGTTGGTTTGATCCCAAACGAGAA
GATGAAGACACACCTGCAAAGATCAGTGCAAGACTTCAAGGCCGGACATCTTTCGTCACT
CACAGAGCTGTGAATCCAGACGAAAGCAGATTACCAGTTATTTTACATACATCCTCATAT
TCTCAGAAAGGAAATAAATTCGAAAATGGGTCGAGAGGAATAGGCCCGTCCGGGATGGAC
CGTAAAGACAAACGTGTAACGCTGTCGAGTCAATGCCAATGTACGGTAAATGATATATTT
AGAATGATGGAGGTTGTACCTGGTTTGGTTGGACCACCTGGACCTCCAGGGCCGACTGGT
GCCGATGGCATGACAGGTGCCCCAGGAAAAACTGGACAAATAGGAGAAACAGGAGCTCCT
GGACCTCAAGGGACAAAGGGTGATCGTGGAGAGAGGGGCGATACAGGTCCCACCGGAATA
GAGGGCCAACCAGGACCTAAAGGAGAGCCTGGTGCTGATGGTTCACCAGGTCTTCAGGGA
CCGCCGGGACCTCCAGGTCCACCTGGATTGCCTTCATCGCCAATGATGGAATCAACAGGA
TTATACGTAGCTGGGGATCGTGGAGTTATGGGACCTCCCGGCGAACGAGGCCCAATGGGT
CTACCCGGACCCCAAGGAGAAAGGGGTTACGCGGGGAATAAAGGCGAAAAAGGATTACAT
GGGGCTAAAGGCGATAAAGGTGAACGGGGGTACGTGGGATTGCGGGGGCCACATGGTGCG
AAAGGGGAACGAGGGGTGCCAGGAAAAGATGGCACACCTGGTTTGCCCGGCGCTCACGGA
CGCCCAGCGGAAAAGGGTGAAAAGGGTGCTCGCGGACTTCCTGGTTTACCCGGACCTTCA
GTAGTCGGCATTTCTGAAAATTCTGTTTTAAGTGAAATCGCACTTCCAGGATCTCGTGAC
GTCATGAAGTTGAAAGGCGAAAGAGGAGAAAGGGGCGAAAAGGGTGAAAAAGGTAGCAGA
GGTATGGAAGGCCCACAAGGTTTTCCAGGCACTGATGGAAGGCCTGGTGAAAGAGGGGAT
ATCGGCCCATCAGGTGTCCCTGGACCTCAAGGAGTACTAGGTCCACCCGGACCTGTAGCA
ATTTCAAGAGAAGAAGCTTTGATCATGACCAAGGGTGAAAAGGGTGAGACCGGTCCCAGG
GGAAAACGGGGTCACCCCGGCGCTCCAGGCCCGAGAGGACCGCCAGGGCTTCCTGGACCC
CCCGGAGTCCCTGGAATTAATGGACCTTCTGGTGATATTGGCCTGCCAGGATGGACGGGT
CCACCGGGTGTAGCGGGTCAACCAGGCCCGCCGGGACAAAAAGGTGAAAAAGGAGACTCC
GGTATATCACCAGCCGACCTCGAAAAGGTAAAAGGTGAAAAAGGTGAACGTGGCTACGAT
GGAACCTCTGGACCACCGGGTAAAGATGGTCCTAGAGGTCCGCCTGGACCTCCAGGAACT
CCAAGCACAAGTTTGCAATATATTCCGGTTCCTGGCCCTCCTGGCCCCCCAGGGCCACCC
GGGCCTCCTGCGGTTTTCACGAATAACGTTCCAATCGACGCTTTGACAGATAGCCCTGGG
ATTAATCGCCTTCAACCTGGCACAGGAAAACCACGAGATCCGCTACAAATTCTAAGAAAC
TTGAATAATTTGATGCAGTACCGCCAAGAACAATTTGAGCCTGGAATTCGAGATTCACTA
GATAGTGACGGAGAAAATACAGATTTCGATGATGAAGAAGATGGCAGGACTCTGGTCGGC
ACTATACTATTTAAATCAACTGAATCATTATTACGGTTGGGAACAAACACTCCTCGAGGA
ACATTAGCATACGTATTGCAAGAGCAAGCGCTCCTTGTAAGAGTCAACAAAGGCTGGCAA
TATGTTGCAATGGGTTCGCTTCTAAAGATACCGAGTCCGCCGGGTAGCGGCGTTACGCTC
ACTCCAGTTCAAAATATATTAGAAACTTCTAGTTTGGTACATCATAAAAACTCGGCAACA
GGCGGACCTGCGCTTCGTCTGGCAACACTTAATGAGCCTCACACAGGCGATATGCATGGA
GTCAGCAGCACAAACTACGAATGTCACAGACAAGCTGAAAGATCCGGATTAGATGGAACT
TTCAGAGCCTTTATTACTTCAAGGGTACAAAACATAGAGTCCATAGTGAATTGGGTGGAC
CGTGAAATACCAGTAGTGAATATCCGAGGGGACATTCTCTTCAATTCGTGGGGTGAAATG
TTGGATGGGTCTGGTGCTGTATTTGCACACGCTCCTAAATTATACAGCTTCAATGGAAAA
AACGTAATGATGGATCCCAGTTGGCCAACAAAAGCTGTTTGGCATGGGGCCACACCAAAT
GGGGAACCGGCAATGGATGCGTATTGTGACGCATGGCACAGCAGTAGCCCGACAAAATTC
GGATTGGCCTCTTCATTACGCTCTAACAAGCTTTTAGATCAAGAAACGTACCCGTGCAGC
ACGCGACTAATCGTGCTCTGCATTGAAACTACTCCGCTTAACACAGTGAGAAGAAAAAAA
CGTTCCAAATATCGGGTATCCGACAAAACACATTTCCTCAAAGACATCGAAAAACGAAAC
GAAACTCTAAACTTATAG

Protein sequence:

MAMVISQTTKAAIAGCLALMILGAVLVVGVAFGWFDPKREDEDTPAKISARLQGRTSFVT
HRAVNPDESRLPVILHTSSYSQKGNKFENGSRGIGPSGMDRKDKRVTLSSQCQCTVNDIF
RMMEVVPGLVGPPGPPGPTGADGMTGAPGKTGQIGETGAPGPQGTKGDRGERGDTGPTGI
EGQPGPKGEPGADGSPGLQGPPGPPGPPGLPSSPMMESTGLYVAGDRGVMGPPGERGPMG
LPGPQGERGYAGNKGEKGLHGAKGDKGERGYVGLRGPHGAKGERGVPGKDGTPGLPGAHG
RPAEKGEKGARGLPGLPGPSVVGISENSVLSEIALPGSRDVMKLKGERGERGEKGEKGSR
GMEGPQGFPGTDGRPGERGDIGPSGVPGPQGVLGPPGPVAISREEALIMTKGEKGETGPR
GKRGHPGAPGPRGPPGLPGPPGVPGINGPSGDIGLPGWTGPPGVAGQPGPPGQKGEKGDS
GISPADLEKVKGEKGERGYDGTSGPPGKDGPRGPPGPPGTPSTSLQYIPVPGPPGPPGPP
GPPAVFTNNVPIDALTDSPGINRLQPGTGKPRDPLQILRNLNNLMQYRQEQFEPGIRDSL
DSDGENTDFDDEEDGRTLVGTILFKSTESLLRLGTNTPRGTLAYVLQEQALLVRVNKGWQ
YVAMGSLLKIPSPPGSGVTLTPVQNILETSSLVHHKNSATGGPALRLATLNEPHTGDMHG
VSSTNYECHRQAERSGLDGTFRAFITSRVQNIESIVNWVDREIPVVNIRGDILFNSWGEM
LDGSGAVFAHAPKLYSFNGKNVMMDPSWPTKAVWHGATPNGEPAMDAYCDAWHSSSPTKF
GLASSLRSNKLLDQETYPCSTRLIVLCIETTPLNTVRRKKRSKYRVSDKTHFLKDIEKRN
ETLNL