DPGLEAN21700 in OGS1.0

New model in OGS2.0DPOGS215444 
Genomic Positionscaffold939:+ 7042-18441
See gene structure
CDS Length3915
Paired RNAseq reads  7392
Single RNAseq reads  19106
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005736 (1e-06)
Best Drosophila hit  CG31999 (1e-70)
Best Human hitfibulin-5 precursor (1e-52)
Best NR hit (blastp)  fibulin 1 [Culex quinquefasciatus] (4e-96)
Best NR hit (blastx)  AGAP011322-PA [Anopheles gambiae str. PEST] (6e-106)
GeneOntology terms

  
GO:0005578 proteinaceous extracellular matrix
GO:0007155 cell adhesion
GO:0005509 calcium ion binding
InterPro families






  
IPR002919 Protease inhibitor I8, cysteine-rich trypsin inhibitor-like
IPR000152 EGF-type aspartate/asparagine hydroxylation site
IPR013032 EGF-like region, conserved site
IPR018097 EGF-like calcium-binding, conserved site
IPR000742 Epidermal growth factor-like, type 3
IPR006210 Epidermal growth factor-like
IPR001881 EGF-like calcium-binding
IPR013091 EGF calcium-binding
Orthology groupMCL11026

Nucleotide sequence:

ATGAAGTTATTAGTGATGAATATAGTGCTGTGTGTTGTCAGGTTTTCAGTCCAGGGAGCT
CTTACGTCAGAGGAAATAGTGGATATAACTGAAACATGCTGCAGTTACGGGGAGATGTTC
CTGATGACTTCTCCGGACAAAGATTGTTCTAAACTAGGCACACCTGAAGATATTGAACCC
GAACAGATGGAAGCTTGCAAACCAGCCGCAAAAACCTGCTGTGAACAGCAAATACTAAAA
ATAGACGAATGCAACGCTGGCATAAAGTGGGCTGTTGCAAAGAAATGTCAGACTCCTGAA
GATGAAATTGGAAAGACATGTTGCGACGAGTGTTCATTTGGTCGTCTTGCTGGGACTCAG
GGTAAGCAGGCCTGTGGAGATGAACCTTCGGAATTCTTGAGCCCTTTAACAGCTTTGAGA
AAGATGGCCTATCATAAATGTTGTGTGGAAGCTGCGCAGGAATTAGAGACGACGACGGAG
AAAAAGAAAGTAACTACAACCGAAAAACCAAAGGAAAAATGTAAGGCGAACTCTTGTGAG
CATAATTGTTCGGACAGTGACGGCAAGGTCACGTGTCTGTGTAAAGATGGTTATAGACTT
CAACAAGATAAAAAATCTTGTAAAGATATAAATGAATGTGCAGAAGCCGTAGATGACCTG
TGCACAGATAAGGACACTGTGTGCCACAATACTGAGGGATCATTTAAATGTGTGCCTCTT
AAGAAGCGAGATGTTGGCCTAAGTTGTCCTCCAGGATTTAAACGAAATGTCGTTAACCAA
GTCTGTGACGATATTAATGAATGTCGTCTTCCAAGGCCCCCGTGTCCCAAATACCTTTGT
GAAAACACTATCGGTGGTTACAAATGTGCCGGTAAAGTTGGAAAGCCTTACACAGAAGAT
GGTACAGGACCAACAACTGAGGCCGGAGCTTCAACTTCCTCGACAGTAAGAAATGATATC
TGCCCGCCGGGTTTCAGAGCCGGCCCTGACGATGAATGCCTCGATATCGACGAGTGCGAG
GAACATTTGGACGACTGCCAGCGTCTGTCACAATATTGTATTAACACTCACGGAAGCTAT
TTCTGCCAGGACCATGTCTCCAAGCGATGCGCTCCCGGCTTCAAGGTCAATAGTAACACT
GGTATATGTGAAGATATCGACGAATGCGAAGAAAGCTCAGAAGTGTGCAAGCGAAACGAA
GTTTGCATTAATCTGCCAGGAGCCTACAATTGCAAGTCGAAAATTAGTACACTACCAAAG
CTGGCCACACAGAATTGCCAAGAAGGTACTCGCAGAAGAGGAAGCAGTTGCGAAGATATT
GACGAATGTCGGGAAGGAACGCATTTGTGCGACCAGTTTCAGAACTGCATTAATACCTTC
GCCGGACATGAATGTCGCTGTAAGAACGGTTTCGAGTTAGACTCTACATCTGGATCATGT
GTAGATATTGATGAGTGCGCTCTAAAGTTAGACAACTGTGGATCAGAACTGCGTTGTTTG
AATGTACTGGGTTCTTTCACTTGTACACGATCAACATCAACACCACCGGCCCCAGTTTAT
GAATATGAATATTACGACTCCGAAGAGGACAATTCAGTAATTCCAAGTCCAGAAACTACA
TCATCTACAACGACTTCAACCACAACATCTACAACTACGACAACGACCACGCCAAGACCG
ACCACAACCAGCTCTACTACTACTTCTACCAGACCATCCACCACACCGAAACCATACCAA
CCCAGAAGATACCCTAACACACCAAGAAGACCATTCTATCATAGATCTTCCACTTCTACC
ACCACTAGCACGACTTCAACAACTCCGCCACCGGTTCCAAAATATCCAGAATGGTCGGAC
TATCCAAGAGAAAACACAACTCCAAAAGAAGTAACAGTTCCAAAACCAGATATAACGAAT
GTTATCGAAACAGACAAAGAACCAGACGGCAGCTTTGTCCTCAACACCAATGATATCCCA
AAGGACAGATGGACCAATGTTATAAACAGAGAGCATGAAAGGTTCAACCCAAACTGGTTA
CATTGTCTTGATGGATATGAGAGGAACGAACGGGGAGAATGCGTTGACATCAATGAATGC
GGAGCCAATCGACATAGTTGCAGTTCCTTAGAGTACTGTATAAATACACCAGGAAGTTAT
GACTGCGAGTGTATTCCTGGTTTTGTGAGGGATCCATCCGGTTGGTGCGGTGTTGCCACT
ACTCCCAGTACTTCTCCATCACCACCAACTCAGAGACCAACCACCCTAAGGCTAACTACT
TCAAAACCAACCACAACTTCAAGACCTACCACTACTCCAAGACCTACTAGACCACCTAGA
ATACCTGCGGCTAGACCCACTAGACCTATACCAAGAATAACTCCTAGGACTACAATTAGA
CCAACAACGACAAGCACTACATCAACGACGTCAACAACCTCACGTAGTACCAACACAAAC
GAAGTTGCTCCTCTAACACCAACGCCAGCCTGGTATCCGAGTCCATCACGTGGTCATCTC
AGCCCTGTTAATTGCGAGCTAGGGTATACCTACAACCACAATGAAAGAAAATGTGTTGAT
ATAGATGAATGTGCTACCCAAAGAGCTAGCTGTGGACCTACAGAGGACTGCGTAAATACA
GAAGGAGGATATCGCTGCGAATGTGGCCCTAGATGTCTATCTCGCAGACAAAACACCTCT
TATACTTACCACGACAACCCGCCAGTCATCAGTCCAGATTCCAATGTGATCACAATAGGC
GCTCAGTACGGCCAGCGAGGGCCGAGGTACATGCGCCCGACATACAAGCGACTCCACGAC
ACGGGATCTGTGCTTACTACATGTCCATGGGGATACAAACTTACACCAGATAGAGTTTGT
ATGGATTTGGATGAATGTGAGATGAATATCTCCGAGTGTGGCCCGCAGCAGCGTTGTGAA
AACTTTTATGGAGGCTACTCGTGCCAATGTCCAGCCGGCCATTGGAGCAACGGCAAGCAA
TGTGATGACATCGATGAGTGCAGTTATGGCAATACATGCTCCTACAACGCGCGATGCATC
AACACTGTCGGGTCCTACCGTTGTGAGTGTTCAGAGGGCTTCAGGAACGCTCCATCTAAC
GACAAAGTCTGCGTGGATGTAGACGAGTGCTCCGAGCCTGAACCTTTATGTGAACAAGTG
TGCGTGAACGCTTGGGGGGGATACAGGTGCTATTGCAATAGGGGCTATAGACTCAGCAAT
GACAATCGGACTTGTACGGATGTAGATGAATGCGCAGAGTCAGGTTCCCGTATATGCACA
GCTCAGTGCGTTAACACCGTGGGCTCCTATCGTTGCGCTTGCCCTTCAGGTTACCGACTG
GCTGACGATAAACGATCTTGTCTAGATATTGACGAATGTGAAAATGGCCAGGCTCGCTGC
GGTGGAGTGGGAGAGGTTTGTCAGAACACCCGCGGTGGCTACCACTGCCATCAGATAAAA
TGCCCGCCAGGGTACCGCCTCGAAGGAAAACACAAATGCGCTCGGATACAACGCTCGTGT
CCAGTCTCGGACTGGTCGTGTCTTCAGCAACCGAGTACCTACAGCTACAATTTTATAACA
TTCGTCTCCAACTTGTATTTGCCTCTAGGAAGTGTGGATCTATTCTCTATGCAAGGTCCT
GCATGGCGTGATGCTGTAGTGAACTTTGAGATGCGTCTCTTAGACGTGCAAGCGGCGCCT
GGAGTCAAACCGGCAGATATCACGTGCTTTGGCATGAGGCCTAGTAGCAACGTCTGTGTG
ATCTCTCTCCAATGTTCCCTTCAAGGTCCACAAGTAGCTGAATTGGAACTAACCATGTCT
CTATACCAAAGATCTATGTTCGCTGGCAACGCTGTCGCCAGACTAGTCGTGATCGTATCA
GAATACGAGTACTAA

Protein sequence:

MKLLVMNIVLCVVRFSVQGALTSEEIVDITETCCSYGEMFLMTSPDKDCSKLGTPEDIEP
EQMEACKPAAKTCCEQQILKIDECNAGIKWAVAKKCQTPEDEIGKTCCDECSFGRLAGTQ
GKQACGDEPSEFLSPLTALRKMAYHKCCVEAAQELETTTEKKKVTTTEKPKEKCKANSCE
HNCSDSDGKVTCLCKDGYRLQQDKKSCKDINECAEAVDDLCTDKDTVCHNTEGSFKCVPL
KKRDVGLSCPPGFKRNVVNQVCDDINECRLPRPPCPKYLCENTIGGYKCAGKVGKPYTED
GTGPTTEAGASTSSTVRNDICPPGFRAGPDDECLDIDECEEHLDDCQRLSQYCINTHGSY
FCQDHVSKRCAPGFKVNSNTGICEDIDECEESSEVCKRNEVCINLPGAYNCKSKISTLPK
LATQNCQEGTRRRGSSCEDIDECREGTHLCDQFQNCINTFAGHECRCKNGFELDSTSGSC
VDIDECALKLDNCGSELRCLNVLGSFTCTRSTSTPPAPVYEYEYYDSEEDNSVIPSPETT
SSTTTSTTTSTTTTTTTPRPTTTSSTTTSTRPSTTPKPYQPRRYPNTPRRPFYHRSSTST
TTSTTSTTPPPVPKYPEWSDYPRENTTPKEVTVPKPDITNVIETDKEPDGSFVLNTNDIP
KDRWTNVINREHERFNPNWLHCLDGYERNERGECVDINECGANRHSCSSLEYCINTPGSY
DCECIPGFVRDPSGWCGVATTPSTSPSPPTQRPTTLRLTTSKPTTTSRPTTTPRPTRPPR
IPAARPTRPIPRITPRTTIRPTTTSTTSTTSTTSRSTNTNEVAPLTPTPAWYPSPSRGHL
SPVNCELGYTYNHNERKCVDIDECATQRASCGPTEDCVNTEGGYRCECGPRCLSRRQNTS
YTYHDNPPVISPDSNVITIGAQYGQRGPRYMRPTYKRLHDTGSVLTTCPWGYKLTPDRVC
MDLDECEMNISECGPQQRCENFYGGYSCQCPAGHWSNGKQCDDIDECSYGNTCSYNARCI
NTVGSYRCECSEGFRNAPSNDKVCVDVDECSEPEPLCEQVCVNAWGGYRCYCNRGYRLSN
DNRTCTDVDECAESGSRICTAQCVNTVGSYRCACPSGYRLADDKRSCLDIDECENGQARC
GGVGEVCQNTRGGYHCHQIKCPPGYRLEGKHKCARIQRSCPVSDWSCLQQPSTYSYNFIT
FVSNLYLPLGSVDLFSMQGPAWRDAVVNFEMRLLDVQAAPGVKPADITCFGMRPSSNVCV
ISLQCSLQGPQVAELELTMSLYQRSMFAGNAVARLVVIVSEYEY