DPGLEAN20685 in OGS1.0

New model in OGS2.0DPOGS205117 
Genomic Positionscaffold1001:+ 14590-20563
See gene structure
CDS Length3783
Paired RNAseq reads  1787
Single RNAseq reads  4291
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005876 (0.0)
Best Drosophila hit  Nidogen/entactin, isoform A (1e-175)
Best Human hitnidogen-1 precursor (5e-43)
Best NR hit (blastp)  PREDICTED: similar to nidogen [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to nidogen [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0005604 basement membrane
GO:0008218 bioluminescence
GO:0005509 calcium ion binding
GO:0016020 membrane
GO:0007160 cell-matrix adhesion
GO:0018298 protein-chromophore linkage
InterPro families










  
IPR009017 Green fluorescent protein-like
IPR006605 G2 nidogen/fibulin G2F
IPR000033 LDLR class B repeat
IPR003886 Nidogen, extracellular domain
IPR013091 EGF calcium-binding
IPR011042 Six-bladed beta-propeller, TolB-like
IPR006210 Epidermal growth factor-like
IPR001881 EGF-like calcium-binding
IPR000742 Epidermal growth factor-like, type 3
IPR000152 EGF-type aspartate/asparagine hydroxylation site
IPR013032 EGF-like region, conserved site
IPR018097 EGF-like calcium-binding, conserved site
Orthology groupMCL11896

Nucleotide sequence:

ATGTGGTCAGCTGCGTTGGTCGTGTGCCTTGCGGCTTGCGCGCGCGCAATCACCCGCGAT
CAATTTTACCCCCACGGTCATGGATTGGACCAACGCTTACCCCGGGGAGCTGAGGTCTCT
TCCCCAGAAGTAACCCTGGCTGTACCAGTCGTGTTTTACGGTCAGACCTACGAGTCGATT
TTCGTTAATAACTTCGGAGTGCTTTCATTCAGAGCCGACATTCCGACATTTCTGAATGCT
GAATTCCCTCTACCATATCCATCGATAGCTGCGTTTTATACCAATATTAATACAACGGAT
GTTGGCACGGTCTACTATAGAGAAACAAATGAATCCCATGTGCTATTAAAAGCTGAGGAG
AGCGTTCAGAATAACTTCCACGATTATTATGACTTCATCCCGACCAGTGTGTTTATTGCC
ACTTGGATTGATGTCACTTACTCGGGTTCCCAGTGGCAGAATCGCAAGAATAGCTTTCAA
ATAGCTATCATAAGTAATGGGACGGAAACGTTCGTCGAGCTTTTGTATCCGGAGAGAGAA
ATTCAATGGATACAACGAGAAACAAAAGATGGCGGCCTACCGGATGCTAAAGCGCAAGCT
GGTTTTGTTGCGGAAGACGGACGTGTATTTACTCTAAGAGGTTCTGGCAGTCATCAGATC
AGGAATGTCGTTTCTTGGTCAAACATTCATGACCCTGGAAGATACGTCTATCGTGTAGGA
AATATCCCATTAGAGGGCACCATAGCCGTTCCTGATCAGTATGATCAATATGAGGCTGAA
GTTGAAGAAGAATCTAAAACTTGTGCCCAAAGTGGTCCTAGTGTATGCCACTTGCAAGCA
AGATGTGTTGATTATCAAGCCGGTATTTGTTGTCAATGTAATGAAGGTTTTTATGGCAAT
GGAAAATCGTGCATCAAAGACGATGTGCCGCTACGTGTACATGGAAAAATGAATGGTATT
ATTAACAATCAAAATTTAAATGATGTTGATATTCAGGCTTATGTGGTTCTAGCTGATGGC
AGATCATATACAGCTTTATCTCAGACACCGTCATCTTTAGGTAGCAGTTTGCAACTTCTT
AGTGTACTGGGAAGTGTCGTAGGTTGGCTTTTTGCCAAGCCATTAGGCGAAGCCCAAAAT
GGTTATCAGTTGACTGGTGGTCTTTTCAATCACACCGCAGATATTTACTTTCCTGAATCT
GGTGACAGAGTAACAATTAATCAAGAATACGTTGGACATGACGTGTTTGACCAAATAACT
TTGGACACTGATGTACGTGGTACAATCCCTAACGTACCTACAGGATCCAGACTAGAAGTG
TCTGAATACGATGAGCAATACACTATTGTCGAACCAGGTCTCATTCAAAGTGTATCAACA
CGGATATTCATGAACAAAATAACAGGACAAAAATATGAACAGAGAGTTTCGCAGACGTTT
ACTTATAGTCCATGCAAGTTTGCACCACCGTCTGAAAACGCAAATAAGCCGCTAACATTG
AAAGTTATAAAAAATTATTTAGGATACGAAACAAGAGGAAATATTGTTCGATATGGAACA
ACAAATAAAATTCAGTCAAATATTCAAGATCCCTGCGCAGTAGGCAGGAATTCTTGCGGT
CCCCATAGCACATGTGTTGTACAGGGTGATTCCTTTGTATGCGTATGTCAATTAGGTTTT
AAGAATAATAATGAAAATTGTATTGACATAAATGAATGTGAAGCCGGAACACATAACTGT
GACAATAACGCTGACTGTTACAATCAAGATGGCGACTACCAGTGTATATGCCGAGAGGGT
TACGAAGGGGATGGAATAAGCTGTAGAAGCATTTCAAATTGTAGGAACAAAGTCTGTGAT
CAGAATGCTCAGTGCACAGAAAATCCTCTTGAAGGCCCAGTCTGTGTATGTAATCCAGGA
TTTACTGGAGACGGGGAAAGATGTTGGACCGCATACTATAATGCGTGCATTAACTGCTCC
CCAAATGCTCAATGTCGACGGTCAGATGACAGTAATACCGAAAGATGTTATTGCAATCCC
GGTTTTATTGGTGACGGACAATCTTGTGTGGAAGAAGTGACAACTGAACCATACGAGCCA
GAGACCACCTCAGTGTCTGTTGCTTTTACACAGTCGACTACTACAGTAATACCTGAAAGT
GAATACAATCAAACCTATGTTTTACCTAACTGTGATCTTTACGAATGCATTTGTCCTCCA
GGATATTCAAGTTTCAAAGATGATAGAAATAACGACCTGTGTCGTCTTGATAACAATGAC
CAGGAGAATGATTTGGATGAAAACAAATACAACTCGAATTCTATGAGGTGTACTGCGGAC
GCCGATTGTCCACCAAACGCGGTATGCGCGTTTAGCTACTACTACTCTTCAGACGATTCT
GGTTTAGGACATTGTGTTTGTCCGGAAGGATATGAAGGTGACGCATATGAGTGTATTGAA
AAAACAGGACCCAGTTGTTCCTGTGGTCCTGCGGCCCATTGTATCGATACAGTAGGCGGC
CAGCTCATATGTGTATGCGATGCTGGTTATCATGGGGATGGTTATATATGCCGTCCGAAC
TTCAGCTGTACAAACAATTCAGACTGTGAATACAACGCTGAATGTCGACCTGATGCAAGC
ACCAATGAATACGTTTGTCAGTGTATAGAAGGATATGTCAAAGATGAGAGCGATGCGTGT
ATTAAAGATGGACAGCTCTGTAATGGCGCCGTATGTTCAGAACACGCTTCCTGCTTATAC
GACGCCGCTATCGATATTAGTTATTGTTACTGTGATGAGGGATATGATGGTGATGGTATT
TCTAAATGTGTCCCCAAAGGAAAAACCTGTGACGTTGCCAATGATTGCGATCCGAATGCC
ATTTGTACTCCAACAGAAATTTCTTATCAGTGTATCTGTCGTGAGGGCTTTACTGGCGAC
GGCTATACCTGTACCCCAGAAATGAATTGCAAATATAATATATATTTATGTGATGATCAT
GCTTCGTGTTTAAAGACGAGCGATGGGTATGAGTGCGAGTGTAATACTGGTTATAACGGT
AATGGAACTCATTGCCAGCTCAATCCACGACAGGCCGGGAACTTCCTGGTGGCGAGCGAT
GGCGCTTCCGTTTATCGCGTACCATTCAGAGTGACACCGAGGGAGTTTGCAGCTCCAATA
AACAGCGGTGCAATTCAGATAGCTGTGGGCATAGACGTAGACTGTTTGACCGGAAAAATT
TATTGGGGAGACGTCAGTGGTGCCACAATCAAACGAGCGTCATATGATGGTTCTGGATTC
GAGTCGTTCCTATCAAATGATGTCCAATCGCCAGAAGGTTTGTCAGTGGACTGGTCAGCT
AGAAACGTCTTTTGGACGGACTCGAAAAAGTTGACTATTGAGGTAGCCAACATTGACACT
AAAATAAGGAAAGTCTTGTTCCAAAGAGAAGGTATACACAATCCAAGGGGTATAGCCGTT
CATCCAGGGAAAGGTAAAATCTTTTGGAGCGACTGGAATCGCGGTGGACCAAAGATAGAG
TGGGCGAGTATGGATGGTTCTCAGAGGGGTATCTTTTTGGACCAATCAGATGTAAAATTG
CCAAACTCATTGGCCATAGATTGGTCCAGAGATAGACTGTGTTACTCCGACGCTGGGTTT
GCTAGCATAAAGTGCGTCGGTATAGATACCTTGGAAAAGGAAACCATAGCTGTGAATTGC
TCGTATCCATTTGGTTTGGCCATCAGTGGAGATACTTACTACTGGACCGATTGGAAAACG
TAA

Protein sequence:

MWSAALVVCLAACARAITRDQFYPHGHGLDQRLPRGAEVSSPEVTLAVPVVFYGQTYESI
FVNNFGVLSFRADIPTFLNAEFPLPYPSIAAFYTNINTTDVGTVYYRETNESHVLLKAEE
SVQNNFHDYYDFIPTSVFIATWIDVTYSGSQWQNRKNSFQIAIISNGTETFVELLYPERE
IQWIQRETKDGGLPDAKAQAGFVAEDGRVFTLRGSGSHQIRNVVSWSNIHDPGRYVYRVG
NIPLEGTIAVPDQYDQYEAEVEEESKTCAQSGPSVCHLQARCVDYQAGICCQCNEGFYGN
GKSCIKDDVPLRVHGKMNGIINNQNLNDVDIQAYVVLADGRSYTALSQTPSSLGSSLQLL
SVLGSVVGWLFAKPLGEAQNGYQLTGGLFNHTADIYFPESGDRVTINQEYVGHDVFDQIT
LDTDVRGTIPNVPTGSRLEVSEYDEQYTIVEPGLIQSVSTRIFMNKITGQKYEQRVSQTF
TYSPCKFAPPSENANKPLTLKVIKNYLGYETRGNIVRYGTTNKIQSNIQDPCAVGRNSCG
PHSTCVVQGDSFVCVCQLGFKNNNENCIDINECEAGTHNCDNNADCYNQDGDYQCICREG
YEGDGISCRSISNCRNKVCDQNAQCTENPLEGPVCVCNPGFTGDGERCWTAYYNACINCS
PNAQCRRSDDSNTERCYCNPGFIGDGQSCVEEVTTEPYEPETTSVSVAFTQSTTTVIPES
EYNQTYVLPNCDLYECICPPGYSSFKDDRNNDLCRLDNNDQENDLDENKYNSNSMRCTAD
ADCPPNAVCAFSYYYSSDDSGLGHCVCPEGYEGDAYECIEKTGPSCSCGPAAHCIDTVGG
QLICVCDAGYHGDGYICRPNFSCTNNSDCEYNAECRPDASTNEYVCQCIEGYVKDESDAC
IKDGQLCNGAVCSEHASCLYDAAIDISYCYCDEGYDGDGISKCVPKGKTCDVANDCDPNA
ICTPTEISYQCICREGFTGDGYTCTPEMNCKYNIYLCDDHASCLKTSDGYECECNTGYNG
NGTHCQLNPRQAGNFLVASDGASVYRVPFRVTPREFAAPINSGAIQIAVGIDVDCLTGKI
YWGDVSGATIKRASYDGSGFESFLSNDVQSPEGLSVDWSARNVFWTDSKKLTIEVANIDT
KIRKVLFQREGIHNPRGIAVHPGKGKIFWSDWNRGGPKIEWASMDGSQRGIFLDQSDVKL
PNSLAIDWSRDRLCYSDAGFASIKCVGIDTLEKETIAVNCSYPFGLAISGDTYYWTDWKT