New model in OGS2.0 | DPOGS205117  |
---|---|
Genomic Position | scaffold1001:+ 14590-20563 |
See gene structure | |
CDS Length | 3783 |
Paired RNAseq reads   | 1787 |
Single RNAseq reads   | 4291 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005876 (0.0) |
Best Drosophila hit   | Nidogen/entactin, isoform A (1e-175) |
Best Human hit | nidogen-1 precursor (5e-43) |
Best NR hit (blastp)   | PREDICTED: similar to nidogen [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to nidogen [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005604 basement membrane GO:0008218 bioluminescence GO:0005509 calcium ion binding GO:0016020 membrane GO:0007160 cell-matrix adhesion GO:0018298 protein-chromophore linkage |
InterPro families    | IPR009017 Green fluorescent protein-like IPR006605 G2 nidogen/fibulin G2F IPR000033 LDLR class B repeat IPR003886 Nidogen, extracellular domain IPR013091 EGF calcium-binding IPR011042 Six-bladed beta-propeller, TolB-like IPR006210 Epidermal growth factor-like IPR001881 EGF-like calcium-binding IPR000742 Epidermal growth factor-like, type 3 IPR000152 EGF-type aspartate/asparagine hydroxylation site IPR013032 EGF-like region, conserved site IPR018097 EGF-like calcium-binding, conserved site |
Orthology group | MCL11896 |
Nucleotide sequence:
ATGTGGTCAGCTGCGTTGGTCGTGTGCCTTGCGGCTTGCGCGCGCGCAATCACCCGCGAT
CAATTTTACCCCCACGGTCATGGATTGGACCAACGCTTACCCCGGGGAGCTGAGGTCTCT
TCCCCAGAAGTAACCCTGGCTGTACCAGTCGTGTTTTACGGTCAGACCTACGAGTCGATT
TTCGTTAATAACTTCGGAGTGCTTTCATTCAGAGCCGACATTCCGACATTTCTGAATGCT
GAATTCCCTCTACCATATCCATCGATAGCTGCGTTTTATACCAATATTAATACAACGGAT
GTTGGCACGGTCTACTATAGAGAAACAAATGAATCCCATGTGCTATTAAAAGCTGAGGAG
AGCGTTCAGAATAACTTCCACGATTATTATGACTTCATCCCGACCAGTGTGTTTATTGCC
ACTTGGATTGATGTCACTTACTCGGGTTCCCAGTGGCAGAATCGCAAGAATAGCTTTCAA
ATAGCTATCATAAGTAATGGGACGGAAACGTTCGTCGAGCTTTTGTATCCGGAGAGAGAA
ATTCAATGGATACAACGAGAAACAAAAGATGGCGGCCTACCGGATGCTAAAGCGCAAGCT
GGTTTTGTTGCGGAAGACGGACGTGTATTTACTCTAAGAGGTTCTGGCAGTCATCAGATC
AGGAATGTCGTTTCTTGGTCAAACATTCATGACCCTGGAAGATACGTCTATCGTGTAGGA
AATATCCCATTAGAGGGCACCATAGCCGTTCCTGATCAGTATGATCAATATGAGGCTGAA
GTTGAAGAAGAATCTAAAACTTGTGCCCAAAGTGGTCCTAGTGTATGCCACTTGCAAGCA
AGATGTGTTGATTATCAAGCCGGTATTTGTTGTCAATGTAATGAAGGTTTTTATGGCAAT
GGAAAATCGTGCATCAAAGACGATGTGCCGCTACGTGTACATGGAAAAATGAATGGTATT
ATTAACAATCAAAATTTAAATGATGTTGATATTCAGGCTTATGTGGTTCTAGCTGATGGC
AGATCATATACAGCTTTATCTCAGACACCGTCATCTTTAGGTAGCAGTTTGCAACTTCTT
AGTGTACTGGGAAGTGTCGTAGGTTGGCTTTTTGCCAAGCCATTAGGCGAAGCCCAAAAT
GGTTATCAGTTGACTGGTGGTCTTTTCAATCACACCGCAGATATTTACTTTCCTGAATCT
GGTGACAGAGTAACAATTAATCAAGAATACGTTGGACATGACGTGTTTGACCAAATAACT
TTGGACACTGATGTACGTGGTACAATCCCTAACGTACCTACAGGATCCAGACTAGAAGTG
TCTGAATACGATGAGCAATACACTATTGTCGAACCAGGTCTCATTCAAAGTGTATCAACA
CGGATATTCATGAACAAAATAACAGGACAAAAATATGAACAGAGAGTTTCGCAGACGTTT
ACTTATAGTCCATGCAAGTTTGCACCACCGTCTGAAAACGCAAATAAGCCGCTAACATTG
AAAGTTATAAAAAATTATTTAGGATACGAAACAAGAGGAAATATTGTTCGATATGGAACA
ACAAATAAAATTCAGTCAAATATTCAAGATCCCTGCGCAGTAGGCAGGAATTCTTGCGGT
CCCCATAGCACATGTGTTGTACAGGGTGATTCCTTTGTATGCGTATGTCAATTAGGTTTT
AAGAATAATAATGAAAATTGTATTGACATAAATGAATGTGAAGCCGGAACACATAACTGT
GACAATAACGCTGACTGTTACAATCAAGATGGCGACTACCAGTGTATATGCCGAGAGGGT
TACGAAGGGGATGGAATAAGCTGTAGAAGCATTTCAAATTGTAGGAACAAAGTCTGTGAT
CAGAATGCTCAGTGCACAGAAAATCCTCTTGAAGGCCCAGTCTGTGTATGTAATCCAGGA
TTTACTGGAGACGGGGAAAGATGTTGGACCGCATACTATAATGCGTGCATTAACTGCTCC
CCAAATGCTCAATGTCGACGGTCAGATGACAGTAATACCGAAAGATGTTATTGCAATCCC
GGTTTTATTGGTGACGGACAATCTTGTGTGGAAGAAGTGACAACTGAACCATACGAGCCA
GAGACCACCTCAGTGTCTGTTGCTTTTACACAGTCGACTACTACAGTAATACCTGAAAGT
GAATACAATCAAACCTATGTTTTACCTAACTGTGATCTTTACGAATGCATTTGTCCTCCA
GGATATTCAAGTTTCAAAGATGATAGAAATAACGACCTGTGTCGTCTTGATAACAATGAC
CAGGAGAATGATTTGGATGAAAACAAATACAACTCGAATTCTATGAGGTGTACTGCGGAC
GCCGATTGTCCACCAAACGCGGTATGCGCGTTTAGCTACTACTACTCTTCAGACGATTCT
GGTTTAGGACATTGTGTTTGTCCGGAAGGATATGAAGGTGACGCATATGAGTGTATTGAA
AAAACAGGACCCAGTTGTTCCTGTGGTCCTGCGGCCCATTGTATCGATACAGTAGGCGGC
CAGCTCATATGTGTATGCGATGCTGGTTATCATGGGGATGGTTATATATGCCGTCCGAAC
TTCAGCTGTACAAACAATTCAGACTGTGAATACAACGCTGAATGTCGACCTGATGCAAGC
ACCAATGAATACGTTTGTCAGTGTATAGAAGGATATGTCAAAGATGAGAGCGATGCGTGT
ATTAAAGATGGACAGCTCTGTAATGGCGCCGTATGTTCAGAACACGCTTCCTGCTTATAC
GACGCCGCTATCGATATTAGTTATTGTTACTGTGATGAGGGATATGATGGTGATGGTATT
TCTAAATGTGTCCCCAAAGGAAAAACCTGTGACGTTGCCAATGATTGCGATCCGAATGCC
ATTTGTACTCCAACAGAAATTTCTTATCAGTGTATCTGTCGTGAGGGCTTTACTGGCGAC
GGCTATACCTGTACCCCAGAAATGAATTGCAAATATAATATATATTTATGTGATGATCAT
GCTTCGTGTTTAAAGACGAGCGATGGGTATGAGTGCGAGTGTAATACTGGTTATAACGGT
AATGGAACTCATTGCCAGCTCAATCCACGACAGGCCGGGAACTTCCTGGTGGCGAGCGAT
GGCGCTTCCGTTTATCGCGTACCATTCAGAGTGACACCGAGGGAGTTTGCAGCTCCAATA
AACAGCGGTGCAATTCAGATAGCTGTGGGCATAGACGTAGACTGTTTGACCGGAAAAATT
TATTGGGGAGACGTCAGTGGTGCCACAATCAAACGAGCGTCATATGATGGTTCTGGATTC
GAGTCGTTCCTATCAAATGATGTCCAATCGCCAGAAGGTTTGTCAGTGGACTGGTCAGCT
AGAAACGTCTTTTGGACGGACTCGAAAAAGTTGACTATTGAGGTAGCCAACATTGACACT
AAAATAAGGAAAGTCTTGTTCCAAAGAGAAGGTATACACAATCCAAGGGGTATAGCCGTT
CATCCAGGGAAAGGTAAAATCTTTTGGAGCGACTGGAATCGCGGTGGACCAAAGATAGAG
TGGGCGAGTATGGATGGTTCTCAGAGGGGTATCTTTTTGGACCAATCAGATGTAAAATTG
CCAAACTCATTGGCCATAGATTGGTCCAGAGATAGACTGTGTTACTCCGACGCTGGGTTT
GCTAGCATAAAGTGCGTCGGTATAGATACCTTGGAAAAGGAAACCATAGCTGTGAATTGC
TCGTATCCATTTGGTTTGGCCATCAGTGGAGATACTTACTACTGGACCGATTGGAAAACG
TAA
Protein sequence:
MWSAALVVCLAACARAITRDQFYPHGHGLDQRLPRGAEVSSPEVTLAVPVVFYGQTYESI
FVNNFGVLSFRADIPTFLNAEFPLPYPSIAAFYTNINTTDVGTVYYRETNESHVLLKAEE
SVQNNFHDYYDFIPTSVFIATWIDVTYSGSQWQNRKNSFQIAIISNGTETFVELLYPERE
IQWIQRETKDGGLPDAKAQAGFVAEDGRVFTLRGSGSHQIRNVVSWSNIHDPGRYVYRVG
NIPLEGTIAVPDQYDQYEAEVEEESKTCAQSGPSVCHLQARCVDYQAGICCQCNEGFYGN
GKSCIKDDVPLRVHGKMNGIINNQNLNDVDIQAYVVLADGRSYTALSQTPSSLGSSLQLL
SVLGSVVGWLFAKPLGEAQNGYQLTGGLFNHTADIYFPESGDRVTINQEYVGHDVFDQIT
LDTDVRGTIPNVPTGSRLEVSEYDEQYTIVEPGLIQSVSTRIFMNKITGQKYEQRVSQTF
TYSPCKFAPPSENANKPLTLKVIKNYLGYETRGNIVRYGTTNKIQSNIQDPCAVGRNSCG
PHSTCVVQGDSFVCVCQLGFKNNNENCIDINECEAGTHNCDNNADCYNQDGDYQCICREG
YEGDGISCRSISNCRNKVCDQNAQCTENPLEGPVCVCNPGFTGDGERCWTAYYNACINCS
PNAQCRRSDDSNTERCYCNPGFIGDGQSCVEEVTTEPYEPETTSVSVAFTQSTTTVIPES
EYNQTYVLPNCDLYECICPPGYSSFKDDRNNDLCRLDNNDQENDLDENKYNSNSMRCTAD
ADCPPNAVCAFSYYYSSDDSGLGHCVCPEGYEGDAYECIEKTGPSCSCGPAAHCIDTVGG
QLICVCDAGYHGDGYICRPNFSCTNNSDCEYNAECRPDASTNEYVCQCIEGYVKDESDAC
IKDGQLCNGAVCSEHASCLYDAAIDISYCYCDEGYDGDGISKCVPKGKTCDVANDCDPNA
ICTPTEISYQCICREGFTGDGYTCTPEMNCKYNIYLCDDHASCLKTSDGYECECNTGYNG
NGTHCQLNPRQAGNFLVASDGASVYRVPFRVTPREFAAPINSGAIQIAVGIDVDCLTGKI
YWGDVSGATIKRASYDGSGFESFLSNDVQSPEGLSVDWSARNVFWTDSKKLTIEVANIDT
KIRKVLFQREGIHNPRGIAVHPGKGKIFWSDWNRGGPKIEWASMDGSQRGIFLDQSDVKL
PNSLAIDWSRDRLCYSDAGFASIKCVGIDTLEKETIAVNCSYPFGLAISGDTYYWTDWKT