New model in OGS2.0 | DPOGS203716  |
---|---|
Genomic Position | scaffold95:+ 137456-166685 |
See gene structure | |
CDS Length | 6102 |
Paired RNAseq reads   | 1390 |
Single RNAseq reads   | 3209 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003503 (2e-09) |
Best Drosophila hit   | terribly reduced optic lobes, isoform A (8e-36) |
Best Human hit | agrin precursor (6e-98) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC004709 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to agrin [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005515 protein binding GO:0005576 extracellular region GO:0005578 proteinaceous extracellular matrix GO:0005604 basement membrane GO:0005605 basal lamina GO:0005615 extracellular space GO:0007009 plasma membrane organization GO:0007213 muscarinic acetylcholine receptor signaling pathway GO:0007268 synaptic transmission GO:0007416 synapse assembly GO:0007528 neuromuscular junction development GO:0008582 regulation of synaptic growth at neuromuscular junction GO:0009986 cell surface GO:0010469 regulation of receptor activity GO:0030548 acetylcholine receptor regulator activity GO:0043113 receptor clustering GO:0043236 laminin binding GO:0045202 synapse GO:0045213 neurotransmitter receptor metabolic process GO:0045944 positive regulation of transcription from RNA polymerase II promoter GO:0031012 extracellular matrix |
InterPro families    | IPR011497 Protease inhibitor, Kazal-type IPR012680 Laminin G, subdomain 2 IPR002350 Proteinase inhibitor I1, Kazal IPR002049 EGF-like, laminin IPR006209 EGF IPR001791 Laminin G domain IPR000742 Epidermal growth factor-like, type 3 IPR008985 Concanavalin A-like lectin/glucanase IPR013032 EGF-like region, conserved site IPR003645 Follistatin-like, N-terminal IPR006210 Epidermal growth factor-like IPR001881 EGF-like calcium-binding IPR013320 Concanavalin A-like lectin/glucanase, subgroup |
Orthology group | MCL13114 |
Nucleotide sequence:
ATGAACAGGTCGGGCTACGATTACAATGTACCTTTGTCAACAACCTACTATGCATCAAAT
TCAAGACCACCACAGTATGAAAGAGATTACGGACAACTTGCTTCCGTACCTGTGTACAGT
CCTTCAACGCACCCACTTTCAACGCTCCGGGCGGAAGAAAAACATTGGTATATAGAAAAC
CATAAAAATATTCAAAGTAATTATAATGAAAACAGTGACGTCATTTCTCGAGAAACAAAA
AATAAAAAAGAAGAAGAGAAAAAAGTTGATGTACAAGCTACGAGATCTGACGCTGCGCAG
TTAAGACAGAGAAGAAAGGAAGACAGGTGTTCTTGTGAATGTTGGCCTAGTACAGACAAT
AATGTAGTACAAGATACTGGAGCGAACGACACGACCCGATCGCTTTCAGGCGTCGACATG
GTAGCATCGTATGGAGTATACGGTAATTCACAAGATGAGGAGAGCTCGGGATTTGCTCAT
CTTTCTGAAAAAAATATATCATATGGTCAGAGAAATGAAACTCCGGTTCAGGGTGTGATA
GTTAATCAACAAGGAAATAGCGCAGTGGTACGGTCTACATCCGAACCTGATTCTTCGTGT
AGCAAATGTTGTCCGTGGTCATCTAATGCCCCAGCAGCTAACTCAGAGGATTACTGTTGG
ATACACTACTGCCGACCAGCACTTATAGTTCTGTTGTTAATATTTTTTCTAATACTACTT
GGCGTGTCAACAGGGTTCTTATTGACAAATAATTATTTACATTATCAACCTCGTCCACCT
GCTCCTGAAGAAGCTTGTGAGAAGACATTTTGTCGATGGGGCGCAGAGTGTGTGTCATTA
GGCGACGGACGCGCCCATTGCGCTTGTCCCACTTCCTGTCCGTCCTCTGCGTCACCCGTT
TGTTCCACCGCTGGAAGAACTTACCGTAACCATTGTTTTTTGCGTAAAGAAGCCTGCGAG
CGAAGGCTCAATTTACGCGTCAAGCATGAGGGTGAATGTGAAGCGGGAGATCCTTGTTCC
GGACTAACCTGTCCCACTGGTGCGAGATGCGTAGTTACCTATGGCAACGCGGAATGTCGG
TGTCCCCGTAATTGTCAACGCCGAAAAACCGTCTGCGGCAGCGATGGCCGGGAGTATCCA
TCTACTTGCCATCTGGATAAGCACGCTTGTGACAATCAGATAAACATCACTATCAAGTAT
CACGGCAAATGTGATCCTTGCCTGGAACATGAATGCGTTGATGGAGGAATATGTCAACTG
AATGAAGTTCGTGCACCAGTATGTAGGTGTGGTCCACCTTGTAACTTAATCGTTCGACCA
GGCTCGGCCGTTTGCGGTTCAGATTTCAAGGACTATGCTAGCGAGTGTTCGCTTCGCAGA
GAATCATGCAGAACGAGACAGCAGCTTTCGATAGCTTATAGAGGAGAGTGTGCCTCAGCT
CAACATCCCTGCGAATCTGTGAAGTGTGGCATCCGAGAGCGTTGCATATTGGATGCACGA
GGTGTGGCTGTATGTGGCTGTGGCCCAGAATGTGAAGACGTTCTGCGACCAGTCTGTGGA
AGCGACGATCGCACATATGCAAGCCCTTGTTTATTAGAGCGAACAGCATGTTTGGAAAAC
CGAGATGTGCGGGTCGCTTATATGGGAGCTTGTGGTCTGGAAAATCCATGCGCACGTGCA
ACTTGTCCTTGGGGTGGAGCATGTGTGACTCGCAGCGGCGCTGCTCACTGTTCGTGTCCA
GTATGCGATGCAACTCTCTCTCCGGTTTGCGCATCTGACCACAACACTTACGGGAGCGAG
TGCAAAATGCGAATGCACGCGTGTCAGGAGAGTCTTAAGGAGGGAGAGCTTCGGGTGCTC
TATAACGGCACTTGTCAAACATGTGCTGATGTTATGTGCATGGGCGACGGGACTTGTGAA
ATGGACGAAACAGGCCGTCCAATCTGCCACTGCAACCACAATTGCACAGCACAGGAATCA
GACGTCGTATGCGGTACTGACGGCCAGACTTATCAGTCACAATGTGAGCTCGATTTGACA
GCGTGTCGCGACCAGAGCGAGCTGCGGTCGGCCTACAGTGGAGATTGTGCTCTCTGCAAT
GGCGTTCAGTGCTCCTATGGTGCTCATTGTGTAGCTGGAGAGTGTATTTGCCCTACGGAT
TGTAGCGGTGCTCCTCGAGAACCTGTATGTGGAAGCACTATGCAAACATATCAAAATGAA
TGTGAACTTCAGAAAGCAGCATGTAACCTTCCTCCTCCAACCAAACTTCATGTGATTTTT
TATGGAGATTGCAAGGATAGGCTGGCAGTGGTTCCACCAATAGCTATGACTACGACTGCG
ATGATAACAACTGAAGCCGATGAAAGTTCAACTGATATTGTGGAAGTGACACAGAAAGTA
GATACGACAAGTCCTTCAGCTTGCCGCGATATCCGCTGTGACTTTGACGCCAGTTGTGAA
ATTGGTTACGACGGATATCCGCGTTGTTCTTGCTTATTTGAATGTCCAGCCGACGATGAA
TATTTTCCGGTTTGCGCTTCTGACTTTCGACTTTACCCAAGCTTATGCGCTATGCGGAAG
GAAGGCTGTCAAAAACAGTTGGAACTTAGATTAAGACCTCTGGATTTATGTAAAGGTATG
GAAGTTAGACCTTGCGGAAATAATAGAGCTATAATAGACAAATCGTCAGGTCTTGAAATA
GATTGCGGCAACGGACCCCATCGTCAAGACTGTCCTGCTGGAAGTTACTGCCATATCACT
TTGACCGCCGCAAAGTGCTGCCCTAAAAACGACACAAAACAAGTAGATGAAAGGAAGACG
ACTACCCACTGTTCGGAGAGCGCCTACGGTTGTTGTTCAGATGGCTCGACCGCAGCAAGT
GGCCCGGGTGAAGAGGGTTGTCCAATTACGACTTCAACTTGTGGCTGCAACCGCCTAGGG
TCAATATCTGATCGGTGTGATGATAGCGGCCAGTGTGTATGTCGCCCCGGTGTAGGGGGC
CTTAAGTGTGATAGATGTGAGCCTGGATACTGGGGTCTGCCTCGCATAGGCTCAGGACAT
ACTGGATGTATTCCATGCGGTTGTTCTGCATTTGGTTCTGTAAGAGAGGATTGTGAGCAA
ATGACCGGTCGATGCGTGTGCCGTACTGGGGTTCAGGGTCAGAAGTGTACCGTGTGCGCA
GACCATCGCCGTCGCTTAGGACCTAACGGCTGTTCCGATCCGGAAAACGGCAGTTCGGTG
GAGTCATGTGCGGATTTGTCATGTTATTTCGGAGCAGTGTGCACAGAACGTACGGGTGGA
GCGTTGTGTGAGTGTGCTGCTGCCAACTGTCCTGACAGTGATCTTAACATGATGGTTTGC
GGCAGTGACGGTAAAACTTACGAATCAGAATGCCACCTGAAATTGCAAGCCTGTCGCACT
CAAGAGGATATTGTCGTTCAAGCATTTGGTCCTTGCAAGTTGTCCGAAGCATCGGGAACT
GCGGGGCCACCACGACCGTCTTCACCGATACAGTTCACTCAACAAGATGATGGAGCAGCT
TCGAAGTCTACAAGGCATCTGCTTAACCCTGACAAATATTACAATAAATATGATTGGACA
AGGAAAGAAACTCCAAGTGATTTTGAAAACATTGTATCGGGCCAGAAAGTAAAAGGTTCG
CAAACAGCGACAACAGCAACAGTGGGTGCGGTGGGAGCATTGCTTGGGGACTTATGTGCT
GAAGACGCAGACTGTGCAGCTTTGCCGGGTGCTCTCTGTACACGTGGTGGCTGTGTGTGT
CGTCCGGGTTACACACCCACTGCGCATAGGAAGGCTTGTATTGAAGAATTTCCTCAAGAA
ACCACAGAAGAATACAGTGCATGTTTGTCAGATCCTTGTTATAATTTTGGAACTTGCATT
GACCTGCCAGGTTCCACTTATACTTGTGTCTGCTCCGAATCCTATACCGGCTCGAACTGT
GAATCACTTATCAAAGACGGTCCACCGATTACGTACATCGAAACTCCATCATTTGTCGGT
TCGTCTTACATACGCTTAAGACCACTGAAGGCTTACCATAAGTTGAACATCGATATCGAA
TTTAAGGCATTCTCAGAAAATGGGGTCTTATTGTACAATCAGCAGAAACTCGACGGAACG
GGTGATTTTGTTTCGCTAGCGTTGGTAAACGGATATTTGGAGTTTAGGTACAATCTTGGG
AATGGAGTAATAATTTTAACATCTTTAGAAAAGATATCATTGAACGAATATCATAAAGTG
TCGGCTAAAAGATATCACCGCGATGGTATTCTGACGGTAGATGATATGGAAGATGTAGCA
GGACAGTCAGATGGTAATTTAAAAGCTTTAGATCTTGCGGACGATGCCTTCATTGGTAGT
GTACCAAGTAATTACACGAGGGTATTCGAAAATATTGGAACCCGAAACGGTTTTATTGGT
TGTATTAAATATTTGAGAATTATTCGGCACCAAATAACGAAGAAATTGGGTCGTCCAGAC
TCATTAGTTGTCGCCATGGAAAACGTTCGAGAGTGTCAATCTAATCCTTGTATGAGTATG
CCGTGTAGAAATGGGGCCACGTGTCAGGCTGTTGAAGGTTCGGTGACTGAATATACATGT
AGCTGTCCTTTCGGATTTCAAGGAGCCAATTGTAACGAGAGAATAGATCCATGCGAATCC
AATCCCTGTGGATATGATGAGGGGTTATTGTGTGATATTGGTCCTGACGGCGGACATATT
TGTCGGTGTTTGTTTGGGGGAAATATCGAATCCGATGGAAATAATTGCAATAAAGATGTT
AATGTTATCCATGAAACTTGGTCACCTCAATTCAATGGTACTAGCTATATCGAGTTACCG
CCGCTCGAGGGCTTGGGAAAAGCATTCCGTATCGAAATTTGGTTTTTAACGAACCGTTTT
TCCGGAATGCTTCTTTATACCGGGCAGTCAAATAAAGCCAAGGGGGATTTTATAGCGATT
AACTTGGTTAATGGATATCTACAGTTTAGGTATAATTTAGGAAGTGGAATTGCAAACATC
ACTTCCCCAACACCGATAACTAAGGGGCAATGGCATCGCGTTCGTGTAAGCCGAGTTGGC
AGACATGGTAGTCTACAGCTTGATCAGTTGCCTGTACAGCGCGGCCTTTCCCCACCACCC
CTCACTCACTTGGAGCTTAATCTCCCTCTATTTATTGGTTCTTTGCCTGCTTACGTCCGT
CCTCACAAAATGTCCGGGGTGACCAGCAGTTTTATCGGGGTTATGCAACAGGTATTCGTA
AACGGCAATCCACTATCGCTATACAGTGAGGATACAGCAAAATGCTTTGTAGTTGCTGAG
GAAGAGCGACTTCCGTGTGCTACGAGCGGCGTCACTAAATACACCGGACCGCCGTGTGGT
GATGATCTAACCCCTTGTAAAAATAATGGTTCGTGCGTACCGTTATTGAACGAATATAAA
TGTATATGCCCAGACGGATATCAAGGACGGAACTGTGAGCTCCAATTAAAAGTAGAGATG
TTAAACGATGGAGCGCCAATTAAGTTCGACGGAAATAATTACTACTCCTACAGAAGTCGT
GGCGGCCGTAGGAACCGTGGATTTCGTGGTATTAGATATGAAATAAAGTTTCGGACTTAT
AATAACTCCGGTCTTTTAATGTGGAGACGAAAAATTGGTATACGACCCCGGGACTTCATC
GGACTCGGATTAAGTAATGGAAAATTACAATTAATATACACTGACACAGATGTAAAAGAG
AACAGTTTGGCTTTGAACGAGGAGTGGTTTCAAAGTGTTGAATCGAAGGAGAGAGTAGAT
GATGGACGTTGGCATACAGCGACTGTTAGAAGAAGGAAGCGGCTCGCAATGTTGCAAGTA
GACGATACACCGCCTGTGAGGGGGTACTCGCAATCATTGCTGGTACCTTCGAAAGCTAAT
CCAAAGTTATGGATAGGAGGATCTCCATCGCTTCCTTTAGGATTGCCAGGGGACCTTTAC
TCAGGATTCCGAGGTTGTATCGCCAGCGTGAAGTCTAACGGTAGGCACATCGACATTACA
ACACCTATACGACCGACGACTACAATACGATATTGTGATTAA
Protein sequence:
MNRSGYDYNVPLSTTYYASNSRPPQYERDYGQLASVPVYSPSTHPLSTLRAEEKHWYIEN
HKNIQSNYNENSDVISRETKNKKEEEKKVDVQATRSDAAQLRQRRKEDRCSCECWPSTDN
NVVQDTGANDTTRSLSGVDMVASYGVYGNSQDEESSGFAHLSEKNISYGQRNETPVQGVI
VNQQGNSAVVRSTSEPDSSCSKCCPWSSNAPAANSEDYCWIHYCRPALIVLLLIFFLILL
GVSTGFLLTNNYLHYQPRPPAPEEACEKTFCRWGAECVSLGDGRAHCACPTSCPSSASPV
CSTAGRTYRNHCFLRKEACERRLNLRVKHEGECEAGDPCSGLTCPTGARCVVTYGNAECR
CPRNCQRRKTVCGSDGREYPSTCHLDKHACDNQINITIKYHGKCDPCLEHECVDGGICQL
NEVRAPVCRCGPPCNLIVRPGSAVCGSDFKDYASECSLRRESCRTRQQLSIAYRGECASA
QHPCESVKCGIRERCILDARGVAVCGCGPECEDVLRPVCGSDDRTYASPCLLERTACLEN
RDVRVAYMGACGLENPCARATCPWGGACVTRSGAAHCSCPVCDATLSPVCASDHNTYGSE
CKMRMHACQESLKEGELRVLYNGTCQTCADVMCMGDGTCEMDETGRPICHCNHNCTAQES
DVVCGTDGQTYQSQCELDLTACRDQSELRSAYSGDCALCNGVQCSYGAHCVAGECICPTD
CSGAPREPVCGSTMQTYQNECELQKAACNLPPPTKLHVIFYGDCKDRLAVVPPIAMTTTA
MITTEADESSTDIVEVTQKVDTTSPSACRDIRCDFDASCEIGYDGYPRCSCLFECPADDE
YFPVCASDFRLYPSLCAMRKEGCQKQLELRLRPLDLCKGMEVRPCGNNRAIIDKSSGLEI
DCGNGPHRQDCPAGSYCHITLTAAKCCPKNDTKQVDERKTTTHCSESAYGCCSDGSTAAS
GPGEEGCPITTSTCGCNRLGSISDRCDDSGQCVCRPGVGGLKCDRCEPGYWGLPRIGSGH
TGCIPCGCSAFGSVREDCEQMTGRCVCRTGVQGQKCTVCADHRRRLGPNGCSDPENGSSV
ESCADLSCYFGAVCTERTGGALCECAAANCPDSDLNMMVCGSDGKTYESECHLKLQACRT
QEDIVVQAFGPCKLSEASGTAGPPRPSSPIQFTQQDDGAASKSTRHLLNPDKYYNKYDWT
RKETPSDFENIVSGQKVKGSQTATTATVGAVGALLGDLCAEDADCAALPGALCTRGGCVC
RPGYTPTAHRKACIEEFPQETTEEYSACLSDPCYNFGTCIDLPGSTYTCVCSESYTGSNC
ESLIKDGPPITYIETPSFVGSSYIRLRPLKAYHKLNIDIEFKAFSENGVLLYNQQKLDGT
GDFVSLALVNGYLEFRYNLGNGVIILTSLEKISLNEYHKVSAKRYHRDGILTVDDMEDVA
GQSDGNLKALDLADDAFIGSVPSNYTRVFENIGTRNGFIGCIKYLRIIRHQITKKLGRPD
SLVVAMENVRECQSNPCMSMPCRNGATCQAVEGSVTEYTCSCPFGFQGANCNERIDPCES
NPCGYDEGLLCDIGPDGGHICRCLFGGNIESDGNNCNKDVNVIHETWSPQFNGTSYIELP
PLEGLGKAFRIEIWFLTNRFSGMLLYTGQSNKAKGDFIAINLVNGYLQFRYNLGSGIANI
TSPTPITKGQWHRVRVSRVGRHGSLQLDQLPVQRGLSPPPLTHLELNLPLFIGSLPAYVR
PHKMSGVTSSFIGVMQQVFVNGNPLSLYSEDTAKCFVVAEEERLPCATSGVTKYTGPPCG
DDLTPCKNNGSCVPLLNEYKCICPDGYQGRNCELQLKVEMLNDGAPIKFDGNNYYSYRSR
GGRRNRGFRGIRYEIKFRTYNNSGLLMWRRKIGIRPRDFIGLGLSNGKLQLIYTDTDVKE
NSLALNEEWFQSVESKERVDDGRWHTATVRRRKRLAMLQVDDTPPVRGYSQSLLVPSKAN
PKLWIGGSPSLPLGLPGDLYSGFRGCIASVKSNGRHIDITTPIRPTTTIRYCD