New model in OGS2.0 | DPOGS203712  |
---|---|
Genomic Position | scaffold95:+ 33683-40542 |
See gene structure | |
CDS Length | 3267 |
Paired RNAseq reads   | 691 |
Single RNAseq reads   | 1599 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003498 (2e-15) |
Best Drosophila hit   | CG32354 (7e-15) |
Best Human hit | agrin precursor (6e-54) |
Best NR hit (blastp)   | PREDICTED: similar to agrin [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC004709 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0016021 integral to membrane GO:0005605 basal lamina |
InterPro families    | IPR011497 Protease inhibitor, Kazal-type IPR002049 EGF-like, laminin IPR012680 Laminin G, subdomain 2 IPR001791 Laminin G domain IPR008985 Concanavalin A-like lectin/glucanase IPR003645 Follistatin-like, N-terminal IPR002350 Proteinase inhibitor I1, Kazal IPR003884 Factor I / membrane attack complex IPR013320 Concanavalin A-like lectin/glucanase, subgroup |
Orthology group | MCL13114 |
Nucleotide sequence:
ATGCTAACTTCACCGAATGCGATTTTATTAATGGTCCCTAATATATTGGGATGCTATATT
TTTCCGAATGACGTGACGAATCCGTGTCGAGGCGTGATCTGCGGCCCAGGAGAGCTGTGT
CGCCCTACTGCAGACGGAAAGAATTACAGTTGTGAATGTCCAACATCTTGTCCAAGTTAC
GGGGATCATGAAGGTTCACGCCCCTTATGCGCTAGCGACGCTAAAGATTATCCCGGAACA
TGTGAAATGCGAAGAGCTGCTTGCGAGAGCAACACAAACATAACTTTTAAATATCATGGC
AAATGTGACCCCTGCGCCGGCGTATCCTGTCCAGATCCAGAAGTTTGCCAATTGGATGAC
CAACGCCAACCTTCCTGTCGCTGCGCAGAACCTTGTCCTTTAGAATTTTCCCCTGTATGT
GCGTCTGATGGAAAGACTTATTCGAACGAGTGTCAAATGCATAGAGAGTCCTGTCGAGCC
AGAAAACAATTAAAGATTATTTTTAAAGGACAATGTATTTCAGGTGTAAACCCATGTGCG
GAGGTGGAGTGTCGTCACGGCGCTGAATGTCGTGTGGAGGGTAGTGGCGCGGTCTGCGCT
TGTCCGCCCCCCTGTGAACAAGTGCTGCGACCTGTCTGCGGTTCTGATGCGAGGACCCAT
GACAGCGAATGTGAACTTCGACGTGCTGGCTGTTTGTTAGGAAGAGAGTTGAAGGTCGTT
CACGCTGGAGCCTGCGGTTCCAACGGTGTTTGTGCTGGGCGGGTATGCCCTCACGGTGGT
GAATGTGTTTCCTCAGGAGGTCGAGGCGTTTGTCGATGTCCAAAATGTTCTAATGAATTT
GCTCCCGTGTGTGGTTCTGATGGTATTTCCTACGGCAACAGATGCAAGCTGCAGTTAGAG
TCCTGTAGACATCGTCGTCACGTTCAAGTGTTGTACGATGGACCTTGCAATGGATGTGAA
AATAAAAAGTGTGAATATTATGCCGTTTGCGAGAGTGATGGTGTTTCTGAAGCTAGCTGC
GTTTGTCCAAAACATTGTGAAGAAGGAACTGAAACTGAAGAAGTTTGCGGTAATGACAAC
AAAACTTATAGCAGCGTTTGTGCTCTTCGTAACATAGCCTGCCGCGAAAAGAGGAGACTC
CACGTTAAACATATGGGTTCTTGTGAATCTTGCGGCAATGTCGAATGTCCGCTAGGTATG
TGGTGTTCTCGAGGCAAATGTGCTTGTGCAAGTTGTGCTGATGTTCCTCGAGAGACCGTT
TGTTCTGACACGAGGCGAACGTTCCCCAACGAGTGTTCATTGCATAAAGCTGCGTGCGAG
GCGCGAGCCCGCGGGGAACCTCCTCCGCAGGTTGCTTACTATGGAGACTGTACTGATGCT
AATAAAGATAATAGTTCAGGGGCTAATGTATCAGAGAAAATGGAACAGAGTAATGAGATA
CGGAATGGTGTGGAGGTCGACAGTACCAGTGAACCATCTGTTGGAACAGCTGTTTGTGCT
AGAGTGCAATGCGCCTACGAGGCGACCTGCGCTGTGGACGGTAACGGCCAGCCACGTTGT
GCATGTCTGTTTGACTGCGCCGCTGCGGCAGCTTCTTCCTCAGCGCCCGTCTGCGCCTCC
GACTTACGCATGTACCCCACGCTGTGTCATATGAAACTGGAGTCTTGTCGCCGTCAAGAG
GACCTTCGACTGAGGCCTTTAGCATTGTGTAGGGGTCTCGAGTTCAGGCCATGTGGTGAT
GATGAAACCGTAACAGATTCGGAAGGTCTTCCAGTTGATTGTGGCGGTGGACCTCATCGT
AAGGACTGTCCGACGGATAGCTACTGCCATCATACTGCTAAGGCTGCAAGATGTTGTAGG
AAAGACAAAGCTGTCGCAGAGAAGAAAGACTGTCAAGAATCTTGGTACGGCTGTTGTGCG
GACGGGGTGACGTCAGCACGTGGTCCCGGGGGGGCGGGATGTCCCTCACAATGTGGTTGT
CACAGGCTGGGTTCTGTGTCCGAGATGTGTGATGAAAGCGGCCAGTGTCAATGTAGACCT
GGTGTGGGAGGGCACAAATGTGACAGATGCGAACCAGGTTATTGGGGCTTACCCCGGATC
GGTACTGGACATACTGGCTGTATACCATGTGGGTGTTCGGCCTTCGGATCAGTTCGTGAG
GACTGCGAACAGATGACCGGGCGGTGTGTTTGCAAACCGGGCATTCAGGGGCAGAAGTGT
ACTGTTTGTTCCAACCATGAACATACTCTAGGACCTAACGGCTGCTTTGACCCGGAATCC
ACCCAACTACCAGCTACCGACTGTGAACGTATGACATGTTATTTCGGAGCCTACTGCGCT
ATACGTAGCGGTCTTGCCACTTGTGAATGTAATGCTCAGGAATGTTTTACAACCGAGGGC
CCGTCTGTTTGCGGTAGTGACGGACGGACATATCTATCAGCTTGTCATGCGAGGGCGCAC
GCTTGTCGGACACAATCGGACATAGTTGTACAGGCGTTTGGTCCCTGTGCTGAAGATACG
CCGTCTGTGAAGCGAGAGGAAATAAATTCATCTATTATTTCGAAAGAAAATGCCGAAGAA
GGTTATTGTAACAAAAACCCATCTCAAATAGATACCGATATTGAAGTTACAGAATCGGAG
GAAGAGCAATACATAACAAATGAGGTTGAAGAAAATTATCCAATATACGAAGAATACATC
GAGGAAAACGAAAACGAAATATACTCATCGCCATTGTTCGACGGGCATGCTCGGATGACA
GCTCGCACAAGATTGCCCGCTAAACGATTCGATATTTGGGCCGAAGTATCGGCGGTGTGC
GGTAAAGGCGCTTTAATAAGTGCCTCAGGTGTGCGAGATTATTTATGGCTCGGGTTCGTA
AAAGACAGAGCTGTATTGCGTTGGGACGCTGGCAATGGCCCTTTAGAGTTACGATCTGGT
AAAATAAGAGTTGATACTAAGTCTAAAATATCGGCGCGGCGATATAAGAAGGACGCCATG
TTGAAACTTGAATCTTATACAGTTAGGGGTACGACACATGGACGCATGAGTTCATTAGAC
GTTGATCCTTATATTTATATTGGCCATCCGCCGGATAACGTTACAAAGTTATCTGGTGTA
CACACAATGAACGGTTTTGTGGGATGTGTACATCGCTTGCGTGTGAGCGGACGTGACGTC
ATCCCCCCGTCCCGAGGCCTAAATATTGTGGCTCATGGTCTGCGACCATGCACTCCTTAC
AATCTAGCCAAGGTCGTGTGTCCTTAG
Protein sequence:
MLTSPNAILLMVPNILGCYIFPNDVTNPCRGVICGPGELCRPTADGKNYSCECPTSCPSY
GDHEGSRPLCASDAKDYPGTCEMRRAACESNTNITFKYHGKCDPCAGVSCPDPEVCQLDD
QRQPSCRCAEPCPLEFSPVCASDGKTYSNECQMHRESCRARKQLKIIFKGQCISGVNPCA
EVECRHGAECRVEGSGAVCACPPPCEQVLRPVCGSDARTHDSECELRRAGCLLGRELKVV
HAGACGSNGVCAGRVCPHGGECVSSGGRGVCRCPKCSNEFAPVCGSDGISYGNRCKLQLE
SCRHRRHVQVLYDGPCNGCENKKCEYYAVCESDGVSEASCVCPKHCEEGTETEEVCGNDN
KTYSSVCALRNIACREKRRLHVKHMGSCESCGNVECPLGMWCSRGKCACASCADVPRETV
CSDTRRTFPNECSLHKAACEARARGEPPPQVAYYGDCTDANKDNSSGANVSEKMEQSNEI
RNGVEVDSTSEPSVGTAVCARVQCAYEATCAVDGNGQPRCACLFDCAAAAASSSAPVCAS
DLRMYPTLCHMKLESCRRQEDLRLRPLALCRGLEFRPCGDDETVTDSEGLPVDCGGGPHR
KDCPTDSYCHHTAKAARCCRKDKAVAEKKDCQESWYGCCADGVTSARGPGGAGCPSQCGC
HRLGSVSEMCDESGQCQCRPGVGGHKCDRCEPGYWGLPRIGTGHTGCIPCGCSAFGSVRE
DCEQMTGRCVCKPGIQGQKCTVCSNHEHTLGPNGCFDPESTQLPATDCERMTCYFGAYCA
IRSGLATCECNAQECFTTEGPSVCGSDGRTYLSACHARAHACRTQSDIVVQAFGPCAEDT
PSVKREEINSSIISKENAEEGYCNKNPSQIDTDIEVTESEEEQYITNEVEENYPIYEEYI
EENENEIYSSPLFDGHARMTARTRLPAKRFDIWAEVSAVCGKGALISASGVRDYLWLGFV
KDRAVLRWDAGNGPLELRSGKIRVDTKSKISARRYKKDAMLKLESYTVRGTTHGRMSSLD
VDPYIYIGHPPDNVTKLSGVHTMNGFVGCVHRLRVSGRDVIPPSRGLNIVAHGLRPCTPY
NLAKVVCP