DPGLEAN16601 in OGS1.0

New model in OGS2.0DPOGS203712 
Genomic Positionscaffold95:+ 33683-40542
See gene structure
CDS Length3267
Paired RNAseq reads  691
Single RNAseq reads  1599
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003498 (2e-15)
Best Drosophila hit  CG32354 (7e-15)
Best Human hitagrin precursor (6e-54)
Best NR hit (blastp)  PREDICTED: similar to agrin [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC004709 [Tribolium castaneum] (0.0)
GeneOntology terms
  
GO:0016021 integral to membrane
GO:0005605 basal lamina
InterPro families







  
IPR011497 Protease inhibitor, Kazal-type
IPR002049 EGF-like, laminin
IPR012680 Laminin G, subdomain 2
IPR001791 Laminin G domain
IPR008985 Concanavalin A-like lectin/glucanase
IPR003645 Follistatin-like, N-terminal
IPR002350 Proteinase inhibitor I1, Kazal
IPR003884 Factor I / membrane attack complex
IPR013320 Concanavalin A-like lectin/glucanase, subgroup
Orthology groupMCL13114

Nucleotide sequence:

ATGCTAACTTCACCGAATGCGATTTTATTAATGGTCCCTAATATATTGGGATGCTATATT
TTTCCGAATGACGTGACGAATCCGTGTCGAGGCGTGATCTGCGGCCCAGGAGAGCTGTGT
CGCCCTACTGCAGACGGAAAGAATTACAGTTGTGAATGTCCAACATCTTGTCCAAGTTAC
GGGGATCATGAAGGTTCACGCCCCTTATGCGCTAGCGACGCTAAAGATTATCCCGGAACA
TGTGAAATGCGAAGAGCTGCTTGCGAGAGCAACACAAACATAACTTTTAAATATCATGGC
AAATGTGACCCCTGCGCCGGCGTATCCTGTCCAGATCCAGAAGTTTGCCAATTGGATGAC
CAACGCCAACCTTCCTGTCGCTGCGCAGAACCTTGTCCTTTAGAATTTTCCCCTGTATGT
GCGTCTGATGGAAAGACTTATTCGAACGAGTGTCAAATGCATAGAGAGTCCTGTCGAGCC
AGAAAACAATTAAAGATTATTTTTAAAGGACAATGTATTTCAGGTGTAAACCCATGTGCG
GAGGTGGAGTGTCGTCACGGCGCTGAATGTCGTGTGGAGGGTAGTGGCGCGGTCTGCGCT
TGTCCGCCCCCCTGTGAACAAGTGCTGCGACCTGTCTGCGGTTCTGATGCGAGGACCCAT
GACAGCGAATGTGAACTTCGACGTGCTGGCTGTTTGTTAGGAAGAGAGTTGAAGGTCGTT
CACGCTGGAGCCTGCGGTTCCAACGGTGTTTGTGCTGGGCGGGTATGCCCTCACGGTGGT
GAATGTGTTTCCTCAGGAGGTCGAGGCGTTTGTCGATGTCCAAAATGTTCTAATGAATTT
GCTCCCGTGTGTGGTTCTGATGGTATTTCCTACGGCAACAGATGCAAGCTGCAGTTAGAG
TCCTGTAGACATCGTCGTCACGTTCAAGTGTTGTACGATGGACCTTGCAATGGATGTGAA
AATAAAAAGTGTGAATATTATGCCGTTTGCGAGAGTGATGGTGTTTCTGAAGCTAGCTGC
GTTTGTCCAAAACATTGTGAAGAAGGAACTGAAACTGAAGAAGTTTGCGGTAATGACAAC
AAAACTTATAGCAGCGTTTGTGCTCTTCGTAACATAGCCTGCCGCGAAAAGAGGAGACTC
CACGTTAAACATATGGGTTCTTGTGAATCTTGCGGCAATGTCGAATGTCCGCTAGGTATG
TGGTGTTCTCGAGGCAAATGTGCTTGTGCAAGTTGTGCTGATGTTCCTCGAGAGACCGTT
TGTTCTGACACGAGGCGAACGTTCCCCAACGAGTGTTCATTGCATAAAGCTGCGTGCGAG
GCGCGAGCCCGCGGGGAACCTCCTCCGCAGGTTGCTTACTATGGAGACTGTACTGATGCT
AATAAAGATAATAGTTCAGGGGCTAATGTATCAGAGAAAATGGAACAGAGTAATGAGATA
CGGAATGGTGTGGAGGTCGACAGTACCAGTGAACCATCTGTTGGAACAGCTGTTTGTGCT
AGAGTGCAATGCGCCTACGAGGCGACCTGCGCTGTGGACGGTAACGGCCAGCCACGTTGT
GCATGTCTGTTTGACTGCGCCGCTGCGGCAGCTTCTTCCTCAGCGCCCGTCTGCGCCTCC
GACTTACGCATGTACCCCACGCTGTGTCATATGAAACTGGAGTCTTGTCGCCGTCAAGAG
GACCTTCGACTGAGGCCTTTAGCATTGTGTAGGGGTCTCGAGTTCAGGCCATGTGGTGAT
GATGAAACCGTAACAGATTCGGAAGGTCTTCCAGTTGATTGTGGCGGTGGACCTCATCGT
AAGGACTGTCCGACGGATAGCTACTGCCATCATACTGCTAAGGCTGCAAGATGTTGTAGG
AAAGACAAAGCTGTCGCAGAGAAGAAAGACTGTCAAGAATCTTGGTACGGCTGTTGTGCG
GACGGGGTGACGTCAGCACGTGGTCCCGGGGGGGCGGGATGTCCCTCACAATGTGGTTGT
CACAGGCTGGGTTCTGTGTCCGAGATGTGTGATGAAAGCGGCCAGTGTCAATGTAGACCT
GGTGTGGGAGGGCACAAATGTGACAGATGCGAACCAGGTTATTGGGGCTTACCCCGGATC
GGTACTGGACATACTGGCTGTATACCATGTGGGTGTTCGGCCTTCGGATCAGTTCGTGAG
GACTGCGAACAGATGACCGGGCGGTGTGTTTGCAAACCGGGCATTCAGGGGCAGAAGTGT
ACTGTTTGTTCCAACCATGAACATACTCTAGGACCTAACGGCTGCTTTGACCCGGAATCC
ACCCAACTACCAGCTACCGACTGTGAACGTATGACATGTTATTTCGGAGCCTACTGCGCT
ATACGTAGCGGTCTTGCCACTTGTGAATGTAATGCTCAGGAATGTTTTACAACCGAGGGC
CCGTCTGTTTGCGGTAGTGACGGACGGACATATCTATCAGCTTGTCATGCGAGGGCGCAC
GCTTGTCGGACACAATCGGACATAGTTGTACAGGCGTTTGGTCCCTGTGCTGAAGATACG
CCGTCTGTGAAGCGAGAGGAAATAAATTCATCTATTATTTCGAAAGAAAATGCCGAAGAA
GGTTATTGTAACAAAAACCCATCTCAAATAGATACCGATATTGAAGTTACAGAATCGGAG
GAAGAGCAATACATAACAAATGAGGTTGAAGAAAATTATCCAATATACGAAGAATACATC
GAGGAAAACGAAAACGAAATATACTCATCGCCATTGTTCGACGGGCATGCTCGGATGACA
GCTCGCACAAGATTGCCCGCTAAACGATTCGATATTTGGGCCGAAGTATCGGCGGTGTGC
GGTAAAGGCGCTTTAATAAGTGCCTCAGGTGTGCGAGATTATTTATGGCTCGGGTTCGTA
AAAGACAGAGCTGTATTGCGTTGGGACGCTGGCAATGGCCCTTTAGAGTTACGATCTGGT
AAAATAAGAGTTGATACTAAGTCTAAAATATCGGCGCGGCGATATAAGAAGGACGCCATG
TTGAAACTTGAATCTTATACAGTTAGGGGTACGACACATGGACGCATGAGTTCATTAGAC
GTTGATCCTTATATTTATATTGGCCATCCGCCGGATAACGTTACAAAGTTATCTGGTGTA
CACACAATGAACGGTTTTGTGGGATGTGTACATCGCTTGCGTGTGAGCGGACGTGACGTC
ATCCCCCCGTCCCGAGGCCTAAATATTGTGGCTCATGGTCTGCGACCATGCACTCCTTAC
AATCTAGCCAAGGTCGTGTGTCCTTAG

Protein sequence:

MLTSPNAILLMVPNILGCYIFPNDVTNPCRGVICGPGELCRPTADGKNYSCECPTSCPSY
GDHEGSRPLCASDAKDYPGTCEMRRAACESNTNITFKYHGKCDPCAGVSCPDPEVCQLDD
QRQPSCRCAEPCPLEFSPVCASDGKTYSNECQMHRESCRARKQLKIIFKGQCISGVNPCA
EVECRHGAECRVEGSGAVCACPPPCEQVLRPVCGSDARTHDSECELRRAGCLLGRELKVV
HAGACGSNGVCAGRVCPHGGECVSSGGRGVCRCPKCSNEFAPVCGSDGISYGNRCKLQLE
SCRHRRHVQVLYDGPCNGCENKKCEYYAVCESDGVSEASCVCPKHCEEGTETEEVCGNDN
KTYSSVCALRNIACREKRRLHVKHMGSCESCGNVECPLGMWCSRGKCACASCADVPRETV
CSDTRRTFPNECSLHKAACEARARGEPPPQVAYYGDCTDANKDNSSGANVSEKMEQSNEI
RNGVEVDSTSEPSVGTAVCARVQCAYEATCAVDGNGQPRCACLFDCAAAAASSSAPVCAS
DLRMYPTLCHMKLESCRRQEDLRLRPLALCRGLEFRPCGDDETVTDSEGLPVDCGGGPHR
KDCPTDSYCHHTAKAARCCRKDKAVAEKKDCQESWYGCCADGVTSARGPGGAGCPSQCGC
HRLGSVSEMCDESGQCQCRPGVGGHKCDRCEPGYWGLPRIGTGHTGCIPCGCSAFGSVRE
DCEQMTGRCVCKPGIQGQKCTVCSNHEHTLGPNGCFDPESTQLPATDCERMTCYFGAYCA
IRSGLATCECNAQECFTTEGPSVCGSDGRTYLSACHARAHACRTQSDIVVQAFGPCAEDT
PSVKREEINSSIISKENAEEGYCNKNPSQIDTDIEVTESEEEQYITNEVEENYPIYEEYI
EENENEIYSSPLFDGHARMTARTRLPAKRFDIWAEVSAVCGKGALISASGVRDYLWLGFV
KDRAVLRWDAGNGPLELRSGKIRVDTKSKISARRYKKDAMLKLESYTVRGTTHGRMSSLD
VDPYIYIGHPPDNVTKLSGVHTMNGFVGCVHRLRVSGRDVIPPSRGLNIVAHGLRPCTPY
NLAKVVCP