DPGLEAN03021 in OGS1.0

New model in OGS2.0DPOGS206535 
Genomic Positionscaffold74:- 101099-110030
See gene structure
CDS Length5739
Paired RNAseq reads  33091
Single RNAseq reads  78724
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014039 (0.0)
Best Drosophila hit  collagen type IV, isoform A (0.0)
Best Human hitcollagen alpha-2(IV) chain preproprotein (6e-129)
Best NR hit (blastp)  GF15711 [Drosophila ananassae] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Collagen type IV CG4145-PA, isoform A isoform 1 [Apis mellifera] (0.0)
GeneOntology terms




  
GO:0005587 collagen type IV
GO:0007391 dorsal closure
GO:0005604 basement membrane
GO:0005201 extracellular matrix structural constituent
GO:0005488 binding
GO:0005581 collagen
InterPro families

  
IPR008160 Collagen triple helix repeat
IPR001442 Collagen IV, non-collagenous
IPR016187 C-type lectin fold
Orthology groupMCL10157

Nucleotide sequence:

ATGATCAATTCTTGGTTAGTTCTTAGAGTAGAAGTTACAGAGGCACTTACAGCAGATACA
GGAGCCGAAATCCTGGAGAGAAGGAGGAGAACGGGCACAAGGTGTACAGCGAGGAGGCGC
GCACGCGCCGGGCCCGACATGGCTGCGCCGCACTACTGGTCGCGCGCCCTCACACACAGG
GACGAGCGCAGCGCCTCCCGTACAGTCAGGCGCGGGGCGGGGGCTCAGAGACCGCCGCCC
GAGCGCTCCTCTTTCCTTAGTATCCCGGCTCAAACGGCGCGCGCGCACGGGACACTGGCG
CAGCGCGCTTTGGGCCATGGCGGTCCACGTACTTTGCAACGAAACGAATATGAAAGAGGA
GACGTCGAAGAAAATAATATTTATGATAATCAAGACTGGATGTATCAGAATAGTTACAAT
CCGCAACCAACAATAAATTCATACTATGGTTTATCTAGAAGGCAGGACTTACCGACACCC
TCTGCCCCCCCTTCTCCTCCACCGGAAAGAGTACAGCCCTCAAGAAGTTTTGGACAAAAC
TTTGCTGTGTATGACCCAGTAACTCGTCAGCGGACAAATGCATTTGATCGTAATTGTACG
GCCCCTGGCTGCTGTGTACCAAAATGTTTTGCAGAAAAGGGTAGTAGGGGTTTCCCAGGA
ATGCGGGGACCACCTGGAATCACAGGTCTACCAGGACACGTTGGCGCTGAAGGTCCACAG
GGACTTAAGGGTCAAAAAGGTCAAGATGGACCACAAGGTCCTCGGGGTCCGCGAGGAGAA
AAGGGTAAACCTGGAGCTCAAGGATTTATAGGCTTAGCAGGACCACCAGGTCCTCAAGGA
GAACCTGGTATGCCAGGGATTCCTGGACGTGATGGTTGTAACGGAACTGATGGAGAACCT
GGAATGGTGGGGATCAAAGGTTCACAGGGTCCACGTGGATTTGCTGGGCCTAAAGGTAAC
AAGGGTGATAAAGGAGAGCCGGCTTATATGGGTCGATACCCAAAAGGTGAAAAAGGAGAA
CCTGGAGCTGATGGTTTACAAGGCCAATCTGGCCCAGCTGGACCAACAGGTCCTCCAGGT
TTGGCTGGTCCCAAAGGAATGACTGGACCTATGGGACCACCTGGATATAAAGGTGATAAA
GGTCCTAAAGGATCTAAAGGACAATCTATTCAAGGTGATAAAGGAGACCGAGGTGACAAA
GGTGACAGAGGACCCGGTTGTCCATCAACAACGTTACCTTCATTGGATAATAAAGGAGCA
ATAAAAGGTGTCAAAGGTGATATGGGATCAAAAGGTGAAAAGGGAGAACCTGGGAGAATG
GGTGAAAAGGGAGAAACAGGTCCAATGGGGGAACCTGGCTTGCCTGGATTAATGGGCATT
AAAGGAGAAAAGGGCTTAAGAGGAAATCCTGGGGAACGGGGTCGTGAAGGAATGTATGGT
GAACCCGGACCTATGGGAAGAAAAGGTGATAGAGGCATTGATGGACTGAATGGTCTTCCC
GGCCGACCGGGTTTGAAAGGAGAACCCGGCAGGGATGGAGCAACAGGTCTAATGGGCTTA
AAAGGAGTGCCAGGTCCACCTGGTGGTCGAGCTGGAGCACGAGGTCCACCTGGGCCACCA
GGTCCTCGGGGCTATATCGGCGTTGCTGGTGCACCAGGGTCTAGTGGTAGGCCCGGAGAA
AATGGATTACCAGGACCTATGGGTCCAAGAGGTGGACAGGGAGAACCAGGTGACACAGGC
ATTGAAGGTCCAGCAGGTCAAAAAGGAGAAAAAGGAGAACCTGGTCTTGATGGCTTGCCT
GGAGAAATAGGTCAACGAGGATATGATGGACCCATTGGTCCTCAAGGACCTAGGGGACTA
AAAGGAGAAGAAGGTCAATCAATTCCTGGTGACAAGGGAAACAGTGGCCAACCAGGAATT
CCAGGAGATAAGGGAGCCAAGGGCGAAAGAGGTTATCCAGGATTACGAGGTACACCTGGA
AACTCTACATTAGGTACACCAGGAAGTCCCGGAGAAATGGGTCCACCTGGTGAAAAAGGT
GAAAAAGGAACTCCTGGGTACGATGGAATACCTGGTAATCCTGGACAAAAAGGTGACATT
GGAGGACGGTGTAACGAATGTCGACCTGGAAGTATGGGCGAAAAGGGAGACCGCGGTGCT
GATGGTCTACCTGGTGAACGGGGTGAACGAGGTCACATCGGACCCATCGGGATGACCGGG
GAGCGTGGTGCTGACGGTATGAATGGAATGCCTGGAGCTGCTGGAGCACCGGGTGAACGT
GGATTGGACGGACCAATAGGACCACCAGGAATGAGGGGAGCAGATGCAATGATACCGTCC
AATTTAGTAAAAGGACCTCCCGGAGAAAGAGGTGAACCAGGAGAAAAAGGAAACATGGGA
CCTAAGGGTGAAAGAGGACCTGATGGAATAATGGGTGATCGTGGATTAAATGGCATGCCC
GGACAGAAGGGTGACATGGGTAGAATGGGACCTTCTGGTATAGATGGCACACCTGGTAGT
GATGGAATACCGGGACGGCCAGGAATGAAAGGCATGTCCATCAAAGGTGAAAAAGGAATA
TCTGGTGATCAGGGTGAAAAAGGTGACAAAGGATTTTCTGGAAGACCAGGACTTAAAGGT
GAACCTGGTCAATGTCCCAATGAGTTAAAAATTCGCACAAAGGGAGAAAAAGGCAACCCT
GGCGTTCCAGGACCACAAGGACCATTAGGTATGAAGGGTGAAAAAGGTAATCAAGGGCCA
TTCGGTTTTACTGGTCCAAAGGGAGAGATGGGTTTACCAGGACGAGCTGGACCGGTAGGT
CCACGTGGTCTTCCGGGTTTCAAAGGCGATAAAGGTGAAATGGGTTCAATGGGATTTCCG
GGAACACCAGGGGATTTAGGCCCTAGAGGTTTTCCAGGGTTACCAGGATTAAAAGGAGAC
AAAGGTGAGATTGGTCCTTCTATGCCTGGACCACCTGGACCTGCTGGATTAAAGGGAGAT
AAAGGAGAACAAGGTCCAAGAGGTCAACCTGGAATAGAAGGAAAGGATGGTCCTCCAGGA
TTAGCTGGCTTACAAGGTGAAAAAGGTGATATGGGATTAATAGGAAGGCAAGGTTATCCA
GGACCTATTGGATTAAAGGGCGAACCGGGTCCTATAGGACCATCCGGAGTTCCGGGCATT
CCTGGTACGCCAGGAAGAGATGGACCTAAAGGTCAACAAGGATTTCCCGGTCCACCTGGT
AAACCTGGTGTAATTGGCTTACCTGGACAAAAAGGTGAACCAGGTATTCAAGGTCCAGAT
GGCCCGAAAGGTTTCCCAGGACCTCGTGGTCATGTTGGTATGCAAGGGCAAACTGGTCTT
GATGGAAGTCCCGGTGAAAAAGGAGATAAGGGTGATATAGGATTCCCGGGTGAGCCTGGT
AGACCTGGTCTTGATGGACCTAGAGGATTAGCTGGTGCACCTGGTGAGAAAGGTGATATA
GGTTTCCCAGGAAACCCTGGGTTGAATGGATTTATTGGACCAGCTGGCCCAAGAGGTGAT
ATAGGCTTCAAGGGTTCCAGGGGACCAAAAGGAGAACCTGGTTTAGCTTCAGAAAAGGGA
GAAAAAGGAGATCAAGGTTTTCCAGGATTACCTGGTGTTGATGGAAGACCTGGGCAAGAT
GGAGAAAAAGGTGACAAAGGTTTCCCTGGCTATCCAGGTCAAGGCATTCCAGGAAGTCAA
GGTGAAAAGGGAGATGCAGGTTTGCCTGGAAAAATGGGTTTTCCTGGTATTCCTGGCGAT
AAAGGCGACCGAGGCTTTCCAGGACTGGCAGGTTTAAAGGGAGAAAGAGGCCCTGCAGGC
AAAGACGGTTTGCCAGGAATGCCGGGAAGAGATGGCAGTCCTGGTGCTCCAGGCCAAGAT
GGTTTACCAGGAATGGATGGCGAAAAGGGTGAAAGAGGTGATCGAGGATTACCAGGTCGT
GATGGTCTTGATGGATTGAAAGGTGACCAGGGTATTGCTGGACCACCAGGGCCAATAGGA
CCAATGGGTTTTCCGGGTCCTAAAGGAGACATTGGTTTACCTGGGCCATCTATAAATATC
AAAGGTGAAAAGGGAGATATAGGTTTTCCCGGTATTACTGGACTTCAAGGAGATAAGGGT
GATCGAGGTAGAGATGGCTTCCAAGGTCTACAAGGGGAAAAGGGTGATCAAGGATTCACT
GGACAAAAGGGTGAAATGGGTAGAATGGGCGCCATGGGTGAAAGAGGTGAAAGAGGTCCA
ATTGGACCGACTGGTATTCCTGGACTCACAGTAAAAGGTGAAAAAGGTTTACCTGGAAAT
AACGGAAAACACGGCAGACCTGGCATGCGCGGTGCTACTGGAGAAAAAGGAGAACAAGGA
TTACCTGGACTTCCAGGTCCAATTGGGCGCTCTGGCATGCCAGGAACACCGGGACCTAGA
GGTGAACCCGGTGAACCAGGAAGTGAAGGAGTCGCAGGACCCCCTGGGTTTGACGGTCCT
CCGGGGCTACAAGGTCGTCCTGGCGAATATGGTGAAAAAGGTAACAAGGGTGATAAGGGT
GCTGTTGGTTTTGGTTTACCTGGCCCGAAAGGAGACACTGGCTTGCCAGGATTACCGGGT
TTAAATGGTGAAAAAGGTGATAAAGGAGATCAGGGTTTCGATGGATTAGTTGGAGAGATG
GGTGAGAAAGGTAACCAAGGAGAAAAAGGTGACAGAGGCTATCCTGGTCGGCCTGGAATT
CCTGGCCTTGATGGTGTAAAAGGAGATAAGGGAGAAGCGGCTGCTATAGTTTATGGAAGT
AAGGGAGAACCAGGACCAAGAGGTCCTCCTGGATTGAATGGTCCACCTGGACTTGACGGA
TTACCTGGTCCTAAAGGCTGGGATGGTGCTCCAGGCATGAAAGGAGATAAAGGTTTCCAA
GGACCTATGGGCCCACCAGGCTTACCAGGACCTCAAGGAATAATGGGTATTCAAGGTGAA
CGTGGTGAAACAGGTCGTATGGGATTACAAGGTGTACCTGGAATACCTGGTGCTCCTTGT
GCTACTACAGACTATCTTACTGGCATCCTTTTAGTGCGTCATAGTCAAACAAACATAGTA
CCCCAATGTGAACCCGGACATATTAAATTGTGGGATGGCTATTCCTTACTTTACATTGAT
GGAAATGAAAAGGCTCATAATCAAGATCTGGGATATGCTGGATCTTGTGTAAGAAAGTTC
AGTACCATGCCATTCCTTTTCTGTGATCTTAATGATGTATGCAATTACGCAAGTCGAAAT
GATCGCAGTTATTGGCTTTCTACAAATTTGCCGATACCCATGATGCCAGTAAACAACAAT
GAAATTTCACGATATATTTCAAGATGTGTTGTTTGTGAGGTTCCAGCCAATGTCATAGCT
GTTCACAGTCAAACTCTTGATATACCTAGTTGTCCAGTGGGTTGGAACTCATTATGGATT
GGATACAGTTTTGTTATGCACACTGGAGCTGGTGGACAAGGCGGTGGTCAAGCCCTTGCT
AGTCCGGGATCTTGTCTTGAAGACTTCCGAGCGACACCATTTATTGAATGTAACGGTGAA
GGTGGTACTTGCCATCATTTCGCCAATAAACTTAGTTTTTGGCTAACAACTATAGATGAT
AAGAAGCAATTCGCAAAACCAGAGCGTGAAACTCTTAAATCTGGACGACTATTGCAGCGA
GTGTCTAGATGCGCTGTTTGCATTAAGAATACCACATAG

Protein sequence:

MINSWLVLRVEVTEALTADTGAEILERRRRTGTRCTARRRARAGPDMAAPHYWSRALTHR
DERSASRTVRRGAGAQRPPPERSSFLSIPAQTARAHGTLAQRALGHGGPRTLQRNEYERG
DVEENNIYDNQDWMYQNSYNPQPTINSYYGLSRRQDLPTPSAPPSPPPERVQPSRSFGQN
FAVYDPVTRQRTNAFDRNCTAPGCCVPKCFAEKGSRGFPGMRGPPGITGLPGHVGAEGPQ
GLKGQKGQDGPQGPRGPRGEKGKPGAQGFIGLAGPPGPQGEPGMPGIPGRDGCNGTDGEP
GMVGIKGSQGPRGFAGPKGNKGDKGEPAYMGRYPKGEKGEPGADGLQGQSGPAGPTGPPG
LAGPKGMTGPMGPPGYKGDKGPKGSKGQSIQGDKGDRGDKGDRGPGCPSTTLPSLDNKGA
IKGVKGDMGSKGEKGEPGRMGEKGETGPMGEPGLPGLMGIKGEKGLRGNPGERGREGMYG
EPGPMGRKGDRGIDGLNGLPGRPGLKGEPGRDGATGLMGLKGVPGPPGGRAGARGPPGPP
GPRGYIGVAGAPGSSGRPGENGLPGPMGPRGGQGEPGDTGIEGPAGQKGEKGEPGLDGLP
GEIGQRGYDGPIGPQGPRGLKGEEGQSIPGDKGNSGQPGIPGDKGAKGERGYPGLRGTPG
NSTLGTPGSPGEMGPPGEKGEKGTPGYDGIPGNPGQKGDIGGRCNECRPGSMGEKGDRGA
DGLPGERGERGHIGPIGMTGERGADGMNGMPGAAGAPGERGLDGPIGPPGMRGADAMIPS
NLVKGPPGERGEPGEKGNMGPKGERGPDGIMGDRGLNGMPGQKGDMGRMGPSGIDGTPGS
DGIPGRPGMKGMSIKGEKGISGDQGEKGDKGFSGRPGLKGEPGQCPNELKIRTKGEKGNP
GVPGPQGPLGMKGEKGNQGPFGFTGPKGEMGLPGRAGPVGPRGLPGFKGDKGEMGSMGFP
GTPGDLGPRGFPGLPGLKGDKGEIGPSMPGPPGPAGLKGDKGEQGPRGQPGIEGKDGPPG
LAGLQGEKGDMGLIGRQGYPGPIGLKGEPGPIGPSGVPGIPGTPGRDGPKGQQGFPGPPG
KPGVIGLPGQKGEPGIQGPDGPKGFPGPRGHVGMQGQTGLDGSPGEKGDKGDIGFPGEPG
RPGLDGPRGLAGAPGEKGDIGFPGNPGLNGFIGPAGPRGDIGFKGSRGPKGEPGLASEKG
EKGDQGFPGLPGVDGRPGQDGEKGDKGFPGYPGQGIPGSQGEKGDAGLPGKMGFPGIPGD
KGDRGFPGLAGLKGERGPAGKDGLPGMPGRDGSPGAPGQDGLPGMDGEKGERGDRGLPGR
DGLDGLKGDQGIAGPPGPIGPMGFPGPKGDIGLPGPSINIKGEKGDIGFPGITGLQGDKG
DRGRDGFQGLQGEKGDQGFTGQKGEMGRMGAMGERGERGPIGPTGIPGLTVKGEKGLPGN
NGKHGRPGMRGATGEKGEQGLPGLPGPIGRSGMPGTPGPRGEPGEPGSEGVAGPPGFDGP
PGLQGRPGEYGEKGNKGDKGAVGFGLPGPKGDTGLPGLPGLNGEKGDKGDQGFDGLVGEM
GEKGNQGEKGDRGYPGRPGIPGLDGVKGDKGEAAAIVYGSKGEPGPRGPPGLNGPPGLDG
LPGPKGWDGAPGMKGDKGFQGPMGPPGLPGPQGIMGIQGERGETGRMGLQGVPGIPGAPC
ATTDYLTGILLVRHSQTNIVPQCEPGHIKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKF
STMPFLFCDLNDVCNYASRNDRSYWLSTNLPIPMMPVNNNEISRYISRCVVCEVPANVIA
VHSQTLDIPSCPVGWNSLWIGYSFVMHTGAGGQGGGQALASPGSCLEDFRATPFIECNGE
GGTCHHFANKLSFWLTTIDDKKQFAKPERETLKSGRLLQRVSRCAVCIKNTT