New model in OGS2.0 | DPOGS206535  |
---|---|
Genomic Position | scaffold74:- 101099-110030 |
See gene structure | |
CDS Length | 5739 |
Paired RNAseq reads   | 33091 |
Single RNAseq reads   | 78724 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014039 (0.0) |
Best Drosophila hit   | collagen type IV, isoform A (0.0) |
Best Human hit | collagen alpha-2(IV) chain preproprotein (6e-129) |
Best NR hit (blastp)   | GF15711 [Drosophila ananassae] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to Collagen type IV CG4145-PA, isoform A isoform 1 [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0005587 collagen type IV GO:0007391 dorsal closure GO:0005604 basement membrane GO:0005201 extracellular matrix structural constituent GO:0005488 binding GO:0005581 collagen |
InterPro families    | IPR008160 Collagen triple helix repeat IPR001442 Collagen IV, non-collagenous IPR016187 C-type lectin fold |
Orthology group | MCL10157 |
Nucleotide sequence:
ATGATCAATTCTTGGTTAGTTCTTAGAGTAGAAGTTACAGAGGCACTTACAGCAGATACA
GGAGCCGAAATCCTGGAGAGAAGGAGGAGAACGGGCACAAGGTGTACAGCGAGGAGGCGC
GCACGCGCCGGGCCCGACATGGCTGCGCCGCACTACTGGTCGCGCGCCCTCACACACAGG
GACGAGCGCAGCGCCTCCCGTACAGTCAGGCGCGGGGCGGGGGCTCAGAGACCGCCGCCC
GAGCGCTCCTCTTTCCTTAGTATCCCGGCTCAAACGGCGCGCGCGCACGGGACACTGGCG
CAGCGCGCTTTGGGCCATGGCGGTCCACGTACTTTGCAACGAAACGAATATGAAAGAGGA
GACGTCGAAGAAAATAATATTTATGATAATCAAGACTGGATGTATCAGAATAGTTACAAT
CCGCAACCAACAATAAATTCATACTATGGTTTATCTAGAAGGCAGGACTTACCGACACCC
TCTGCCCCCCCTTCTCCTCCACCGGAAAGAGTACAGCCCTCAAGAAGTTTTGGACAAAAC
TTTGCTGTGTATGACCCAGTAACTCGTCAGCGGACAAATGCATTTGATCGTAATTGTACG
GCCCCTGGCTGCTGTGTACCAAAATGTTTTGCAGAAAAGGGTAGTAGGGGTTTCCCAGGA
ATGCGGGGACCACCTGGAATCACAGGTCTACCAGGACACGTTGGCGCTGAAGGTCCACAG
GGACTTAAGGGTCAAAAAGGTCAAGATGGACCACAAGGTCCTCGGGGTCCGCGAGGAGAA
AAGGGTAAACCTGGAGCTCAAGGATTTATAGGCTTAGCAGGACCACCAGGTCCTCAAGGA
GAACCTGGTATGCCAGGGATTCCTGGACGTGATGGTTGTAACGGAACTGATGGAGAACCT
GGAATGGTGGGGATCAAAGGTTCACAGGGTCCACGTGGATTTGCTGGGCCTAAAGGTAAC
AAGGGTGATAAAGGAGAGCCGGCTTATATGGGTCGATACCCAAAAGGTGAAAAAGGAGAA
CCTGGAGCTGATGGTTTACAAGGCCAATCTGGCCCAGCTGGACCAACAGGTCCTCCAGGT
TTGGCTGGTCCCAAAGGAATGACTGGACCTATGGGACCACCTGGATATAAAGGTGATAAA
GGTCCTAAAGGATCTAAAGGACAATCTATTCAAGGTGATAAAGGAGACCGAGGTGACAAA
GGTGACAGAGGACCCGGTTGTCCATCAACAACGTTACCTTCATTGGATAATAAAGGAGCA
ATAAAAGGTGTCAAAGGTGATATGGGATCAAAAGGTGAAAAGGGAGAACCTGGGAGAATG
GGTGAAAAGGGAGAAACAGGTCCAATGGGGGAACCTGGCTTGCCTGGATTAATGGGCATT
AAAGGAGAAAAGGGCTTAAGAGGAAATCCTGGGGAACGGGGTCGTGAAGGAATGTATGGT
GAACCCGGACCTATGGGAAGAAAAGGTGATAGAGGCATTGATGGACTGAATGGTCTTCCC
GGCCGACCGGGTTTGAAAGGAGAACCCGGCAGGGATGGAGCAACAGGTCTAATGGGCTTA
AAAGGAGTGCCAGGTCCACCTGGTGGTCGAGCTGGAGCACGAGGTCCACCTGGGCCACCA
GGTCCTCGGGGCTATATCGGCGTTGCTGGTGCACCAGGGTCTAGTGGTAGGCCCGGAGAA
AATGGATTACCAGGACCTATGGGTCCAAGAGGTGGACAGGGAGAACCAGGTGACACAGGC
ATTGAAGGTCCAGCAGGTCAAAAAGGAGAAAAAGGAGAACCTGGTCTTGATGGCTTGCCT
GGAGAAATAGGTCAACGAGGATATGATGGACCCATTGGTCCTCAAGGACCTAGGGGACTA
AAAGGAGAAGAAGGTCAATCAATTCCTGGTGACAAGGGAAACAGTGGCCAACCAGGAATT
CCAGGAGATAAGGGAGCCAAGGGCGAAAGAGGTTATCCAGGATTACGAGGTACACCTGGA
AACTCTACATTAGGTACACCAGGAAGTCCCGGAGAAATGGGTCCACCTGGTGAAAAAGGT
GAAAAAGGAACTCCTGGGTACGATGGAATACCTGGTAATCCTGGACAAAAAGGTGACATT
GGAGGACGGTGTAACGAATGTCGACCTGGAAGTATGGGCGAAAAGGGAGACCGCGGTGCT
GATGGTCTACCTGGTGAACGGGGTGAACGAGGTCACATCGGACCCATCGGGATGACCGGG
GAGCGTGGTGCTGACGGTATGAATGGAATGCCTGGAGCTGCTGGAGCACCGGGTGAACGT
GGATTGGACGGACCAATAGGACCACCAGGAATGAGGGGAGCAGATGCAATGATACCGTCC
AATTTAGTAAAAGGACCTCCCGGAGAAAGAGGTGAACCAGGAGAAAAAGGAAACATGGGA
CCTAAGGGTGAAAGAGGACCTGATGGAATAATGGGTGATCGTGGATTAAATGGCATGCCC
GGACAGAAGGGTGACATGGGTAGAATGGGACCTTCTGGTATAGATGGCACACCTGGTAGT
GATGGAATACCGGGACGGCCAGGAATGAAAGGCATGTCCATCAAAGGTGAAAAAGGAATA
TCTGGTGATCAGGGTGAAAAAGGTGACAAAGGATTTTCTGGAAGACCAGGACTTAAAGGT
GAACCTGGTCAATGTCCCAATGAGTTAAAAATTCGCACAAAGGGAGAAAAAGGCAACCCT
GGCGTTCCAGGACCACAAGGACCATTAGGTATGAAGGGTGAAAAAGGTAATCAAGGGCCA
TTCGGTTTTACTGGTCCAAAGGGAGAGATGGGTTTACCAGGACGAGCTGGACCGGTAGGT
CCACGTGGTCTTCCGGGTTTCAAAGGCGATAAAGGTGAAATGGGTTCAATGGGATTTCCG
GGAACACCAGGGGATTTAGGCCCTAGAGGTTTTCCAGGGTTACCAGGATTAAAAGGAGAC
AAAGGTGAGATTGGTCCTTCTATGCCTGGACCACCTGGACCTGCTGGATTAAAGGGAGAT
AAAGGAGAACAAGGTCCAAGAGGTCAACCTGGAATAGAAGGAAAGGATGGTCCTCCAGGA
TTAGCTGGCTTACAAGGTGAAAAAGGTGATATGGGATTAATAGGAAGGCAAGGTTATCCA
GGACCTATTGGATTAAAGGGCGAACCGGGTCCTATAGGACCATCCGGAGTTCCGGGCATT
CCTGGTACGCCAGGAAGAGATGGACCTAAAGGTCAACAAGGATTTCCCGGTCCACCTGGT
AAACCTGGTGTAATTGGCTTACCTGGACAAAAAGGTGAACCAGGTATTCAAGGTCCAGAT
GGCCCGAAAGGTTTCCCAGGACCTCGTGGTCATGTTGGTATGCAAGGGCAAACTGGTCTT
GATGGAAGTCCCGGTGAAAAAGGAGATAAGGGTGATATAGGATTCCCGGGTGAGCCTGGT
AGACCTGGTCTTGATGGACCTAGAGGATTAGCTGGTGCACCTGGTGAGAAAGGTGATATA
GGTTTCCCAGGAAACCCTGGGTTGAATGGATTTATTGGACCAGCTGGCCCAAGAGGTGAT
ATAGGCTTCAAGGGTTCCAGGGGACCAAAAGGAGAACCTGGTTTAGCTTCAGAAAAGGGA
GAAAAAGGAGATCAAGGTTTTCCAGGATTACCTGGTGTTGATGGAAGACCTGGGCAAGAT
GGAGAAAAAGGTGACAAAGGTTTCCCTGGCTATCCAGGTCAAGGCATTCCAGGAAGTCAA
GGTGAAAAGGGAGATGCAGGTTTGCCTGGAAAAATGGGTTTTCCTGGTATTCCTGGCGAT
AAAGGCGACCGAGGCTTTCCAGGACTGGCAGGTTTAAAGGGAGAAAGAGGCCCTGCAGGC
AAAGACGGTTTGCCAGGAATGCCGGGAAGAGATGGCAGTCCTGGTGCTCCAGGCCAAGAT
GGTTTACCAGGAATGGATGGCGAAAAGGGTGAAAGAGGTGATCGAGGATTACCAGGTCGT
GATGGTCTTGATGGATTGAAAGGTGACCAGGGTATTGCTGGACCACCAGGGCCAATAGGA
CCAATGGGTTTTCCGGGTCCTAAAGGAGACATTGGTTTACCTGGGCCATCTATAAATATC
AAAGGTGAAAAGGGAGATATAGGTTTTCCCGGTATTACTGGACTTCAAGGAGATAAGGGT
GATCGAGGTAGAGATGGCTTCCAAGGTCTACAAGGGGAAAAGGGTGATCAAGGATTCACT
GGACAAAAGGGTGAAATGGGTAGAATGGGCGCCATGGGTGAAAGAGGTGAAAGAGGTCCA
ATTGGACCGACTGGTATTCCTGGACTCACAGTAAAAGGTGAAAAAGGTTTACCTGGAAAT
AACGGAAAACACGGCAGACCTGGCATGCGCGGTGCTACTGGAGAAAAAGGAGAACAAGGA
TTACCTGGACTTCCAGGTCCAATTGGGCGCTCTGGCATGCCAGGAACACCGGGACCTAGA
GGTGAACCCGGTGAACCAGGAAGTGAAGGAGTCGCAGGACCCCCTGGGTTTGACGGTCCT
CCGGGGCTACAAGGTCGTCCTGGCGAATATGGTGAAAAAGGTAACAAGGGTGATAAGGGT
GCTGTTGGTTTTGGTTTACCTGGCCCGAAAGGAGACACTGGCTTGCCAGGATTACCGGGT
TTAAATGGTGAAAAAGGTGATAAAGGAGATCAGGGTTTCGATGGATTAGTTGGAGAGATG
GGTGAGAAAGGTAACCAAGGAGAAAAAGGTGACAGAGGCTATCCTGGTCGGCCTGGAATT
CCTGGCCTTGATGGTGTAAAAGGAGATAAGGGAGAAGCGGCTGCTATAGTTTATGGAAGT
AAGGGAGAACCAGGACCAAGAGGTCCTCCTGGATTGAATGGTCCACCTGGACTTGACGGA
TTACCTGGTCCTAAAGGCTGGGATGGTGCTCCAGGCATGAAAGGAGATAAAGGTTTCCAA
GGACCTATGGGCCCACCAGGCTTACCAGGACCTCAAGGAATAATGGGTATTCAAGGTGAA
CGTGGTGAAACAGGTCGTATGGGATTACAAGGTGTACCTGGAATACCTGGTGCTCCTTGT
GCTACTACAGACTATCTTACTGGCATCCTTTTAGTGCGTCATAGTCAAACAAACATAGTA
CCCCAATGTGAACCCGGACATATTAAATTGTGGGATGGCTATTCCTTACTTTACATTGAT
GGAAATGAAAAGGCTCATAATCAAGATCTGGGATATGCTGGATCTTGTGTAAGAAAGTTC
AGTACCATGCCATTCCTTTTCTGTGATCTTAATGATGTATGCAATTACGCAAGTCGAAAT
GATCGCAGTTATTGGCTTTCTACAAATTTGCCGATACCCATGATGCCAGTAAACAACAAT
GAAATTTCACGATATATTTCAAGATGTGTTGTTTGTGAGGTTCCAGCCAATGTCATAGCT
GTTCACAGTCAAACTCTTGATATACCTAGTTGTCCAGTGGGTTGGAACTCATTATGGATT
GGATACAGTTTTGTTATGCACACTGGAGCTGGTGGACAAGGCGGTGGTCAAGCCCTTGCT
AGTCCGGGATCTTGTCTTGAAGACTTCCGAGCGACACCATTTATTGAATGTAACGGTGAA
GGTGGTACTTGCCATCATTTCGCCAATAAACTTAGTTTTTGGCTAACAACTATAGATGAT
AAGAAGCAATTCGCAAAACCAGAGCGTGAAACTCTTAAATCTGGACGACTATTGCAGCGA
GTGTCTAGATGCGCTGTTTGCATTAAGAATACCACATAG
Protein sequence:
MINSWLVLRVEVTEALTADTGAEILERRRRTGTRCTARRRARAGPDMAAPHYWSRALTHR
DERSASRTVRRGAGAQRPPPERSSFLSIPAQTARAHGTLAQRALGHGGPRTLQRNEYERG
DVEENNIYDNQDWMYQNSYNPQPTINSYYGLSRRQDLPTPSAPPSPPPERVQPSRSFGQN
FAVYDPVTRQRTNAFDRNCTAPGCCVPKCFAEKGSRGFPGMRGPPGITGLPGHVGAEGPQ
GLKGQKGQDGPQGPRGPRGEKGKPGAQGFIGLAGPPGPQGEPGMPGIPGRDGCNGTDGEP
GMVGIKGSQGPRGFAGPKGNKGDKGEPAYMGRYPKGEKGEPGADGLQGQSGPAGPTGPPG
LAGPKGMTGPMGPPGYKGDKGPKGSKGQSIQGDKGDRGDKGDRGPGCPSTTLPSLDNKGA
IKGVKGDMGSKGEKGEPGRMGEKGETGPMGEPGLPGLMGIKGEKGLRGNPGERGREGMYG
EPGPMGRKGDRGIDGLNGLPGRPGLKGEPGRDGATGLMGLKGVPGPPGGRAGARGPPGPP
GPRGYIGVAGAPGSSGRPGENGLPGPMGPRGGQGEPGDTGIEGPAGQKGEKGEPGLDGLP
GEIGQRGYDGPIGPQGPRGLKGEEGQSIPGDKGNSGQPGIPGDKGAKGERGYPGLRGTPG
NSTLGTPGSPGEMGPPGEKGEKGTPGYDGIPGNPGQKGDIGGRCNECRPGSMGEKGDRGA
DGLPGERGERGHIGPIGMTGERGADGMNGMPGAAGAPGERGLDGPIGPPGMRGADAMIPS
NLVKGPPGERGEPGEKGNMGPKGERGPDGIMGDRGLNGMPGQKGDMGRMGPSGIDGTPGS
DGIPGRPGMKGMSIKGEKGISGDQGEKGDKGFSGRPGLKGEPGQCPNELKIRTKGEKGNP
GVPGPQGPLGMKGEKGNQGPFGFTGPKGEMGLPGRAGPVGPRGLPGFKGDKGEMGSMGFP
GTPGDLGPRGFPGLPGLKGDKGEIGPSMPGPPGPAGLKGDKGEQGPRGQPGIEGKDGPPG
LAGLQGEKGDMGLIGRQGYPGPIGLKGEPGPIGPSGVPGIPGTPGRDGPKGQQGFPGPPG
KPGVIGLPGQKGEPGIQGPDGPKGFPGPRGHVGMQGQTGLDGSPGEKGDKGDIGFPGEPG
RPGLDGPRGLAGAPGEKGDIGFPGNPGLNGFIGPAGPRGDIGFKGSRGPKGEPGLASEKG
EKGDQGFPGLPGVDGRPGQDGEKGDKGFPGYPGQGIPGSQGEKGDAGLPGKMGFPGIPGD
KGDRGFPGLAGLKGERGPAGKDGLPGMPGRDGSPGAPGQDGLPGMDGEKGERGDRGLPGR
DGLDGLKGDQGIAGPPGPIGPMGFPGPKGDIGLPGPSINIKGEKGDIGFPGITGLQGDKG
DRGRDGFQGLQGEKGDQGFTGQKGEMGRMGAMGERGERGPIGPTGIPGLTVKGEKGLPGN
NGKHGRPGMRGATGEKGEQGLPGLPGPIGRSGMPGTPGPRGEPGEPGSEGVAGPPGFDGP
PGLQGRPGEYGEKGNKGDKGAVGFGLPGPKGDTGLPGLPGLNGEKGDKGDQGFDGLVGEM
GEKGNQGEKGDRGYPGRPGIPGLDGVKGDKGEAAAIVYGSKGEPGPRGPPGLNGPPGLDG
LPGPKGWDGAPGMKGDKGFQGPMGPPGLPGPQGIMGIQGERGETGRMGLQGVPGIPGAPC
ATTDYLTGILLVRHSQTNIVPQCEPGHIKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKF
STMPFLFCDLNDVCNYASRNDRSYWLSTNLPIPMMPVNNNEISRYISRCVVCEVPANVIA
VHSQTLDIPSCPVGWNSLWIGYSFVMHTGAGGQGGGQALASPGSCLEDFRATPFIECNGE
GGTCHHFANKLSFWLTTIDDKKQFAKPERETLKSGRLLQRVSRCAVCIKNTT