DPGLEAN11115 in OGS1.0

New model in OGS2.0DPOGS204583 
Genomic Positionscaffold411:- 18293-52847
See gene structure
CDS Length3372
Paired RNAseq reads  1285
Single RNAseq reads  2991
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001435 (0.0)
Best Drosophila hit  CG4096, isoform B (7e-151)
Best Human hitA disintegrin and metalloproteinase with thrombospondin motifs 7 preproprotein (7e-131)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC001014 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC001014 [Tribolium castaneum] (0.0)
GeneOntology terms


  
GO:0004222 metalloendopeptidase activity
GO:0008270 zinc ion binding
GO:0005578 proteinaceous extracellular matrix
GO:0006508 proteolysis
InterPro families



  
IPR000884 Thrombospondin, type 1 repeat
IPR002870 Peptidase M12B, propeptide
IPR001590 Peptidase M12B, ADAM/reprolysin
IPR010294 ADAM-TS Spacer 1
IPR013273 Peptidase M12B, ADAM-TS
Orthology groupMCL10351

Nucleotide sequence:

ATGCGTGCTAGTGACTGCCGGGTGTTTAGAAAAATGGATGTGCGGTGTTCGATATTCAAA
CTGCTGTTGTTTCAAATTGTGATTATAAGATGTGGAGCAAAACACATACGCACAGCGAGG
GTAGAGTTGGCTGGTGATGTGGAGAGGGAAGTCCAGGATAGCATAAGGAACCTTATACAC
ACCGGTATCTATTCACACCAACACCTGGATCATTCTCAAGTGCAAGTGGTGACCCCGGTG
AAGGTTTCATCAGAGGGCGAACTTATCTCTCATGAAGTTGAACATGCACACGGGCATGGT
CACGCGCGCTCACGCCGTGACCTTCATGGCATAGAACATCATCTGCCCCATTCTCTTCAC
TACAACCTCACTGTGGATGGACGGAATCTTAGGCTGGACCTAAGGCCCTCAGTAACGTTC
ATCACTCCAGCGTTGGTTGTAGAGCGACACAGGGTTAACTCGAGAACTCGGGCGAGGCCT
CTGGAGGCAGCTACTGCGTGCCACTACACCGGCTCCGTGAGAGGACAGCCCGGCTCGAAT
GTCGCACTATCAGCCTGCGATGGACTTGCAGGCATGTTGCGGACGAAATATGGAGAATAT
TGGATTGAGCCATCAAATCAACTAACGACCGACAGCTCGTCTGGGCGACCACATGTTATA
TTTAAGAGATCTGCTGTCGATAAAGTTGAGGCATTCCATAGAAGAAAGAGGGAAGTCGAT
AGGAGAATGACTTCTTCGTCTAGAAATAACAATGAAAAGAATTATAAAGAAAAAAATTCA
AAACGAAACACGAGAAGAAAGGATAAGGAGGACATGGATCAGCGTAGACGAGCGTATATA
GAAGAGAGACGGAGGAGATTAGAGGCAATGAGACGTGATCCAGCAGCATATAGAAGAAAT
AAAAAGAATCTACGAATGGAGCATAGAAGACCGCTGTCGGAATCAGCGTCTTCATCCGTA
TCGAGGAGCACATCCATGGAAAATAGTCATTCCTTGGAACAAAATTTAACAACGAGCAGG
TCGAGTATTGAACAGCGACCACGAAGGATAAGAACGAGACGAAAGAGACGGAGACGTTCA
AAGAACTGTGCAACAAAACAACCACCGTATCAATGGAAGTATAGAAATTTTAAAGACCTT
CATTCAAAGGAAGAACGTAGCAAATCAAAATATTACAAGAGAACCAATAAAACTCAACGC
AATAATGTAAACAGACGGTATCTAGAAGCATCAAAACGATCGACGCGATCCGTTAGCAAG
CCGCGTCACGTTGAAGTGTTACTTGTCGCCGATAAGTCTATGACTGATTTCCACAATCAA
GGCAACTTGGAAACTTATCTACTGACAATCATGAACATGGTATCATCCCTGTATATGGAT
CCATCAATCGGGAACTACATCAAAGTGGTGGTAGTCAAAATTATTTTGGTCGAGGAAATG
CACGCGGCACCTGAGCTATCAGTGTCCACTAACGCGGACTCAACCCTCGCGTCCTTCTGT
AGGTGGCAGCACCAGCTGAACCCGGACGACGACCACGATCCACACCACCATGACGTCGCC
ATACTCATCACACGACGGGATATATGCAGTCAGCAGGACACACCTTGTAGTACTTTGGGC
GTGGCGCACGTGGCGGGCATGTGCAAGCCGGACCGCAGCTGTTCCGTCAACGAGGACAAT
GGCATCATGCTGGCGCACACCATCACGCACGAGCTGGGACACAATTTCGGCCTGTACCAT
GACACCGAGAAAATTGGTTGTCACAGACGCGATGGGGCTACTCTTCATATTATGACACCG
ATCTTTGAACCAGATACGGTACAAGTCGCCTGGTCACGATGCAGTAAACGGGATGTCACC
AACTTTTTAGATGCCGGTTTAGGGGATTGCTTGAGCGACAGACCATCTCAGGAAGAACCG
TACGTCTATCCTGAACTACCCGCAGGAGTAACATTTGACGCGGCTTATCAATGTCACTTA
CAATTCGGTGAGGAAGCCGTAGTGTGTTCCAAGCTGACGGAGCTTTGTGAGCATCTGTGG
TGTCTCGTCAACAACACGTGCAAATCGATGTTAAGACCGGCCGCGCCGGGGACAACTTGT
GGAGAGAATATGTGGTGTCAGAATCAGACGTGCGTCCCAAAGGCACCGTCCCCCGTACCA
CGTGACGGTGGTTGGGGCGCCTGGAGCGAGTGGAGCGAATGTTCCCGCACTTGTGGTGCA
GGGGTGTCCACTCAACATCGAGAGTGCAACAATCCTGAACCACTCAACAATGGAAACTAC
TGCATCGGGGATAGGAGCCGGTACAAAGTTTGCAACACCGATCCGTGCCCTATCAATGAG
CCGACGTTCAGGGAAGTTCAGTGCTCCAAGTTTAATAACATGACATACAAGAACGAAACC
ATCTCCGAATGGATACCATACATTGATCAAGATAAACCGTGCGATTTACAATGCGTGCCT
CGCAATCGGAATGATATAGAGATGATCGGCAGCTTCGTAACAGACGGAACACCGTGCAGA
CAGGGTCTCGGAGTTCGGGACATGTGTATAGCTGGCGTGTGCTACAAAGTGGGTTGTGAC
TGGATTGTGGACTCTGATGCTGAAGAGGACGCGTGCGGTGAGTGCGGAGGAGACGGCTCC
GCGTGTAGGACAGTACAAGGGATATATAACAAGGATACTACTAGACAGTCTGGTTATAGC
GAGGTTGCGGTAATACCAGCTGGTTCTAGAAATGTGAAGATTCAAGAAAAAGTCAGCCCT
GGAAACTATATTTCAATCGGAAGCGCTAAATCTAGAAAGATATATCTCACTGGAGCGCGA
AATGCAACATTAACTGAGTACTTCGTAGCCGGAGCTCAGGCGATCTATGAAAGGGACCGT
GATTGGGAAAAAGTTAGGATCAGTGGACCCCTTGCTGAGGACATAAAGGTTTATCAACGC
ATCTTCCGCGGCAAGCATCGCAATCCTGGCGTGACGTATCAGTATACAGTGGACCAACCT
AAATCATTGGTCCGCTATCGTCTGTCAGACTGGAGCGCCTGTTCTGTAACCTGCGGCCTC
GGGCGGATGTACCGACACTACATCTGCGTGGACCAGCACAACCGTCAAGTTGACCTATCC
ATGTGTTACCACATGGAGCAACCTCGTCTTGGCGCGTTGGTGCAGCAGTGCCGGTCAGTC
TCCTGTACACACTGGTGGGTCGGACCATGGAAACCATGCTCCATTTGCCATATGCCGGGC
GAGGAAGCTACCAAGAAACGTAATGTCCACTGTGTGAACAAGACATCCAACCAGGTAGTT
GACGACTCAGAATGTGATATGTCAACTAAGCCGATAGGCGTTATCAAATGCGCAGACGTA
CCAGCTTGTTGA

Protein sequence:

MRASDCRVFRKMDVRCSIFKLLLFQIVIIRCGAKHIRTARVELAGDVEREVQDSIRNLIH
TGIYSHQHLDHSQVQVVTPVKVSSEGELISHEVEHAHGHGHARSRRDLHGIEHHLPHSLH
YNLTVDGRNLRLDLRPSVTFITPALVVERHRVNSRTRARPLEAATACHYTGSVRGQPGSN
VALSACDGLAGMLRTKYGEYWIEPSNQLTTDSSSGRPHVIFKRSAVDKVEAFHRRKREVD
RRMTSSSRNNNEKNYKEKNSKRNTRRKDKEDMDQRRRAYIEERRRRLEAMRRDPAAYRRN
KKNLRMEHRRPLSESASSSVSRSTSMENSHSLEQNLTTSRSSIEQRPRRIRTRRKRRRRS
KNCATKQPPYQWKYRNFKDLHSKEERSKSKYYKRTNKTQRNNVNRRYLEASKRSTRSVSK
PRHVEVLLVADKSMTDFHNQGNLETYLLTIMNMVSSLYMDPSIGNYIKVVVVKIILVEEM
HAAPELSVSTNADSTLASFCRWQHQLNPDDDHDPHHHDVAILITRRDICSQQDTPCSTLG
VAHVAGMCKPDRSCSVNEDNGIMLAHTITHELGHNFGLYHDTEKIGCHRRDGATLHIMTP
IFEPDTVQVAWSRCSKRDVTNFLDAGLGDCLSDRPSQEEPYVYPELPAGVTFDAAYQCHL
QFGEEAVVCSKLTELCEHLWCLVNNTCKSMLRPAAPGTTCGENMWCQNQTCVPKAPSPVP
RDGGWGAWSEWSECSRTCGAGVSTQHRECNNPEPLNNGNYCIGDRSRYKVCNTDPCPINE
PTFREVQCSKFNNMTYKNETISEWIPYIDQDKPCDLQCVPRNRNDIEMIGSFVTDGTPCR
QGLGVRDMCIAGVCYKVGCDWIVDSDAEEDACGECGGDGSACRTVQGIYNKDTTRQSGYS
EVAVIPAGSRNVKIQEKVSPGNYISIGSAKSRKIYLTGARNATLTEYFVAGAQAIYERDR
DWEKVRISGPLAEDIKVYQRIFRGKHRNPGVTYQYTVDQPKSLVRYRLSDWSACSVTCGL
GRMYRHYICVDQHNRQVDLSMCYHMEQPRLGALVQQCRSVSCTHWWVGPWKPCSICHMPG
EEATKKRNVHCVNKTSNQVVDDSECDMSTKPIGVIKCADVPAC