New model in OGS2.0 | DPOGS204583  |
---|---|
Genomic Position | scaffold411:- 18293-52847 |
See gene structure | |
CDS Length | 3372 |
Paired RNAseq reads   | 1285 |
Single RNAseq reads   | 2991 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001435 (0.0) |
Best Drosophila hit   | CG4096, isoform B (7e-151) |
Best Human hit | A disintegrin and metalloproteinase with thrombospondin motifs 7 preproprotein (7e-131) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC001014 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC001014 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0004222 metalloendopeptidase activity GO:0008270 zinc ion binding GO:0005578 proteinaceous extracellular matrix GO:0006508 proteolysis |
InterPro families    | IPR000884 Thrombospondin, type 1 repeat IPR002870 Peptidase M12B, propeptide IPR001590 Peptidase M12B, ADAM/reprolysin IPR010294 ADAM-TS Spacer 1 IPR013273 Peptidase M12B, ADAM-TS |
Orthology group | MCL10351 |
Nucleotide sequence:
ATGCGTGCTAGTGACTGCCGGGTGTTTAGAAAAATGGATGTGCGGTGTTCGATATTCAAA
CTGCTGTTGTTTCAAATTGTGATTATAAGATGTGGAGCAAAACACATACGCACAGCGAGG
GTAGAGTTGGCTGGTGATGTGGAGAGGGAAGTCCAGGATAGCATAAGGAACCTTATACAC
ACCGGTATCTATTCACACCAACACCTGGATCATTCTCAAGTGCAAGTGGTGACCCCGGTG
AAGGTTTCATCAGAGGGCGAACTTATCTCTCATGAAGTTGAACATGCACACGGGCATGGT
CACGCGCGCTCACGCCGTGACCTTCATGGCATAGAACATCATCTGCCCCATTCTCTTCAC
TACAACCTCACTGTGGATGGACGGAATCTTAGGCTGGACCTAAGGCCCTCAGTAACGTTC
ATCACTCCAGCGTTGGTTGTAGAGCGACACAGGGTTAACTCGAGAACTCGGGCGAGGCCT
CTGGAGGCAGCTACTGCGTGCCACTACACCGGCTCCGTGAGAGGACAGCCCGGCTCGAAT
GTCGCACTATCAGCCTGCGATGGACTTGCAGGCATGTTGCGGACGAAATATGGAGAATAT
TGGATTGAGCCATCAAATCAACTAACGACCGACAGCTCGTCTGGGCGACCACATGTTATA
TTTAAGAGATCTGCTGTCGATAAAGTTGAGGCATTCCATAGAAGAAAGAGGGAAGTCGAT
AGGAGAATGACTTCTTCGTCTAGAAATAACAATGAAAAGAATTATAAAGAAAAAAATTCA
AAACGAAACACGAGAAGAAAGGATAAGGAGGACATGGATCAGCGTAGACGAGCGTATATA
GAAGAGAGACGGAGGAGATTAGAGGCAATGAGACGTGATCCAGCAGCATATAGAAGAAAT
AAAAAGAATCTACGAATGGAGCATAGAAGACCGCTGTCGGAATCAGCGTCTTCATCCGTA
TCGAGGAGCACATCCATGGAAAATAGTCATTCCTTGGAACAAAATTTAACAACGAGCAGG
TCGAGTATTGAACAGCGACCACGAAGGATAAGAACGAGACGAAAGAGACGGAGACGTTCA
AAGAACTGTGCAACAAAACAACCACCGTATCAATGGAAGTATAGAAATTTTAAAGACCTT
CATTCAAAGGAAGAACGTAGCAAATCAAAATATTACAAGAGAACCAATAAAACTCAACGC
AATAATGTAAACAGACGGTATCTAGAAGCATCAAAACGATCGACGCGATCCGTTAGCAAG
CCGCGTCACGTTGAAGTGTTACTTGTCGCCGATAAGTCTATGACTGATTTCCACAATCAA
GGCAACTTGGAAACTTATCTACTGACAATCATGAACATGGTATCATCCCTGTATATGGAT
CCATCAATCGGGAACTACATCAAAGTGGTGGTAGTCAAAATTATTTTGGTCGAGGAAATG
CACGCGGCACCTGAGCTATCAGTGTCCACTAACGCGGACTCAACCCTCGCGTCCTTCTGT
AGGTGGCAGCACCAGCTGAACCCGGACGACGACCACGATCCACACCACCATGACGTCGCC
ATACTCATCACACGACGGGATATATGCAGTCAGCAGGACACACCTTGTAGTACTTTGGGC
GTGGCGCACGTGGCGGGCATGTGCAAGCCGGACCGCAGCTGTTCCGTCAACGAGGACAAT
GGCATCATGCTGGCGCACACCATCACGCACGAGCTGGGACACAATTTCGGCCTGTACCAT
GACACCGAGAAAATTGGTTGTCACAGACGCGATGGGGCTACTCTTCATATTATGACACCG
ATCTTTGAACCAGATACGGTACAAGTCGCCTGGTCACGATGCAGTAAACGGGATGTCACC
AACTTTTTAGATGCCGGTTTAGGGGATTGCTTGAGCGACAGACCATCTCAGGAAGAACCG
TACGTCTATCCTGAACTACCCGCAGGAGTAACATTTGACGCGGCTTATCAATGTCACTTA
CAATTCGGTGAGGAAGCCGTAGTGTGTTCCAAGCTGACGGAGCTTTGTGAGCATCTGTGG
TGTCTCGTCAACAACACGTGCAAATCGATGTTAAGACCGGCCGCGCCGGGGACAACTTGT
GGAGAGAATATGTGGTGTCAGAATCAGACGTGCGTCCCAAAGGCACCGTCCCCCGTACCA
CGTGACGGTGGTTGGGGCGCCTGGAGCGAGTGGAGCGAATGTTCCCGCACTTGTGGTGCA
GGGGTGTCCACTCAACATCGAGAGTGCAACAATCCTGAACCACTCAACAATGGAAACTAC
TGCATCGGGGATAGGAGCCGGTACAAAGTTTGCAACACCGATCCGTGCCCTATCAATGAG
CCGACGTTCAGGGAAGTTCAGTGCTCCAAGTTTAATAACATGACATACAAGAACGAAACC
ATCTCCGAATGGATACCATACATTGATCAAGATAAACCGTGCGATTTACAATGCGTGCCT
CGCAATCGGAATGATATAGAGATGATCGGCAGCTTCGTAACAGACGGAACACCGTGCAGA
CAGGGTCTCGGAGTTCGGGACATGTGTATAGCTGGCGTGTGCTACAAAGTGGGTTGTGAC
TGGATTGTGGACTCTGATGCTGAAGAGGACGCGTGCGGTGAGTGCGGAGGAGACGGCTCC
GCGTGTAGGACAGTACAAGGGATATATAACAAGGATACTACTAGACAGTCTGGTTATAGC
GAGGTTGCGGTAATACCAGCTGGTTCTAGAAATGTGAAGATTCAAGAAAAAGTCAGCCCT
GGAAACTATATTTCAATCGGAAGCGCTAAATCTAGAAAGATATATCTCACTGGAGCGCGA
AATGCAACATTAACTGAGTACTTCGTAGCCGGAGCTCAGGCGATCTATGAAAGGGACCGT
GATTGGGAAAAAGTTAGGATCAGTGGACCCCTTGCTGAGGACATAAAGGTTTATCAACGC
ATCTTCCGCGGCAAGCATCGCAATCCTGGCGTGACGTATCAGTATACAGTGGACCAACCT
AAATCATTGGTCCGCTATCGTCTGTCAGACTGGAGCGCCTGTTCTGTAACCTGCGGCCTC
GGGCGGATGTACCGACACTACATCTGCGTGGACCAGCACAACCGTCAAGTTGACCTATCC
ATGTGTTACCACATGGAGCAACCTCGTCTTGGCGCGTTGGTGCAGCAGTGCCGGTCAGTC
TCCTGTACACACTGGTGGGTCGGACCATGGAAACCATGCTCCATTTGCCATATGCCGGGC
GAGGAAGCTACCAAGAAACGTAATGTCCACTGTGTGAACAAGACATCCAACCAGGTAGTT
GACGACTCAGAATGTGATATGTCAACTAAGCCGATAGGCGTTATCAAATGCGCAGACGTA
CCAGCTTGTTGA
Protein sequence:
MRASDCRVFRKMDVRCSIFKLLLFQIVIIRCGAKHIRTARVELAGDVEREVQDSIRNLIH
TGIYSHQHLDHSQVQVVTPVKVSSEGELISHEVEHAHGHGHARSRRDLHGIEHHLPHSLH
YNLTVDGRNLRLDLRPSVTFITPALVVERHRVNSRTRARPLEAATACHYTGSVRGQPGSN
VALSACDGLAGMLRTKYGEYWIEPSNQLTTDSSSGRPHVIFKRSAVDKVEAFHRRKREVD
RRMTSSSRNNNEKNYKEKNSKRNTRRKDKEDMDQRRRAYIEERRRRLEAMRRDPAAYRRN
KKNLRMEHRRPLSESASSSVSRSTSMENSHSLEQNLTTSRSSIEQRPRRIRTRRKRRRRS
KNCATKQPPYQWKYRNFKDLHSKEERSKSKYYKRTNKTQRNNVNRRYLEASKRSTRSVSK
PRHVEVLLVADKSMTDFHNQGNLETYLLTIMNMVSSLYMDPSIGNYIKVVVVKIILVEEM
HAAPELSVSTNADSTLASFCRWQHQLNPDDDHDPHHHDVAILITRRDICSQQDTPCSTLG
VAHVAGMCKPDRSCSVNEDNGIMLAHTITHELGHNFGLYHDTEKIGCHRRDGATLHIMTP
IFEPDTVQVAWSRCSKRDVTNFLDAGLGDCLSDRPSQEEPYVYPELPAGVTFDAAYQCHL
QFGEEAVVCSKLTELCEHLWCLVNNTCKSMLRPAAPGTTCGENMWCQNQTCVPKAPSPVP
RDGGWGAWSEWSECSRTCGAGVSTQHRECNNPEPLNNGNYCIGDRSRYKVCNTDPCPINE
PTFREVQCSKFNNMTYKNETISEWIPYIDQDKPCDLQCVPRNRNDIEMIGSFVTDGTPCR
QGLGVRDMCIAGVCYKVGCDWIVDSDAEEDACGECGGDGSACRTVQGIYNKDTTRQSGYS
EVAVIPAGSRNVKIQEKVSPGNYISIGSAKSRKIYLTGARNATLTEYFVAGAQAIYERDR
DWEKVRISGPLAEDIKVYQRIFRGKHRNPGVTYQYTVDQPKSLVRYRLSDWSACSVTCGL
GRMYRHYICVDQHNRQVDLSMCYHMEQPRLGALVQQCRSVSCTHWWVGPWKPCSICHMPG
EEATKKRNVHCVNKTSNQVVDDSECDMSTKPIGVIKCADVPAC