DPGLEAN05226 in OGS1.0

New model in OGS2.0DPOGS206275 
Genomic Positionscaffold1236:- 13810-30393
See gene structure
CDS Length5250
Paired RNAseq reads  4230
Single RNAseq reads  10227
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010830 (0.0)
Best Drosophila hit  Neu3, isoform C (0.0)
Best Human hitdisintegrin and metalloproteinase domain-containing protein 12 isoform 1 preproprotein (3e-98)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC010657 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC010657 [Tribolium castaneum] (0.0)
GeneOntology terms

  
GO:0004222 metalloendopeptidase activity
GO:0006508 proteolysis
GO:0008270 zinc ion binding
InterPro families




  
IPR001762 Blood coagulation inhibitor, Disintegrin
IPR006586 ADAM, cysteine-rich
IPR000742 Epidermal growth factor-like, type 3
IPR001590 Peptidase M12B, ADAM/reprolysin
IPR013032 EGF-like region, conserved site
IPR002870 Peptidase M12B, propeptide
Orthology groupMCL10437

Nucleotide sequence:

ATGCTATCAAAATATGCGTTTTATTATATAGGATCCCCCGCGCCAGAGTTTAGCCGGCAC
AGACTCGTACGACCGATCATCCAGCATGCGAGGACTAAGAGGGAGATAACATCGACCAGA
CACACGGAAGGCATCCATCATCCAGAGCTGGTTATGAAGATGAACTTTGATGGTCGCGAG
CACGTCCTTGACTTGAGACTGAACGAGGATCTCATTACCAAGGATCATGTGATAGCATAC
CAGAAGGATGGGGAGACGGTGATACATCGACCTACATTGAAGGAGCTCGACATATGCCAG
TACTCTGGCAAGGTGAGGGACAAGAAAGAATCGTGGGTCGCCGTGTCCACATGCGACGGA
GTGAGGGGGATCATTCACGATGGACAGACAATGAGATATATAGAACCAGCCGATAGAAAC
GAAATCGACTCTCAGCACTATCTATACGAGCACTCGGATCTGAACACCGATTTCCACTGC
GGGTACAGCGGAGGCATCACTACCAATGACACGTACGACCCCGAGCTCATGAAGCGACAC
ATGCATAGCAGGAACGTGGAGAAGAGCAGAATAAGTCGGTACAAACGTGATGCGTACGAG
GACACAGAGGTGAGGGGTCCGTTCAAGGTCAACAAACTGTCCCGCTTCGTGGAGTTGGTG
CTCGTGGCGGACAACAGAGAGTTCAGAGCCAACGGGGAGAGCAAGGAAACGGTGCACAGA
CAGCTCAAGGACGTCGCTAATATTATTAATTCTGTGTACACCCCGCTTAATATCTTCATA
GCGCTAGTGGGTGTTGTCGTGTGGAACGAAAGAGACGAAATACGGTTAGAGGAGGACGGA
GATAAAACTCTCACAGAGTTCCTACATTACAGGAAAAGGCTGCTCCCTGTCATGCCCAAC
GACAACGCACACCTGTTAACCCGTCAGAAATTTAAAGATGGCGTCGTGGGGAAGGCTCTA
AAAGGGCCGATATGTACGTACAATTTCTCTGGTGGTGTCGCCACAAACCATTCGGAGGTG
ATCGGTCTGGTGGCGACCACTATAGCCCACGAGATGGGCCACAACTTTGGCATGGAACAC
GACACTGAGGCCGACTGCGAGTGTCCCGATGAGAAGTGCATCATGAGCCCCTCCAGTACG
TCGGTCACCCCTACCAAATGGTCGTCCTGCAGCTTGAGATCACTCGCGCTGGCGTTCGAG
AGAGGCATGGATTACTGTCTGCGTAATAAGCCAAAGCGTCTATTCGAGCCTTCCACTTGC
GGCAACGGATTCATTGAACCCGGCGAGCAGTGTGATTGTGGCCTGGCGGGCGATCCAGCC
TGCACTGCTTGCTGTGACCCGCGGGCGTGCGTGTTACGCTCTAACGCGACCTGCGCGGCG
GGAGAGTGCTGTGATACGACGACTTGTCGTCCGAAGCCGGCGGGGACGGTGTGCAGGGCG
GCCGACAAGGAGTGTGATCTGGCGGAGTACTGCAGCGGACACTCGGAGTACTGTCCGCGG
GATGTGTACAAGATGGACGCCACGCCCTGTGGGGGAGGGAAAGCGTACTGTGCGGGCGGG
TCTTGTCGGACCCACACGGATCAATGCCGACTTCTCTGGGGTTTCTCCGGAGAGAACTCG
GACGTTCAATGTTACACCAACTCTAATACTAAAGGGGATAGGAAGGGGAACTGCGGCTAC
CATCGCGAAGACCCGCCCGTCTACTACAAATGTTCTAAAGAAGATTCTCTCTGCGGTCTG
CTGCAGTGTCGCCATCTCAATGAAAGACTCGAATTCGGCATGGAGTCCGTGTCTACACTG
TCAGCTGTCTTCATTAATAATAACGGCACGATAATTCCCTGCCGCACGGCCATGGTGGAC
ATGGGCACGAGCGATCCCGACCCGGGCTTCGTACCAGACGGCGCGAAATGTGGAGACGAT
AAAATGTGTATGAAACATAGATGCGTTTCAATAGCGGAAGTGACGTCAGAGATCGCTCGG
AAAGAAACATCCGTCTGTCCGTCCAACTGTTCGGGCCATGGAGTGTGTAACTCAGAAGGA
CATTGTCACTGCGACTCGGGCTTCGCCCCTCCACTGTGTGAGCTCCCCGGGCCGGGAGGT
TCCGTGGACTCCGGACCAGCCACTGACGCTTCAATTCAACGGAACTTCATGGTCGCTATG
TACATAATCTTCCTGGGCATCCTGCCGTCCGTGCTGCTGGTGATGCTGCTCATGTACTAC
TCGCGTCACAACGTGCTGCTGTGCTGGAAGAAACCCAAAAAATCGTACGTAAATAACATT
TTCAACGGCGACCGATTCAAAAGATTCAAAACATCGACCGATTCCTTCGTAAGACTAATC
AGCTTTAGGCGGACACAGAAGAAAAATATGTGCAGGAAATGTCAGGACGATATATACAGT
AATATATGCGAACACAAAGAAAATATAGACAGTACGTGGAGTTTTAACATTAGTTCTAAG
ATAATCAATATGTTGAATAGTAACAAACACGATGAAAGCAAAAAGTTTCAGAGGAAGATA
AACAAAGACGACATCAAAGTCGCCGACGACTTAGATCTAGCTAACGTTAGGGTTAAAGTT
GAACCGAAAATCAACAAATCAAACATAGTCATAGTTAAAACCGGCCTGGCCTCGACTACC
AACGAACATGTTAAGGCCGAGATCAACACAACCAAACAGGAAGTTAGCTTAGATAGGAAC
AAAACCAAAAAGAACATTGTTGTGACAAATAAAATTAATACAGATGTCATATATGAAAAC
TGTATACCGAATAAGGTAAAGGGGAGTGCGTTAAGGGTGGAAAGTAAAAATAAAAAGAAA
GATACAAAGATTACAGCACACACAGCAGCCAGCCACAGACATGTGCACACAAAAACTACA
GCGAGCAATGTAAAGAAACTAAGCCAGAGATTCGATTCATCAATCAAAGCTAACAAGCTT
GTACAATCGCCAAAGAAGAACAGCTTGCAGCGTCGCCTGTCGCGAAGTGCGACGAAATTT
GCTGCAAATTTCCAAAATAATTCACAAAACAACGCCCAGCCCGTCAACGTTCATACTCTG
TCGAACTCTGACGACATGAGCTCCAGCCTCCTCAGAAGTGATTCCGATCGTAGCCCTTCA
GGCAACATAAACCCCTCGGTAAATTTCTTCGGAAACTTCAAAGGATTCTCACTCACCCCG
ATGGATAAGAACTCACAAAACGAAACTGATGTCAAAGATAAAAAAGATAATGTACAGAAA
AGCGCCAAAATTACACCCGTGCATCGAAGCGGTAGCAACAGTCAGAATATAGCACAGGGA
TCGAAGCCGATACTGAGATCAGCACCACCGCTGCCTGTGGTTCCGAACACGGCTAAACTA
AGCCCGAAAACCAGCCCATCCATCAAAAGAACGAACAGCTCCGTTCAGAATCGCATTAAA
GCTTTCATGGGAACAGAAAAGGCTGAAGAAATACCCGTGAACACAGCTCCGAGACCGACA
ATATCTAGTCCCATTCTGGAAGCATCGACGTGTACAGCGAAAGAACTCATCTCTCCTCTC
CAGGGTTCCAAAACCTTGGGTCCTGTCCGCGCCGCTCCTACCGTTCCTAACTTCTCTCCG
GACTTACCGAAGAGGCCGTTAAGCATGCACTCAGCGGGAAATGTACCACAGAAACCACTG
CCGGAAGAACCGAAGAAAGTTAAAGAAGGCATATCTCTCAATAGGATTGCGTCGTTCCTG
AAACAAGATAAACCAAAAGAAAAAGATAGGAACCCTGTGGAGAGAAGCCATTCGCTACCC
AAAAATGGTAACAACCAATTAAAAGTCAAAACCGGTGATAAAGTCGCACTGCGCAATTTG
CAAATATCTGGTCCTATTTTGCAAAAGGAAATAGATTTACCTGTTACTACTGTCCCAGTC
GTTTCGGATTCAGAAGAAGCCGACGATTCGAAGGCCTTCGTAAACAGAGCGCAGAGTATG
CGAGCACCTGCCAGCCAAAAGCCAGTCCTACAAAGCTTTGCATCGATGAGACAAGCGCCA
GGTGTACCGCGGCCCTTATCGTGTGTGGGAAGACCAACAGCGCCCCCTCCCCCGTTACCA
TCACAACCGAAAAACGAAGAACAATCCATTTACCAAAATCCGAAAGTGCAAAATGACATC
AAATCGACTGATTATGTTGATTGCATAGAGGAAAAACAGGTCCCATTGGCGCACATCGAT
GAAGAATCTGGGGACAATATTTACGCCATCATAGAAGAAAGTCCCGAAAAGCATTTCAAA
CCGATGCCGGGACGTCCTCCTAAATCTACACAGGCGCCGTTCGAAGAATACAATGTACCA
AAACCTATTACTTCCAATTCGGGAAGCTCTGAAAGTTTAGGTCTACTCGGAGAAATTGTC
AATGAAATACAAAACCGCAATTTCGACTCCATTTACTGCACCAATTCTTTGGCGAGAATG
AAAGATAAGAACAAAAATACGGATAGTAACAGAGATAGCACGTACATGAACACAGACTAT
AAAAGCCCGGAGAGCGTTTACAGCAACTCTGAGACAAAATCTAGTGCAGCCTCCACAACT
AGCAGCGGCTACCTTCACCCGTCCGCCGTGAACGTACCGACTTACATGCAAAAAGACAGC
GATGAACTAGAAATTGAAAAGCCTCCGTCCCCCACATTAAAAACTAATTCGAAAATACCT
ACGTTTACCAGACAAGTCACCCCGCCGGGGTTAAGAACTTTCAAAAATATACCGCAATCA
CCGAAGACGACAACGAGGAGTAATCTTAAAACGATTCCGAACAGTCCCGACCTAGTATCG
AGCTGTGCTGTTCCCGAAACACAGAATGCTAAAGCTCCGGATGTTATAAACAATAATAAA
ACAGAACCACCTAAATTAGCGACTAAACCGAATACGACCAAGACGACCGATAACCGACCC
CCACTCAAACCGGTTCCGTCGGAGAAGAAACCTAACGTCAAACCAACACCGGTCCCAAAA
ACTAATTCTGCCTTAAGTATGAACAAAACAGATAAAAATCCTCCCCTCAACAGAACCACT
TCTAAGACAGACTCCAACGTTAAGGCGATAGCTGACAGTTTGAACAAAAATCGACCAAAA
ATTGTCCCAAAGCCTAACAACATACAGAAGACTGAAGCTGTGAAAACAAACGCTACCAAA
TTATCAGCGAAACCGTCAAACGTTGCAAGTTTGCAGCAAAAATTTGAAAACAGGAAGTCA
TTAGGAAAAGAAATAAGTGTCAAAAAATAA

Protein sequence:

MLSKYAFYYIGSPAPEFSRHRLVRPIIQHARTKREITSTRHTEGIHHPELVMKMNFDGRE
HVLDLRLNEDLITKDHVIAYQKDGETVIHRPTLKELDICQYSGKVRDKKESWVAVSTCDG
VRGIIHDGQTMRYIEPADRNEIDSQHYLYEHSDLNTDFHCGYSGGITTNDTYDPELMKRH
MHSRNVEKSRISRYKRDAYEDTEVRGPFKVNKLSRFVELVLVADNREFRANGESKETVHR
QLKDVANIINSVYTPLNIFIALVGVVVWNERDEIRLEEDGDKTLTEFLHYRKRLLPVMPN
DNAHLLTRQKFKDGVVGKALKGPICTYNFSGGVATNHSEVIGLVATTIAHEMGHNFGMEH
DTEADCECPDEKCIMSPSSTSVTPTKWSSCSLRSLALAFERGMDYCLRNKPKRLFEPSTC
GNGFIEPGEQCDCGLAGDPACTACCDPRACVLRSNATCAAGECCDTTTCRPKPAGTVCRA
ADKECDLAEYCSGHSEYCPRDVYKMDATPCGGGKAYCAGGSCRTHTDQCRLLWGFSGENS
DVQCYTNSNTKGDRKGNCGYHREDPPVYYKCSKEDSLCGLLQCRHLNERLEFGMESVSTL
SAVFINNNGTIIPCRTAMVDMGTSDPDPGFVPDGAKCGDDKMCMKHRCVSIAEVTSEIAR
KETSVCPSNCSGHGVCNSEGHCHCDSGFAPPLCELPGPGGSVDSGPATDASIQRNFMVAM
YIIFLGILPSVLLVMLLMYYSRHNVLLCWKKPKKSYVNNIFNGDRFKRFKTSTDSFVRLI
SFRRTQKKNMCRKCQDDIYSNICEHKENIDSTWSFNISSKIINMLNSNKHDESKKFQRKI
NKDDIKVADDLDLANVRVKVEPKINKSNIVIVKTGLASTTNEHVKAEINTTKQEVSLDRN
KTKKNIVVTNKINTDVIYENCIPNKVKGSALRVESKNKKKDTKITAHTAASHRHVHTKTT
ASNVKKLSQRFDSSIKANKLVQSPKKNSLQRRLSRSATKFAANFQNNSQNNAQPVNVHTL
SNSDDMSSSLLRSDSDRSPSGNINPSVNFFGNFKGFSLTPMDKNSQNETDVKDKKDNVQK
SAKITPVHRSGSNSQNIAQGSKPILRSAPPLPVVPNTAKLSPKTSPSIKRTNSSVQNRIK
AFMGTEKAEEIPVNTAPRPTISSPILEASTCTAKELISPLQGSKTLGPVRAAPTVPNFSP
DLPKRPLSMHSAGNVPQKPLPEEPKKVKEGISLNRIASFLKQDKPKEKDRNPVERSHSLP
KNGNNQLKVKTGDKVALRNLQISGPILQKEIDLPVTTVPVVSDSEEADDSKAFVNRAQSM
RAPASQKPVLQSFASMRQAPGVPRPLSCVGRPTAPPPPLPSQPKNEEQSIYQNPKVQNDI
KSTDYVDCIEEKQVPLAHIDEESGDNIYAIIEESPEKHFKPMPGRPPKSTQAPFEEYNVP
KPITSNSGSSESLGLLGEIVNEIQNRNFDSIYCTNSLARMKDKNKNTDSNRDSTYMNTDY
KSPESVYSNSETKSSAASTTSSGYLHPSAVNVPTYMQKDSDELEIEKPPSPTLKTNSKIP
TFTRQVTPPGLRTFKNIPQSPKTTTRSNLKTIPNSPDLVSSCAVPETQNAKAPDVINNNK
TEPPKLATKPNTTKTTDNRPPLKPVPSEKKPNVKPTPVPKTNSALSMNKTDKNPPLNRTT
SKTDSNVKAIADSLNKNRPKIVPKPNNIQKTEAVKTNATKLSAKPSNVASLQQKFENRKS
LGKEISVKK