DPGLEAN21163 in OGS1.0

New model in OGS2.0DPOGS206492 
Genomic Positionscaffold820:+ 54483-69238
See gene structure
CDS Length3666
Paired RNAseq reads  1153
Single RNAseq reads  2627
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002519 (1e-65)
Best Drosophila hit  tolkin, isoform A (0.0)
Best Human hittolloid-like protein 2 precursor (0.0)
Best NR hit (blastp)  bone morphogenetic protein [Aedes aegypti] (0.0)
Best NR hit (blastx)  PREDICTED: similar to tolkin CG6863-PA, isoform A [Apis mellifera] (0.0)
GeneOntology terms







  
GO:0004222 metalloendopeptidase activity
GO:0005509 calcium ion binding
GO:0008270 zinc ion binding
GO:0006508 proteolysis
GO:0008586 imaginal disc-derived wing vein morphogenesis
GO:0008045 motor axon guidance
GO:0007415 defasciculation of motor neuron axon
GO:0007411 axon guidance
GO:0051605 protein maturation by peptide bond cleavage
InterPro families









  
IPR001506 Peptidase M12A, astacin
IPR000859 CUB
IPR006026 Peptidase, metallopeptidase
IPR001881 EGF-like calcium-binding
IPR006210 Epidermal growth factor-like
IPR015446 Bone morphogenetic protein 1/tolloid-like protein
IPR013091 EGF calcium-binding
IPR000152 EGF-type aspartate/asparagine hydroxylation site
IPR013032 EGF-like region, conserved site
IPR018097 EGF-like calcium-binding, conserved site
IPR000742 Epidermal growth factor-like, type 3
Orthology groupMCL10409

Nucleotide sequence:

TTCAATGAAGTTACTGTAATAACTAAGGTGATTCCACCGGGCTTAGATAAATCGATATTG
ATGAAAAATGGTAAAATACCACCCGGCGAAGATTTAGATAGTATTCACATGGGCAATCAG
AGCAGGGATAGTCATTTAAACCAATCAAGTGTAACTGACAACTTAACTGAATCTGAAAGT
CTTGAACAGGATGGCGTGTTAGTTGTTAATGTCTCAACTGATAATATATTGAATCTAGAT
GAATTTTATCCTAATATACAACTGTCAAACTTTTCAAACAAAATGGATGAAGTTACAAAC
AAAATACAAAGCAACAGTCGTGAAATAACAGGGGACAATAATAACTTTAAACTTAAAGAC
TTACCAGAGAACATGACGGTTGCTCCTGCTCCTACGAACGGTGTTATATTTACAAATGAA
GATGACAATGAGAGTCCCATAAAGGCTACTGAATTAAATCAAGAGCTCAATGCAAATTAT
AATATCAATGATCTACTAAAACCCACAGAAAGTTTACAAAGTGTTGTTAATCTTTCAAGC
CATAAAAGACGACGAAGAAGAAGACACGGGAACGGGAGACGCTTAAGAAATATTCAATCA
AGTGAACGAAGAGGTCACAATGCTGTGAAAAATTCGGGTGAGGAAAATGGATTAGAACTT
AAAAAATCAGTCCTGCGTCACGAGAAAGCAAACGATTTAAACCAACATGAACCAGCCATT
CTACTGCCTGAACATGAATTTTATGATAAATTCGAACTAAAATCTCCTCAAAAAAACGTA
TTCAATGAAACTAAGGACCAATCGTTTAATGGCTGGTTTTTTCGGGACTCTGAAGAAGAT
TTAGAAGCAAGCGAAACTCGTCACCACAGAAATCATTACTATAATCATACACAGATGAAA
AGGAGACATCGCACGGCCAGAGCCGCTACGAACAGGAAGGAGCGTATTTGGGAAAACGGT
GTCATTCCGTATGAAATCGACGGTAACTTCAGCGGCGCTCACAAATCTCTGTTCAAGCAG
GCTATGAGACATTGGGAGAATTTCACTTGCGTCAAATTCGTTGAAAGGGACGCTGAATTA
CATCGGGATTACATTGTGTTCACAGAACGACCGTGCGGATGTTGTTCATTCGTCGGAAAA
CGTGGCAACGGAGCCCAAGCGATATCGATCGGAAAGAACTGTGACAAGTTTGGGATTGTG
GTCCATGAGTTGGGACACGTGGTCGGCTTTTGGCACGAACATACTCGACCCGACCGAGAC
AGACATGTTCAAATAATCCGGGATAATATTATGACTGGGCAAGAGTATAATTTTAATAAG
CTAACAGAAGAAGAAGTAAATTCTTTGGGACAGACGTACGATTATGATTCAATCATGCAT
TATGCGAGAAACACTTTCAGTAAAGGGACGTTCTTAGATACGATTCTACCTCTTGAAGTT
CATGGGAAGAAGAGACCTGAGATAGGACAGAGAGTGAGATTGAGTGTGAGTGACATAGCT
CAGACTAACTTACTATATAAATGTGCAAAATGCGGAAAAACGTTCCTTGGTAACTCAGGC
TGGTTTAATTCTCCTGGCTGGGGTTCTGAAACACCACCAGAGACTCCGGAGAAATGCGAA
TGGAGGATAGTCGCCACTCACGGAGAACGAGTCGTCCTGAACATTACTGAGATAGATATT
CACAAAACGGATGGTTGTCGTTCGGAATGGGTGGAAGTCAGGGATGGTTATATGCCAAAC
GCACCAGTCCTTAGTCGTATCTGTGGTTCAGGGAAAGGACCAATGATGAGATCAACGGGA
TCCAGGCTAACGGTGGTCTACCAGCCGGGGACGAGGTCCAAACCTCACAGAGGATTTAGA
GCACATTACGAAGCTGTATGCGGCGGAGACATAGAAGTTGATAGTAGTGGTCATCTAGAG
TCACCGAACTATCCCGATGATTATCATCCAAATAAATTATGCATTTGGAGACTTTCCGTG
CCACAAGATTACCAAGTAGCATTACGATTCCATTCATTCGAAGTGGAAAACCATGACACC
TGCAATTATGATAAAGTAAAAGTAAGAGACGGAGACTCAATGGACAGTCCTCTGATAGGG
ATGTTTTGTGGACATAAGATTCCGCCTGACATAAGGTCAACATCAAACAAACTGCTCGTG
ATCTTCGAGTCTGACAGTTCGGTACAGAAAGCCGGTTTCTCCGCTACCTTCATGAAGGAA
TACGATGAATGCACCTCCATAGACCACGGCTGCAGTCACTCTTGCGTCAACACTCTCGGT
GGTTACGAATGCGCATGTGACATTGGCTATGAGTTGCATTCAGATGGAAAGAAATGCGAG
AATGCATGTGGCGGAGTGCTTTACGCTCCGAACGGTACAATAACCTCACCGTCTTTCCCG
GACTTGTATCCAGCATCCAAGAACTGTCTTTGGGAGATCGTAGCGCCGCCTCAACACAGA
ATCACTCTAAACTTCACTCATTTCGATTTAGAAGGCAGCAATAATATGTATCACCAGGAG
TGTGAATACGATAGTGTGACGGTCCATTCGCGACTTGGTGCTGACGTGTTACGGAGGCAC
GGCGCTTTCTGTGGTTCGGTCGTCCCGCCGCCTGTCACCTCAGACGGATCCGTGTTGCGA
GTACAGTTCACGTCGGACACATCCGTTCATCATTCGGGTTTCGCGGCAGCGTATTACATA
GACGTTGATGAGTGCGCAGACAATAATGGTGGCTGTGAACACGAGTGTCACAACACTCTC
GGCGGATATGAGTGCGCGTGTCACAGTGGGTTCACACTGCACCCTAACAAGCACGACTGT
AAGGAAGGCGGGTGCAAACATGACATCACGCACCCGCACGGAACCATTTTTAGTCCAAAC
TACCCAGACTTGTATCCATCACGGAAAGATTGCGTGTGGCAATTTTCTACCACCCCAGGG
CATCGTATCAAGCTCATATTTAACGTGTTTGAGTTGGAGCCGCATCAGGAATGCACGTAC
GACCACGTAACAATCTACGACGGAGCTTCAGCCGACGAAAAAACTTTGGGTAGATTCTGC
GGCAGCAAACTTCCGCATCCAGTGGTCGCGTCACAGAACCAGATGTACGTAGTGTTCAAA
TCCGACGCTTCTGTGCAGAGGAAGGGGTTCCTAGCTACTTATTCCACCGCTTGCGGGGGT
TACCTCTCGGCATCAGAGACAGTGAAGCACTTGTACTCCCACGCTAGATACGGGCATGAT
TCATACGAGTCGCGAGCTAACTGCGATTGGAGCATTGTGGCGCCATTGGGATATTTCGTA
CGACTTACATTCCTCACATTCGAGTTGGAACCGGAAGCTAATTGTGGTTATGACTTCGTT
CAAGTCTTTGGTGGTTTGGAAGGCAGTTCTGGTGATTACGGAAGCTTTTGTGGATCTAAG
ATGCCGCCACAAATAGTTTCTACAACAGAGGCTCTCCTACTGAGGTTCCGTACAGATGAT
TCTATAGTATTCAAAGGATTTTCTGCATCATACGAAGCTGTGAAACCTGACGTGTGGAGC
GGAGAAGATAGCTCCGAGGGCGGAGAAGATTTGGACGAAGAAGAAGACGAAGAAATGCCA
CCTCTAGTTTTAGGGAGGAGAGGTCTCCGCGCCCCTTTACCACGATTCGTTCGTCGGCCC
ACGTGA

Protein sequence:

FNEVTVITKVIPPGLDKSILMKNGKIPPGEDLDSIHMGNQSRDSHLNQSSVTDNLTESES
LEQDGVLVVNVSTDNILNLDEFYPNIQLSNFSNKMDEVTNKIQSNSREITGDNNNFKLKD
LPENMTVAPAPTNGVIFTNEDDNESPIKATELNQELNANYNINDLLKPTESLQSVVNLSS
HKRRRRRRHGNGRRLRNIQSSERRGHNAVKNSGEENGLELKKSVLRHEKANDLNQHEPAI
LLPEHEFYDKFELKSPQKNVFNETKDQSFNGWFFRDSEEDLEASETRHHRNHYYNHTQMK
RRHRTARAATNRKERIWENGVIPYEIDGNFSGAHKSLFKQAMRHWENFTCVKFVERDAEL
HRDYIVFTERPCGCCSFVGKRGNGAQAISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRD
RHVQIIRDNIMTGQEYNFNKLTEEEVNSLGQTYDYDSIMHYARNTFSKGTFLDTILPLEV
HGKKRPEIGQRVRLSVSDIAQTNLLYKCAKCGKTFLGNSGWFNSPGWGSETPPETPEKCE
WRIVATHGERVVLNITEIDIHKTDGCRSEWVEVRDGYMPNAPVLSRICGSGKGPMMRSTG
SRLTVVYQPGTRSKPHRGFRAHYEAVCGGDIEVDSSGHLESPNYPDDYHPNKLCIWRLSV
PQDYQVALRFHSFEVENHDTCNYDKVKVRDGDSMDSPLIGMFCGHKIPPDIRSTSNKLLV
IFESDSSVQKAGFSATFMKEYDECTSIDHGCSHSCVNTLGGYECACDIGYELHSDGKKCE
NACGGVLYAPNGTITSPSFPDLYPASKNCLWEIVAPPQHRITLNFTHFDLEGSNNMYHQE
CEYDSVTVHSRLGADVLRRHGAFCGSVVPPPVTSDGSVLRVQFTSDTSVHHSGFAAAYYI
DVDECADNNGGCEHECHNTLGGYECACHSGFTLHPNKHDCKEGGCKHDITHPHGTIFSPN
YPDLYPSRKDCVWQFSTTPGHRIKLIFNVFELEPHQECTYDHVTIYDGASADEKTLGRFC
GSKLPHPVVASQNQMYVVFKSDASVQRKGFLATYSTACGGYLSASETVKHLYSHARYGHD
SYESRANCDWSIVAPLGYFVRLTFLTFELEPEANCGYDFVQVFGGLEGSSGDYGSFCGSK
MPPQIVSTTEALLLRFRTDDSIVFKGFSASYEAVKPDVWSGEDSSEGGEDLDEEEDEEMP
PLVLGRRGLRAPLPRFVRRPT