New model in OGS2.0 | DPOGS206492  |
---|---|
Genomic Position | scaffold820:+ 54483-69238 |
See gene structure | |
CDS Length | 3666 |
Paired RNAseq reads   | 1153 |
Single RNAseq reads   | 2627 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002519 (1e-65) |
Best Drosophila hit   | tolkin, isoform A (0.0) |
Best Human hit | tolloid-like protein 2 precursor (0.0) |
Best NR hit (blastp)   | bone morphogenetic protein [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to tolkin CG6863-PA, isoform A [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0004222 metalloendopeptidase activity GO:0005509 calcium ion binding GO:0008270 zinc ion binding GO:0006508 proteolysis GO:0008586 imaginal disc-derived wing vein morphogenesis GO:0008045 motor axon guidance GO:0007415 defasciculation of motor neuron axon GO:0007411 axon guidance GO:0051605 protein maturation by peptide bond cleavage |
InterPro families    | IPR001506 Peptidase M12A, astacin IPR000859 CUB IPR006026 Peptidase, metallopeptidase IPR001881 EGF-like calcium-binding IPR006210 Epidermal growth factor-like IPR015446 Bone morphogenetic protein 1/tolloid-like protein IPR013091 EGF calcium-binding IPR000152 EGF-type aspartate/asparagine hydroxylation site IPR013032 EGF-like region, conserved site IPR018097 EGF-like calcium-binding, conserved site IPR000742 Epidermal growth factor-like, type 3 |
Orthology group | MCL10409 |
Nucleotide sequence:
TTCAATGAAGTTACTGTAATAACTAAGGTGATTCCACCGGGCTTAGATAAATCGATATTG
ATGAAAAATGGTAAAATACCACCCGGCGAAGATTTAGATAGTATTCACATGGGCAATCAG
AGCAGGGATAGTCATTTAAACCAATCAAGTGTAACTGACAACTTAACTGAATCTGAAAGT
CTTGAACAGGATGGCGTGTTAGTTGTTAATGTCTCAACTGATAATATATTGAATCTAGAT
GAATTTTATCCTAATATACAACTGTCAAACTTTTCAAACAAAATGGATGAAGTTACAAAC
AAAATACAAAGCAACAGTCGTGAAATAACAGGGGACAATAATAACTTTAAACTTAAAGAC
TTACCAGAGAACATGACGGTTGCTCCTGCTCCTACGAACGGTGTTATATTTACAAATGAA
GATGACAATGAGAGTCCCATAAAGGCTACTGAATTAAATCAAGAGCTCAATGCAAATTAT
AATATCAATGATCTACTAAAACCCACAGAAAGTTTACAAAGTGTTGTTAATCTTTCAAGC
CATAAAAGACGACGAAGAAGAAGACACGGGAACGGGAGACGCTTAAGAAATATTCAATCA
AGTGAACGAAGAGGTCACAATGCTGTGAAAAATTCGGGTGAGGAAAATGGATTAGAACTT
AAAAAATCAGTCCTGCGTCACGAGAAAGCAAACGATTTAAACCAACATGAACCAGCCATT
CTACTGCCTGAACATGAATTTTATGATAAATTCGAACTAAAATCTCCTCAAAAAAACGTA
TTCAATGAAACTAAGGACCAATCGTTTAATGGCTGGTTTTTTCGGGACTCTGAAGAAGAT
TTAGAAGCAAGCGAAACTCGTCACCACAGAAATCATTACTATAATCATACACAGATGAAA
AGGAGACATCGCACGGCCAGAGCCGCTACGAACAGGAAGGAGCGTATTTGGGAAAACGGT
GTCATTCCGTATGAAATCGACGGTAACTTCAGCGGCGCTCACAAATCTCTGTTCAAGCAG
GCTATGAGACATTGGGAGAATTTCACTTGCGTCAAATTCGTTGAAAGGGACGCTGAATTA
CATCGGGATTACATTGTGTTCACAGAACGACCGTGCGGATGTTGTTCATTCGTCGGAAAA
CGTGGCAACGGAGCCCAAGCGATATCGATCGGAAAGAACTGTGACAAGTTTGGGATTGTG
GTCCATGAGTTGGGACACGTGGTCGGCTTTTGGCACGAACATACTCGACCCGACCGAGAC
AGACATGTTCAAATAATCCGGGATAATATTATGACTGGGCAAGAGTATAATTTTAATAAG
CTAACAGAAGAAGAAGTAAATTCTTTGGGACAGACGTACGATTATGATTCAATCATGCAT
TATGCGAGAAACACTTTCAGTAAAGGGACGTTCTTAGATACGATTCTACCTCTTGAAGTT
CATGGGAAGAAGAGACCTGAGATAGGACAGAGAGTGAGATTGAGTGTGAGTGACATAGCT
CAGACTAACTTACTATATAAATGTGCAAAATGCGGAAAAACGTTCCTTGGTAACTCAGGC
TGGTTTAATTCTCCTGGCTGGGGTTCTGAAACACCACCAGAGACTCCGGAGAAATGCGAA
TGGAGGATAGTCGCCACTCACGGAGAACGAGTCGTCCTGAACATTACTGAGATAGATATT
CACAAAACGGATGGTTGTCGTTCGGAATGGGTGGAAGTCAGGGATGGTTATATGCCAAAC
GCACCAGTCCTTAGTCGTATCTGTGGTTCAGGGAAAGGACCAATGATGAGATCAACGGGA
TCCAGGCTAACGGTGGTCTACCAGCCGGGGACGAGGTCCAAACCTCACAGAGGATTTAGA
GCACATTACGAAGCTGTATGCGGCGGAGACATAGAAGTTGATAGTAGTGGTCATCTAGAG
TCACCGAACTATCCCGATGATTATCATCCAAATAAATTATGCATTTGGAGACTTTCCGTG
CCACAAGATTACCAAGTAGCATTACGATTCCATTCATTCGAAGTGGAAAACCATGACACC
TGCAATTATGATAAAGTAAAAGTAAGAGACGGAGACTCAATGGACAGTCCTCTGATAGGG
ATGTTTTGTGGACATAAGATTCCGCCTGACATAAGGTCAACATCAAACAAACTGCTCGTG
ATCTTCGAGTCTGACAGTTCGGTACAGAAAGCCGGTTTCTCCGCTACCTTCATGAAGGAA
TACGATGAATGCACCTCCATAGACCACGGCTGCAGTCACTCTTGCGTCAACACTCTCGGT
GGTTACGAATGCGCATGTGACATTGGCTATGAGTTGCATTCAGATGGAAAGAAATGCGAG
AATGCATGTGGCGGAGTGCTTTACGCTCCGAACGGTACAATAACCTCACCGTCTTTCCCG
GACTTGTATCCAGCATCCAAGAACTGTCTTTGGGAGATCGTAGCGCCGCCTCAACACAGA
ATCACTCTAAACTTCACTCATTTCGATTTAGAAGGCAGCAATAATATGTATCACCAGGAG
TGTGAATACGATAGTGTGACGGTCCATTCGCGACTTGGTGCTGACGTGTTACGGAGGCAC
GGCGCTTTCTGTGGTTCGGTCGTCCCGCCGCCTGTCACCTCAGACGGATCCGTGTTGCGA
GTACAGTTCACGTCGGACACATCCGTTCATCATTCGGGTTTCGCGGCAGCGTATTACATA
GACGTTGATGAGTGCGCAGACAATAATGGTGGCTGTGAACACGAGTGTCACAACACTCTC
GGCGGATATGAGTGCGCGTGTCACAGTGGGTTCACACTGCACCCTAACAAGCACGACTGT
AAGGAAGGCGGGTGCAAACATGACATCACGCACCCGCACGGAACCATTTTTAGTCCAAAC
TACCCAGACTTGTATCCATCACGGAAAGATTGCGTGTGGCAATTTTCTACCACCCCAGGG
CATCGTATCAAGCTCATATTTAACGTGTTTGAGTTGGAGCCGCATCAGGAATGCACGTAC
GACCACGTAACAATCTACGACGGAGCTTCAGCCGACGAAAAAACTTTGGGTAGATTCTGC
GGCAGCAAACTTCCGCATCCAGTGGTCGCGTCACAGAACCAGATGTACGTAGTGTTCAAA
TCCGACGCTTCTGTGCAGAGGAAGGGGTTCCTAGCTACTTATTCCACCGCTTGCGGGGGT
TACCTCTCGGCATCAGAGACAGTGAAGCACTTGTACTCCCACGCTAGATACGGGCATGAT
TCATACGAGTCGCGAGCTAACTGCGATTGGAGCATTGTGGCGCCATTGGGATATTTCGTA
CGACTTACATTCCTCACATTCGAGTTGGAACCGGAAGCTAATTGTGGTTATGACTTCGTT
CAAGTCTTTGGTGGTTTGGAAGGCAGTTCTGGTGATTACGGAAGCTTTTGTGGATCTAAG
ATGCCGCCACAAATAGTTTCTACAACAGAGGCTCTCCTACTGAGGTTCCGTACAGATGAT
TCTATAGTATTCAAAGGATTTTCTGCATCATACGAAGCTGTGAAACCTGACGTGTGGAGC
GGAGAAGATAGCTCCGAGGGCGGAGAAGATTTGGACGAAGAAGAAGACGAAGAAATGCCA
CCTCTAGTTTTAGGGAGGAGAGGTCTCCGCGCCCCTTTACCACGATTCGTTCGTCGGCCC
ACGTGA
Protein sequence:
FNEVTVITKVIPPGLDKSILMKNGKIPPGEDLDSIHMGNQSRDSHLNQSSVTDNLTESES
LEQDGVLVVNVSTDNILNLDEFYPNIQLSNFSNKMDEVTNKIQSNSREITGDNNNFKLKD
LPENMTVAPAPTNGVIFTNEDDNESPIKATELNQELNANYNINDLLKPTESLQSVVNLSS
HKRRRRRRHGNGRRLRNIQSSERRGHNAVKNSGEENGLELKKSVLRHEKANDLNQHEPAI
LLPEHEFYDKFELKSPQKNVFNETKDQSFNGWFFRDSEEDLEASETRHHRNHYYNHTQMK
RRHRTARAATNRKERIWENGVIPYEIDGNFSGAHKSLFKQAMRHWENFTCVKFVERDAEL
HRDYIVFTERPCGCCSFVGKRGNGAQAISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRD
RHVQIIRDNIMTGQEYNFNKLTEEEVNSLGQTYDYDSIMHYARNTFSKGTFLDTILPLEV
HGKKRPEIGQRVRLSVSDIAQTNLLYKCAKCGKTFLGNSGWFNSPGWGSETPPETPEKCE
WRIVATHGERVVLNITEIDIHKTDGCRSEWVEVRDGYMPNAPVLSRICGSGKGPMMRSTG
SRLTVVYQPGTRSKPHRGFRAHYEAVCGGDIEVDSSGHLESPNYPDDYHPNKLCIWRLSV
PQDYQVALRFHSFEVENHDTCNYDKVKVRDGDSMDSPLIGMFCGHKIPPDIRSTSNKLLV
IFESDSSVQKAGFSATFMKEYDECTSIDHGCSHSCVNTLGGYECACDIGYELHSDGKKCE
NACGGVLYAPNGTITSPSFPDLYPASKNCLWEIVAPPQHRITLNFTHFDLEGSNNMYHQE
CEYDSVTVHSRLGADVLRRHGAFCGSVVPPPVTSDGSVLRVQFTSDTSVHHSGFAAAYYI
DVDECADNNGGCEHECHNTLGGYECACHSGFTLHPNKHDCKEGGCKHDITHPHGTIFSPN
YPDLYPSRKDCVWQFSTTPGHRIKLIFNVFELEPHQECTYDHVTIYDGASADEKTLGRFC
GSKLPHPVVASQNQMYVVFKSDASVQRKGFLATYSTACGGYLSASETVKHLYSHARYGHD
SYESRANCDWSIVAPLGYFVRLTFLTFELEPEANCGYDFVQVFGGLEGSSGDYGSFCGSK
MPPQIVSTTEALLLRFRTDDSIVFKGFSASYEAVKPDVWSGEDSSEGGEDLDEEEDEEMP
PLVLGRRGLRAPLPRFVRRPT