New model in OGS2.0 | DPOGS208618  |
---|---|
Genomic Position | scaffold862:- 42340-53804 |
See gene structure | |
CDS Length | 2571 |
Paired RNAseq reads   | 1146 |
Single RNAseq reads   | 2901 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005722 (7e-108) |
Best Drosophila hit   | neprilysin 3 (0.0) |
Best Human hit | endothelin-converting enzyme 1 isoform 4 (2e-140) |
Best NR hit (blastp)   | AGAP007796-PA [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | AGAP007796-PA [Anopheles gambiae str. PEST] (0.0) |
GeneOntology terms    | GO:0004222 metalloendopeptidase activity GO:0008237 metallopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR018497 Peptidase M13, neprilysin, C-terminal IPR008753 Peptidase M13 IPR000718 Peptidase M13, neprilysin |
Orthology group | MCL11713 |
Nucleotide sequence:
ATGTTAGCTCTGTATTTGGGAGTTTCCTTCATAGGTTGGCATCAAAGAACGAAGTTGGAA
AGAATTCTTCTCGTTATAACAGCTTTTCTTCTGGTGGCTATTCTTCTTACCACCTGTTTA
TTGCTGACCATCAAGGGACCGTCCTATATACCTGTGCCACCATGTTACAACCCTGAACCC
AAAGAAGAGAGCAGCGTATGTTTATCAGGTTCATGTATTTACACAGCCAGCGAGGTTATT
AGAGCTTTGGATGAGACACAGGATCCCTGTGAGGATTTCTACGATTTCGCATGTGGTGGA
TGGTTGAAAAACAATCCCATACCAGAAGGGAAATCGAGCTGGGGTATATTCAGCAAGATT
GAACTACAGAATCAGCTAATTATTCGCTCGGCAATCGAAAAAGTTAACGTATCTGATAAG
AACAGCGCTGAAACAAAAGCAAGAATATACTACGACGCGTGCATAGACGGGAATGAGACG
ATAGAGAAACTGGGTGAGAAACCTCTCATCAGTGTGATAAAGAAGCTGGGGGGGTGGCAT
CTTGTAACGAACACGCTGGTTAAGCAGAGGAAATGGGATCTACAGAGACTCCTACAGGAT
GTTCAGAATACTTATAATCTGGGAGGATTCTTCAATTGGGCGGTTACGGAAGACGATAGA
AATTCCTCAAAACACGTCATTGTGCTAGATCAAGGCGGACTAAATCTACCGACACGAGAC
AATTACCTAAACGCTACAGCCCACAAGAAAGTACTGGACGCCTATCTGGATTATATGACA
AAGATATGCACATTACTGGGAGCTAACGAAACAGAAGCTAGGGCACAAATGTCGAAGGTT
ATACAGTTTGAGACGGAACTCGCGAATATAACCATCCCATCCGAGGATAGGAGGGATGAA
GAGGGATTGTACAATCCGTATACCGTGAAGCAGTGGCAGAGGGAGGCGCCGTTCTTGAAC
TGGTCGATGTTCTTCAACGACGCCTTCAAACTCGTCAATAGGAGTATATCGGATAACGAG
AGAATAGTTGTCTACGCGCCGGAATATTTCAGAAATTTAACAAGACTAGTAAGAAAATAC
AGCAAGAGTGAAGAGGATCAGAAAACACTGACGAGTTACATGATGTGGCAAGTGTCCCGT
TCTTTATCGTCGTATTTGTCCAAATCTTTCCGTGACGCGACCAAAATATTGAGGAAGGCG
CTGTTTGGATCCGAGGGCACCGAGGAGTCCTGGAGATACTGCGTCACGGATACCAACAAC
GCTGTTGGCTTCGCTGTCGGCGCGATGTTTGTGCGCGAAGTGTTCCATGGTGAGGCGAAG
ACTCAGGGCGAGATCATGATAGACAACATCCGAGCGGCTTTCAAGAAGAATTTGAAGAAT
CTCATCTGGATGGACGAAGAGACGAGAGATGCTGCGGAGATTAAGGCGGATGCTATCACT
GATATGATAGGTTTCCCCGACTACATACTGAACAAAGACGAGCTGGACAAGCAGTACGAG
GAGCTGGACGTAAGACCGAACAAGTACTTCGAGAACAACATCGCCTTCAACACGTACAGC
CTGAAACATGATCTAAGGAAATTGGATAAACCCGTCAATAAAACTAAATGGGGCATGACA
CCGTCCACTGTGAACGCGTATTACACGCCCACCAAAAACCAGATAGTATTCCCCGCTGGT
ATTCTCCAACTGCCGTTCTATGATGGAGATAATCCCAAGAGCGTGAACTACGGAGCGATG
GGCGTTGTTATGGGCCACGAGTTAACCCACGCGTTCGACGACCAAGGACGAGAATACGAT
AGATTCGGCAATTTGAACCGTTGGTGGAACAACGCTACCATAGCACGTTTCAAGCAAAGG
ACTCAATGCATTCAGAAACAGTATTCAACATACGAGATCGAAGGCCAGCATTTGAATGGA
AAACAAACTCTCGGCATGACACCGTCCACTGTGAACGCGTATTACACGCCCACCAAAAAC
CAGATAGTATTCCCCGCTGGTATTCTCCAACTGCCGTTCTATGATGGAGATAATCCCAAG
AGCGTGAACTACGGAGCGATGGGCGTTGTTATGGGCCACGAGTTAACCCACGCGTTCGAC
GACCAAGGACGAGAATACGATAGATTCGGCAATTTGAACCGTTGGTGGAACAACGCTACC
ATAGCACGTTTCAAGCAAAGGACTCAATGCATTCAGAAACAGTATTCAACATACGAGATC
GAAGGCCAGCATTTGAATGGAAAACAAACTCTCGGCGAGAATATAGCAGACAACGGAGGT
TTAAAGGCGTCGTTCCACGCTTATAAGGAGTACAGTAAAAACTCCAAAGTTAACCTCACT
TTACCTGGATTGAAGTACAACCACAGACAATTGTTCTTCATATCTTTCGCTCAGGTATGG
TGTTCAGCAATGACAAAGGAGTCGACGAAAATGCAAATCGAAAAGGACGATCACACCGTG
GCCAAGTATAGAGTCATTGGACCAATATCGAACCTTCGAGAATTCTCTGAAGAATTCAAT
TGTCCCGTAGGAAGTAAAATGAACCCAAAACATAAATGCGAGGTATGGTAA
Protein sequence:
MLALYLGVSFIGWHQRTKLERILLVITAFLLVAILLTTCLLLTIKGPSYIPVPPCYNPEP
KEESSVCLSGSCIYTASEVIRALDETQDPCEDFYDFACGGWLKNNPIPEGKSSWGIFSKI
ELQNQLIIRSAIEKVNVSDKNSAETKARIYYDACIDGNETIEKLGEKPLISVIKKLGGWH
LVTNTLVKQRKWDLQRLLQDVQNTYNLGGFFNWAVTEDDRNSSKHVIVLDQGGLNLPTRD
NYLNATAHKKVLDAYLDYMTKICTLLGANETEARAQMSKVIQFETELANITIPSEDRRDE
EGLYNPYTVKQWQREAPFLNWSMFFNDAFKLVNRSISDNERIVVYAPEYFRNLTRLVRKY
SKSEEDQKTLTSYMMWQVSRSLSSYLSKSFRDATKILRKALFGSEGTEESWRYCVTDTNN
AVGFAVGAMFVREVFHGEAKTQGEIMIDNIRAAFKKNLKNLIWMDEETRDAAEIKADAIT
DMIGFPDYILNKDELDKQYEELDVRPNKYFENNIAFNTYSLKHDLRKLDKPVNKTKWGMT
PSTVNAYYTPTKNQIVFPAGILQLPFYDGDNPKSVNYGAMGVVMGHELTHAFDDQGREYD
RFGNLNRWWNNATIARFKQRTQCIQKQYSTYEIEGQHLNGKQTLGMTPSTVNAYYTPTKN
QIVFPAGILQLPFYDGDNPKSVNYGAMGVVMGHELTHAFDDQGREYDRFGNLNRWWNNAT
IARFKQRTQCIQKQYSTYEIEGQHLNGKQTLGENIADNGGLKASFHAYKEYSKNSKVNLT
LPGLKYNHRQLFFISFAQVWCSAMTKESTKMQIEKDDHTVAKYRVIGPISNLREFSEEFN
CPVGSKMNPKHKCEVW