New model in OGS2.0 | DPOGS209790  |
---|---|
Genomic Position | scaffold3190:- 24801-27964 |
See gene structure | |
CDS Length | 1251 |
Paired RNAseq reads   | 2065 |
Single RNAseq reads   | 5190 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008015 (6e-149) |
Best Drosophila hit   | CG40470 (5e-86) |
Best Human hit | thyrotropin-releasing hormone-degrading ectoenzyme (1e-27) |
Best NR hit (blastp)   | protease m1 zinc metalloprotease [Culex quinquefasciatus] (1e-124) |
Best NR hit (blastx)   | protease m1 zinc metalloprotease [Culex quinquefasciatus] (7e-123) |
GeneOntology terms    | GO:0008237 metallopeptidase activity GO:0006508 proteolysis GO:0008270 zinc ion binding |
InterPro families    | IPR014782 Peptidase M1, membrane alanine aminopeptidase, N-terminal IPR001930 Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase |
Orthology group | MCL16944 |
Nucleotide sequence:
ATGTTCCCATGTTTTGATGAACCTGGATACAAAACTCCCTTTGAACTGAGCGTCGTGCGA
CCGAGGGATATGGTAGCACTTAGCAATGTTCCTGTCGCTAGGACAGAAGATATTAACGAT
GAACCAAACGCCGTCTGGGATCATTTCGAGAGGACTCCCCCAATGTCAACGTTCACGCTC
GGCCTTGTCATCGCTGACCTCAAACAATTTGGCAGTGCCATACATTATGAAGACGAAAAT
GGAAACAATATTGAAATACGTGTTTGGGGTCGTCCAGAATTTGTAGAAATGTTAGAAGGT
CTCAATGAGAAAGTGGCTCAAGTGTTTTCTGAAGTCGCAAACTTCTGGCAAGTTCCGCTA
CCATTACGCAGATTGGACATAGTGGCTTTACCAAACTATCAAGGGGTAAAGCCCGCCGAT
AATTGGGGTTTGATAGTTTTTAAGGAAAGCGATTTGTCCTCACGAGGCTACTTGCAGCTG
TCCCAGGAGTTGTCCTACCAGTGGCTAGGCGCTCTCGTCTCTCCAGCTTGGTGGAGCGAT
GCTCATCTCAACAAAGCACTAGTTGGATACCTCGCTGCGGAGATTGCATTTAAAATTAAC
AATGGTTCAGAGATGGAAGGAAAATGGCCGATGACGGTTCTTTATTCTCTGTACTACGAG
TTCAGTAAACGATATCCACACTCTCGGATCACCGGCATGAAACAAGAGACGGCTTGCACC
AAAATAGAGTTATTGTTCCGAATGTTCAATTATACTATTGGTGGAGACACTTTCAGAAAA
GGAATGAGGAAATTCATTGAGTCAAGGAAGTTTAAGACTTTTACTGGTGATGATATTTGG
AATGCCCTCAACGAAGCCGCATTAGCAGATGGCAAGATTCCGAAAGATATTAATATTAAA
ACAGTAGCCACCAGTTGGATAGAAAAAGACAGACTTCCAGTCATTACAGTTAAGAGGAAT
TACGAAACCAATACGGCTTTTGTAACTCAGAAAGTGTATCTTCGCGAACGTCCCCACGAT
CTGCCTTCATCTAATAAGATGTTGTGGAGCGCTCCGCTGGTCGTGTGTCGTTCTGACAGA
CTCTCCTTCGAAGACTTCACGCCTTCCTCCTGGATCAGACACACAGACCTCAACCTGCTC
AACATGCCGGATGACAAGCACTTCATCATCGTCAACCCTGAAGAAATTGGTAAGCGAAAA
TCTGTGAAGGGAGACGAAATCATCCACCGTGTGGGTGCCGGTTCCATCTGA
Protein sequence:
MFPCFDEPGYKTPFELSVVRPRDMVALSNVPVARTEDINDEPNAVWDHFERTPPMSTFTL
GLVIADLKQFGSAIHYEDENGNNIEIRVWGRPEFVEMLEGLNEKVAQVFSEVANFWQVPL
PLRRLDIVALPNYQGVKPADNWGLIVFKESDLSSRGYLQLSQELSYQWLGALVSPAWWSD
AHLNKALVGYLAAEIAFKINNGSEMEGKWPMTVLYSLYYEFSKRYPHSRITGMKQETACT
KIELLFRMFNYTIGGDTFRKGMRKFIESRKFKTFTGDDIWNALNEAALADGKIPKDINIK
TVATSWIEKDRLPVITVKRNYETNTAFVTQKVYLRERPHDLPSSNKMLWSAPLVVCRSDR
LSFEDFTPSSWIRHTDLNLLNMPDDKHFIIVNPEEIGKRKSVKGDEIIHRVGAGSI