New model in OGS2.0 | DPOGS215724  |
---|---|
Genomic Position | scaffold788:- 52490-57656 |
See gene structure | |
CDS Length | 1398 |
Paired RNAseq reads   | 281 |
Single RNAseq reads   | 952 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005815 (1e-06) |
Best Drosophila hit   | CG12130, isoform B (7e-75) |
Best Human hit | peptidyl-glycine alpha-amidating monooxygenase isoform e preproprotein (9e-68) |
Best NR hit (blastp)   | GF13565 [Drosophila ananassae] (1e-82) |
Best NR hit (blastx)   | AGAP007606-PA [Anopheles gambiae str. PEST] (2e-80) |
GeneOntology terms    | GO:0004504 peptidylglycine monooxygenase activity GO:0006518 peptide metabolic process GO:0005507 copper ion binding GO:0055114 oxidation reduction GO:0016020 membrane GO:0004598 peptidylamidoglycolate lyase activity GO:0043025 neuronal cell body |
InterPro families    | IPR001258 NHL repeat IPR013017 NHL repeat, subgroup IPR011042 Six-bladed beta-propeller, TolB-like IPR000720 Peptidyl-glycine alpha-amidating monooxygenase |
Orthology group | MCL14608 |
Nucleotide sequence:
ATGATACATTTTTATAGAAGTACACTTAAGGCCTTATTATTACTATTTGTGTCGTGTATT
GCAGCAAGAAGGTTGCCGGAAGAATACTATCCGGATAATTTTTTCGCCAACCAACAGTCT
TTAAAAATAAATGCGGCACTACAAAAATTAGAGCACGTTCCACAATGGGTACCCAACTGG
CCGGACTCCAAAATCAAGATGGGTCAAGTATCAGGGGTAGCACTTGATAATTCAGGGCAA
TTGCTGGTATTTCACCGAGCTGATAATACTTGGGATGCGAATACATTTTCCATCAGGAAT
GTATATCAAGCCATTGGAGAGCCGCCCATCTCACAACCTACAATACTAGTTTTTAATGAA
ACTGGAGTTATGGTCGACTCTTGGGGACAGAATCTTTTCCACATGCCACATGGAATAACA
GTCGACAGCGAGAGCAACGTCTGGGTGACGGATGTAGCTCTCCATCAGGTGTTCAAGTTC
ACACCAGACAACAGAACCGCGCCAGCTTTAGTGCTCGGAGAGAAGTTTGGGCCGCTACTG
GACAAACATTTCTGCAAGCCGAGCGCGGTGGCCGTGCTCAGCTCCGGAGACTTCTTTGTG
GCCGACGGCTACTGTAACACTCGCATCGTCAAGTACGCCGCAGACGGCACCAAGATACTG
CAGTGGGGGAAACGTCTGGGCGAGTCTCCGTTCGTGTTGTCAGTCCCGCACGCGCTGTCC
GTGTCGGAGGACCGCTCGCTGCTGGTGGTGGCGGACCGCGAGCGGGGCCGCGTCGCCTGC
TTCAGGACGGACTCGGGCGCCTTCGTCACAGCCTTCAGACACTGGCTCATAGGACCGAGA
CTGTTCAGTGTAGCGTACTCGCCGATACACGGAGGTCGTCTGTATATAGTAAACGGACCA
ACAATCGGCCCTCCGCCGGTTAGGGGTTACGTCATAGACTTCTCGTCAGGGAGGTTGATC
CAGACCTTCGCTACAGGCGACAGCTTCAGTAATCCTCACGATCTGGTGGTGTCTCCTGAT
GGGACGGTCTACGTCGCGGAACTGGATCCCCATAGGGTCCACAAATTCGTCGACGACACT
CTAAGAAACGAGACCAAAGTTAACGTCACTAGGACGAAACCAACTACCGTCGAGGTCGGT
GTAGCGGGAGGGTGGGAATGGGAGCGTTGGGGCTCGTGGGCCGGCGCCGCTGGCAGCGCG
CTAGGGGCCGCGTGCGGAGCCCTGCTCTTAGCACTGTGCCGCGCCCGCGACGGTCGCAAA
TCGGTCGGCCGACGTCGTTGGGAATACGATCACAGTCAGTTCAAGTTACGCCGGTTGCTG
GAAAGGCGGCGCTTTACACGAGTGCACTCAGATGATTCAGAAGACGAGCCCGCACCAATG
TTGCCACCAACAGTATAA
Protein sequence:
MIHFYRSTLKALLLLFVSCIAARRLPEEYYPDNFFANQQSLKINAALQKLEHVPQWVPNW
PDSKIKMGQVSGVALDNSGQLLVFHRADNTWDANTFSIRNVYQAIGEPPISQPTILVFNE
TGVMVDSWGQNLFHMPHGITVDSESNVWVTDVALHQVFKFTPDNRTAPALVLGEKFGPLL
DKHFCKPSAVAVLSSGDFFVADGYCNTRIVKYAADGTKILQWGKRLGESPFVLSVPHALS
VSEDRSLLVVADRERGRVACFRTDSGAFVTAFRHWLIGPRLFSVAYSPIHGGRLYIVNGP
TIGPPPVRGYVIDFSSGRLIQTFATGDSFSNPHDLVVSPDGTVYVAELDPHRVHKFVDDT
LRNETKVNVTRTKPTTVEVGVAGGWEWERWGSWAGAAGSALGAACGALLLALCRARDGRK
SVGRRRWEYDHSQFKLRRLLERRRFTRVHSDDSEDEPAPMLPPTV