New model in OGS2.0 | DPOGS202935  |
---|---|
Genomic Position | scaffold2456:- 7504-9164 |
See gene structure | |
CDS Length | 1113 |
Paired RNAseq reads   | 98 |
Single RNAseq reads   | 299 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001905 (4e-174) |
Best Drosophila hit   | Peptidyl-alpha-hydroxyglycine-alpha-amidating lyase, isoform D (2e-92) |
Best Human hit | peptidyl-glycine alpha-amidating monooxygenase isoform a preproprotein (3e-54) |
Best NR hit (blastp)   | PREDICTED: similar to peptidyl-glycine alpha-amidating monooxygenase [Tribolium castaneum] (7e-102) |
Best NR hit (blastx)   | PREDICTED: similar to peptidyl-glycine alpha-amidating monooxygenase [Tribolium castaneum] (1e-98) |
GeneOntology terms    | GO:0004598 peptidylamidoglycolate lyase activity GO:0005576 extracellular region GO:0044237 cellular metabolic process |
InterPro families    | IPR001258 NHL repeat IPR011042 Six-bladed beta-propeller, TolB-like IPR000720 Peptidyl-glycine alpha-amidating monooxygenase IPR013017 NHL repeat, subgroup |
Orthology group | MCL15890 |
Nucleotide sequence:
ATGTGGCCCCTAATTTTGTTGTTTCTCACTTTTCACGGAATTAAATGTGAACCGGAAACC
GTTAGAGACAATGCCGATTACTTCAGTTACGGTAATGAAGAAAATGTTTTAAAAAATCTG
GACCTACACGCGCCAAAAGATGAAATTGTGTTAAGACCGCAAGAAGTTCAAGACTGGCCA
CAACAGTCTCTGAATGTGGGACAAATAACTGCTGTCTCAATAAATTCTTTGGGACAGCCC
GTAATCTTCCACCGAGCCGAAAGAGTGTGGGATGAAAGTACTTTCAATGAATCGAATGTG
TATCAGAATCTCGATAAAGGTCCCATTATTGAAGATACCATATTGGTACTCGACCCTCAT
ACAGGTTCTGTGCTTCATAGTTGGGGGGCCTATGCCTTTTATATGCCCCATGGTTTAACT
GTAGACCATCATGACAACGTGTGGGTAACTGACGTGGCTAAACATCAAGTTTTCAAGTAT
ACACCGAACAATCACAAATATCCAAGCCTTACCATCGGAGAGGCCTTTACTGCTGGTTAC
CCTTATAGACGTAGGGTACTGTTATGTATGCCGACGTCAGTAGCTGTCGCTACAACGGGT
GAAATTTTTGTTGCCGATGGGTATTGCAACAATCAGATTTTAAAATTCAATGCCGCTGGA
ACTTTATTATTCGCCATACCCACATTCTCCGATACCCTGACCTTAAATCTGCCACACAGT
GTCACCTTGTTGGAAAGTTTGGATGTAGTTTGCGTGGCTGACAGAGAGAATATGAGAATT
GTATGCCCCAAAGCTGGGTTGAAGAGCTATGTGAATATGTTTGAAGCGGCGACTGTAATT
GAAGATCCCACTCTAGGTCGTGTTTTTGCCGTGGCTTCCCATAATGATATGATTTATGCT
GTTAATGGTCCGACCTCTCAAAACATCGCTGTACGGGGTTTTACTGTAAATGCCGTATAT
GGAAATATATTGGACACTTGGGAACCAAGCGCTGGTTTTACTAATCCTCATTCTCTGGCG
GTTACAAGAAACGGCTCCCATCTTTACGTTACGGAAATTGGACCTAATAAAATCTGGAAA
TTCGAATTAACTGATGTCTTTGACAAGAAATAA
Protein sequence:
MWPLILLFLTFHGIKCEPETVRDNADYFSYGNEENVLKNLDLHAPKDEIVLRPQEVQDWP
QQSLNVGQITAVSINSLGQPVIFHRAERVWDESTFNESNVYQNLDKGPIIEDTILVLDPH
TGSVLHSWGAYAFYMPHGLTVDHHDNVWVTDVAKHQVFKYTPNNHKYPSLTIGEAFTAGY
PYRRRVLLCMPTSVAVATTGEIFVADGYCNNQILKFNAAGTLLFAIPTFSDTLTLNLPHS
VTLLESLDVVCVADRENMRIVCPKAGLKSYVNMFEAATVIEDPTLGRVFAVASHNDMIYA
VNGPTSQNIAVRGFTVNAVYGNILDTWEPSAGFTNPHSLAVTRNGSHLYVTEIGPNKIWK
FELTDVFDKK