New model in OGS2.0 | DPOGS209428  |
---|---|
Genomic Position | scaffold3771:- 1021-15350 |
See gene structure | |
CDS Length | 2508 |
Paired RNAseq reads   | 3150 |
Single RNAseq reads   | 8634 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001638 (9e-32) |
Best Drosophila hit   | aminopeptidase P (4e-151) |
Best Human hit | xaa-Pro aminopeptidase 1 isoform 1 (4e-140) |
Best NR hit (blastp)   | PREDICTED: similar to X-prolyl aminopeptidase (aminopeptidase P) 1, soluble [Tribolium castaneum] (2e-164) |
Best NR hit (blastx)   | aminopeptidase P [Glossina morsitans morsitans] (2e-163) |
GeneOntology terms    | GO:0004177 aminopeptidase activity GO:0005829 cytosol GO:0009987 cellular process |
InterPro families    | IPR000587 Creatinase IPR000994 Peptidase M24, structural domain |
Orthology group | MCL14918 |
Nucleotide sequence:
ATGAGTCTACAAAGGTTGACGGCGCTACGAGCGCTGATGGCTGGACATCCGACAGCCTTA
GCTGCGTATATAATACCTACTGCGGATGCTCATAATTCGGAGTACATATCACCGGCGGAT
GCTCGTAGGGAGTGGATATCAGGGTTCACGGGTTCAGCCGGAACAGCTGTGGTTACAGCC
AACAAGGCCCTGGTGTGGACTGATGGTAGATATTACACGCAATTTGAGAAGGAAGCCGAT
CTCACGATGTGGACTCTAATGAAGCAATCTTTGCCCGAAACTCCAACTATGGAGAAGTGG
CTGGCGAGCAATCTGATAGCCGGTTCTGTTGTGGGGGTCGACCCCCACACTATGACGAGA
GAGGAATGGACCCCCTTGCAGACGGCGCTGTCTAAGGCAAAAATGCAACTAGTTGCTGTA
GAGAGTAATTTGGTTGATAAAGCCAGGATCTCACTGGATGATCCTCCGCCGAAGAGACCC
CAAAATGATATTATACACCTGCCTTTAGAATACACTGGAAAGACTGCTGGTGAAAAAATC
CATGATCTGAGAGTAGGGATGCTGGAGAAGAAAGCTTCAGCTCTCGTTATAACAGCCCTC
GATGAAGTCGCCTACACACTGAATCTGAGGGGTAGCGATATAAGATACAATCCAGTTTTT
TTCTCGTATCTATTGCTGACCCCCGACACGGTGACGCTGTTCTGGAGTGGGGGTCGCATT
CCGGACGACATAGAACGCAACTTATCTGACGAGGGGGTCAAGATAGTTGGTCGGCCGTAC
GATGACGTCATTGAGGGCTTGAGTAATTTGGCTCGCGAGTTATCCAACATGGGTGACGGT
GAGCATTCTGTGTGGATATCAAACGAAGCGAGCGAGGCGGTCCACAGAGCTGTGTCCGGG
GAAGGCGTGTTGAAAAACCCTCTTAATCTGATATCAGAAGTGTCTCCGGTGGCTTTAGCG
AAGTTGGTGAAGAACGACGTCGAGCTCGAGGGTTTCCGTAAATGTCACATCCGGGACGGT
ACAGCCGTCTGTAGATTCTTTAGATGGCTCCACCAGGAGGTGGACTCCGGAAATAAGATC
ACGGAAGTGGAAGCCGCTGAGAGATTATTGGAGTTCAGGAAGGATGAAAAAGACTTCATG
GGCCCCTCCTTCGAGACCATATCCGGGGCTGGTGAAAACGGCGCCGTCATACATTATACT
CCATCATCAGACTCGCCCAGGATCATAACGGCTGATGACGTGTACCTCCTGGACTCCGGC
GGACAGTACAAGGACGGTACGACTGATATCACCCGCACTCGTCACATGAGTGAGCCCACA
GACCTTCAGAAGGAAACCTTCACTAGAGTGCTCAAGGGTCAGATTGCTATCGGCGCTGCT
TTGTACCCCGTTGGGGTAAAGGGTAACGTCTTAGACTCGTTGGCACGTAAGTATCTGTGG
GACGTCGGCCTGGACTATGCGCATGGGACTGGCCATGGGGTAGGGCATTTCCTGAACGTC
CACGAGGGTCCCTCGGGGATCTCTTGGCGGCCGTACCCCCACGACCCGGGACTAAAGATG
GGTCAGATATTGAGTAACGAACCCGGTTATTACCGGGTCGGGGAATTCGGTATCCGGATA
GAGGATCTAGTCGAGACTATCAACGTCACAAACGACACGAACCACCCGAGGGCCAAAGAT
CTTCTGGGTGACTACAACGGGCGCGGCGTGCTGGGTTTCAACACGATAACTCTGGTACCG
AATCAGAGGAAGTTCATCAAAACTGAGCTGCTGGATGACTTCGAGCTCAAATACATAAAT
TCCTACCACAAGAGAGTGTTGGATACTCTCGGACCGATACTGAAGAACCGAGGCCTGATG
GAAGATTATGCCTGGCTGGAGAAGGAATGTTCTCCATTGATTGCAGCACAGCACCTGGTG
ATTGGGGACCTCATTTCCTTCGATGCAAACACTTATTCTCTATACCCTGAAAGTCATATA
CCCGGTAACATAAATGGCGGGAAACTCAGTGGATCTCAAAGCTTGGAGCGCGTGTCAGCT
GTTAGGAATGTTATGGCAGAGAGAGGAATCGACGCTTTTATAGTACCTACGTCTGACGCG
CATAACTCTCAATATATAGCGCCTACGGATGCTAGACGGGAGTGGCTCTCAGGTCTGTCG
GGGTCCGCCGGTACAGCCCTCGTAACAGCCGACCACGCCTTACTGTGGACTGACGGCAGA
TACTTCACGCAATTCGATATGCAAGTTGATCCTCGTATTTGGACTCTCATGAGGATCGGT
ACTGATGTAACGATCGAGAGTTGGCTAGCGTCTAACATGAGAGGTTCAAGAGTTGGTATT
GATCCAACGACCTACACACGCAGTTCTTGGACAACTTTGGAGGTAACCTTATTTAAATAT
TGTAACAAATCGCTGGTGGTAGAGCCTAGCTGTGGTCGAGTTACTGCAGCTCGAACATGG
GCTGTCCCGACCGGGGAAGTGCCACCCTCTCACAGAAGATCCGCGTGA
Protein sequence:
MSLQRLTALRALMAGHPTALAAYIIPTADAHNSEYISPADARREWISGFTGSAGTAVVTA
NKALVWTDGRYYTQFEKEADLTMWTLMKQSLPETPTMEKWLASNLIAGSVVGVDPHTMTR
EEWTPLQTALSKAKMQLVAVESNLVDKARISLDDPPPKRPQNDIIHLPLEYTGKTAGEKI
HDLRVGMLEKKASALVITALDEVAYTLNLRGSDIRYNPVFFSYLLLTPDTVTLFWSGGRI
PDDIERNLSDEGVKIVGRPYDDVIEGLSNLARELSNMGDGEHSVWISNEASEAVHRAVSG
EGVLKNPLNLISEVSPVALAKLVKNDVELEGFRKCHIRDGTAVCRFFRWLHQEVDSGNKI
TEVEAAERLLEFRKDEKDFMGPSFETISGAGENGAVIHYTPSSDSPRIITADDVYLLDSG
GQYKDGTTDITRTRHMSEPTDLQKETFTRVLKGQIAIGAALYPVGVKGNVLDSLARKYLW
DVGLDYAHGTGHGVGHFLNVHEGPSGISWRPYPHDPGLKMGQILSNEPGYYRVGEFGIRI
EDLVETINVTNDTNHPRAKDLLGDYNGRGVLGFNTITLVPNQRKFIKTELLDDFELKYIN
SYHKRVLDTLGPILKNRGLMEDYAWLEKECSPLIAAQHLVIGDLISFDANTYSLYPESHI
PGNINGGKLSGSQSLERVSAVRNVMAERGIDAFIVPTSDAHNSQYIAPTDARREWLSGLS
GSAGTALVTADHALLWTDGRYFTQFDMQVDPRIWTLMRIGTDVTIESWLASNMRGSRVGI
DPTTYTRSSWTTLEVTLFKYCNKSLVVEPSCGRVTAARTWAVPTGEVPPSHRRSA