New model in OGS2.0 | DPOGS211079  |
---|---|
Genomic Position | scaffold93:+ 14365-18591 |
See gene structure | |
CDS Length | 2703 |
Paired RNAseq reads   | 359 |
Single RNAseq reads   | 912 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002959 (0.0) |
Best Drosophila hit   | CG4751 (6e-120) |
Best Human hit | MPN domain-containing protein isoform 1 (3e-40) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL007827 [Aedes aegypti] (3e-142) |
Best NR hit (blastx)   | hypothetical protein AaeL_AAEL007827 [Aedes aegypti] (6e-133) |
GeneOntology terms    | GO:0008150 biological_process GO:0005575 cellular_component GO:0016787 hydrolase activity GO:0008233 peptidase activity |
InterPro families   | IPR000555 Mov34/MPN/PAD-1 |
Orthology group | MCL16712 |
Nucleotide sequence:
ATGGATTTACAAACATCGCATTTGGCAGCTGCACTGATGTCTAATAACAATTCATTTAAC
AATGAACAACCTGGGGCTGTCATAGCTCCTACCATCAACGATAAAGATGAACCTATGCAA
AATGATCCGGTACCAGCGGTTGGTGAACCAGCAAAACCAACCACATATGAAGAAAAAGAG
GAAATAGGCAGTGGTGAAGAGTTCTCCGATGAGGAGTCAGAAGCACAACCTGAAAACAAA
TGTATGCCAGGAAGAGGTGTTACTTTGCAAATGCTTTTGGAAGAGAAAATGTTAGAACCC
GGACATGCTGCTATGACAATAGAATATTTGGGTCAAAAGTTTGTTGGTGATTTACAAGCC
GATGGAAAAATAAAATCACATGAAACAGAAACAATATTCTGTTCACCGTCTGCTTGGGCC
ATTCACTGCAAGAGAATTATAAACCCCGACAAAAGATCGGGATGCGGGTGGGCATCAGTG
AAATACAGAGGCAAGAAATTGGATACCATTAAAGCTACTTATTTAAGAAAGAAACAACTA
CAGAGGGAGAACATGCATAGTGATGAAGAAACAGAAATGGAGGTGGAGAGTCCTCCTGAG
CCTCCCCCGCAGAGGATTGTAATGAAACATAATACTGTACCCAATAGAATGATGCAACAT
GATGCGAACATGCTGATTGAAGCTGTGTCATTCTCGACAGCGGGAAAGATTCAGCCGTTT
TTAGTATCAGTTAACTCCAATGCCTTACTGATACTAGATATACATTGTCATTTGAAAAAG
GAAGAAGTTTATGGCTATTTGGCTGGTACATGGGATCTGAATAATCATAATGTCTCAATC
ACTCATACATTCCCATGCTTGATAAGCAAGAATGACTCGAGACCAAGGGTTTTAGTTGAG
TTGGAAATACAAATGGAAATTGAAAAGTTAGGGCTGTCTTTATTAGGCTGGTACCACTCT
CACCCAACCAACCCGGCCATGCCCAGTCTTAGGGACTGTGATAATCAACTTGAATACCAG
ATAAAAATGAGGGGCCCTACAGAAATATCGTACATTCCTTGTATTGGAGTTATTTGTTCA
CCCTACAATCCGGAAAGTCCTGTAATGGAATCATCATTAACATTCTTCTGGGTTATGCCT
CCTCCAGAACAGAGACCCACAGAATACCCGAAACCTCTTCTTTTGCAATACAATATGATT
CATGATACACATTTATCAACTCACGCTATGGAACAGATAAAGAAAAGCATCAAATACTAT
GGCACATTCGCTGACGACTCGCTAGTCAGTTTTAAGGATAACTTTAAGCCTGATATCACT
TATTTGGATAAATTAAAATGTACCCTCACTCCGAAATTCCCGCGAGAGCAAAGTGATGGG
TTGTTATGGCATTTTATAAGGGATGAGTTAGGTTGTTCGTCAGAAAATGATGATAAGATG
GATTTAGATGCGTTATTGGCTGTTCCACAACCGATTCCCGCATCTAAACCGCAATCCACT
TCCATACCTAACTTCCCATCAGTTAGCACACTGCAACAGATGGTGAGTAGACCGGCCGGA
GTGCCACCAATCAATGTTTCTTCGGCTATAGGTTCAGTTTCCCCTCATAAATTTGAGACG
CCACCGCTTAATATACCCGTTTTACCGACATCCTCTAAACTTTCTAAATCGACAACTCCT
TCCCTTCCCCCAACATATCCAACCGGTCTGGATATGTTAACTAGTATGGCTTTAGGACTT
GGATCTACGAATATGCCATTACCTCTAGGTACATCAGGTCTAGAGAGTCTAGCGGCTGCT
AACAGTATGCTAACAGGCTTTAATCCAGCATTATCTTCGAACTTAGCATCCACATTATCT
TCAAGCAAACTCCCGGATTTGCCTTCATACGCTGCCTCTTTGCAAAATCTATCGAATAGT
ATGGTTAATTATGAGAAAACTTCTACTACGACTTCAACAAGTTGTACAACTTCCGTCGCC
CCTATACCAGCCAGCATCGCCTCTAACCTTATGATGAGTTCGGCTGATATAGCCAATGCA
TTATTTTCAGCTAGTAAATATTCTAGTGCTGGTATATTAGGAATACCAGATCCAATGTCG
AAATCCACTCTGGCTGCCAATAACATGTTTTTGTCGCCTTCTTTGCTTAAAATGCAAGAG
TCATTGATGAAGCCTTTGTCAAGTAGTAGTCCGATCCCGTCTAAAGTTGGTCTCGATCAG
AACATGTTAATGAAAAGTCCCCATGACCTCATTAAACCATCTAAGGACTATCTGCCCCCT
GATTTTGGAAGTATTAATAAAACAAAAAGTAGTTCCCACGATCCCATAAAACACCCGGAC
GTGAGCTCGTCTAAATCAGAGACTTCTAAGTCGGTTTTATCTGAATCACAACTACCAGAA
TACCCTCAAATTTGCTCCTCGCCAAGGGTTGGGGGTGATCCATTTTTGAATCAAATGTTG
GAATTGACTAAGAAGACGACGATTCCTGACTATCCGGCCGACTACAGCCAGCCGCGGAAG
ATGGATGAAGATGTCCAGAAGCTAACTCCGACAACCCTTTCATACTCAAGCGCCGCTAGC
ATAGCTGACACTATAGCCCAGGTTGCGATGGGAAATTTTAATAAAATGGAAGATGCCATG
GACTATTCTACTGGTCAAGATTATTCAACTACGAAAAATACAAGCAGTGAAACAGAAAAT
TAA
Protein sequence:
MDLQTSHLAAALMSNNNSFNNEQPGAVIAPTINDKDEPMQNDPVPAVGEPAKPTTYEEKE
EIGSGEEFSDEESEAQPENKCMPGRGVTLQMLLEEKMLEPGHAAMTIEYLGQKFVGDLQA
DGKIKSHETETIFCSPSAWAIHCKRIINPDKRSGCGWASVKYRGKKLDTIKATYLRKKQL
QRENMHSDEETEMEVESPPEPPPQRIVMKHNTVPNRMMQHDANMLIEAVSFSTAGKIQPF
LVSVNSNALLILDIHCHLKKEEVYGYLAGTWDLNNHNVSITHTFPCLISKNDSRPRVLVE
LEIQMEIEKLGLSLLGWYHSHPTNPAMPSLRDCDNQLEYQIKMRGPTEISYIPCIGVICS
PYNPESPVMESSLTFFWVMPPPEQRPTEYPKPLLLQYNMIHDTHLSTHAMEQIKKSIKYY
GTFADDSLVSFKDNFKPDITYLDKLKCTLTPKFPREQSDGLLWHFIRDELGCSSENDDKM
DLDALLAVPQPIPASKPQSTSIPNFPSVSTLQQMVSRPAGVPPINVSSAIGSVSPHKFET
PPLNIPVLPTSSKLSKSTTPSLPPTYPTGLDMLTSMALGLGSTNMPLPLGTSGLESLAAA
NSMLTGFNPALSSNLASTLSSSKLPDLPSYAASLQNLSNSMVNYEKTSTTTSTSCTTSVA
PIPASIASNLMMSSADIANALFSASKYSSAGILGIPDPMSKSTLAANNMFLSPSLLKMQE
SLMKPLSSSSPIPSKVGLDQNMLMKSPHDLIKPSKDYLPPDFGSINKTKSSSHDPIKHPD
VSSSKSETSKSVLSESQLPEYPQICSSPRVGGDPFLNQMLELTKKTTIPDYPADYSQPRK
MDEDVQKLTPTTLSYSSAASIADTIAQVAMGNFNKMEDAMDYSTGQDYSTTKNTSSETEN