New model in OGS2.0 | DPOGS209575  |
---|---|
Genomic Position | scaffold154:+ 116744-148123 |
See gene structure | |
CDS Length | 3342 |
Paired RNAseq reads   | 688 |
Single RNAseq reads   | 2101 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006636 (0.0) |
Best Drosophila hit   | CG3003 (3e-53) |
Best Human hit | E3 ubiquitin-protein ligase HECW2 (2e-60) |
Best NR hit (blastp)   | hect type E3 ubiquitin ligase [Aedes aegypti] (1e-96) |
Best NR hit (blastx)   | PREDICTED: similar to E3 ubiquitin-protein ligase HECW2 (HECT, C2 and WW domain-containing protein 2) (NEDD4-like E3 ubiquitin-protein ligase 2) [Tribolium castaneum] (6e-90) |
GeneOntology terms    | GO:0005515 protein binding GO:0005622 intracellular GO:0006464 protein modification process GO:0016874 ligase activity GO:0016881 acid-amino acid ligase activity |
InterPro families   | IPR001202 WW/Rsp5/WWP |
Orthology group | MCL19823 |
Nucleotide sequence:
ATGTCAGAGGAACATGATGGTGATGTCGTTGATAGTGAAAGAACCCTGGACAAAGAGTGT
CAGACGTGCGATGTGACTTGTGATGAAGGGAGTGCCAATAACAGTGCCACTAGCAAAGAG
AGTGAGACGAGCGAGGTGGAAGAGAAGGGTGACCTGTCCGAGAAAGGAAACGAACCCAGG
GGCAGCATCATATGGACGGATGAGAAGACGCACCTTCGAGTGTCATGGAACTTGAAGGAA
GGAACGGCCACAGACAAAGATTATGTGGCTCTTTGTTATACAGAGACCACAAGCATAGCC
GGTATCGCCAGGCTCGTGCCGGCGACCGGATGCGACACTGGACATATAATGTGGCTGCTT
GATGAGCCCAATCAGCCTTATGAAGATTGTGAACAACTGCTCTGTTTCCGTTACTACAAT
GGCGAAGAAGACGAATGCGTTGCAGAGTCTTCTACCCTACCACCACGATTTAAAATTGAT
CTTAAAAAGCTGCCACATCGGCTTAAAGAGGGAATCTCGAGGAAAAGAAGCGGTGATCAA
GTGTCACCCTTCTCATATAACAATGAGAGTTTTGAAATGAGTTCAGAAAACACACCTCGA
GTTGAATTAGTGAGTACGAGCCAGGACATCAAGTATCTTAGTGAAAATGTCAATAAATGC
TCAATTAGCACCTTAGGAAATAATGGGACCTTAAATCTCGAACAAATAAATTGTGATCAT
AGTAAATCTCTAGTTAGTAGTAGTACTGAATCTCCTAGTGAATGTCCCAATATTTTGGAC
TGCAAACCAACGTCATCTGGTGTCAAAAAGAAATGTCCTCCTCCTGTCGATACTGGTGGT
GGGCAAGTTTTGAATGGTGTCAAGAAAAAAACTAATATGAATATCGGGTTAGAATCTCCG
ACATCCCCCGATGGAACTGAATATTATAAGTTATGGTCACCGCAGAATCCTTGCAAAATT
ATGAAGTTTGTTTATGATGTGGAAGGTGCGAATGTGCCAAATGATGCTCAAAAATCTCCA
GTTCCGCCCTTACCACCTAGACAATCACATAAACCTCTAGAAAGAATGCACGCTTTACCT
CCGATTGTTCAGAGGCATAGGAAACCTAAAAAGTTAACAAAGCCAGAAGATGCGTTTACT
TTTGAGCTGATAGACACTGATGAACAATTTTTTACGGATAATAACATTGCCAATTCCGCA
GCTTTGCACGGAGATATTAATGATTTTAAACCTGGAGAATTTTATTCAGGCGTTCAAGCT
CCCATTTCTTCAACCACAACTTTTTTAAGAGGATGTCAAAATGTTGCACCTCTAGCAACT
CCTGAGACTGAAGGTAGCTTATGTGACTCAACTATATCTTCCATTAAATTATCTAAATTT
GTAGAGGATAAGAAGTCTGTCGATGTACCTGATAGCACGCAAAATGTTGGTAAAAGCAAC
ATCGTGTTTATAAAAGATATAGGTCCTGACAATTATATTACAACAACCTTCGCCAGAAAA
GATAAAAATGTTGAAGGTGATTTTCATTTGACGCCTAATCACGCAAAATCAGATGAGAAG
GCACAACATACATTCTTAAATGAAATATGTAATCGTGTAGATTTCGAAGACGATAATAGA
CATCAGGTCGAAGAAAAGGAAATTTCTGTAGTAAAAGGTCACAAGCCCCTCACAAGACAA
AATTCGGGGCGAGCTACGCCGACCATGATGACAGCATTTTTGAACTCGTCTCATATTGAT
GGGAGCAGTCGGGAAGGAGAGCAAGAAATCCACACCGCACCCAGTTCCTGTAATTCATCC
CCTTCTAGCTCTCCCATAGAAGAAGGCGGTAAGACGATACCAAATGTGGGGACATTGCTA
ATGAATCGAAAATATTCGTGTGACAGTTCCACTCCTAAACATTCAAGTTTACCACGTCAT
TTGTTGAAAAATCTCGGATGTGAAAACAATACCAAACATTCCCCTCACAAATCGGTACCA
TCGAAGGATGTCCCAGACAGTCCTATGAGGCCTCACCCGAGAGTCTTATCTAGAGTGGCT
GCGCTAGCAAATACGACCGTCCCACAATGTCCACCGACACCGACACATCACGCGAGAAGA
ACGCGGCCGTTGCCGCCACCAGATCTACACCCACCACCTATACAGAACTGTGAAAACTTT
GAAATGGTCGAATTCACAAATGAATTAAGAACATCGGAGATTAGATCGCCTAACTTAGAG
TTCGTTAATAGCAACCATTTCACTATCGTTCACGCTGGACCACCTGATGATGTGGTTTTG
CGGAGACCGCAGACCGTGGACGACACTGATGATAATGAAAACGCGGTAGCATCGCCGACG
CCGTTGCGGCATATGGCTGGGATAAGGCTACCCTCCATACCGGAGAGAGCGTCCAGGCAA
ATGGCCTTGACAGGAGATTTCCCGGCCAACATGATTGGAGGAATCATTGAATGCGAAGAA
CCTTTGCCACCTGGTTGGGAGGCTCGTATGGACAGCCACGGACGTGTCTTCTATATTGAT
CATATAAATCGCACAACAACGTGGCAGAGGCCGGCTGCCAACGGCGCAGCGCGTTCGCCT
GAACCAGAAGTACAGAGACGCCAGTTAGATAGGAGATATCAATCTATTCGTCGCACGTTG
ACGAGGGCGCCGCCTGAGGAGGAAGAGCCCCCCGCAGGGACGTCCAACGCGCCCGCCGCG
CCTCACGCCCCACATCCCGCCGCCGAGTTCCTCGCACGACCTGACTTCTACTCCATATTG
CATATGAACCTGGAAGCGTCATCGCTATACAACTGTAACTCGACCCTCAAACACATGATA
TCTAAGATCCGTCGGGACACCAGCTCCTTCGAGCGTTATCAACACAACCGCGATCTCGTA
GCTCTGGTCAACATGTTCAGTGAAACCGACAGGGAGTTGCCCTTAGGATGGGACTCCAAG
CTCGACAGGAACGGGAAGCGTTTCTTCGTGGACCACGTGATGCGTCGCACCACATTCGTG
GACCCCCGCCTGCCTCGGGCTCCGACCGCGGGACCGTTCTCTCCGCTGCTGCCTCCGAGG
CGGAGGCCCATTATGACCGACCAGGCTCCCACTCCGCCGCCGAGACCTCCGATATCCACA
GCCGACTCGTACTTACAGAACTCTCAGCAAGAGATACCCATAGCATACAATGATAAGGTG
GTAGCATTCCTCCGTCAGCCCAACATCCTGTCCATCCTGAAGGAGCGCTGTTCAGGCTGC
GGGACCGCCTTGAGGGACAAAGTTAACGCGGTTAGGGTTGAAGGAGCGTCCGCACTCGCG
CGGTACCAAAACGACGTCCAGCTGACATGCCTGTTGAGGTGA
Protein sequence:
MSEEHDGDVVDSERTLDKECQTCDVTCDEGSANNSATSKESETSEVEEKGDLSEKGNEPR
GSIIWTDEKTHLRVSWNLKEGTATDKDYVALCYTETTSIAGIARLVPATGCDTGHIMWLL
DEPNQPYEDCEQLLCFRYYNGEEDECVAESSTLPPRFKIDLKKLPHRLKEGISRKRSGDQ
VSPFSYNNESFEMSSENTPRVELVSTSQDIKYLSENVNKCSISTLGNNGTLNLEQINCDH
SKSLVSSSTESPSECPNILDCKPTSSGVKKKCPPPVDTGGGQVLNGVKKKTNMNIGLESP
TSPDGTEYYKLWSPQNPCKIMKFVYDVEGANVPNDAQKSPVPPLPPRQSHKPLERMHALP
PIVQRHRKPKKLTKPEDAFTFELIDTDEQFFTDNNIANSAALHGDINDFKPGEFYSGVQA
PISSTTTFLRGCQNVAPLATPETEGSLCDSTISSIKLSKFVEDKKSVDVPDSTQNVGKSN
IVFIKDIGPDNYITTTFARKDKNVEGDFHLTPNHAKSDEKAQHTFLNEICNRVDFEDDNR
HQVEEKEISVVKGHKPLTRQNSGRATPTMMTAFLNSSHIDGSSREGEQEIHTAPSSCNSS
PSSSPIEEGGKTIPNVGTLLMNRKYSCDSSTPKHSSLPRHLLKNLGCENNTKHSPHKSVP
SKDVPDSPMRPHPRVLSRVAALANTTVPQCPPTPTHHARRTRPLPPPDLHPPPIQNCENF
EMVEFTNELRTSEIRSPNLEFVNSNHFTIVHAGPPDDVVLRRPQTVDDTDDNENAVASPT
PLRHMAGIRLPSIPERASRQMALTGDFPANMIGGIIECEEPLPPGWEARMDSHGRVFYID
HINRTTTWQRPAANGAARSPEPEVQRRQLDRRYQSIRRTLTRAPPEEEEPPAGTSNAPAA
PHAPHPAAEFLARPDFYSILHMNLEASSLYNCNSTLKHMISKIRRDTSSFERYQHNRDLV
ALVNMFSETDRELPLGWDSKLDRNGKRFFVDHVMRRTTFVDPRLPRAPTAGPFSPLLPPR
RRPIMTDQAPTPPPRPPISTADSYLQNSQQEIPIAYNDKVVAFLRQPNILSILKERCSGC
GTALRDKVNAVRVEGASALARYQNDVQLTCLLR