New model in OGS2.0 | DPOGS208752  |
---|---|
Genomic Position | scaffold721:+ 11448-19818 |
See gene structure | |
CDS Length | 4362 |
Paired RNAseq reads   | 2952 |
Single RNAseq reads   | 6648 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003413 (3e-72) |
Best Drosophila hit   | MBD-R2, isoform A (3e-48) |
Best Human hit | PHD finger protein 20-like protein 1 isoform 1 (1e-18) |
Best NR hit (blastp)   | PREDICTED: similar to MBD-R2 CG10042-PA, isoform A [Apis mellifera] (7e-98) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC010453 [Tribolium castaneum] (5e-93) |
GeneOntology terms    | GO:0008270 zinc ion binding GO:0003677 DNA binding GO:0005515 protein binding GO:0005634 nucleus |
InterPro families    | IPR006612 Zinc finger, C2CH-type IPR002999 Tudor domain IPR001739 Methyl-CpG DNA binding IPR001965 Zinc finger, PHD-type IPR013083 Zinc finger, RING/FYVE/PHD-type IPR007087 Zinc finger, C2H2-type IPR019786 Zinc finger, PHD-type, conserved site IPR016177 DNA-binding, integrase-type IPR011011 Zinc finger, FYVE/PHD-type IPR016197 Chromo domain-like |
Orthology group | MCL16848 |
Nucleotide sequence:
ATGGCTGTAAAAAAATGTTGTGTAGAAAACTGTAATTCATCTTCAACAAGACCAGAAGAC
ATTGGTGTTACATACCACAAGTTCCCTAAAGATAAGACATTACGTGATTTATGGTCGTTG
GTAACGCACTACAAACAAACTAACATAGACTCTACGACATACGTGTGCTCGCGTCACTTT
TGCAAAATCGATTTTCAAATTTACGAGGACTCAAAATACATTCTTAGATCAGATTCCATT
CCCTCTATATTTTCATGGATCCAAAGAGATAAAGATACAAAGATACAGCAATTAGAATCT
AATATGGATGAACCTAATATTTCTGGGGCAGCATCCCCTGTAAATGAAAGCTCAGATGGA
GGTGCTGCAAACCTGAACACCTCATCCAGCAGTAAAGAATCCGAAGGTGAAAATGTTGAA
GCTATTATGAAATTCATTGAAGAACAAGAACAGGAAATAAAAAAGCAACAGAATGAGAGC
CAATCTGAAAAGATTGCAAGCGATCAAAATGTGCCGCTAACCCATAATGACAATATTGAC
AATAATGACAATAATCAAGTGTTCAGTGACATAGCGGAGCCTATGGTGATAGCGACGAGT
GTTATGGACATGATTCTCAGTGAATCAGAAGCTAAAATAGACACAAGGAAAAATATTAAA
CCAATACCACAGAAACTGAGTAAAAATGATAAAGGCAACAGTGTTTCACTGTCGGTCGGA
TCTAAGGTTGAGGCCAAAGATTATGGAGAATTTTGGCATTCGGCTCAGATTGTGGAAGTG
GACTATGACGAAATGGAAGTTCTGGTGCATTATGAGAACACACACAACAAACCCGATGAA
TGGATAAGTGTGAGCAGTCCCAGATTGAGGCTTACGAACAATCCTACACAAAGCACCCCA
GCGAGAAACGTCAGGACTGAAATAAAACCTGAAAAGGAAGAAGTTAAGGTGGAGGAGAAA
CCAAAACAACAGTTTGTTGTTGGTGAAAGATGTCTGGCTCGTTGGAGGGACAACAGACGT
TTCATAGCCACAATACTAACAGATCTCGGCAATGGCAATTACGAGATTATGTTCGACGAT
GGTTTCAAATGGAAATGCACTACGTCAAGGATGTGTAAGCTCAAGGAGTCTAAGACGGAA
CCGCTGGCCATCGATACGTCAGCGTCAGCATCATCTTCCAGTCTTTCACCGATACCAATT
CCTGGGACGGGTCCGACGGGAAATATCCCAAACAGCCAATACACGTTCCACACACATCTA
TTCGATCCCACCCGCGATTATTTGGGCTCTAAGAGCGAGAGGCGAGAAATGAAACGCAAA
TTGAACATAAAAGAGATATTTAATATAGGTCAGAAGAAACAAAAGCGAAAAGATAGCGAT
CAAGGGAAACCGAAAATTGGTAAGGTGAAGAAGGCAAGGGTTATTAAGAAGAAGGTTGAC
ATTAAACCGGAAGCTGAGACCGAAGTCAAGCTTGAAGTGTCGCAAATAACAATGGAGATA
AAAAAAGAGATTCCCGATGCAGTCGCTTCGATAATTGGTACTGTCAGTAAAGATGAAAGC
GATAAAACTAAAATTGATACAGAAATTAATATACCGATGGACGCTGCTAAGGACGTGGAA
GAATCTAGTGCTGAGACTAACATTGAAGACGCAAATGTATTAGACGTAAAAAATACAGGA
ACGGCTTTCGATTCAGAGGAGGTCGTTGAAAAAAATGACATCGGAGATAGTTTAGTTAGT
AGCTCAGATCTGTTAGTCCCTAGCGACTTAGGCTTCCAATCAGAGCCTACATCGGATCCT
GTCATGGAAGAAGAAGCGAAATTAGAACAATTCGAAAAACCGGATGAAAGTAAACAAGAG
GTTATAGAGAAAATGAAAGAGGTTATATGCAAATTAGAGGGCGGTTTAGATATACATAAG
ATAGATACAACGAAACGTGAAGTAGACACGAAGCCAGTAAGCGAAGTTGATACAACAGTA
AAAGACGACAGATCGAAAAGGAAGCTGTCGAAAATAAAGAGGAATAAAAGATTGAGAATG
TTACAGGAGAAGAAAGTTAAGAAGCAGGTGGAGAAAGTGAAGAACGAGCTGGTGGAGATG
AGGAAGCAGATGGAGGAGATGAGGAAACAGATGTTGATGAAGACGGAGGAGATGGCTCGC
CCGCACGAGATGCCCGAGAGCTTCCTGCTACCGGGAGAGTGGTGCTGCAAATGGCTCAAC
GGGCAACCCCTGGGCAATGTGTGCGAGTTTGAAGATAAAGTCGACGGCAAAGGGCTTAAG
AAGATGAGCGTTCAGGTCGAGGATAAAAGACTGCCTCCAGGGTGGACGAAACTTATGGTG
CGTCGAAGTTTTGGACAGTCCGCTGGGAAATGGGACGTCGTTTTAGTTGGACCGGAAAAT
CGTAGGTTTCATACTAAGACGGATATACGGAACTATCTCGAGCAGCACGACGACTCCCTC
AAGCAGTACGAACACGCGCTGTTAGATTTCGGTGTACATCTGAAGCTGTCCCGTAGGATG
GGATGGTATACGACGGATGGCGGCGTTGCACCGGCACTGGTGAAAAGAAAGAAATTAGGT
ATAAACAGGAAGGAAGGAAAGAAAAGAAAGAAAGAGAAGTCAGCCAAGCGTGATATATCC
TTGGAAAGTTTTTATAAACGTACATTTTACCCGGAAAGCCCACCGGTCTTCCTGGAAAAT
CCCGTGGAAGACGATGGTTCTGTGTACGTCGGTTCTATGAAGGTGGAAGTGATCGATAAT
CTCTTACGGTGTCCAGCTGAGGGCTGCCTGAAGAACTTCCGAAATACCACACTACTGCAG
ATGCACATAAAACATTACCACAGAGAAATGAGGGAAATGTTGGGAGCCACCCCAAAAGTT
TTGGACTTAGCGCGCGAAAGAACGAAACCCACTGATATCGAAGTCAAAAAGACGGAATTT
GAATCCAAAATTATTAAAGTCAAGCTACCAAAACTGCCGAAGAGATCCGAGGAACCCAAA
AGTCAAACGAACCCAGAAGTCAAAGAGCCCATTGTGCAAAAAGTAGAGCCTCGACCACAA
ACACCACCTAAACTGGATGTGCCCATACCTAGATCACAGGATTCTCCTAAACTAAGACAA
GCACTAATCACCAAACCGGCTAAAAGACCGAAAGTTCTACTCCCAGTTAGAAAACCAGAG
CCAGAAGAAAAAGAAGAAATCCCCGAGGAAGCTGATGTAGAAAAGATAGATTTCGACGAC
AGCTCCAATACTGCAGAGAAACCGTTTGAGGAGTTCCGAAGGAAGTGTGATAAAAAGCGC
AAATGTTTTTCAACTGTGTCAAGGAAGCCTATCAGCGAGGAGGACGAGTGGTTCGGTGTG
AACTCTGACCTTGACACTCGGTCCAGTTTCCCAGGGTCTGGCACACCGGACTCCAAAAAC
ATGGACAAGGCAGTACCACTTCCGGTTTCCTCCGAGTCCAATGAAGAACAGAAGGACGGC
AATATGTATATGTATACAGAGACTGGCGAACGTATAAAGATCGAGCACATGAAACGCGAG
GAGATCATAAACTGTCATTGCGGTTTCCGCGAGGAAGACGGGCTGATGGTGCAGTGTGAA
CTCTGCCTGTGTTGGCAACACGCGCTGTGTCACAACATACAGAAGGAGTCGGAGGTTCCA
GAGAAATACACTTGCAGTATATGTCTCAATCCTCGGCGTGGGAGACGCTCCAAGCGGTTC
TTGCACGATCAGGACAGACTGTACGAGGGGTTGCTGCCGGGGGCGAAGCCCTGCGAGACT
TTGCGACGCTCTCACGAATTATCAGCTAACCTATTGAAAATTCAGGATGCTCTGCATGCA
ATGCGAGTCAAACACTATGTAGCTACTAAGAAAGACCACCCAAAATTATATCTGTGGGCC
AAAGACTGGGAGAGTCCAGAGGTAAATTTCACCCAAGAAAGACTTAATTCAGATTACTCA
GATCTGAATATTATCATAAATAACATCGGCAAGGAGAATTTGCCGCTGAAACCCGATGAA
GTTAATCCACATCTGGATATAAGAATGCCCATAACTGAAGAGCCTGAAGATAGATTCACT
CAGAGAGACAAACAAGAAGTACAAAGAGTGGTCATCCCTCAGCCCGAGGCAGCCATTGAG
AACAGTGCATGCAGGGAACGCTTGCTGCGACATGTGCAGCGCTGTCAGGGCTTCATTGAC
GCCAGACTCGATTCTATAGAAGCTCAAGTAGCCGAACTCGAATCTCAAGATCCATCATTT
GAGGATGATGAGACAGCGGATTTCTTCCCAAGAACAAAACAAACTATCCAAATGCTGATG
AGGGACCTCGATACGATGGAAGAACTGGGAATTATATCTTGA
Protein sequence:
MAVKKCCVENCNSSSTRPEDIGVTYHKFPKDKTLRDLWSLVTHYKQTNIDSTTYVCSRHF
CKIDFQIYEDSKYILRSDSIPSIFSWIQRDKDTKIQQLESNMDEPNISGAASPVNESSDG
GAANLNTSSSSKESEGENVEAIMKFIEEQEQEIKKQQNESQSEKIASDQNVPLTHNDNID
NNDNNQVFSDIAEPMVIATSVMDMILSESEAKIDTRKNIKPIPQKLSKNDKGNSVSLSVG
SKVEAKDYGEFWHSAQIVEVDYDEMEVLVHYENTHNKPDEWISVSSPRLRLTNNPTQSTP
ARNVRTEIKPEKEEVKVEEKPKQQFVVGERCLARWRDNRRFIATILTDLGNGNYEIMFDD
GFKWKCTTSRMCKLKESKTEPLAIDTSASASSSSLSPIPIPGTGPTGNIPNSQYTFHTHL
FDPTRDYLGSKSERREMKRKLNIKEIFNIGQKKQKRKDSDQGKPKIGKVKKARVIKKKVD
IKPEAETEVKLEVSQITMEIKKEIPDAVASIIGTVSKDESDKTKIDTEINIPMDAAKDVE
ESSAETNIEDANVLDVKNTGTAFDSEEVVEKNDIGDSLVSSSDLLVPSDLGFQSEPTSDP
VMEEEAKLEQFEKPDESKQEVIEKMKEVICKLEGGLDIHKIDTTKREVDTKPVSEVDTTV
KDDRSKRKLSKIKRNKRLRMLQEKKVKKQVEKVKNELVEMRKQMEEMRKQMLMKTEEMAR
PHEMPESFLLPGEWCCKWLNGQPLGNVCEFEDKVDGKGLKKMSVQVEDKRLPPGWTKLMV
RRSFGQSAGKWDVVLVGPENRRFHTKTDIRNYLEQHDDSLKQYEHALLDFGVHLKLSRRM
GWYTTDGGVAPALVKRKKLGINRKEGKKRKKEKSAKRDISLESFYKRTFYPESPPVFLEN
PVEDDGSVYVGSMKVEVIDNLLRCPAEGCLKNFRNTTLLQMHIKHYHREMREMLGATPKV
LDLARERTKPTDIEVKKTEFESKIIKVKLPKLPKRSEEPKSQTNPEVKEPIVQKVEPRPQ
TPPKLDVPIPRSQDSPKLRQALITKPAKRPKVLLPVRKPEPEEKEEIPEEADVEKIDFDD
SSNTAEKPFEEFRRKCDKKRKCFSTVSRKPISEEDEWFGVNSDLDTRSSFPGSGTPDSKN
MDKAVPLPVSSESNEEQKDGNMYMYTETGERIKIEHMKREEIINCHCGFREEDGLMVQCE
LCLCWQHALCHNIQKESEVPEKYTCSICLNPRRGRRSKRFLHDQDRLYEGLLPGAKPCET
LRRSHELSANLLKIQDALHAMRVKHYVATKKDHPKLYLWAKDWESPEVNFTQERLNSDYS
DLNIIINNIGKENLPLKPDEVNPHLDIRMPITEEPEDRFTQRDKQEVQRVVIPQPEAAIE
NSACRERLLRHVQRCQGFIDARLDSIEAQVAELESQDPSFEDDETADFFPRTKQTIQMLM
RDLDTMEELGIIS