New model in OGS2.0 | DPOGS214648  |
---|---|
Genomic Position | scaffold1106:+ 11596-16974 |
See gene structure | |
CDS Length | 1485 |
Paired RNAseq reads   | 428 |
Single RNAseq reads   | 1102 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005064 (0.0) |
Best Drosophila hit   | CG17446 (6e-133) |
Best Human hit | cpG-binding protein isoform 1 (2e-84) |
Best NR hit (blastp)   | PREDICTED: similar to cpg binding protein [Nasonia vitripennis] (6e-175) |
Best NR hit (blastx)   | PREDICTED: similar to cpg binding protein [Nasonia vitripennis] (2e-170) |
GeneOntology terms    | GO:0008270 zinc ion binding GO:0005515 protein binding GO:0003677 DNA binding |
InterPro families    | IPR019786 Zinc finger, PHD-type, conserved site IPR001965 Zinc finger, PHD-type IPR019787 Zinc finger, PHD-finger IPR002857 Zinc finger, CXXC-type IPR011011 Zinc finger, FYVE/PHD-type IPR022056 CpG binding protein, C-terminal IPR013083 Zinc finger, RING/FYVE/PHD-type |
Orthology group | MCL14789 |
Nucleotide sequence:
ATGAGTGAAAAGAAGTCTAAACAAAGTAAAGCAGATATTGCTAAACAGTTTGATCTACCC
GAGCGTCAATCGAAGATTACGTCGCTTTTAAATCAAGCCGGGCAAGCTTACTGTATATGC
CGATCTTCAGATAGTTCGCGGTTTATGATAGCTTGTGATGCTTGCGAGGAATGGTATCAC
GGAGATTGTATAAACATTTCTGAAAGAGAGGCGAAGTATATTAAAAACTATTTCTGTGAA
CGCTGCCGTGAGGAAGATCCCACTTTAAAAACAAGATTCAGGCCACAGAAACGAGAAAAT
GATGGAGATTCTGGTAGAGATGATAGAAAGAAAAAACGCAAAGAAAAAGATCATTCAGAA
AACAAGTCATCAAAGAGACCAAATAAAGATGGTTGTGGAGATTGTGGTGGCTGTTCACAA
ACACACGACTGTGGCCACTGTGACGCATGCGAGGATATGTACAAGTATGGAGGTAATAAC
AAGTTAAAGTTGACCTGCCGTCAAAGATTGTGCGTTAAAAGCAAAAAGACCTCTAGAATA
TCAAGTAGTAACCGCATAAAGAAGAAGCATGAACGTGAGGTTTATGAACCCGAAGAGTCA
ATATCCCACCTTCAGACATCAGAGCCTCGGCAATGCTATGGACCGCAATGCTGTAAGGCA
GCTCAGTATGGCTCCAAATACTGTTCCCAGCAGTGCGGAATGAGACTTGCTACCGCTAGG
ATTTATCAGGTGTTGCCGCAGCGCATTCAGGAGTGGTCGCTCTCCAGCTGTGTGGCGGAG
CAGCATAACCGTCGTTCATTGGAGGTGGTCAGGTCGGGGCTGGCTAAAGCGCAGGCAGCG
CTGAGGGCGTTAGACAAACAGCACGCTGAGATAGACGAGATGCTGCAGCGAGCTAAGCAT
GCCACCATAGAACACACTGATGAGAAGGAAGCAGACGACGAGACCTCAATGTACTGCATC
ACGTGCGGCCACGAGATACACTCGCGCTCAGCTGTCAAACATATGGAGAAATGCTTCATA
AAGTACGAGGCCCAGGCCTCGTTTGGCAGCAGACATCGCACACGGATAGACGGACAGAGC
ATGTTCTGTGATTACTACAACCAAATCAATGGCACCTATTGTAAGAGGTTGCGGGTAATG
TGCCCCGAGCACTTCAAGGATCCCAAGGTGAGCGACACGGATGTATGCGGCTGTCCGCTG
GTGAAGAACGTCTTCGATCCCACCGGAGAGTTCTGTAGGGCGCCGAAGAAGTCCTGTCTG
AAGCACTACCAGTGGGAGAAGCTGCGGCGGGCGGAGGTTGACATGGAGAGGGTCCGCCAG
TGGCTGAGGCTGGACGAGCTGGTCGAGCAGGAGAGGAATATACGCCTCGCTATGGCCTCC
AGGGCCGGCGTTTTAGGTTTGATGCTTCACTCGACGTACAACCACGAGGTCATGGAGAGG
ATAACGAAGGCGAACGAAAACGGAAAGGTCAAAGAGGGGTCATGA
Protein sequence:
MSEKKSKQSKADIAKQFDLPERQSKITSLLNQAGQAYCICRSSDSSRFMIACDACEEWYH
GDCINISEREAKYIKNYFCERCREEDPTLKTRFRPQKRENDGDSGRDDRKKKRKEKDHSE
NKSSKRPNKDGCGDCGGCSQTHDCGHCDACEDMYKYGGNNKLKLTCRQRLCVKSKKTSRI
SSSNRIKKKHEREVYEPEESISHLQTSEPRQCYGPQCCKAAQYGSKYCSQQCGMRLATAR
IYQVLPQRIQEWSLSSCVAEQHNRRSLEVVRSGLAKAQAALRALDKQHAEIDEMLQRAKH
ATIEHTDEKEADDETSMYCITCGHEIHSRSAVKHMEKCFIKYEAQASFGSRHRTRIDGQS
MFCDYYNQINGTYCKRLRVMCPEHFKDPKVSDTDVCGCPLVKNVFDPTGEFCRAPKKSCL
KHYQWEKLRRAEVDMERVRQWLRLDELVEQERNIRLAMASRAGVLGLMLHSTYNHEVMER
ITKANENGKVKEGS