New model in OGS2.0 | DPOGS215005  |
---|---|
Genomic Position | scaffold2570:- 7922-16056 |
See gene structure | |
CDS Length | 3147 |
Paired RNAseq reads   | 1192 |
Single RNAseq reads   | 3355 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002351 (4e-06) |
Best Drosophila hit   | meiotic central spindle (1e-17) |
Best Human hit | zinc finger protein 615 isoform 1 (6e-24) |
Best NR hit (blastp)   | PREDICTED: similar to mCG7830 [Acyrthosiphon pisum] (7e-40) |
Best NR hit (blastx)   | novel KRAB box and zinc finger, C2H2 type domain containing protein [Mus musculus] (2e-37) |
GeneOntology terms    | GO:0005575 cellular_component GO:0003674 molecular_function GO:0008150 biological_process |
InterPro families    | IPR015880 Zinc finger, C2H2-like IPR007087 Zinc finger, C2H2-type IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding |
Orthology group | MCL39660 |
Nucleotide sequence:
ATGAATGAAGACATGACAGCTGATGTGGACCCGTTGAGTACGTACTTAAGCTATCCTCTA
AACAACTATTCCTCACCGCTTGAATACCAGAATGCTGTTAAACAGGAGAACTATGGATTC
AGTCTCTATGACAACACTATCACTCAACAGGCACCTGTTGACAATGGTAGCGTAATTAAC
GGAACTGCTGACGACAAAATAGATATATCTAGTGGTAATAATGAACAAAATAACACAAAT
ACAACACCATTAGTTGATTTTAAACCATGTTTACCGACGAGACGGAGAAAGATAAAGAAA
GAAAAGAGTTCATATTTCTCCGAGAAGATCATGGACAAAGATTTCCCGTTCTATGGCTGT
GCTGTTTGCAATATAAATTTTAAGACCCTCCATGAATTAGATTCCCATGTGCCCATTCAT
AAAGACAGGATAACTAGCTACGATCTGAGAATAAAGAATCAGATCAAAAAGAAGAAACTA
CAGAAAGAAATGAGGAAGAATAAGAAAGCGAAGAAAAACATCAAGAAGGAGTTCTCCGTC
GAAATTGACATAAAACCCGAGGACGGGTACATTGGTGATAAGAAAGCCTCGGAGTTTGTA
ACAGAATCCGACAACCAGAGCTCGAAAGAAAACAATGTGAATATGGAACACAATGAGGAA
AACCAGCAGAATGGACCCTCGAAGTTGAGAACTCTGGACAAAAATGATGAGGATCTGGCC
AAGAGACAGGAGAGAATGAATTTACAAAAGATATATAAATGTTTCGCATGCCAAAAGCAG
TTCATGCTGAGCTATTACTTGAAGCTGCATGTACGATCTCATACAGATGAGAAGCCCTAC
TCGTGCAGCCAGTGCGGCCAGGCCTTCATCACCGCGAGCAAGCTCGGGAGGCACAATAAG
AGGTATCACCTCGCGGTCAGGCACCAGTGCAGGATATGCTATAGGTTCTTCTCCAGATTC
GAATTCCTAACTCGTCATTTCGATAAAAAACACCCTGACGATAAATTGGAGGGTGAGCCG
TACGACTACAACGCCATCCTGCCGTACCTGAAAGAATTGGAAGCGGAGCTAAAAGAGAAA
TCGGAATCAAAGAAAGAAGATGACACAGAAGAATCGTGGTCGGAGCCCGGGAAGGACAAC
AAGGATTATATTATCAAAGAAGAATTGCAAATTGAAGAGGTGAAAGTCGATATGGATGTT
GAAATTAAGTTCGAGACGGAAGTAGAGGAGGCGGAGGAGAAGGAGGGAGTCAAGGAGGAG
CTGCCGGGGGAGGGGGACGTGAAGGAGGAGGGGAGCGGCGGGGAGAGTCTGGGTGGAGGG
GAGGAGAGGGGGGGTCAGGATGACAACTCTGATTCCGACTACTTCCCTCCATCCACGTGG
GCGGCGCCCCCGGCCTCCGCCCCCGGTCTCAGCTGTCACGTGTGCAACAAGACGCTCAGC
ACGAGAAGCTACATGCGAATACACATGCGGACACACACCGGGGAGAGGCCCTACAAGTGC
TACGTGTGCGGCGCGGGCTTCATCACCAGCAGCAAGATGAACAGGCACGTCCTCACGCAC
CCGGAGACGTGGGACGAGGACGGAGTGAAACAAGAAAATAAAGAAGGAATGAAGACGGAG
AGGAAAGAAGAAGATGATGACGTCAACGACAAGAAGAACGAAGTGTCTTTGAGCCCTGTC
ATGAGGGATCTACACGCCTCGTGGTCTCTCCTCCGTCTGCTAGACATCTTCCACTTTTCC
TGCGGTGAGGCTCGTCGCTGGGCGTGTCTGGCGTTCGCCTCGGGGTATATGCTGCCCAGC
TTGCAAGCCTTTTCCCATGCAGCTGTGGAATATTTAGGAGCAGAGACCCTGATATTTCCA
CCAGGTCTCTGTGTCCAATTATCAGTACTATCTGGATCGCCTCGTACAAGTCTGTCCGAG
CCTAAATTTAAAGTCAACGCGGACGAAGTCGCGGCCACTGCTAGTATAACGCGCATTGAA
AGTATCGTTAGTGGTGGTAAGTGGTGGAGCTCGGACACGGACTGGCTCGACCGGGGAAGT
ACCACTCTCTCCCAGAAGATCGGCGTGAAAGTGAAGATGAAAGCTGTTCTGGCTAAGTTC
TCGAAGAAACAGAAGAATGGAGAGAAGAAAAGAAGTTCCCAGAAGAGACCGCACGCCTGC
GAGTACTGCCAGAAGAGGTTCCTACACCTGGAGACCTTGCAGGTACATAAGAAGTCCCAC
TCCGGGGAGCTCCTGCGCCACGAGTGTCACTACTGCCTGGCCGAGCTGGGAGACGCGGCC
GCGCTCAGGGTGCACGAGGAGGAACACGAGGGTACACAGGGGACGCAGGGTACACCCGCC
CGACCCTACCTGTGTACCATATGCGGGAACACGTACCAGAAGAGAGAGAGCATGATCTAC
CATCGCAAGGGCCACACGTCCTCCAAGTCGTTCCCGTGCCCGCTGTGCCCGGCCAGCTTC
TCCGCGTCCTGCAAGCTGTCTCGGCACGCCCTCACCCACCGGACGGCGCGCTACACGATG
CGCTTCGAGTGCCCCGTGTGCGCGCACATGTTCAACACCAAGTACCACATCCAGATGCAC
CTCACCACCCACCAGAAGGAGGGCCTTATCCAGGAGGAGAACAGGAACGAGATCCTGGCG
ATGGTGCTGCAGAACGCCCGCAAGATACCCAAGACGCCGGAGGTAGCGGTCAGCGACGCG
CTCCAAACGGACGAGAGGAGCCGCGTGTGCAACATATGCGGCGAGGTGTTCCAGCACTTC
TACTTCCTGGAGGAGCACCTCAAGAGCCACGGCTCCAAGATAGCCATCGACGACAAGGAA
GACGCCAAGAAGCACATCTGCACGGTCTGCAACAAGGGCTTCAAGCTGCACTACTACCTT
AAGCTGCACAGCTTCACTCACACCAAGGAGAAGCCGTTCATCTGCCAGCAGTGCGGGAAG
GGCTTCATCACGAGGGGGAAGCTCAAGCGGCACCTCGAGACCCACACGGGCCTCAAGAAG
TACCAGTGCCACATATGCTACAAGTTCTTCACCAGGCCCAGCTACCTGAGGATCCACGTG
CGGACCATCCACGGCACCCAGGACTACAACTTCCGCTTGGAGAAGCAGTACGGCCTCACC
TCGCTCCCGGTCGCGGACCGGATGTGA
Protein sequence:
MNEDMTADVDPLSTYLSYPLNNYSSPLEYQNAVKQENYGFSLYDNTITQQAPVDNGSVIN
GTADDKIDISSGNNEQNNTNTTPLVDFKPCLPTRRRKIKKEKSSYFSEKIMDKDFPFYGC
AVCNINFKTLHELDSHVPIHKDRITSYDLRIKNQIKKKKLQKEMRKNKKAKKNIKKEFSV
EIDIKPEDGYIGDKKASEFVTESDNQSSKENNVNMEHNEENQQNGPSKLRTLDKNDEDLA
KRQERMNLQKIYKCFACQKQFMLSYYLKLHVRSHTDEKPYSCSQCGQAFITASKLGRHNK
RYHLAVRHQCRICYRFFSRFEFLTRHFDKKHPDDKLEGEPYDYNAILPYLKELEAELKEK
SESKKEDDTEESWSEPGKDNKDYIIKEELQIEEVKVDMDVEIKFETEVEEAEEKEGVKEE
LPGEGDVKEEGSGGESLGGGEERGGQDDNSDSDYFPPSTWAAPPASAPGLSCHVCNKTLS
TRSYMRIHMRTHTGERPYKCYVCGAGFITSSKMNRHVLTHPETWDEDGVKQENKEGMKTE
RKEEDDDVNDKKNEVSLSPVMRDLHASWSLLRLLDIFHFSCGEARRWACLAFASGYMLPS
LQAFSHAAVEYLGAETLIFPPGLCVQLSVLSGSPRTSLSEPKFKVNADEVAATASITRIE
SIVSGGKWWSSDTDWLDRGSTTLSQKIGVKVKMKAVLAKFSKKQKNGEKKRSSQKRPHAC
EYCQKRFLHLETLQVHKKSHSGELLRHECHYCLAELGDAAALRVHEEEHEGTQGTQGTPA
RPYLCTICGNTYQKRESMIYHRKGHTSSKSFPCPLCPASFSASCKLSRHALTHRTARYTM
RFECPVCAHMFNTKYHIQMHLTTHQKEGLIQEENRNEILAMVLQNARKIPKTPEVAVSDA
LQTDERSRVCNICGEVFQHFYFLEEHLKSHGSKIAIDDKEDAKKHICTVCNKGFKLHYYL
KLHSFTHTKEKPFICQQCGKGFITRGKLKRHLETHTGLKKYQCHICYKFFTRPSYLRIHV
RTIHGTQDYNFRLEKQYGLTSLPVADRM