New model in OGS2.0 | DPOGS213643  |
---|---|
Genomic Position | scaffold543:- 50335-52712 |
See gene structure | |
CDS Length | 1542 |
Paired RNAseq reads   | 58 |
Single RNAseq reads   | 166 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004584 (2e-169) |
Best Drosophila hit   | lame duck (3e-58) |
Best Human hit | zinc finger protein GLIS3 isoform b (4e-55) |
Best NR hit (blastp)   | zinc finger protein transcription factor lame duck, putative [Pediculus humanus corporis] (4e-62) |
Best NR hit (blastx)   | GF18275 [Drosophila ananassae] (1e-61) |
GeneOntology terms    | GO:0005622 intracellular GO:0008270 zinc ion binding GO:0003676 nucleic acid binding GO:0008150 biological_process |
InterPro families    | IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR015880 Zinc finger, C2H2-like IPR007087 Zinc finger, C2H2-type |
Orthology group | MCL39870 |
Nucleotide sequence:
ATGGCGGAATACGACCTCATGGCGTACGACCACGACATGTTCCAGGACCGAGGAGACGTC
ACCGACCTTCACGCTTACTACATGACGCAGTTTAACCTGGAACTAGAAAACGGGAAAGGA
AAGCCGAAAAGTGACGGCTTCAAGTTTGGAATCGAAGAGAGCTTCGACTTCGAAGTTGTG
GACAACTGGAACGGCTGGAACTGCAACACCAAAACAGACTTCGACATCATAGACGAAGTC
ATCGACAACAACGTCAGCGAGCAAAATATCAAAGAGGTACTTCTAGACCTGGACACCATT
GAGTTTGATGATAACAGCTTCAAGTCAGCTTCTTGCGAAGGAAGTAAGAGAAGTGAAGCA
AATTATGAAACGAACGACTACATAGACGACGAGTGTCTCATCGACGAGCTGTGCAGAGAG
GAAGGAGAGTCGTGTCGTCTGACGCCAGACGTGTTTAGCGATGACTGCACCAACATCTTG
AACAACGACACGCAGCTGCCATCCATAGAGACCGCCTTCTCTAAAAGATACGGAGCTTTT
AACAACTTGGACAGTTATAACACACAAAATGCTTACGAAGCGAACCCATCTCAGAACACA
AACACACCGACTGTCAGCAGTTTGGAACATTACAGTTATCCTAATAATATTCTTCACAAC
TTGGATAATCCAAAGAACTACGAGCTCCCGGACACACCGACAAGCTGCCAGGATTTTAAT
TTTGACAGAAATATTAGAAAAGTATCGATATCAGATTCTATCGAGAGTGACGTCCAGAGC
GCCGGTTATTATGACGACAACTCAGAAAACTTAGACGAAGACGACCTGTTCATAAACCTC
GATGACTTCGGAATCGCCTTTGAAACGGAGAATGAAGGAAACAACTGCGAAAAGAGCCAT
CACGCCGAGAAAAGAAATGACAAAGACAAAACTCAAGGCGAAAGGGTTTGTTTATGGGAG
CACTGTTTTGAAAGATATCCAAATCAAAACACACTCGTGGAGCACATAGAGCGCGCACAC
GTCAACACTTACAAAGGTGACGAGTTCAGTTGTTTGTGGCGGGACTGTGCGCGCGGTCGC
CGTCCGTTCAACGCGCGTTACAAGCTACTCATACATATGAGAGTACACTCGGGACACAAA
CCCAATAGATGTCATCATCCCGGCTGCGGCAAAGCGTTCTCTCGCCTGGAAAACCTTAAG
ATCCACGTGAGATCCCACACGGGCGAGCGACCCTACGCCTGTCCCGCTCCTCACTGCAGG
AAGGCCTTCTCTAATTCCTCAGACCGAGCCAAGCATCAGCGAACTCACTTTAATGCCAGA
CCGTACGCGTGCGGCGCTGCAGGCTGTAACAAGCGCTACACGGACCCCTCCTCGCTGCGG
AAGCACGTCAAGTCCCACCCGCATGCTCCCCCTCGCGCGCGCCTCCCCCCGGCCAGACCT
CCTCCTCACGAGGAACAGCTTGTGCCGAGTTCACCCGCCAAACTCGACACACTCAGATGT
ATCAGAGACAAACTCACCGTTCCCCGACTACGGAACATGTAG
Protein sequence:
MAEYDLMAYDHDMFQDRGDVTDLHAYYMTQFNLELENGKGKPKSDGFKFGIEESFDFEVV
DNWNGWNCNTKTDFDIIDEVIDNNVSEQNIKEVLLDLDTIEFDDNSFKSASCEGSKRSEA
NYETNDYIDDECLIDELCREEGESCRLTPDVFSDDCTNILNNDTQLPSIETAFSKRYGAF
NNLDSYNTQNAYEANPSQNTNTPTVSSLEHYSYPNNILHNLDNPKNYELPDTPTSCQDFN
FDRNIRKVSISDSIESDVQSAGYYDDNSENLDEDDLFINLDDFGIAFETENEGNNCEKSH
HAEKRNDKDKTQGERVCLWEHCFERYPNQNTLVEHIERAHVNTYKGDEFSCLWRDCARGR
RPFNARYKLLIHMRVHSGHKPNRCHHPGCGKAFSRLENLKIHVRSHTGERPYACPAPHCR
KAFSNSSDRAKHQRTHFNARPYACGAAGCNKRYTDPSSLRKHVKSHPHAPPRARLPPARP
PPHEEQLVPSSPAKLDTLRCIRDKLTVPRLRNM