New model in OGS2.0 | DPOGS202118  |
---|---|
Genomic Position | scaffold495:+ 63325-72850 |
See gene structure | |
CDS Length | 3243 |
Paired RNAseq reads   | 302 |
Single RNAseq reads   | 704 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006961 (3e-145) |
Best Drosophila hit   | CG32778 (2e-40) |
Best Human hit | myelin transcription factor 1-like protein (6e-45) |
Best NR hit (blastp)   | PREDICTED: similar to CG32778-PA [Nasonia vitripennis] (3e-147) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC004325 [Tribolium castaneum] (2e-124) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0008270 zinc ion binding GO:0030154 cell differentiation GO:0005515 protein binding GO:0007275 multicellular organismal development GO:0016563 transcription activator activity GO:0046872 metal ion binding GO:0006355 regulation of transcription, DNA-dependent GO:0005634 nucleus GO:0007399 nervous system development |
InterPro families   | IPR002515 Zinc finger, C2HC-type |
Orthology group | MCL11015 |
Nucleotide sequence:
AGTCACACAACGTTGTCGCAAATTAATAAAAGACGTTTAGAATTTGGAGTTGATATAGAA
ACGTTCAGAGTGAACGATCGAAGACAAGATCATAGTGACAGTGAAGCCTCAATGGACGAA
GAAGCAGCGAAACGGCGACGACGACTCCTTACAGAAGAAAGAAGAGAAGAAACAGATTCG
TACTATAGAAATTATAACACTCATCGAACTAACATCCTACGTCACTCCGACGAGGATGAC
GATTCACTTGTTCAAAAAACTCAAAACGCTCTAAAAACTTTAAGTTGCTGGGCACGAGAC
CACGCTGCTATCCCTAACGATCAAAGCGAAGACGCTAACGACTTTGATAACATTTTCAAC
GATAAACGTTCCGACAAAATGTTGCCATCATCACCATCTTTGTCTTCTGACAGTGCTGAG
AACTCACGCCGAAGCTTCTCTTTGCATTGTCAAAACGACGAACTAAATGCTGCCATGAAT
GAAAGGATGGATCCCAGGGAAAATACTGAACGGCCACCAAAAATGGAAAATGACAACAGT
ATGGACAATTACAAGATAGACAAAAGTCCTAATCCTTATGATGCTTCTAATTTCGATGAA
TTAGCGGATTCTTCATCAAACGAACTAGAAATAGACATGTCAGACAGAAACGAAGACAAA
TACGAAGAGGACAGAGAAGCGAAACGTAGGCAGGCTGAAATAATCTCAGTTAACAAGCAA
TCTCTTTACAACGCTTACAAAGCTGCCACAGCTTCATTGCCGTATTCGACCCAATCCGCC
TTCAAGCCACCAGCGGAAGTGAAACACAAGATCCACGGCTCCAGTTTTCCCAGCGAGCCA
TTCGGTGGCTATTCTAATGATCACGAGAAGAGTGCGATTCATAAGGGTTCGAAGCAGTAC
ACGGTGCTGCAGCCAGCCGCTGCTGGGTCAAGGGCTGCTACAGCGTTGCAGGAGGCTCGC
ACTGTACCATCAGCTCAACCGGCGCGAGACTTGCGACCCATCAACCCTCTCTCCCCACCA
GCCGTCCGAGAAGGCAACAAGTGTCCGACTCCTGGTTGCAATGGGCAGGGCCACGTTACA
GGCCTCTATACGCATCATCGAAGTATTTATACATTTCATCCAGTTCAATTGCCCGCACCG
GCTCCTGTGACTCAAGTGTCAGCCAGCACGTCAGATAGCCACACTTCAGACGGTTCCCGC
GATCGGGTTCCAGCGGCGTCGCCGCAAACTCCGGCCGTGAAGCGCGAAGCCCCCGAACTG
CTGGTGCCGAAGCGTGAGGCCGCCGAGCCCGAGCGCGACTCCCCCGGCATGGAGACGCGC
CACGCCGGCTACGGCGCGCCGCCCGATCAGCGCTCACCATACGAACGACCGCCTGACGAC
CACGTACGGTCATACAGTCAAATGAACGAAGCTCGGTACGGATACGAAGCTAGGTGCTAT
GAGGGCGCCCCAGCTTTTGAGAGATATGACCCAGCTCAATGCCCTCAGAGGCCTTACGGT
TGGGAAGAAGAACGATACCATGACCCTCATTTGCCAACGCCAATGAAAACGGACCAATCA
GAACAGGAAACTAATTCTGGACCTATATATCCTAGACCAATGTACCATTACGAAGCTGGC
GGTGTAGGCGCCGTGGGCGGGGTGGGCGCTATGGGCCCAGGCGTTCCCCCCGGCTTCTCC
GCTATCAATCTCTCAGTGAAGATAGCCGCAGCTCAGGCTCAACGTCCTCGAAGTCCCACA
CCCAGGGATCCTCGTGATCCGCGTCCGGCTATAGATCTATCCACATCTAGTGGCAGTCCA
CAGGGTCCATATGCGTCACCGGTATACACGAGCGCCGGTGGTGGCAGTGGGGGCGGTGCA
CGGGGAAGCCCGCAGCCGGGCGCTTCGCCCCAACTTACGGCAAGTCCCCAAGTTCCCAGT
CCACAAGGCCAGACCCTCGACCTTAGTGTGTCCCGTTTACCACATAGTAGAAGTTTTCCG
GGTGGTGTTTCATACAGTCGAGAATCAACGCCGGATAGCGGTGGAAGCCATCCATATCTT
GAAGCATACCATCGCGACACAGCCGGGTACGGTGGTGTAAGCCCTCACCCGGTAGCCGGA
TACGGTCTTGCGCAGCCGGATTACGCAGCTGCTGCTGCCGCTGCCGGATACGGTGGCTAT
CAGTACCAATGCGGGGCATACCCACCCCCGCCCGCGTACCCCCCGCACGCGCCGCCGTAT
TCACCACCGTGCTATATGCCGCCGCCGCACGCACCGCACGACAAGCCCAAGGATAGTCTG
TCCGGGTGTCCTCGTGCTGATCGCTCTCAACTGCAGCCACATTCTCAAGAACTGAAGTGT
CCCACACCAGGTTGCGACGGCTCCGGGCACGTTACCGGGAACTACTCCTCCCATCGATCA
CTATCAGGTTGCCCCAGGGCTAATAAACCGAAAAGCAAGCCCAGGGATGGCCAAGATTCT
GAACCGCTCAGCGCGTCGGGCTGCCCTATTGCAAATCGGAACAAAATGCGGGTTCTAGAA
AGCGGCGGCACAGTTGAGCAGCACAAAGCGGCAGTGGCGGCAGCGGCATCCGCTATTAAA
TTCGATGGCGTGAACTGTCCTACCCCGGGATGTGATGGATCGGGACATATAAACGGTTCG
TTTCTAACCCATCGTTCGCTATCCGGCTGTCCCGTAGCCGGTGCAACCACACCGACGCCT
CAACCAAAGAAACCGAAATATCCTGATGATATCACTCCGCTATACCCCAAGCCCTATTCA
GGTATGGATATTAACATGCAGACAGGAAACGGCGAAGATTTAATGACACTGGAGCAAGAA
ATTACTGAACTCCAGCGTGAAAATGCAAGAGTGGAATCACAGATGATGCGTCTGAAATCG
GACATAAACGCGATGGAGTCACACTTGAGCCATGGAGAAAGGGAGAATCAGCTCATCATT
CATCGCAACAGCAATCTGAATGAATATTACGAAAGCCTTCGGAACAATGTGATCACGTTG
CTGGAGCACGTTAAGATACCAGGAGGAGGTACGGTGCCCGTATCAACTGCCCCCGGAACT
CCCGGAGCTGCACCCCCGACTGGCCCTGGTGATAAACCCGCCCACGATAACTTCGACTCT
TATCTCACCAAGCTGCAGACCCTATGCTCCCCGGAAGGATACTGCCCCGATGAGAATCGA
CCGATCTATGAGACCGTTAAAAACGCGCTCCAAGACTTCACAGTGCTACCAACGCCGATA
TAA
Protein sequence:
SHTTLSQINKRRLEFGVDIETFRVNDRRQDHSDSEASMDEEAAKRRRRLLTEERREETDS
YYRNYNTHRTNILRHSDEDDDSLVQKTQNALKTLSCWARDHAAIPNDQSEDANDFDNIFN
DKRSDKMLPSSPSLSSDSAENSRRSFSLHCQNDELNAAMNERMDPRENTERPPKMENDNS
MDNYKIDKSPNPYDASNFDELADSSSNELEIDMSDRNEDKYEEDREAKRRQAEIISVNKQ
SLYNAYKAATASLPYSTQSAFKPPAEVKHKIHGSSFPSEPFGGYSNDHEKSAIHKGSKQY
TVLQPAAAGSRAATALQEARTVPSAQPARDLRPINPLSPPAVREGNKCPTPGCNGQGHVT
GLYTHHRSIYTFHPVQLPAPAPVTQVSASTSDSHTSDGSRDRVPAASPQTPAVKREAPEL
LVPKREAAEPERDSPGMETRHAGYGAPPDQRSPYERPPDDHVRSYSQMNEARYGYEARCY
EGAPAFERYDPAQCPQRPYGWEEERYHDPHLPTPMKTDQSEQETNSGPIYPRPMYHYEAG
GVGAVGGVGAMGPGVPPGFSAINLSVKIAAAQAQRPRSPTPRDPRDPRPAIDLSTSSGSP
QGPYASPVYTSAGGGSGGGARGSPQPGASPQLTASPQVPSPQGQTLDLSVSRLPHSRSFP
GGVSYSRESTPDSGGSHPYLEAYHRDTAGYGGVSPHPVAGYGLAQPDYAAAAAAAGYGGY
QYQCGAYPPPPAYPPHAPPYSPPCYMPPPHAPHDKPKDSLSGCPRADRSQLQPHSQELKC
PTPGCDGSGHVTGNYSSHRSLSGCPRANKPKSKPRDGQDSEPLSASGCPIANRNKMRVLE
SGGTVEQHKAAVAAAASAIKFDGVNCPTPGCDGSGHINGSFLTHRSLSGCPVAGATTPTP
QPKKPKYPDDITPLYPKPYSGMDINMQTGNGEDLMTLEQEITELQRENARVESQMMRLKS
DINAMESHLSHGERENQLIIHRNSNLNEYYESLRNNVITLLEHVKIPGGGTVPVSTAPGT
PGAAPPTGPGDKPAHDNFDSYLTKLQTLCSPEGYCPDENRPIYETVKNALQDFTVLPTPI