New model in OGS2.0 | DPOGS200341  |
---|---|
Genomic Position | scaffold640:- 84927-92891 |
See gene structure | |
CDS Length | 3498 |
Paired RNAseq reads   | 3159 |
Single RNAseq reads   | 8174 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005634 (0.0) |
Best Drosophila hit   | shuttle craft, isoform A (4e-159) |
Best Human hit | transcriptional repressor NF-X1 isoform 1 (2e-124) |
Best NR hit (blastp)   | putative shuttle craft [Heliconius melpomene] (0.0) |
Best NR hit (blastx)   | putative shuttle craft [Heliconius melpomene] (0.0) |
GeneOntology terms    | GO:0003702 RNA polymerase II transcription factor activity GO:0005634 nucleus GO:0045449 regulation of transcription GO:0003697 single-stranded DNA binding GO:0007399 nervous system development GO:0006355 regulation of transcription, DNA-dependent GO:0008270 zinc ion binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005515 protein binding |
InterPro families    | IPR019787 Zinc finger, PHD-finger IPR001841 Zinc finger, RING-type IPR001374 Single-stranded nucleic acid binding R3H IPR019786 Zinc finger, PHD-type, conserved site IPR000967 Zinc finger, NF-X1-type |
Orthology group | MCL13322 |
Nucleotide sequence:
ATGTCTCAGTGGAATAATTCTTACTCCTACAATAATCAGTACCACACACCTAATAATTGG
AATGGGGACTATAACAACCAATATCAGGCATACTATCCAAACGCTCAATATAATGCGAAC
CAATATGTAAGCTTCGATGAATTTTTATCCCAAATGCACATTTCCAACCCTCAAACAAAC
CCATATAATACTCAATATCCAAACTATCCCAATAGTCAGTATTCACAGTTACCTAATTAT
CAAAATGATTCACCCAACCAAAATTCTCAAGCTGTGTATAATTATGAAACTAGTTCAAGC
AATTATAATTACAACAATGAAACATACCAGGGGAATACAGAAGAACAGATTCAGCAAACA
TCAATAGATCAACAATTACCAAGAGAAGTTGTGAAATCCAAACTCATGCCCACTGCCACT
GAGTTTGTCCCTAAGCAATCTAGCACTAGTAATAAAGAACAACATTCAAGCAATACAAAT
AGAAATGCAGGTGATAGTAACAATTCCAAGCCATCAGGTTCAACAAACTGGAGAGAAAGA
CCGCAGAACTCAAAGAATTCTTTTACTTCAGAATCTAGCAACTTTTACCAAAAAAATGTG
AGACCTCAAGAATTAAATAACCGCCATAACAAATATGATTCAAAATACCGCAATCAAGAT
AATCAAAATACCAATGGTGAAAGTAGTGGCCAAAATTCTGCAAACCCTGTGAACAAAGAT
CGTCCAAGTGATGCTAATAATCGCAAAGGCAAATCTAAGAGCAGACCCTTTGAAAACAAC
CAAAATTCGGAACCTAGTTTCCGCAACCAATATAATCAAGGCTACAATAACCAGTCAAAA
CCTAAAAACCGTACTTACAATGGCTGTAATCATGAATCTAACCACAACGAAGATATAAAC
GATATCAGTTCAGCATCCGAATTACCTGAAAATTCTAACAGTGACGAAGGGGGCCAATCA
AAAAGTAATTCTAAGTTTAAAAGCAAAGACTCTGACCCAAGTCGGACTTTTTATAACAGT
GGAATGCCAAAAGAAAGCCAAGATGTAAGAAATGGTAGAAGTGAAGGGTCAGGAAGGAAT
CGTAGGTGGATAGGAAGTCAAAGGTTAAAAGGTGCGGAAAGAGATATTTATGATGATGAA
CAGTATGCAAAGTCTTATTTCCATGCCAAAGAAGAAAGAAATAGGGATAATCTATCAAGT
CCGGCCAAAGGGAAGAGTAAAAACTTGTCTAACCCGGGAGCTAACATAGATATGACACAA
CGTGAGCGCTTAAGCGACCAACTAGACAGGGGCACCCTTGAGTGTCTTGTATGCTGCGAG
AGAGTTAAACAAACTGATCCAGTATGGTATTGCGGTAACTGTTATCATGTATTGCATCTC
CGCTGTATAAGGAAATGGGCTATAAGTAGCATGATTGAAACAAAATGGCGATGTCCAGCA
TGTCAAAATGTGAATCAAGACATACCTCATCCATGCACACTATTGTGCCACCCTGGACCA
TGCCCTCCGTGCCAGGCCACTATAAGCAAGCAATGCGGCTGTGGGGCGGAGACGCGTTCA
GTGTTATGTAGTAGTAAATTACCGCAAGTCTGTGGAAGAGTGTGTAATAAAAAATTAGAA
TGCGGGGTTCATTCATGCACTAAACAGTGCCATGAAGAACAGTGCGACCCCTGCGAGGAA
ATTGTCACACAAGTGTGTCACTGCCCCGCGGCCAAGTCTCGCTCTGTGGCGTGCACGTCA
CACACGGACCGCACCAGCTGGTCCTGCGGCGAGCAGTGCGGGCGTGTGTTGTCATGCGGA
GCGCACGTGTGTCGCGCGACCTGCCACGCCCCGCCCTGCCCTGCGTGTCCCCTCACACCC
GACAACGTACCGGCCTGTCCGTGTGGCAAGACTAGGATCAATAAAGATCAGCGCAAGACT
TGCGTGGACCCCATACCGCTCTGTGGCAACATTTGTGCTAAGCCTCTACCGTGCGGGCCG
GCGGGCGACAAGCATATATGTAAAGAGAGCTGTCACGAAGGTGGTTGCCGCGTCTGTCCC
GACACAACTCTGCTGCAGTGCCGTTGCGGTCACTCGAGCCAAGAGGTGCCGTGTGCTGAT
CTGCCGCAGATGATCAACAACGTATTGTGTCAGAGAAAATGTAACAAGAAGTTGTCATGC
GGCCGTCACCGCTGTCATACCCGGTGCTGTGACTCTGCCACTCATCGCTGCGCCGTCGTC
TGCGGCCGTTCTCTCTCATGTCAGCTCCACAGATGTGAGGAGTTCTGTCACACTGGACAC
TGCGCGCCCTGTCCGAGAGTCAGTTTCGACGAGCTCCATTGTGAGTGTGGCGCGGAGGTG
CTGATGCCGCCGGTCCGCTGCGGCACCAAACCACCCGCCTGCAGCGCCCCCTGCCGCAGG
AGCAGACCCTGCGGTCACCCGCCGCACCACTCGTGCCACTCCGGCGATTGCCCGCCATGC
GTCGTGCTTACAACTAAAATGTGTTACGGAAAGCATGAAGAACGGAAAACAATACCCTGT
TCTCAAGAAGAATTCTCCTGCGGCCTGCCGTGCGGGAAACCTCTTCCTTGCGGTAAACAC
ACTTGTATCAAAATATGTCACAAGGGATCTTGTGATATAAGCACATGTAGTCAACCCTGC
ACATCCAAACGGCCGAGTTGTGGTCACCCGTGTGCTGCGAGGTGTCACTCTAGCGGCGGG
GGCTCCTGTCCCAGTCCGGCGCCCTGTCGCCGGCCAGTACGAGCCACCTGCCAGTGTGGA
CGAAAACAAACAGAGCGATCTTGCTGCGATAACGCCAGGGACTACGCCAAGATGATGAGT
ACCCTAGCGGCAACGAAAATGCAAGAGGGTGGCACTGTGGATATATCAGATGTTCAACGA
CCCGGATCAATGCTCAAAACATTGGAGTGTGACGAGGAGTGCTTCGTAGAGGCTCGGAGC
CGTCGCCTGGCTTTGGCACTTCAGCTGCGAAATCCTGACGTATCGGCCAAGCTCGCGCCT
CGATATAGCGATCATCTACGAACAACGGCCGCCAGGGAACCGACCTTCGCACAACAAATA
CACGACAAGCTGACGGAACTCGTCCAATTAGCCAAAAAGTCCAAACAGAAGACGAGAGCA
CATTCATTCCCATCTATGAACCGCCAAAAGCGTCAGTTCATCCACGAGCTGTGCGAACAT
TTCGGATGCGAAAGTGTTGCGTATGACGCTGAACCTAACAGAAACGTCGTTGCTACAGCC
GACAAGGAAAAGTCTTGGCTGCCGGCTATGAGTGTACTAGAGGTGTTATCCCGGGAGGCT
GGTAAGAGACGTGTACCCGGGCCGGTACTACGAGCGCCCGCCGCCGCCCTACCACCAACA
AAGGAAATACCTTCCACCACCTCAAAGTCCTCATCGGGTGGTTGGGCAACGCTCACATCT
ACTAACGCGTGGGCGGCCCGCAGTCAGCCCAAGAAGGAAGAAACCAAAATTGACTATTTC
GATAACCCTCCAGAGTAA
Protein sequence:
MSQWNNSYSYNNQYHTPNNWNGDYNNQYQAYYPNAQYNANQYVSFDEFLSQMHISNPQTN
PYNTQYPNYPNSQYSQLPNYQNDSPNQNSQAVYNYETSSSNYNYNNETYQGNTEEQIQQT
SIDQQLPREVVKSKLMPTATEFVPKQSSTSNKEQHSSNTNRNAGDSNNSKPSGSTNWRER
PQNSKNSFTSESSNFYQKNVRPQELNNRHNKYDSKYRNQDNQNTNGESSGQNSANPVNKD
RPSDANNRKGKSKSRPFENNQNSEPSFRNQYNQGYNNQSKPKNRTYNGCNHESNHNEDIN
DISSASELPENSNSDEGGQSKSNSKFKSKDSDPSRTFYNSGMPKESQDVRNGRSEGSGRN
RRWIGSQRLKGAERDIYDDEQYAKSYFHAKEERNRDNLSSPAKGKSKNLSNPGANIDMTQ
RERLSDQLDRGTLECLVCCERVKQTDPVWYCGNCYHVLHLRCIRKWAISSMIETKWRCPA
CQNVNQDIPHPCTLLCHPGPCPPCQATISKQCGCGAETRSVLCSSKLPQVCGRVCNKKLE
CGVHSCTKQCHEEQCDPCEEIVTQVCHCPAAKSRSVACTSHTDRTSWSCGEQCGRVLSCG
AHVCRATCHAPPCPACPLTPDNVPACPCGKTRINKDQRKTCVDPIPLCGNICAKPLPCGP
AGDKHICKESCHEGGCRVCPDTTLLQCRCGHSSQEVPCADLPQMINNVLCQRKCNKKLSC
GRHRCHTRCCDSATHRCAVVCGRSLSCQLHRCEEFCHTGHCAPCPRVSFDELHCECGAEV
LMPPVRCGTKPPACSAPCRRSRPCGHPPHHSCHSGDCPPCVVLTTKMCYGKHEERKTIPC
SQEEFSCGLPCGKPLPCGKHTCIKICHKGSCDISTCSQPCTSKRPSCGHPCAARCHSSGG
GSCPSPAPCRRPVRATCQCGRKQTERSCCDNARDYAKMMSTLAATKMQEGGTVDISDVQR
PGSMLKTLECDEECFVEARSRRLALALQLRNPDVSAKLAPRYSDHLRTTAAREPTFAQQI
HDKLTELVQLAKKSKQKTRAHSFPSMNRQKRQFIHELCEHFGCESVAYDAEPNRNVVATA
DKEKSWLPAMSVLEVLSREAGKRRVPGPVLRAPAAALPPTKEIPSTTSKSSSGGWATLTS
TNAWAARSQPKKEETKIDYFDNPPE