DPGLEAN04561 in OGS1.0

New model in OGS2.0DPOGS200341 
Genomic Positionscaffold640:- 84927-92891
See gene structure
CDS Length3498
Paired RNAseq reads  3159
Single RNAseq reads  8174
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005634 (0.0)
Best Drosophila hit  shuttle craft, isoform A (4e-159)
Best Human hittranscriptional repressor NF-X1 isoform 1 (2e-124)
Best NR hit (blastp)  putative shuttle craft [Heliconius melpomene] (0.0)
Best NR hit (blastx)  putative shuttle craft [Heliconius melpomene] (0.0)
GeneOntology terms







  
GO:0003702 RNA polymerase II transcription factor activity
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0003697 single-stranded DNA binding
GO:0007399 nervous system development
GO:0006355 regulation of transcription, DNA-dependent
GO:0008270 zinc ion binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005515 protein binding
InterPro families



  
IPR019787 Zinc finger, PHD-finger
IPR001841 Zinc finger, RING-type
IPR001374 Single-stranded nucleic acid binding R3H
IPR019786 Zinc finger, PHD-type, conserved site
IPR000967 Zinc finger, NF-X1-type
Orthology groupMCL13322

Nucleotide sequence:

ATGTCTCAGTGGAATAATTCTTACTCCTACAATAATCAGTACCACACACCTAATAATTGG
AATGGGGACTATAACAACCAATATCAGGCATACTATCCAAACGCTCAATATAATGCGAAC
CAATATGTAAGCTTCGATGAATTTTTATCCCAAATGCACATTTCCAACCCTCAAACAAAC
CCATATAATACTCAATATCCAAACTATCCCAATAGTCAGTATTCACAGTTACCTAATTAT
CAAAATGATTCACCCAACCAAAATTCTCAAGCTGTGTATAATTATGAAACTAGTTCAAGC
AATTATAATTACAACAATGAAACATACCAGGGGAATACAGAAGAACAGATTCAGCAAACA
TCAATAGATCAACAATTACCAAGAGAAGTTGTGAAATCCAAACTCATGCCCACTGCCACT
GAGTTTGTCCCTAAGCAATCTAGCACTAGTAATAAAGAACAACATTCAAGCAATACAAAT
AGAAATGCAGGTGATAGTAACAATTCCAAGCCATCAGGTTCAACAAACTGGAGAGAAAGA
CCGCAGAACTCAAAGAATTCTTTTACTTCAGAATCTAGCAACTTTTACCAAAAAAATGTG
AGACCTCAAGAATTAAATAACCGCCATAACAAATATGATTCAAAATACCGCAATCAAGAT
AATCAAAATACCAATGGTGAAAGTAGTGGCCAAAATTCTGCAAACCCTGTGAACAAAGAT
CGTCCAAGTGATGCTAATAATCGCAAAGGCAAATCTAAGAGCAGACCCTTTGAAAACAAC
CAAAATTCGGAACCTAGTTTCCGCAACCAATATAATCAAGGCTACAATAACCAGTCAAAA
CCTAAAAACCGTACTTACAATGGCTGTAATCATGAATCTAACCACAACGAAGATATAAAC
GATATCAGTTCAGCATCCGAATTACCTGAAAATTCTAACAGTGACGAAGGGGGCCAATCA
AAAAGTAATTCTAAGTTTAAAAGCAAAGACTCTGACCCAAGTCGGACTTTTTATAACAGT
GGAATGCCAAAAGAAAGCCAAGATGTAAGAAATGGTAGAAGTGAAGGGTCAGGAAGGAAT
CGTAGGTGGATAGGAAGTCAAAGGTTAAAAGGTGCGGAAAGAGATATTTATGATGATGAA
CAGTATGCAAAGTCTTATTTCCATGCCAAAGAAGAAAGAAATAGGGATAATCTATCAAGT
CCGGCCAAAGGGAAGAGTAAAAACTTGTCTAACCCGGGAGCTAACATAGATATGACACAA
CGTGAGCGCTTAAGCGACCAACTAGACAGGGGCACCCTTGAGTGTCTTGTATGCTGCGAG
AGAGTTAAACAAACTGATCCAGTATGGTATTGCGGTAACTGTTATCATGTATTGCATCTC
CGCTGTATAAGGAAATGGGCTATAAGTAGCATGATTGAAACAAAATGGCGATGTCCAGCA
TGTCAAAATGTGAATCAAGACATACCTCATCCATGCACACTATTGTGCCACCCTGGACCA
TGCCCTCCGTGCCAGGCCACTATAAGCAAGCAATGCGGCTGTGGGGCGGAGACGCGTTCA
GTGTTATGTAGTAGTAAATTACCGCAAGTCTGTGGAAGAGTGTGTAATAAAAAATTAGAA
TGCGGGGTTCATTCATGCACTAAACAGTGCCATGAAGAACAGTGCGACCCCTGCGAGGAA
ATTGTCACACAAGTGTGTCACTGCCCCGCGGCCAAGTCTCGCTCTGTGGCGTGCACGTCA
CACACGGACCGCACCAGCTGGTCCTGCGGCGAGCAGTGCGGGCGTGTGTTGTCATGCGGA
GCGCACGTGTGTCGCGCGACCTGCCACGCCCCGCCCTGCCCTGCGTGTCCCCTCACACCC
GACAACGTACCGGCCTGTCCGTGTGGCAAGACTAGGATCAATAAAGATCAGCGCAAGACT
TGCGTGGACCCCATACCGCTCTGTGGCAACATTTGTGCTAAGCCTCTACCGTGCGGGCCG
GCGGGCGACAAGCATATATGTAAAGAGAGCTGTCACGAAGGTGGTTGCCGCGTCTGTCCC
GACACAACTCTGCTGCAGTGCCGTTGCGGTCACTCGAGCCAAGAGGTGCCGTGTGCTGAT
CTGCCGCAGATGATCAACAACGTATTGTGTCAGAGAAAATGTAACAAGAAGTTGTCATGC
GGCCGTCACCGCTGTCATACCCGGTGCTGTGACTCTGCCACTCATCGCTGCGCCGTCGTC
TGCGGCCGTTCTCTCTCATGTCAGCTCCACAGATGTGAGGAGTTCTGTCACACTGGACAC
TGCGCGCCCTGTCCGAGAGTCAGTTTCGACGAGCTCCATTGTGAGTGTGGCGCGGAGGTG
CTGATGCCGCCGGTCCGCTGCGGCACCAAACCACCCGCCTGCAGCGCCCCCTGCCGCAGG
AGCAGACCCTGCGGTCACCCGCCGCACCACTCGTGCCACTCCGGCGATTGCCCGCCATGC
GTCGTGCTTACAACTAAAATGTGTTACGGAAAGCATGAAGAACGGAAAACAATACCCTGT
TCTCAAGAAGAATTCTCCTGCGGCCTGCCGTGCGGGAAACCTCTTCCTTGCGGTAAACAC
ACTTGTATCAAAATATGTCACAAGGGATCTTGTGATATAAGCACATGTAGTCAACCCTGC
ACATCCAAACGGCCGAGTTGTGGTCACCCGTGTGCTGCGAGGTGTCACTCTAGCGGCGGG
GGCTCCTGTCCCAGTCCGGCGCCCTGTCGCCGGCCAGTACGAGCCACCTGCCAGTGTGGA
CGAAAACAAACAGAGCGATCTTGCTGCGATAACGCCAGGGACTACGCCAAGATGATGAGT
ACCCTAGCGGCAACGAAAATGCAAGAGGGTGGCACTGTGGATATATCAGATGTTCAACGA
CCCGGATCAATGCTCAAAACATTGGAGTGTGACGAGGAGTGCTTCGTAGAGGCTCGGAGC
CGTCGCCTGGCTTTGGCACTTCAGCTGCGAAATCCTGACGTATCGGCCAAGCTCGCGCCT
CGATATAGCGATCATCTACGAACAACGGCCGCCAGGGAACCGACCTTCGCACAACAAATA
CACGACAAGCTGACGGAACTCGTCCAATTAGCCAAAAAGTCCAAACAGAAGACGAGAGCA
CATTCATTCCCATCTATGAACCGCCAAAAGCGTCAGTTCATCCACGAGCTGTGCGAACAT
TTCGGATGCGAAAGTGTTGCGTATGACGCTGAACCTAACAGAAACGTCGTTGCTACAGCC
GACAAGGAAAAGTCTTGGCTGCCGGCTATGAGTGTACTAGAGGTGTTATCCCGGGAGGCT
GGTAAGAGACGTGTACCCGGGCCGGTACTACGAGCGCCCGCCGCCGCCCTACCACCAACA
AAGGAAATACCTTCCACCACCTCAAAGTCCTCATCGGGTGGTTGGGCAACGCTCACATCT
ACTAACGCGTGGGCGGCCCGCAGTCAGCCCAAGAAGGAAGAAACCAAAATTGACTATTTC
GATAACCCTCCAGAGTAA

Protein sequence:

MSQWNNSYSYNNQYHTPNNWNGDYNNQYQAYYPNAQYNANQYVSFDEFLSQMHISNPQTN
PYNTQYPNYPNSQYSQLPNYQNDSPNQNSQAVYNYETSSSNYNYNNETYQGNTEEQIQQT
SIDQQLPREVVKSKLMPTATEFVPKQSSTSNKEQHSSNTNRNAGDSNNSKPSGSTNWRER
PQNSKNSFTSESSNFYQKNVRPQELNNRHNKYDSKYRNQDNQNTNGESSGQNSANPVNKD
RPSDANNRKGKSKSRPFENNQNSEPSFRNQYNQGYNNQSKPKNRTYNGCNHESNHNEDIN
DISSASELPENSNSDEGGQSKSNSKFKSKDSDPSRTFYNSGMPKESQDVRNGRSEGSGRN
RRWIGSQRLKGAERDIYDDEQYAKSYFHAKEERNRDNLSSPAKGKSKNLSNPGANIDMTQ
RERLSDQLDRGTLECLVCCERVKQTDPVWYCGNCYHVLHLRCIRKWAISSMIETKWRCPA
CQNVNQDIPHPCTLLCHPGPCPPCQATISKQCGCGAETRSVLCSSKLPQVCGRVCNKKLE
CGVHSCTKQCHEEQCDPCEEIVTQVCHCPAAKSRSVACTSHTDRTSWSCGEQCGRVLSCG
AHVCRATCHAPPCPACPLTPDNVPACPCGKTRINKDQRKTCVDPIPLCGNICAKPLPCGP
AGDKHICKESCHEGGCRVCPDTTLLQCRCGHSSQEVPCADLPQMINNVLCQRKCNKKLSC
GRHRCHTRCCDSATHRCAVVCGRSLSCQLHRCEEFCHTGHCAPCPRVSFDELHCECGAEV
LMPPVRCGTKPPACSAPCRRSRPCGHPPHHSCHSGDCPPCVVLTTKMCYGKHEERKTIPC
SQEEFSCGLPCGKPLPCGKHTCIKICHKGSCDISTCSQPCTSKRPSCGHPCAARCHSSGG
GSCPSPAPCRRPVRATCQCGRKQTERSCCDNARDYAKMMSTLAATKMQEGGTVDISDVQR
PGSMLKTLECDEECFVEARSRRLALALQLRNPDVSAKLAPRYSDHLRTTAAREPTFAQQI
HDKLTELVQLAKKSKQKTRAHSFPSMNRQKRQFIHELCEHFGCESVAYDAEPNRNVVATA
DKEKSWLPAMSVLEVLSREAGKRRVPGPVLRAPAAALPPTKEIPSTTSKSSSGGWATLTS
TNAWAARSQPKKEETKIDYFDNPPE