DPGLEAN10586 in OGS1.0

New model in OGS2.0DPOGS206164 
Genomic Positionscaffold2357:- 18193-38648
See gene structure
CDS Length5022
Paired RNAseq reads  1842
Single RNAseq reads  5103
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009297 (2e-14)
Best Drosophila hit  brahma associated protein 170kD (2e-65)
Best Human hitAT-rich interactive domain-containing protein 2 (6e-67)
Best NR hit (blastp)  Brahma associated protein 170kD, putative [Aedes aegypti] (1e-110)
Best NR hit (blastx)  PREDICTED: similar to AGAP006990-PA [Tribolium castaneum] (2e-119)
GeneOntology terms


  
GO:0008134 transcription factor binding
GO:0045944 positive regulation of transcription from RNA polymerase II promoter
GO:0005667 transcription factor complex
GO:0016563 transcription activator activity
InterPro families




  
IPR001606 ARID/BRIGHT DNA-binding domain
IPR016024 Armadillo-type fold
IPR007087 Zinc finger, C2H2-type
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL13908

Nucleotide sequence:

ATGGCAAAATCTCAAATAAATACGAAATCTAGAAATTATGTTCAAGACAAAGAAGCATTT
TTGAAAGAACTTAAGCAGTTTAATGAAAGCAAAAATATCCCTTATAAAATACCGGTTGTT
AATGGTGTAGATATAGATTTGTATCTCTTGTATTCATTAGTTCAACAAAGAGGAGGTCTT
AGCAAAGTTAATCAAAATGATACCTGGGAGACATTTTTGCGTCAGCTCCACTTGCCACAT
CCATGTGTTAATGGGTCTACATTATTGAGGAGAATATATGGAATGTATTTGGAGAAATAT
GAAAGAGCTAAAGGTCCACCAGGTAGAGATGATGACTTAGATATGGACGACGACCCTCGT
CGTGGCAGGGGCGGAGGAATGCCACGCATATCATTTGGTAGCGGCACTTATATATCAGCT
TCGGGTGAACCGCTTCGTACAGGAAATCGTGTGGCTGGTCCATCTGAACGCCTTACATTA
TCATTGTTGTCTCCGATGCCGAATGAGCAGGATTTTGCTGTTAATGTTTGCACAGTACTA
GCCGCTGATCATTCTAACCGGCTTCCGTTGAGCACAACACCTCATATACTGGATTTCTTG
CTAGCTCATGCTGGAGTTTATAATCACTCAAGCCTCCGCGATACAATCGGCCGCTCATAC
TTCGAATCTCGCGGTCGGTATCCCCACGAGTTCTGGTCGGAGCGAGCTGGTGGTGGCGGG
GCGAGGGAACTGGCCGACGAGACGAAGTTCACCGGGGATCAACCTGAACTGGTCGTGCAG
GCATTGGCTGCACATAACACACTCACGGATTGTCTGATGCTGGCTGGTGGTGAAGAAGAG
AATATGGAGAAGATTGTTGAGGATGACACCGAGGATTGGGTGACGGAACCTTCAGAGGAA
GATCAGCTGTTTGCTCCGACGTTACCAGGAGGCGCCACATGTGTTTACACACAACGTGTA
CTACAGATAGCTAGCATCGTACGAAGCCTGTCCTTTCACGAGGAGAATGTACAGTACTTA
GCCAGGAATACCACTCTTATAAGGTTCTTACTGCTATGTGCTAACTGTTGGGTGGGAACT
CTTCGTCAGAGCGGTCTAGACACGCTCGGTAACGTTGCTGCCGAGCTCATTATTAAAGAC
CCCGCGACATGTTTGATATCTCGTCATGTTCTATCAACCATACAATCTGCGCTCGTATCT
CAAGATCGTGCTAGAGTGTTGGCAGCACTGGAGCTCTTGAATAAGTTGGCACAGAACGAA
GTCAACGAGGAAGCATTACTCAAAGCATTGGAATCAAAAGTGTATAGCGACGTGTGTGCT
CTGCTCACCCTCCGTGATATAATGGTGTTGGTCTGCACCCTGGAGTGTGTATACGCCCTT
ACCGGTCTCGGAGACCGCGCGTGTGAGGCGGTCGCACGTGTACCGGGACTGCTACACACA
CTCGTGTCACTGGTTACTGTTGAGGCCCAGAGCTACGGTCCCCGCGCGTGTATCCTCATG
CGTGTGGTGGAGACGGTTAGCGGTCCGCCCGCGGTGGACCACGTGCAACCACACACAGTA
CAGAACAATATCCCCTCCCAACAGGTTCAAGCCCCAAAGCCTCAAGTGGAACCCCCCGTG
GCGTCCCCCGCGGCCGCCACCCACACACAGCCTACTACACTACAACAATCCCACATGCAA
CAACGTACTGTACAAGAAAACGAGCACTTCGCCCAAGCGTGGCTCCGCGCTACGTACGAA
GCTCTGCCCGCGTCGGACAACAGCGCGTGCGATGCTGCGGACGTGTACAGGCAGTACCTC
GCGTGCTGCACCAAACTGGCTCGCAAGGGAGTCATCGCACCCGCGCACTTCCCGCGACTT
GTCAGGACGGTGTTCGGCGGCACGGTTGGGCCAAACACAGTGAGCACTTCTACGGGTGAA
ACACAACATGTGTACATCGGCATACGAGCGAAGAATATAGCAAATAGAAGTAATCCGCCT
GTTGGTCCGTCGTCACCTATATTAAAAGCTCAACTCACTAACAAGCCGAGCGCGACCGTT
GAAACAAAGCCGGTCGTGACGCAACTGCAGACGCCAGCGCAGCCCGCGGACAACAGCAAC
ACGTCGCTTATCAAACACCTGTTAGCGCACAAAGTAAGCGCTGCTCACACACACGTCGCC
CAGAGACAGCAAAGCCAACAACGTCTACCGACCTCTGGAACAGTGGTTGTACAAACATCT
ACAGCGACGTCGCTCCAGAATATGGAGGTGGATCCAGAAGCGCTCATCAAATTACAAAAA
GACGAACCAGTTCAAATACAGATAGACGATCAGGCTCAATTGACTATAAAAACAGCACAG
AACAAGATGCTGGCTGATCTCCTTGAGAAAAAATCAAACCCACCAGTACAGGTTGTACAG
ATGGGACAACAAATAAATGCACCAACTATACAAATAACGGAAACGGGACAAATAGTTCAA
GTTAAATCGGAAAATATGATACAGTTATCGGATTCCGTGCAACCGAGCGCGCCGTTTTTT
CAAATTAAGAACGAGCAAGGACAACTGATACAGATCAAAAACGACCAAGGACAGATTATA
CAACTCAAAAGCGACCAATTACAGGGCATGATTCAAATTAAGAACGACCAAGGTCAGATC
GTACAGATTAAAAATGACAATCTAGCACAGTTATTACAGTCTGGTGTTCTACAGAAGAAT
GAGAAGGATATAGCGGAAAGTGTTGTGACGGATCACTCGTATACGGAACCACCGAACAAG
AAAATCAAAGTCGAAGACAAGGCAGAGAATCCCCCGGAAAGCGTTTCAAAGACTGCTGCC
AATCTGTACGCGGCCTTAGCTGCCAGCCTCCAGGATGAAGACGATCTGCTTCCACCGAAA
CAAGAACCCGTGGATGTTATTCAGCCATCAGTATTAGTCGGTACGCCGGAGAACCAATCA
GTTTTGATACAAGAACCTATATTACAGGTGCAGCAACCAACATTACAAGTGCAACAACCA
ACATTACAAGTGCAGCAACCGTCGTTACAAGTGCAGCAACCCACGATACAGATGCAGCAG
CCGGCGTTACAGGTACAGGTACAGCAGCCCTTACAAGTTCAACAGCCGATGCTGCAAGTT
CAACCAATGGATGTACAGAATATCATGTCCCAGGCTGGACAGATTATATTGCAGGAAAAA
CAGGTCGCTACTCAGCAGACGCAGTTTGTACAACAGCCCATGCAACTTATAGCAGCACCA
AGCACATCACAAGGTGGTTTGAGTTACATAGCGCAAAACATACCCGGTAATATGATGCAG
AAAACTATCATAATAGTTCAGGGTACTGGAGGTGGTCCTCTCACACTAACGGTTAACAAT
CCCTCTGGTTTGGACGAGGCCACGCTAAACTCGCTCATAGCGCAGGCGACTGAGGCGATA
ACACAGCAGCAAATTATTCAGGTGCAGCAACCAACATTACAAGTGCAACAACCAACATTA
CAAGTGCAGCAACCGTCGTTACAAGTGCAGCAACCCACGATACAGATGCAGCAGCCGGCG
TTACAGGTACAGGTACAGCAGCCCTTACAAGTTCAACAGCCGATGCTGCAAGTTCAACCA
ATGGATGTACAGAATATCATGTCCCAGGCTGGACAGATTATATTGCAGGAAAAACAGGTC
GCTACTCAGCAGACGCAGTTTGTACAACAGCCCATGCAACTTATAGCAGCACCAAGCACA
TCACAAGGTGGTTTGAGTTACATAGCGCAAAACATACCCGGTAATATGATGCAGAAAACT
ATCATAATAGTTCAGGGTACTGGAGGTGGTCCTCTCACACTAACGGTTAACAATCCCTCT
GGTTTGGACGAGGCCACGCTAAACTCGCTCATAGCGCAGGCGACTGAGGCGATAACACAG
CAGCAAATTATTCAGAATCCACCTCAACTGACGCCAAGCCAGCAACCTATAATAACATCT
CAACCACAACCTCACAAAGCACAAATCGTTAACCCTCAACAAATCGTCGTCACCCAGAAA
CAACCACCTGGTATAATAAGTACGTCATCTGGCAACCAGATCGTCAGCACTATAGTTGGT
AGCAACCAGCAAATAATCCAAGGGAATCAGCAGTTACTGCAGGGTAACCAACAAATAATA
GCGGTTTCCAACAACCAGCAAATAATAGTTAACACTCCAATGAAACCAACTCATAGAGTT
GTCCAAGCGTCAAGGAACCAGGTTACAACAGTTGTGACCAGTAACCAGGCTGTCGTCACA
ACTGATACAAAAACTGTTCAGAGTTCAGCGAAACCTCAATCGGTGATGCGACAGGTTATA
ACTCGACAACCAGTCATGGTCGGCAATACCAAGATCGGTGACAAAGAAATGGTGGTCACG
CAACCTGTAACTGAGAAGATTCAACAACCAAAGAAGATAGAAACTCCACCGCCACAGACG
CCACTTCAGACACAGACGCCTACGACGCCAGGGTCTGAGGACACGCCCTGGATCTGTCAC
TGGCGGGGATGTGGGAAAACGTTCTCCAGTTCGTCCGAGGTGTTCACTCACGTGGCTCGG
ACCCACTGTCCCAGTACAGCCGGCGGTGAAGCCCCCTGTATGTGGCTAGACTGTGATCGA
GTCCCACGGAAGACATTTGCCTTACTAAACCATCTCACTGACAAACATTGCACTCCAAAT
GCTCTCAAAGCAATATTCAATTCCCGTCGTCACACCGCGAGCGAGGCCGAGTCTGGTAAG
CCCATGTCAGTGGGATATCCGCCGAACGCAGCGTTGGCGGCCTTGAACAAACACGCGGCG
GATATGTTCAATCCCAGGGAGCTTATGGATGAAAACGAAGGCCCAGTTACGAAAAGCATT
CGACTAACAGCGGCACTTATTCTCAGAAACATAGTTATTTACTCAAACACTGGTAGAAGA
TTACTACGTTCATACGAAGCGCATTTGGCGTCAATAGCCCTCAGCAACGTGGAGGCATCG
CGAACTATCTCCCAAGTTCTGTACGATATGAACAATATATGA

Protein sequence:

MAKSQINTKSRNYVQDKEAFLKELKQFNESKNIPYKIPVVNGVDIDLYLLYSLVQQRGGL
SKVNQNDTWETFLRQLHLPHPCVNGSTLLRRIYGMYLEKYERAKGPPGRDDDLDMDDDPR
RGRGGGMPRISFGSGTYISASGEPLRTGNRVAGPSERLTLSLLSPMPNEQDFAVNVCTVL
AADHSNRLPLSTTPHILDFLLAHAGVYNHSSLRDTIGRSYFESRGRYPHEFWSERAGGGG
ARELADETKFTGDQPELVVQALAAHNTLTDCLMLAGGEEENMEKIVEDDTEDWVTEPSEE
DQLFAPTLPGGATCVYTQRVLQIASIVRSLSFHEENVQYLARNTTLIRFLLLCANCWVGT
LRQSGLDTLGNVAAELIIKDPATCLISRHVLSTIQSALVSQDRARVLAALELLNKLAQNE
VNEEALLKALESKVYSDVCALLTLRDIMVLVCTLECVYALTGLGDRACEAVARVPGLLHT
LVSLVTVEAQSYGPRACILMRVVETVSGPPAVDHVQPHTVQNNIPSQQVQAPKPQVEPPV
ASPAAATHTQPTTLQQSHMQQRTVQENEHFAQAWLRATYEALPASDNSACDAADVYRQYL
ACCTKLARKGVIAPAHFPRLVRTVFGGTVGPNTVSTSTGETQHVYIGIRAKNIANRSNPP
VGPSSPILKAQLTNKPSATVETKPVVTQLQTPAQPADNSNTSLIKHLLAHKVSAAHTHVA
QRQQSQQRLPTSGTVVVQTSTATSLQNMEVDPEALIKLQKDEPVQIQIDDQAQLTIKTAQ
NKMLADLLEKKSNPPVQVVQMGQQINAPTIQITETGQIVQVKSENMIQLSDSVQPSAPFF
QIKNEQGQLIQIKNDQGQIIQLKSDQLQGMIQIKNDQGQIVQIKNDNLAQLLQSGVLQKN
EKDIAESVVTDHSYTEPPNKKIKVEDKAENPPESVSKTAANLYAALAASLQDEDDLLPPK
QEPVDVIQPSVLVGTPENQSVLIQEPILQVQQPTLQVQQPTLQVQQPSLQVQQPTIQMQQ
PALQVQVQQPLQVQQPMLQVQPMDVQNIMSQAGQIILQEKQVATQQTQFVQQPMQLIAAP
STSQGGLSYIAQNIPGNMMQKTIIIVQGTGGGPLTLTVNNPSGLDEATLNSLIAQATEAI
TQQQIIQVQQPTLQVQQPTLQVQQPSLQVQQPTIQMQQPALQVQVQQPLQVQQPMLQVQP
MDVQNIMSQAGQIILQEKQVATQQTQFVQQPMQLIAAPSTSQGGLSYIAQNIPGNMMQKT
IIIVQGTGGGPLTLTVNNPSGLDEATLNSLIAQATEAITQQQIIQNPPQLTPSQQPIITS
QPQPHKAQIVNPQQIVVTQKQPPGIISTSSGNQIVSTIVGSNQQIIQGNQQLLQGNQQII
AVSNNQQIIVNTPMKPTHRVVQASRNQVTTVVTSNQAVVTTDTKTVQSSAKPQSVMRQVI
TRQPVMVGNTKIGDKEMVVTQPVTEKIQQPKKIETPPPQTPLQTQTPTTPGSEDTPWICH
WRGCGKTFSSSSEVFTHVARTHCPSTAGGEAPCMWLDCDRVPRKTFALLNHLTDKHCTPN
ALKAIFNSRRHTASEAESGKPMSVGYPPNAALAALNKHAADMFNPRELMDENEGPVTKSI
RLTAALILRNIVIYSNTGRRLLRSYEAHLASIALSNVEASRTISQVLYDMNNI