New model in OGS2.0 | DPOGS206164  |
---|---|
Genomic Position | scaffold2357:- 18193-38648 |
See gene structure | |
CDS Length | 5022 |
Paired RNAseq reads   | 1842 |
Single RNAseq reads   | 5103 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009297 (2e-14) |
Best Drosophila hit   | brahma associated protein 170kD (2e-65) |
Best Human hit | AT-rich interactive domain-containing protein 2 (6e-67) |
Best NR hit (blastp)   | Brahma associated protein 170kD, putative [Aedes aegypti] (1e-110) |
Best NR hit (blastx)   | PREDICTED: similar to AGAP006990-PA [Tribolium castaneum] (2e-119) |
GeneOntology terms    | GO:0008134 transcription factor binding GO:0045944 positive regulation of transcription from RNA polymerase II promoter GO:0005667 transcription factor complex GO:0016563 transcription activator activity |
InterPro families    | IPR001606 ARID/BRIGHT DNA-binding domain IPR016024 Armadillo-type fold IPR007087 Zinc finger, C2H2-type IPR011991 Winged helix-turn-helix transcription repressor DNA-binding IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR015880 Zinc finger, C2H2-like |
Orthology group | MCL13908 |
Nucleotide sequence:
ATGGCAAAATCTCAAATAAATACGAAATCTAGAAATTATGTTCAAGACAAAGAAGCATTT
TTGAAAGAACTTAAGCAGTTTAATGAAAGCAAAAATATCCCTTATAAAATACCGGTTGTT
AATGGTGTAGATATAGATTTGTATCTCTTGTATTCATTAGTTCAACAAAGAGGAGGTCTT
AGCAAAGTTAATCAAAATGATACCTGGGAGACATTTTTGCGTCAGCTCCACTTGCCACAT
CCATGTGTTAATGGGTCTACATTATTGAGGAGAATATATGGAATGTATTTGGAGAAATAT
GAAAGAGCTAAAGGTCCACCAGGTAGAGATGATGACTTAGATATGGACGACGACCCTCGT
CGTGGCAGGGGCGGAGGAATGCCACGCATATCATTTGGTAGCGGCACTTATATATCAGCT
TCGGGTGAACCGCTTCGTACAGGAAATCGTGTGGCTGGTCCATCTGAACGCCTTACATTA
TCATTGTTGTCTCCGATGCCGAATGAGCAGGATTTTGCTGTTAATGTTTGCACAGTACTA
GCCGCTGATCATTCTAACCGGCTTCCGTTGAGCACAACACCTCATATACTGGATTTCTTG
CTAGCTCATGCTGGAGTTTATAATCACTCAAGCCTCCGCGATACAATCGGCCGCTCATAC
TTCGAATCTCGCGGTCGGTATCCCCACGAGTTCTGGTCGGAGCGAGCTGGTGGTGGCGGG
GCGAGGGAACTGGCCGACGAGACGAAGTTCACCGGGGATCAACCTGAACTGGTCGTGCAG
GCATTGGCTGCACATAACACACTCACGGATTGTCTGATGCTGGCTGGTGGTGAAGAAGAG
AATATGGAGAAGATTGTTGAGGATGACACCGAGGATTGGGTGACGGAACCTTCAGAGGAA
GATCAGCTGTTTGCTCCGACGTTACCAGGAGGCGCCACATGTGTTTACACACAACGTGTA
CTACAGATAGCTAGCATCGTACGAAGCCTGTCCTTTCACGAGGAGAATGTACAGTACTTA
GCCAGGAATACCACTCTTATAAGGTTCTTACTGCTATGTGCTAACTGTTGGGTGGGAACT
CTTCGTCAGAGCGGTCTAGACACGCTCGGTAACGTTGCTGCCGAGCTCATTATTAAAGAC
CCCGCGACATGTTTGATATCTCGTCATGTTCTATCAACCATACAATCTGCGCTCGTATCT
CAAGATCGTGCTAGAGTGTTGGCAGCACTGGAGCTCTTGAATAAGTTGGCACAGAACGAA
GTCAACGAGGAAGCATTACTCAAAGCATTGGAATCAAAAGTGTATAGCGACGTGTGTGCT
CTGCTCACCCTCCGTGATATAATGGTGTTGGTCTGCACCCTGGAGTGTGTATACGCCCTT
ACCGGTCTCGGAGACCGCGCGTGTGAGGCGGTCGCACGTGTACCGGGACTGCTACACACA
CTCGTGTCACTGGTTACTGTTGAGGCCCAGAGCTACGGTCCCCGCGCGTGTATCCTCATG
CGTGTGGTGGAGACGGTTAGCGGTCCGCCCGCGGTGGACCACGTGCAACCACACACAGTA
CAGAACAATATCCCCTCCCAACAGGTTCAAGCCCCAAAGCCTCAAGTGGAACCCCCCGTG
GCGTCCCCCGCGGCCGCCACCCACACACAGCCTACTACACTACAACAATCCCACATGCAA
CAACGTACTGTACAAGAAAACGAGCACTTCGCCCAAGCGTGGCTCCGCGCTACGTACGAA
GCTCTGCCCGCGTCGGACAACAGCGCGTGCGATGCTGCGGACGTGTACAGGCAGTACCTC
GCGTGCTGCACCAAACTGGCTCGCAAGGGAGTCATCGCACCCGCGCACTTCCCGCGACTT
GTCAGGACGGTGTTCGGCGGCACGGTTGGGCCAAACACAGTGAGCACTTCTACGGGTGAA
ACACAACATGTGTACATCGGCATACGAGCGAAGAATATAGCAAATAGAAGTAATCCGCCT
GTTGGTCCGTCGTCACCTATATTAAAAGCTCAACTCACTAACAAGCCGAGCGCGACCGTT
GAAACAAAGCCGGTCGTGACGCAACTGCAGACGCCAGCGCAGCCCGCGGACAACAGCAAC
ACGTCGCTTATCAAACACCTGTTAGCGCACAAAGTAAGCGCTGCTCACACACACGTCGCC
CAGAGACAGCAAAGCCAACAACGTCTACCGACCTCTGGAACAGTGGTTGTACAAACATCT
ACAGCGACGTCGCTCCAGAATATGGAGGTGGATCCAGAAGCGCTCATCAAATTACAAAAA
GACGAACCAGTTCAAATACAGATAGACGATCAGGCTCAATTGACTATAAAAACAGCACAG
AACAAGATGCTGGCTGATCTCCTTGAGAAAAAATCAAACCCACCAGTACAGGTTGTACAG
ATGGGACAACAAATAAATGCACCAACTATACAAATAACGGAAACGGGACAAATAGTTCAA
GTTAAATCGGAAAATATGATACAGTTATCGGATTCCGTGCAACCGAGCGCGCCGTTTTTT
CAAATTAAGAACGAGCAAGGACAACTGATACAGATCAAAAACGACCAAGGACAGATTATA
CAACTCAAAAGCGACCAATTACAGGGCATGATTCAAATTAAGAACGACCAAGGTCAGATC
GTACAGATTAAAAATGACAATCTAGCACAGTTATTACAGTCTGGTGTTCTACAGAAGAAT
GAGAAGGATATAGCGGAAAGTGTTGTGACGGATCACTCGTATACGGAACCACCGAACAAG
AAAATCAAAGTCGAAGACAAGGCAGAGAATCCCCCGGAAAGCGTTTCAAAGACTGCTGCC
AATCTGTACGCGGCCTTAGCTGCCAGCCTCCAGGATGAAGACGATCTGCTTCCACCGAAA
CAAGAACCCGTGGATGTTATTCAGCCATCAGTATTAGTCGGTACGCCGGAGAACCAATCA
GTTTTGATACAAGAACCTATATTACAGGTGCAGCAACCAACATTACAAGTGCAACAACCA
ACATTACAAGTGCAGCAACCGTCGTTACAAGTGCAGCAACCCACGATACAGATGCAGCAG
CCGGCGTTACAGGTACAGGTACAGCAGCCCTTACAAGTTCAACAGCCGATGCTGCAAGTT
CAACCAATGGATGTACAGAATATCATGTCCCAGGCTGGACAGATTATATTGCAGGAAAAA
CAGGTCGCTACTCAGCAGACGCAGTTTGTACAACAGCCCATGCAACTTATAGCAGCACCA
AGCACATCACAAGGTGGTTTGAGTTACATAGCGCAAAACATACCCGGTAATATGATGCAG
AAAACTATCATAATAGTTCAGGGTACTGGAGGTGGTCCTCTCACACTAACGGTTAACAAT
CCCTCTGGTTTGGACGAGGCCACGCTAAACTCGCTCATAGCGCAGGCGACTGAGGCGATA
ACACAGCAGCAAATTATTCAGGTGCAGCAACCAACATTACAAGTGCAACAACCAACATTA
CAAGTGCAGCAACCGTCGTTACAAGTGCAGCAACCCACGATACAGATGCAGCAGCCGGCG
TTACAGGTACAGGTACAGCAGCCCTTACAAGTTCAACAGCCGATGCTGCAAGTTCAACCA
ATGGATGTACAGAATATCATGTCCCAGGCTGGACAGATTATATTGCAGGAAAAACAGGTC
GCTACTCAGCAGACGCAGTTTGTACAACAGCCCATGCAACTTATAGCAGCACCAAGCACA
TCACAAGGTGGTTTGAGTTACATAGCGCAAAACATACCCGGTAATATGATGCAGAAAACT
ATCATAATAGTTCAGGGTACTGGAGGTGGTCCTCTCACACTAACGGTTAACAATCCCTCT
GGTTTGGACGAGGCCACGCTAAACTCGCTCATAGCGCAGGCGACTGAGGCGATAACACAG
CAGCAAATTATTCAGAATCCACCTCAACTGACGCCAAGCCAGCAACCTATAATAACATCT
CAACCACAACCTCACAAAGCACAAATCGTTAACCCTCAACAAATCGTCGTCACCCAGAAA
CAACCACCTGGTATAATAAGTACGTCATCTGGCAACCAGATCGTCAGCACTATAGTTGGT
AGCAACCAGCAAATAATCCAAGGGAATCAGCAGTTACTGCAGGGTAACCAACAAATAATA
GCGGTTTCCAACAACCAGCAAATAATAGTTAACACTCCAATGAAACCAACTCATAGAGTT
GTCCAAGCGTCAAGGAACCAGGTTACAACAGTTGTGACCAGTAACCAGGCTGTCGTCACA
ACTGATACAAAAACTGTTCAGAGTTCAGCGAAACCTCAATCGGTGATGCGACAGGTTATA
ACTCGACAACCAGTCATGGTCGGCAATACCAAGATCGGTGACAAAGAAATGGTGGTCACG
CAACCTGTAACTGAGAAGATTCAACAACCAAAGAAGATAGAAACTCCACCGCCACAGACG
CCACTTCAGACACAGACGCCTACGACGCCAGGGTCTGAGGACACGCCCTGGATCTGTCAC
TGGCGGGGATGTGGGAAAACGTTCTCCAGTTCGTCCGAGGTGTTCACTCACGTGGCTCGG
ACCCACTGTCCCAGTACAGCCGGCGGTGAAGCCCCCTGTATGTGGCTAGACTGTGATCGA
GTCCCACGGAAGACATTTGCCTTACTAAACCATCTCACTGACAAACATTGCACTCCAAAT
GCTCTCAAAGCAATATTCAATTCCCGTCGTCACACCGCGAGCGAGGCCGAGTCTGGTAAG
CCCATGTCAGTGGGATATCCGCCGAACGCAGCGTTGGCGGCCTTGAACAAACACGCGGCG
GATATGTTCAATCCCAGGGAGCTTATGGATGAAAACGAAGGCCCAGTTACGAAAAGCATT
CGACTAACAGCGGCACTTATTCTCAGAAACATAGTTATTTACTCAAACACTGGTAGAAGA
TTACTACGTTCATACGAAGCGCATTTGGCGTCAATAGCCCTCAGCAACGTGGAGGCATCG
CGAACTATCTCCCAAGTTCTGTACGATATGAACAATATATGA
Protein sequence:
MAKSQINTKSRNYVQDKEAFLKELKQFNESKNIPYKIPVVNGVDIDLYLLYSLVQQRGGL
SKVNQNDTWETFLRQLHLPHPCVNGSTLLRRIYGMYLEKYERAKGPPGRDDDLDMDDDPR
RGRGGGMPRISFGSGTYISASGEPLRTGNRVAGPSERLTLSLLSPMPNEQDFAVNVCTVL
AADHSNRLPLSTTPHILDFLLAHAGVYNHSSLRDTIGRSYFESRGRYPHEFWSERAGGGG
ARELADETKFTGDQPELVVQALAAHNTLTDCLMLAGGEEENMEKIVEDDTEDWVTEPSEE
DQLFAPTLPGGATCVYTQRVLQIASIVRSLSFHEENVQYLARNTTLIRFLLLCANCWVGT
LRQSGLDTLGNVAAELIIKDPATCLISRHVLSTIQSALVSQDRARVLAALELLNKLAQNE
VNEEALLKALESKVYSDVCALLTLRDIMVLVCTLECVYALTGLGDRACEAVARVPGLLHT
LVSLVTVEAQSYGPRACILMRVVETVSGPPAVDHVQPHTVQNNIPSQQVQAPKPQVEPPV
ASPAAATHTQPTTLQQSHMQQRTVQENEHFAQAWLRATYEALPASDNSACDAADVYRQYL
ACCTKLARKGVIAPAHFPRLVRTVFGGTVGPNTVSTSTGETQHVYIGIRAKNIANRSNPP
VGPSSPILKAQLTNKPSATVETKPVVTQLQTPAQPADNSNTSLIKHLLAHKVSAAHTHVA
QRQQSQQRLPTSGTVVVQTSTATSLQNMEVDPEALIKLQKDEPVQIQIDDQAQLTIKTAQ
NKMLADLLEKKSNPPVQVVQMGQQINAPTIQITETGQIVQVKSENMIQLSDSVQPSAPFF
QIKNEQGQLIQIKNDQGQIIQLKSDQLQGMIQIKNDQGQIVQIKNDNLAQLLQSGVLQKN
EKDIAESVVTDHSYTEPPNKKIKVEDKAENPPESVSKTAANLYAALAASLQDEDDLLPPK
QEPVDVIQPSVLVGTPENQSVLIQEPILQVQQPTLQVQQPTLQVQQPSLQVQQPTIQMQQ
PALQVQVQQPLQVQQPMLQVQPMDVQNIMSQAGQIILQEKQVATQQTQFVQQPMQLIAAP
STSQGGLSYIAQNIPGNMMQKTIIIVQGTGGGPLTLTVNNPSGLDEATLNSLIAQATEAI
TQQQIIQVQQPTLQVQQPTLQVQQPSLQVQQPTIQMQQPALQVQVQQPLQVQQPMLQVQP
MDVQNIMSQAGQIILQEKQVATQQTQFVQQPMQLIAAPSTSQGGLSYIAQNIPGNMMQKT
IIIVQGTGGGPLTLTVNNPSGLDEATLNSLIAQATEAITQQQIIQNPPQLTPSQQPIITS
QPQPHKAQIVNPQQIVVTQKQPPGIISTSSGNQIVSTIVGSNQQIIQGNQQLLQGNQQII
AVSNNQQIIVNTPMKPTHRVVQASRNQVTTVVTSNQAVVTTDTKTVQSSAKPQSVMRQVI
TRQPVMVGNTKIGDKEMVVTQPVTEKIQQPKKIETPPPQTPLQTQTPTTPGSEDTPWICH
WRGCGKTFSSSSEVFTHVARTHCPSTAGGEAPCMWLDCDRVPRKTFALLNHLTDKHCTPN
ALKAIFNSRRHTASEAESGKPMSVGYPPNAALAALNKHAADMFNPRELMDENEGPVTKSI
RLTAALILRNIVIYSNTGRRLLRSYEAHLASIALSNVEASRTISQVLYDMNNI