New model in OGS2.0 | DPOGS204678  |
---|---|
Genomic Position | scaffold4494:+ 10119-21304 |
See gene structure | |
CDS Length | 2871 |
Paired RNAseq reads   | 3586 |
Single RNAseq reads   | 8458 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010249 (0.0) |
Best Drosophila hit   | moira (1e-113) |
Best Human hit | SWI/SNF complex subunit SMARCC2 isoform b (1e-109) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL004358 [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | hypothetical protein AaeL_AAEL004358 [Aedes aegypti] (0.0) |
GeneOntology terms    | GO:0048477 oogenesis GO:0016585 chromatin remodeling complex GO:0016251 general RNA polymerase II transcription factor activity GO:0006338 chromatin remodeling GO:0035060 brahma complex GO:0003713 transcription coactivator activity GO:0005515 protein binding GO:0045893 positive regulation of transcription, DNA-dependent GO:0003677 DNA binding GO:0008586 imaginal disc-derived wing vein morphogenesis GO:0008587 imaginal disc-derived wing margin morphogenesis |
InterPro families    | IPR009057 Homeodomain-like IPR001357 BRCT IPR001005 SANT domain, DNA binding IPR007526 SWIRM IPR014778 Myb, DNA-binding IPR011991 Winged helix-turn-helix transcription repressor DNA-binding IPR012287 Homeodomain-related IPR017884 SANT, eukarya |
Orthology group | MCL11632 |
Nucleotide sequence:
ATGGCTGCGCTTAGTCCTAAGAAGGATGGAGGCCCGAATATAGAGTTTTTCCAATCACCG
GAGTCCTTAGCTCAGTTTGATCAAATCCGTGTTTGGTTACAAAAAAACTGCAAAAAGCAT
GTACAAACTGATCCACCAACAAAAGAAGGCTTGGCACAACTTGTCATTCAGCTCATACAG
TATCAAGAGAACAAATTGGGAAAGAATGCCACTGATCCCCCTTTTATGAGGCTTCCAATG
AAAGTGTTCATGGACATGAAAGCCGGTGGTTCGTTGTGCACAGTGCTAGCCACCATGTTC
CGTTTCAAGTCGGAGCAGCGGTGGCGCAAGTTCGACTTCCAGGTCGGTAAGAACCCGTCC
CGCAAGGATCTTAACGTGCAGATGATGATGGAAATAGAGTCGGCTCTGCTAACAGCTGAA
TTACTCCGCTCCCCCTGCATCTACATCCGCCCCGACGTCGACAAAGCTACGGCGAACAAA
ATCAAAGATATCATCGTGAATCACCAGGGAGAGATATGTGAAGACGAAGAGGATGCCACT
CATATAATATACCCAGCTGTTGATCCTCTAGAGGAAGAATACGCCAGGCCGGTGTTCAGG
AGGGGAAATAATGTCCTGGTGCATTGGTACTACTTACCAGACAGTCACGACACTTGGGCT
CAAGCCGACCTCCCCGTGGATGTTCCGGAGACAGCCAACTGGGACTGTAATAGATCGGAG
CCGTGGCGAGTGTCGGCCACGTGGGCGCTGGATCTGACCCAGTACAACGAGTGGATGAAC
GAGGAGGACTACGAGTTAGACCAGCATGGAAAGAAGAAGGTCCACAAACTGCGGTTGTCC
GTGGACGAGCTGATGCCGGGAGCGGAGAGTTCAGGGAAAAGTAAGAAGAGCAAGAGGAAG
AGGTCGCCGTCACCCCCTCCACAGAAACATGGGAAGAGAAAGAGTCGAGTTGCTAAACGT
CGAGACAACGACGGCGATGACGAGGACGATAACGCGTCCAGAGACAACACGGACGTGGCG
CCCGCCACCGACAGCGAGCGCTCCACCGAGGCACCTGTGTCTGTACCGTCTGCGGGTCCG
TCGGGTGCGGGTGGGAGCGGCAGTGGTGGGGGCAGCGGCGGCGGTGGTTGTAGCGAGGTG
GTGCAGGAAGCCCCCGCCACGCCCGCCCCGGCCATGGACGCGCACGACGACTCACAAGGG
AAGCACAGTGATTCCAATACACAGGAGATGACTAAGGAAGAGCTGGAAGACAACGTAACA
GACCAAACCCATCACATAGTGGTGCCGTCTTACTCCGCCTGGTTTGATTACAACTCGATA
CACACCATAGAGAAGAGGGCCTTGCCGGAGTTCTTCAATAATAAGAATAAGTCCAAAACA
CCGGAGATATATCTGGCTTACAGAAATTTCATGCTAGACACGTATCGTTTGAACCCTACT
GAATATTTAACAAGCACGGCCTGTCGGCGGAACCTCGCCGGGGACGTGTGCGCCATCATG
AGGGTACATGGATTCCTGGAACAATGGGGACTTATTAATTATCAGGTGGAGGCGGAAGCT
CGTCCGACCGCGATGGGTCCTCCTCCGACATCTCACTTCCACGTGCTCTCAGACACTCCC
TCCGGGCTGCAGCCGCTCCAGGCGAGGTCCACCCAACAGAGACCAGCAGAGAACGCGGCG
GTGCCGAAGATCGAGGCCGGTCTGCCGAACGGCACTGAGGCACCCATCAAGGCCGAGCCC
AGTGTTAAGACTGAACCCATAGAGCTGGGGACGGCCCCTGGGCTTAAAATGGATCAGTAC
CGCGGCGGTGCGAGGGGTCGCGAGTGGACGGAACAGGAGACGCTTCTGCTGCTGGAAGCT
CTGGAACTCCACCGGGACGACTGGAACAGGGTTGCAGCACACGTCGGCTCCAGGACACAC
GACGAGTGCATCCTACACTTCCTCAGGCTACCCATCGAGGATCCCTACCTAAACGACACA
TCCGCGGGTGGAGTTTTGGGTCCGCTAGCCTACCAGCCTGTGCCGTTCAGTAAGGCCGGT
AACCCTGTCATGAGTACAGTAGCCTTCCTCGCCTCGGTCGTTGACCCCCGAATCGCCTCT
AAAGCTACAAGGGCCGCTATGGATGAATTCGCTGCTATTAAGGATGAAGTTCCGGCGGCC
ATGATGGAGGCTCACGTGAAGGCAGCCGGCGCCCACGGACCCGCCGCCGCCCTAGCAGCC
ACCGGCATAGCGGGGACTGCGCCCCCCGCACCCCCTGCCGGGGACACGCCCAGCGCCGGC
GAAAAGAAAGAAGGAGGCAGCGATGTTAAGACTGAGGCGATGGAGGTTGATAACGAAGAG
GCCAAGGTGAAGGAGGAACCGGCTGAGGCAGAGGAGGCTAAGGACTCCAAAGAAGAAGAC
ACAAGCACGCCAGAGACCCCAGCTGTAGTGGACGCCAAGCTGCAATCAGCTGCAGCGGCA
GCACTAGCAGCTGCAGCTGTTAAAGCGAAACACCTGGCGGGGGTCGAGGAGAGAAAGATC
AAATCCCTGGTGGCATTACTGGTGGAGACACAGATGAAGAAGCTGGAGATCAAGCTGCGG
CACTTCGAAGAGCTGGAGGCTACCATGGAGAGGGAAAGAGAGGGTCTAGAATATCAACGG
CAGCAGTTGATTCAGGAACGGCAGCAGTTCCACCTGGAACAGCTGAAGGCAGCTGAATTC
CGAGCGAGGAACCACGCCATCCAGAGATTACAGGCCGAGAGCGGTGGAGTGGTGGGCGTC
GTGCCTGGCGTGGTGGGCGTCCCGGGGGGCGGACCGCCCCTCGCAGCCGGCGGACCTGCC
ATGGAAGCCCCTCAGGAGCCCCCGCCACAACCCGCGCCGCATCACGCATAA
Protein sequence:
MAALSPKKDGGPNIEFFQSPESLAQFDQIRVWLQKNCKKHVQTDPPTKEGLAQLVIQLIQ
YQENKLGKNATDPPFMRLPMKVFMDMKAGGSLCTVLATMFRFKSEQRWRKFDFQVGKNPS
RKDLNVQMMMEIESALLTAELLRSPCIYIRPDVDKATANKIKDIIVNHQGEICEDEEDAT
HIIYPAVDPLEEEYARPVFRRGNNVLVHWYYLPDSHDTWAQADLPVDVPETANWDCNRSE
PWRVSATWALDLTQYNEWMNEEDYELDQHGKKKVHKLRLSVDELMPGAESSGKSKKSKRK
RSPSPPPQKHGKRKSRVAKRRDNDGDDEDDNASRDNTDVAPATDSERSTEAPVSVPSAGP
SGAGGSGSGGGSGGGGCSEVVQEAPATPAPAMDAHDDSQGKHSDSNTQEMTKEELEDNVT
DQTHHIVVPSYSAWFDYNSIHTIEKRALPEFFNNKNKSKTPEIYLAYRNFMLDTYRLNPT
EYLTSTACRRNLAGDVCAIMRVHGFLEQWGLINYQVEAEARPTAMGPPPTSHFHVLSDTP
SGLQPLQARSTQQRPAENAAVPKIEAGLPNGTEAPIKAEPSVKTEPIELGTAPGLKMDQY
RGGARGREWTEQETLLLLEALELHRDDWNRVAAHVGSRTHDECILHFLRLPIEDPYLNDT
SAGGVLGPLAYQPVPFSKAGNPVMSTVAFLASVVDPRIASKATRAAMDEFAAIKDEVPAA
MMEAHVKAAGAHGPAAALAATGIAGTAPPAPPAGDTPSAGEKKEGGSDVKTEAMEVDNEE
AKVKEEPAEAEEAKDSKEEDTSTPETPAVVDAKLQSAAAAALAAAAVKAKHLAGVEERKI
KSLVALLVETQMKKLEIKLRHFEELEATMEREREGLEYQRQQLIQERQQFHLEQLKAAEF
RARNHAIQRLQAESGGVVGVVPGVVGVPGGGPPLAAGGPAMEAPQEPPPQPAPHHA