DPGLEAN06579 in OGS1.0

New model in OGS2.0DPOGS204678 
Genomic Positionscaffold4494:+ 10119-21304
See gene structure
CDS Length2871
Paired RNAseq reads  3586
Single RNAseq reads  8458
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010249 (0.0)
Best Drosophila hit  moira (1e-113)
Best Human hitSWI/SNF complex subunit SMARCC2 isoform b (1e-109)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL004358 [Aedes aegypti] (0.0)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL004358 [Aedes aegypti] (0.0)
GeneOntology terms









  
GO:0048477 oogenesis
GO:0016585 chromatin remodeling complex
GO:0016251 general RNA polymerase II transcription factor activity
GO:0006338 chromatin remodeling
GO:0035060 brahma complex
GO:0003713 transcription coactivator activity
GO:0005515 protein binding
GO:0045893 positive regulation of transcription, DNA-dependent
GO:0003677 DNA binding
GO:0008586 imaginal disc-derived wing vein morphogenesis
GO:0008587 imaginal disc-derived wing margin morphogenesis
InterPro families






  
IPR009057 Homeodomain-like
IPR001357 BRCT
IPR001005 SANT domain, DNA binding
IPR007526 SWIRM
IPR014778 Myb, DNA-binding
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR012287 Homeodomain-related
IPR017884 SANT, eukarya
Orthology groupMCL11632

Nucleotide sequence:

ATGGCTGCGCTTAGTCCTAAGAAGGATGGAGGCCCGAATATAGAGTTTTTCCAATCACCG
GAGTCCTTAGCTCAGTTTGATCAAATCCGTGTTTGGTTACAAAAAAACTGCAAAAAGCAT
GTACAAACTGATCCACCAACAAAAGAAGGCTTGGCACAACTTGTCATTCAGCTCATACAG
TATCAAGAGAACAAATTGGGAAAGAATGCCACTGATCCCCCTTTTATGAGGCTTCCAATG
AAAGTGTTCATGGACATGAAAGCCGGTGGTTCGTTGTGCACAGTGCTAGCCACCATGTTC
CGTTTCAAGTCGGAGCAGCGGTGGCGCAAGTTCGACTTCCAGGTCGGTAAGAACCCGTCC
CGCAAGGATCTTAACGTGCAGATGATGATGGAAATAGAGTCGGCTCTGCTAACAGCTGAA
TTACTCCGCTCCCCCTGCATCTACATCCGCCCCGACGTCGACAAAGCTACGGCGAACAAA
ATCAAAGATATCATCGTGAATCACCAGGGAGAGATATGTGAAGACGAAGAGGATGCCACT
CATATAATATACCCAGCTGTTGATCCTCTAGAGGAAGAATACGCCAGGCCGGTGTTCAGG
AGGGGAAATAATGTCCTGGTGCATTGGTACTACTTACCAGACAGTCACGACACTTGGGCT
CAAGCCGACCTCCCCGTGGATGTTCCGGAGACAGCCAACTGGGACTGTAATAGATCGGAG
CCGTGGCGAGTGTCGGCCACGTGGGCGCTGGATCTGACCCAGTACAACGAGTGGATGAAC
GAGGAGGACTACGAGTTAGACCAGCATGGAAAGAAGAAGGTCCACAAACTGCGGTTGTCC
GTGGACGAGCTGATGCCGGGAGCGGAGAGTTCAGGGAAAAGTAAGAAGAGCAAGAGGAAG
AGGTCGCCGTCACCCCCTCCACAGAAACATGGGAAGAGAAAGAGTCGAGTTGCTAAACGT
CGAGACAACGACGGCGATGACGAGGACGATAACGCGTCCAGAGACAACACGGACGTGGCG
CCCGCCACCGACAGCGAGCGCTCCACCGAGGCACCTGTGTCTGTACCGTCTGCGGGTCCG
TCGGGTGCGGGTGGGAGCGGCAGTGGTGGGGGCAGCGGCGGCGGTGGTTGTAGCGAGGTG
GTGCAGGAAGCCCCCGCCACGCCCGCCCCGGCCATGGACGCGCACGACGACTCACAAGGG
AAGCACAGTGATTCCAATACACAGGAGATGACTAAGGAAGAGCTGGAAGACAACGTAACA
GACCAAACCCATCACATAGTGGTGCCGTCTTACTCCGCCTGGTTTGATTACAACTCGATA
CACACCATAGAGAAGAGGGCCTTGCCGGAGTTCTTCAATAATAAGAATAAGTCCAAAACA
CCGGAGATATATCTGGCTTACAGAAATTTCATGCTAGACACGTATCGTTTGAACCCTACT
GAATATTTAACAAGCACGGCCTGTCGGCGGAACCTCGCCGGGGACGTGTGCGCCATCATG
AGGGTACATGGATTCCTGGAACAATGGGGACTTATTAATTATCAGGTGGAGGCGGAAGCT
CGTCCGACCGCGATGGGTCCTCCTCCGACATCTCACTTCCACGTGCTCTCAGACACTCCC
TCCGGGCTGCAGCCGCTCCAGGCGAGGTCCACCCAACAGAGACCAGCAGAGAACGCGGCG
GTGCCGAAGATCGAGGCCGGTCTGCCGAACGGCACTGAGGCACCCATCAAGGCCGAGCCC
AGTGTTAAGACTGAACCCATAGAGCTGGGGACGGCCCCTGGGCTTAAAATGGATCAGTAC
CGCGGCGGTGCGAGGGGTCGCGAGTGGACGGAACAGGAGACGCTTCTGCTGCTGGAAGCT
CTGGAACTCCACCGGGACGACTGGAACAGGGTTGCAGCACACGTCGGCTCCAGGACACAC
GACGAGTGCATCCTACACTTCCTCAGGCTACCCATCGAGGATCCCTACCTAAACGACACA
TCCGCGGGTGGAGTTTTGGGTCCGCTAGCCTACCAGCCTGTGCCGTTCAGTAAGGCCGGT
AACCCTGTCATGAGTACAGTAGCCTTCCTCGCCTCGGTCGTTGACCCCCGAATCGCCTCT
AAAGCTACAAGGGCCGCTATGGATGAATTCGCTGCTATTAAGGATGAAGTTCCGGCGGCC
ATGATGGAGGCTCACGTGAAGGCAGCCGGCGCCCACGGACCCGCCGCCGCCCTAGCAGCC
ACCGGCATAGCGGGGACTGCGCCCCCCGCACCCCCTGCCGGGGACACGCCCAGCGCCGGC
GAAAAGAAAGAAGGAGGCAGCGATGTTAAGACTGAGGCGATGGAGGTTGATAACGAAGAG
GCCAAGGTGAAGGAGGAACCGGCTGAGGCAGAGGAGGCTAAGGACTCCAAAGAAGAAGAC
ACAAGCACGCCAGAGACCCCAGCTGTAGTGGACGCCAAGCTGCAATCAGCTGCAGCGGCA
GCACTAGCAGCTGCAGCTGTTAAAGCGAAACACCTGGCGGGGGTCGAGGAGAGAAAGATC
AAATCCCTGGTGGCATTACTGGTGGAGACACAGATGAAGAAGCTGGAGATCAAGCTGCGG
CACTTCGAAGAGCTGGAGGCTACCATGGAGAGGGAAAGAGAGGGTCTAGAATATCAACGG
CAGCAGTTGATTCAGGAACGGCAGCAGTTCCACCTGGAACAGCTGAAGGCAGCTGAATTC
CGAGCGAGGAACCACGCCATCCAGAGATTACAGGCCGAGAGCGGTGGAGTGGTGGGCGTC
GTGCCTGGCGTGGTGGGCGTCCCGGGGGGCGGACCGCCCCTCGCAGCCGGCGGACCTGCC
ATGGAAGCCCCTCAGGAGCCCCCGCCACAACCCGCGCCGCATCACGCATAA

Protein sequence:

MAALSPKKDGGPNIEFFQSPESLAQFDQIRVWLQKNCKKHVQTDPPTKEGLAQLVIQLIQ
YQENKLGKNATDPPFMRLPMKVFMDMKAGGSLCTVLATMFRFKSEQRWRKFDFQVGKNPS
RKDLNVQMMMEIESALLTAELLRSPCIYIRPDVDKATANKIKDIIVNHQGEICEDEEDAT
HIIYPAVDPLEEEYARPVFRRGNNVLVHWYYLPDSHDTWAQADLPVDVPETANWDCNRSE
PWRVSATWALDLTQYNEWMNEEDYELDQHGKKKVHKLRLSVDELMPGAESSGKSKKSKRK
RSPSPPPQKHGKRKSRVAKRRDNDGDDEDDNASRDNTDVAPATDSERSTEAPVSVPSAGP
SGAGGSGSGGGSGGGGCSEVVQEAPATPAPAMDAHDDSQGKHSDSNTQEMTKEELEDNVT
DQTHHIVVPSYSAWFDYNSIHTIEKRALPEFFNNKNKSKTPEIYLAYRNFMLDTYRLNPT
EYLTSTACRRNLAGDVCAIMRVHGFLEQWGLINYQVEAEARPTAMGPPPTSHFHVLSDTP
SGLQPLQARSTQQRPAENAAVPKIEAGLPNGTEAPIKAEPSVKTEPIELGTAPGLKMDQY
RGGARGREWTEQETLLLLEALELHRDDWNRVAAHVGSRTHDECILHFLRLPIEDPYLNDT
SAGGVLGPLAYQPVPFSKAGNPVMSTVAFLASVVDPRIASKATRAAMDEFAAIKDEVPAA
MMEAHVKAAGAHGPAAALAATGIAGTAPPAPPAGDTPSAGEKKEGGSDVKTEAMEVDNEE
AKVKEEPAEAEEAKDSKEEDTSTPETPAVVDAKLQSAAAAALAAAAVKAKHLAGVEERKI
KSLVALLVETQMKKLEIKLRHFEELEATMEREREGLEYQRQQLIQERQQFHLEQLKAAEF
RARNHAIQRLQAESGGVVGVVPGVVGVPGGGPPLAAGGPAMEAPQEPPPQPAPHHA