DPGLEAN12579 in OGS1.0

Genomic Positionscaffold474:+ 89874-115018
See gene structure
CDS Length3711
Paired RNAseq reads  6105
Single RNAseq reads  15831
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010273 (1e-72)
Best Drosophila hit  osa, isoform D (1e-89)
Best Human hitAT-rich interactive domain-containing protein 1B isoform 2 (5e-40)
Best NR hit (blastp)  PREDICTED: similar to osa CG7467-PA [Tribolium castaneum] (9e-133)
Best NR hit (blastx)  conserved hypothetical protein [Culex quinquefasciatus] (1e-99)
GeneOntology terms











  
GO:0016514 SWI/SNF complex
GO:0005515 protein binding
GO:0003713 transcription coactivator activity
GO:0005622 intracellular
GO:0007399 nervous system development
GO:0005730 nucleolus
GO:0045449 regulation of transcription
GO:0016568 chromatin modification
GO:0005634 nucleus
GO:0048096 chromatin-mediated maintenance of transcription
GO:0005737 cytoplasm
GO:0003677 DNA binding
GO:0043231 intracellular membrane-bounded organelle
InterPro families
  
IPR001606 ARID/BRIGHT DNA-binding domain
IPR021906 Protein of unknown function DUF3518
Orthology groupMCL11634

Nucleotide sequence:

ATGTGCGCTAACTCTAATCTGTGTATCCGTGTCATTCTGAGAAGGATTCAAGACGGCCGT
CATCATAAGGCTGCCTGCGAATACAGTAAGTATCGGCCGCAACACTACTTAGTTAAATGG
GACGATTCTTATCTACTGGCGATTCTCGTTGGTGTGAGTGTGTCGGGACACGACCGTGCC
GCTGTGTGCGTGACATCCGACCGTGTCCCCTTCTATGATCTGACCGGTCAGAACAGCAAT
GACTCCGGCGGCAGCGGTGGCGCGGGGCGCGCTTCCACGCCGCACTTGAGGCCCACACCG
AGCCCCACCGGCTCCAGTGGATCGCGATCCATGTCACCGGCTGTCGGTACCCAGAATCTA
CCGATGCCACCGCGGCCGTCGTCGTCTCTATCGGACGGCAGCGGGCCGACCGTGCGAGCA
GGCACCGGCGCCCCCGCTGCTGGCCCTCCGCCCTCGGGCGCTCCGCCCCCCGGGGCCATG
CTGCCGCAGCCTTACCCGCATCACGGCCCGTACAAGACGGCGCCCTACCCGCCGCAGCCC
TACGGCTACCCCCCGCGCAACCACCACCCCTATCCTTACGGAGGATACAGGCCGACACCT
CCGCCACATCCCTCACAACATTACCCGCCACTCAAGCAGGCGGGTCGTCACATGGGTCCG
CCGGCGGGAGCTGGCGAGGCGATGCCGCCGCCGACCGCACCCGGGGAGCCCCATGACAAC
GGCCCCGCGGCGCCCGCCACCGCCCTCGTCACCACCGGCCCCGACGGAGCGCCGCTGGAC
GAGGGCAGCCAGCAGAGCACGCTCAGCAACGCCTCAGCTGCTTCCGGCGAGGAGACGTGC
GGTACACCCAAGAGTCGCAAGGAGTACGGCGCGGGCAGTGCGGCGCCCTCGCCCTCGCCC
GGCGGGGCCTCGCACTCGTCCGTACACGACGAATACGACGCCTCCGCCTCGCCCTGGCCG
AGACCGCCATCTAGTCCCGTATTTAACAGTCACATAGCGCCGGAGTCCTACAGATCAAAG
AAGTCGGACTCGCTGGGCAAGCTGTACGAGATGGACGACGCGCCGGAGAGGAGGGGCTGG
GTGGAGAGACTGCTGGCCTTCATGGACGAGAGGCGCACGCCCATCGCCGCCTGCCCCACC
ATCAGCAAGCAGCCGCTCGACCTCTACAGGCTGTACCTGCTGGTGCGGGACCGCGGGGGA
TTCGTCGAGGTCACTAAAAATAAAACGTGGAAAGACATAGCCGGTTTACTCGGCATCGGC
GCGTCGTCGTCGGCCGCTTACACCCTGAGGAAGCATTACACGAAGAACCTGTTGGCGTAC
GAGTGTCACTTCGACCGCGGCGGCATCGACCCCCAGCCCATCATCAGCCAGGTGGAGGCG
TCCACGAAGAAGAAGAGCGGGAAGGCCAACAGCACCTCCAGCGCAGGGTCGTCAAACTCC
CAGGAGCAGTTCCCGGGCGGCGCGGCGGACGGCTATCCGTCACACGGCGCGCACCCCGCA
CACTACGCACCCTACCCCCCGCAGCCGAGCCAACCGCAGGGCGGCGGGCCGGGCGGCGAC
AACCTCGCCGCCTCCAACCCATTCGACGAGCCGCCGGGGCCCAGGCGACCCCCAGGTTAC
CAACAAGGTTACGGGTACGAGTACGGCTCGCCCTACCCCACCAACAGGCCGGTGTATCCG
CCCTACGGTCCGGAAGGAGACAGGGGTTACGGCGGTAGCGGGGAATACCGCTACGGGTAT
GGCGGCTATCGTGCGGGGGCGCCCGCGGCCGGAGCTCCGCCCTCACAGCCCGCGCCCGCG
CAGCCCGCACCGGCTCAGCCGGCGCAGCCCTACCCGGACTACTACCGCGCGCCTCACCCT
CACGCGCACCCTCACCCGCCGCACCCGCCGCACCCCCCGCACCCTCCACACCCGCCGCAC
TCGCCGCAGCAGCAGCACGACGGTGGCGGAGCCAGCGCAGGTGGACCGGCCGGAGCCCCC
CGGCGGCACCCGGACTTCGCCAAGCAAGAGGGGTACGCGGGTCCCGGAGGCCCGGGCGGC
CCAGCGGGAGCGGCGCGGTTCGCAGGTGGCGGCTGGGCGGGCGGCTTCCCCCGTGGCGGC
CCACCCGCGCCTGCCTGGCGCCCTCCCGCCCCCCTACCACACGCGCCTCACCACCCACAC
CAGCCGGCCTGGCCTCACCACCAGCCGCACCAGCCACATCAACCTTACCACCCGCCGACG
CCGGGCGGCGTGGCGTGGGGCGCCCCGCGACCGCCGCAGGAACTACCGCCCGCCGCGTCT
TCACCCGGTGCAGCGGGCGTAGGTCAATTAAAGAGGGAATTAACCTTCCCCGCCGAGTGT
ATAGAGGCGACCGTCCCCGCCGCGGAGAAGCGTCGCCGACTGACCAAGGCCGACGTGGCA
CCCGTCGACGCCTGGAGGATCATGATGGCGCTCAAGTCCGGTCTGCTGGCCGAGACCTGC
TGGGCCCTCGACATACTCAACATACTACTCTTCGACGACAATTGCATAGGATACTTCGGC
CTCCAGCACATGCCCGGCCTACTCGACCTCCTGCTCGAGCACTTCCAGAAAAGTCTCGGA
GACGTGTTCGACGCTCCCGCTACCGAAAGCGAACCCTGGCGACCGGCGCTGCAAGTAAGG
GACCCCGCCGGCGTGCTTAAACGACGCCGCCTAGAGGACTACGAGGACGAGTGTTACACG
CGCGACGAACCAAGCCTAAACTTAGTGAACGAATCCCGGGACGCCCTCGCGAGACGATGT
ATCGCGTTATCCAATATATTACGTGGACTCACGTTCGTGCCAGGAAACGAGGCGGAGTTC
TCCAGGTCCGGGGCGTTCCTCGCCCTCGCCGGGAAACTGTTGCTGCTTCACCACGAGCAC
GCGCCCAGAGCCGCGAGAGCGAGGGCTTACGAGCGAGCGGCGAGAGACGAAGTCGACGTG
GACTCTTGCTGTTCGAGTCTTCGGGGAGAAGGGGAGTGGTGGTGGGACACGCTGGCCCAG
CTGCGGGAGGACGCGCTCGTCTGCTGCGCGAACATCGCGGGCAGCGTGGAGCTCGGCGGC
CAGCCGGAGGCCGTGGCGCGGCCGCTGCTGGACGGCCTGCTGCACTGGAGTGTGTGTCCG
GCTGCCGTGGCGGGAGACCCCCCGCCCGCCGCCGCGCCCGGCTCTCCGCTGTCTCCGCGC
CGCCTCGCTCTGGAGGCGCTGTGCAAGCTGTGCGTGACGGACGCCAACGTGGACTTGGTG
CTGGCGACGCCGCCGCGCGGTCGCATGGCGGCTCTGTGTGCGGGACTGGCGCGAGACCTG
TGTCGGCCGGAACGGCCCGTGGTGCGAGAGTTCGCCGTCAACCTGCTGCACTACCTGGCA
GGAGCCGGAGGCGCGGCGGCGCGGGAGGTTGCCATGCACGCGCCGGCCGTGGCGCAGCTG
GTGGCGTTCATCGAGCGCGCGGAGCAAACCGCGCTGGGCGTCGCCAACCAGCACGGGGTG
GCGGCCCTGCGAGACAACCCGGATGCGATGGGCACCTCACTAGACATGCTGCGGCGCGCC
GCGGCCACGCTGCTGCGGCTGGCGGAGCACCCCGAGAACAGGCCGCTGATCCGCCGCCAC
GAGCGCCGCCTGCTGTCGCTTGTCATGAGCCAGATCCTCGACCAGAAGGTGGCGCACGAG
CTGGCCGACGTGTTGTTCCACTGCAGCCAGGCGGCCGGCCAGGCGGACTGA

Protein sequence:

MCANSNLCIRVILRRIQDGRHHKAACEYSKYRPQHYLVKWDDSYLLAILVGVSVSGHDRA
AVCVTSDRVPFYDLTGQNSNDSGGSGGAGRASTPHLRPTPSPTGSSGSRSMSPAVGTQNL
PMPPRPSSSLSDGSGPTVRAGTGAPAAGPPPSGAPPPGAMLPQPYPHHGPYKTAPYPPQP
YGYPPRNHHPYPYGGYRPTPPPHPSQHYPPLKQAGRHMGPPAGAGEAMPPPTAPGEPHDN
GPAAPATALVTTGPDGAPLDEGSQQSTLSNASAASGEETCGTPKSRKEYGAGSAAPSPSP
GGASHSSVHDEYDASASPWPRPPSSPVFNSHIAPESYRSKKSDSLGKLYEMDDAPERRGW
VERLLAFMDERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVTKNKTWKDIAGLLGIG
ASSSAAYTLRKHYTKNLLAYECHFDRGGIDPQPIISQVEASTKKKSGKANSTSSAGSSNS
QEQFPGGAADGYPSHGAHPAHYAPYPPQPSQPQGGGPGGDNLAASNPFDEPPGPRRPPGY
QQGYGYEYGSPYPTNRPVYPPYGPEGDRGYGGSGEYRYGYGGYRAGAPAAGAPPSQPAPA
QPAPAQPAQPYPDYYRAPHPHAHPHPPHPPHPPHPPHPPHSPQQQHDGGGASAGGPAGAP
RRHPDFAKQEGYAGPGGPGGPAGAARFAGGGWAGGFPRGGPPAPAWRPPAPLPHAPHHPH
QPAWPHHQPHQPHQPYHPPTPGGVAWGAPRPPQELPPAASSPGAAGVGQLKRELTFPAEC
IEATVPAAEKRRRLTKADVAPVDAWRIMMALKSGLLAETCWALDILNILLFDDNCIGYFG
LQHMPGLLDLLLEHFQKSLGDVFDAPATESEPWRPALQVRDPAGVLKRRRLEDYEDECYT
RDEPSLNLVNESRDALARRCIALSNILRGLTFVPGNEAEFSRSGAFLALAGKLLLLHHEH
APRAARARAYERAARDEVDVDSCCSSLRGEGEWWWDTLAQLREDALVCCANIAGSVELGG
QPEAVARPLLDGLLHWSVCPAAVAGDPPPAAAPGSPLSPRRLALEALCKLCVTDANVDLV
LATPPRGRMAALCAGLARDLCRPERPVVREFAVNLLHYLAGAGGAAAREVAMHAPAVAQL
VAFIERAEQTALGVANQHGVAALRDNPDAMGTSLDMLRRAAATLLRLAEHPENRPLIRRH
ERRLLSLVMSQILDQKVAHELADVLFHCSQAAGQAD