Genomic Position | scaffold474:+ 89874-115018 |
---|---|
See gene structure | |
CDS Length | 3711 |
Paired RNAseq reads   | 6105 |
Single RNAseq reads   | 15831 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010273 (1e-72) |
Best Drosophila hit   | osa, isoform D (1e-89) |
Best Human hit | AT-rich interactive domain-containing protein 1B isoform 2 (5e-40) |
Best NR hit (blastp)   | PREDICTED: similar to osa CG7467-PA [Tribolium castaneum] (9e-133) |
Best NR hit (blastx)   | conserved hypothetical protein [Culex quinquefasciatus] (1e-99) |
GeneOntology terms    | GO:0016514 SWI/SNF complex GO:0005515 protein binding GO:0003713 transcription coactivator activity GO:0005622 intracellular GO:0007399 nervous system development GO:0005730 nucleolus GO:0045449 regulation of transcription GO:0016568 chromatin modification GO:0005634 nucleus GO:0048096 chromatin-mediated maintenance of transcription GO:0005737 cytoplasm GO:0003677 DNA binding GO:0043231 intracellular membrane-bounded organelle |
InterPro families    | IPR001606 ARID/BRIGHT DNA-binding domain IPR021906 Protein of unknown function DUF3518 |
Orthology group | MCL11634 |
Nucleotide sequence:
ATGTGCGCTAACTCTAATCTGTGTATCCGTGTCATTCTGAGAAGGATTCAAGACGGCCGT
CATCATAAGGCTGCCTGCGAATACAGTAAGTATCGGCCGCAACACTACTTAGTTAAATGG
GACGATTCTTATCTACTGGCGATTCTCGTTGGTGTGAGTGTGTCGGGACACGACCGTGCC
GCTGTGTGCGTGACATCCGACCGTGTCCCCTTCTATGATCTGACCGGTCAGAACAGCAAT
GACTCCGGCGGCAGCGGTGGCGCGGGGCGCGCTTCCACGCCGCACTTGAGGCCCACACCG
AGCCCCACCGGCTCCAGTGGATCGCGATCCATGTCACCGGCTGTCGGTACCCAGAATCTA
CCGATGCCACCGCGGCCGTCGTCGTCTCTATCGGACGGCAGCGGGCCGACCGTGCGAGCA
GGCACCGGCGCCCCCGCTGCTGGCCCTCCGCCCTCGGGCGCTCCGCCCCCCGGGGCCATG
CTGCCGCAGCCTTACCCGCATCACGGCCCGTACAAGACGGCGCCCTACCCGCCGCAGCCC
TACGGCTACCCCCCGCGCAACCACCACCCCTATCCTTACGGAGGATACAGGCCGACACCT
CCGCCACATCCCTCACAACATTACCCGCCACTCAAGCAGGCGGGTCGTCACATGGGTCCG
CCGGCGGGAGCTGGCGAGGCGATGCCGCCGCCGACCGCACCCGGGGAGCCCCATGACAAC
GGCCCCGCGGCGCCCGCCACCGCCCTCGTCACCACCGGCCCCGACGGAGCGCCGCTGGAC
GAGGGCAGCCAGCAGAGCACGCTCAGCAACGCCTCAGCTGCTTCCGGCGAGGAGACGTGC
GGTACACCCAAGAGTCGCAAGGAGTACGGCGCGGGCAGTGCGGCGCCCTCGCCCTCGCCC
GGCGGGGCCTCGCACTCGTCCGTACACGACGAATACGACGCCTCCGCCTCGCCCTGGCCG
AGACCGCCATCTAGTCCCGTATTTAACAGTCACATAGCGCCGGAGTCCTACAGATCAAAG
AAGTCGGACTCGCTGGGCAAGCTGTACGAGATGGACGACGCGCCGGAGAGGAGGGGCTGG
GTGGAGAGACTGCTGGCCTTCATGGACGAGAGGCGCACGCCCATCGCCGCCTGCCCCACC
ATCAGCAAGCAGCCGCTCGACCTCTACAGGCTGTACCTGCTGGTGCGGGACCGCGGGGGA
TTCGTCGAGGTCACTAAAAATAAAACGTGGAAAGACATAGCCGGTTTACTCGGCATCGGC
GCGTCGTCGTCGGCCGCTTACACCCTGAGGAAGCATTACACGAAGAACCTGTTGGCGTAC
GAGTGTCACTTCGACCGCGGCGGCATCGACCCCCAGCCCATCATCAGCCAGGTGGAGGCG
TCCACGAAGAAGAAGAGCGGGAAGGCCAACAGCACCTCCAGCGCAGGGTCGTCAAACTCC
CAGGAGCAGTTCCCGGGCGGCGCGGCGGACGGCTATCCGTCACACGGCGCGCACCCCGCA
CACTACGCACCCTACCCCCCGCAGCCGAGCCAACCGCAGGGCGGCGGGCCGGGCGGCGAC
AACCTCGCCGCCTCCAACCCATTCGACGAGCCGCCGGGGCCCAGGCGACCCCCAGGTTAC
CAACAAGGTTACGGGTACGAGTACGGCTCGCCCTACCCCACCAACAGGCCGGTGTATCCG
CCCTACGGTCCGGAAGGAGACAGGGGTTACGGCGGTAGCGGGGAATACCGCTACGGGTAT
GGCGGCTATCGTGCGGGGGCGCCCGCGGCCGGAGCTCCGCCCTCACAGCCCGCGCCCGCG
CAGCCCGCACCGGCTCAGCCGGCGCAGCCCTACCCGGACTACTACCGCGCGCCTCACCCT
CACGCGCACCCTCACCCGCCGCACCCGCCGCACCCCCCGCACCCTCCACACCCGCCGCAC
TCGCCGCAGCAGCAGCACGACGGTGGCGGAGCCAGCGCAGGTGGACCGGCCGGAGCCCCC
CGGCGGCACCCGGACTTCGCCAAGCAAGAGGGGTACGCGGGTCCCGGAGGCCCGGGCGGC
CCAGCGGGAGCGGCGCGGTTCGCAGGTGGCGGCTGGGCGGGCGGCTTCCCCCGTGGCGGC
CCACCCGCGCCTGCCTGGCGCCCTCCCGCCCCCCTACCACACGCGCCTCACCACCCACAC
CAGCCGGCCTGGCCTCACCACCAGCCGCACCAGCCACATCAACCTTACCACCCGCCGACG
CCGGGCGGCGTGGCGTGGGGCGCCCCGCGACCGCCGCAGGAACTACCGCCCGCCGCGTCT
TCACCCGGTGCAGCGGGCGTAGGTCAATTAAAGAGGGAATTAACCTTCCCCGCCGAGTGT
ATAGAGGCGACCGTCCCCGCCGCGGAGAAGCGTCGCCGACTGACCAAGGCCGACGTGGCA
CCCGTCGACGCCTGGAGGATCATGATGGCGCTCAAGTCCGGTCTGCTGGCCGAGACCTGC
TGGGCCCTCGACATACTCAACATACTACTCTTCGACGACAATTGCATAGGATACTTCGGC
CTCCAGCACATGCCCGGCCTACTCGACCTCCTGCTCGAGCACTTCCAGAAAAGTCTCGGA
GACGTGTTCGACGCTCCCGCTACCGAAAGCGAACCCTGGCGACCGGCGCTGCAAGTAAGG
GACCCCGCCGGCGTGCTTAAACGACGCCGCCTAGAGGACTACGAGGACGAGTGTTACACG
CGCGACGAACCAAGCCTAAACTTAGTGAACGAATCCCGGGACGCCCTCGCGAGACGATGT
ATCGCGTTATCCAATATATTACGTGGACTCACGTTCGTGCCAGGAAACGAGGCGGAGTTC
TCCAGGTCCGGGGCGTTCCTCGCCCTCGCCGGGAAACTGTTGCTGCTTCACCACGAGCAC
GCGCCCAGAGCCGCGAGAGCGAGGGCTTACGAGCGAGCGGCGAGAGACGAAGTCGACGTG
GACTCTTGCTGTTCGAGTCTTCGGGGAGAAGGGGAGTGGTGGTGGGACACGCTGGCCCAG
CTGCGGGAGGACGCGCTCGTCTGCTGCGCGAACATCGCGGGCAGCGTGGAGCTCGGCGGC
CAGCCGGAGGCCGTGGCGCGGCCGCTGCTGGACGGCCTGCTGCACTGGAGTGTGTGTCCG
GCTGCCGTGGCGGGAGACCCCCCGCCCGCCGCCGCGCCCGGCTCTCCGCTGTCTCCGCGC
CGCCTCGCTCTGGAGGCGCTGTGCAAGCTGTGCGTGACGGACGCCAACGTGGACTTGGTG
CTGGCGACGCCGCCGCGCGGTCGCATGGCGGCTCTGTGTGCGGGACTGGCGCGAGACCTG
TGTCGGCCGGAACGGCCCGTGGTGCGAGAGTTCGCCGTCAACCTGCTGCACTACCTGGCA
GGAGCCGGAGGCGCGGCGGCGCGGGAGGTTGCCATGCACGCGCCGGCCGTGGCGCAGCTG
GTGGCGTTCATCGAGCGCGCGGAGCAAACCGCGCTGGGCGTCGCCAACCAGCACGGGGTG
GCGGCCCTGCGAGACAACCCGGATGCGATGGGCACCTCACTAGACATGCTGCGGCGCGCC
GCGGCCACGCTGCTGCGGCTGGCGGAGCACCCCGAGAACAGGCCGCTGATCCGCCGCCAC
GAGCGCCGCCTGCTGTCGCTTGTCATGAGCCAGATCCTCGACCAGAAGGTGGCGCACGAG
CTGGCCGACGTGTTGTTCCACTGCAGCCAGGCGGCCGGCCAGGCGGACTGA
Protein sequence:
MCANSNLCIRVILRRIQDGRHHKAACEYSKYRPQHYLVKWDDSYLLAILVGVSVSGHDRA
AVCVTSDRVPFYDLTGQNSNDSGGSGGAGRASTPHLRPTPSPTGSSGSRSMSPAVGTQNL
PMPPRPSSSLSDGSGPTVRAGTGAPAAGPPPSGAPPPGAMLPQPYPHHGPYKTAPYPPQP
YGYPPRNHHPYPYGGYRPTPPPHPSQHYPPLKQAGRHMGPPAGAGEAMPPPTAPGEPHDN
GPAAPATALVTTGPDGAPLDEGSQQSTLSNASAASGEETCGTPKSRKEYGAGSAAPSPSP
GGASHSSVHDEYDASASPWPRPPSSPVFNSHIAPESYRSKKSDSLGKLYEMDDAPERRGW
VERLLAFMDERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVTKNKTWKDIAGLLGIG
ASSSAAYTLRKHYTKNLLAYECHFDRGGIDPQPIISQVEASTKKKSGKANSTSSAGSSNS
QEQFPGGAADGYPSHGAHPAHYAPYPPQPSQPQGGGPGGDNLAASNPFDEPPGPRRPPGY
QQGYGYEYGSPYPTNRPVYPPYGPEGDRGYGGSGEYRYGYGGYRAGAPAAGAPPSQPAPA
QPAPAQPAQPYPDYYRAPHPHAHPHPPHPPHPPHPPHPPHSPQQQHDGGGASAGGPAGAP
RRHPDFAKQEGYAGPGGPGGPAGAARFAGGGWAGGFPRGGPPAPAWRPPAPLPHAPHHPH
QPAWPHHQPHQPHQPYHPPTPGGVAWGAPRPPQELPPAASSPGAAGVGQLKRELTFPAEC
IEATVPAAEKRRRLTKADVAPVDAWRIMMALKSGLLAETCWALDILNILLFDDNCIGYFG
LQHMPGLLDLLLEHFQKSLGDVFDAPATESEPWRPALQVRDPAGVLKRRRLEDYEDECYT
RDEPSLNLVNESRDALARRCIALSNILRGLTFVPGNEAEFSRSGAFLALAGKLLLLHHEH
APRAARARAYERAARDEVDVDSCCSSLRGEGEWWWDTLAQLREDALVCCANIAGSVELGG
QPEAVARPLLDGLLHWSVCPAAVAGDPPPAAAPGSPLSPRRLALEALCKLCVTDANVDLV
LATPPRGRMAALCAGLARDLCRPERPVVREFAVNLLHYLAGAGGAAAREVAMHAPAVAQL
VAFIERAEQTALGVANQHGVAALRDNPDAMGTSLDMLRRAAATLLRLAEHPENRPLIRRH
ERRLLSLVMSQILDQKVAHELADVLFHCSQAAGQAD