New model in OGS2.0 | DPOGS208176  |
---|---|
Genomic Position | scaffold472:- 17576-33228 |
See gene structure | |
CDS Length | 4974 |
Paired RNAseq reads   | 4255 |
Single RNAseq reads   | 10195 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010263 (5e-67) |
Best Drosophila hit   | CG5060, isoform B (3e-35) |
Best Human hit | protein capicua homolog (2e-33) |
Best NR hit (blastp)   | capicua protein, putative [Pediculus humanus corporis] (9e-98) |
Best NR hit (blastx)   | PREDICTED: similar to capicua protein [Acyrthosiphon pisum] (7e-93) |
GeneOntology terms    | GO:0005515 protein binding GO:0005634 nucleus GO:0045449 regulation of transcription GO:0003677 DNA binding |
InterPro families    | IPR000910 High mobility group, HMG1/HMG2 IPR009071 High mobility group, superfamily |
Orthology group | MCL11313 |
Nucleotide sequence:
ATGCAGCGGCAGGCGGGCGGTGCGTCCTCGGCGCAGAGCCCGCGAGCCGTGTACCGCACG
CACGCGTCGCACATATACGACATTGATGATCAGATACCAGCGTCTGTTATCAGTAGTGTG
GGTACAGTGAGCCTGTCACAATCGACAACAAACCCAACGTCGAACACAACCATGGCCAGT
CGCGGGAATACGATATCTAATAACAACACACCCCAAACACAAGCACCAGTACGGAACCTT
CCCAAGAAGCGTAAATTTGATCCGTCTGAACTCGAGGAGATAGAACGTAATTGCGTCAAC
AGTATCGCTGAGAGGAACAGCCTGAATATACCCACAGCTGTCACGAATTCAATGGATTAC
ACGTCAAGCTATCAGCCAATAGCTCAGCCATCCGTAGTACCGAGATCTTCCCCCCACGAT
ACGAAACAATACATCCAATACCCTAACATAGATCTATCTGAGTGGCGTGATCACCGAGTG
CTGGCTAAACATCGCGGGTTATACCTCCCAGGGGTGATAAGGCAGGCTGACGGCTGTAAG
GTCACCGTGGAATTGGATGGACAGGATATTGAACCGATAGAGTACAGTGATTTGTTCGGT
GTCAATAGATATGACGTCATAAGTGACGCGAGTCCTCAGCTCAGTCATTTACCGGTGGGG
TCCGCCTGCGTGTTCAAGACCACAGACCCCGCCAGAGATGGAGGGCACAACGTGTTCGTC
GAGGGTCAAGTCTTCGAAGTTAATAATTCTCCTATCAGGATTCGTGTTAAGGTTATAGAG
GGCGACACTTGTAAGGAGGTGGTAGAGGTGAAGCGTGCTGATATCCGTCTTCGTCAACCG
CCTTGGGCCGACGAGCTGGAGGACGCCGGGTCACACGCACCCGCTGTCCCTCATATGAGA
CAGCAGTGCGTCTCTTACTCTATGGGTGATCACTTCGCGACGTCGTCTCCTATGCCGGGC
GCGTCTCACGTGTCTGTGGGGGCGCTGTCTGCTGGGTCGCGTCCCTTCGACGACTACGGC
AGCGACGACGACCTGCCTAGCGAGAACATTATGTTCCCCACTGACGCCTCGCATATGGAT
TGTAATAACAGTAAGAGGAGCAGTCTACAGAGCAGAGGCAGCACATCCAGTCTGGTCGAG
GGTAGCCTGACGCCTCGCTCCCAGCCACCTACACCTAGATCTCAAGCCGCCACACCTCAT
AAGTACAAGAAGGGTGACGTGGTGTCCACTCCCACTGGGATAAGGAAAAAGTTTAACGGG
AAACAGTGGCGAAGGCTGTGCTCCAAGAACGGGTGCGGCAAGGAGAGTCAGAGGCGAGGG
TACTGTTCACGACATCTCTCGCAGGGAGGCGTCAACAGGTCATCCAACACGCCGCTAACC
CAGGGATCCACACATACTCCGCAGCAGAGAAGCAGCAGTAAGTCGTTATCATCGAACGGT
ACTGGTGTAGAGGGAGATGATACGTCCCGCGAGTCGGACACCACGCCGCCCAACTACCGC
GTCACCGGCAGGTTCGACCAGGACGAGACAGAGGCCGCCAATATGCTTGTGTCACTCGGT
AGTTCTCGTTCAGGCAGTCCGGGCGCGTCTCCAGTGAGCGGGTCGCCGGTGCTCCGCGGT
AACGTGTTCGTGCCGATATCATCGCCACAGCCTCCGCTCAATAATCCTCCGCACAAGAAC
TACCATCACCACCTTATCAGACCTGAGTCACTACGTCCAGCCATAGTGAGTCCACCGGTG
GGGGGCGTGGCCACTAGTGTCATAAGAGTCTCCCCCGCCCCCACCCATCACTATCAGGTA
GATAATCGCAACGGACAAAATATTCAATCGAGCCAACCGAATATGATGGGACTACAAACG
ACGCCATACAACATACAGAGCAACATGCCATCCAATCTGAACGCGCCCACGACGATGCAA
TCGTCATTAAACTTCCCTACCATTATAAACAATCTGAATCAAAAATTTCAAACGTACGCC
AGTTCAGTGCGACCGACCAAAATAGAGGATTCGTTACACAACGTGGTCGTCCACCGCATG
CCCAGCAACGGCACGGACTCGGACTACAGGAACAAAGCCTACCGCAGGAACGGTATACAG
GAGCAGTTCAGGCGAGACGCGGATATGTCACCGCCTTTGAATAACTATGAAAATTTTCTG
AATAGAAGAGTCTCGGATTACGATGAGGAGGACCACTCGGTCCCTCAGCCAGATAGCGGC
CACTTAGAACTGTCCGAGGCTCGCTTGATAGACGACAAGAGGATCGTCAAGCCGGCGCCG
CTACCTGGCCGGTACATCTCACTGGTCGACGACACCAAGGACACGCTGCGGAAACTGTAC
GTCATACCGCAGAACACCATCGACAAGAAGATAGTACTCATCAAGAACGAACCCACAGAC
ATACAGATAGAACACAAGCCGCAGTCGCAGCAGCTGAACAGCAGCGACCAGGACATGGAG
CATCGCAGCACGGACAACGGGGACACTGGTAATAAGCTCAACAACAGCGCCGTTATAGTA
CATCCAAGTCAACTACTGCCGGTGTTGCCGCCGCCTTCCTCGGCTATTATAGTGTCATCC
AGCGGTGTGCCCAGCGTGTTCTCTTGGCAGTCACTGGTGCCTCTGCTAAGAGCGGCGTCT
CCCCCGGCAGTGCCGCACTCGCCACGCACGCCACACACGCCACACACGCCCCACACGCCG
CACACGCCGCACACGCCACATACACCACACACGCCACACGTCAAGACAGAGGATATCAAG
ACTGAGAATGAGTTATATGTCATAATGTATTATCTTCGGCAGAAGGAGCGTCGCATCCGC
CGGCCGATGAACGCTTTCATGATATTTTCCAAGCGCCACCGCCAGATAGTCCACCAGCTG
CACCCCAACCAGGACAACAGGACCGTCAGCAAGATACTGGGAGAGTGGTGGTACTCGCTC
AAGCCCGACGAGAAGAAGAAATACAACGAACTGGCCAGCGAGGTGAAAGAGGCTCACTTC
AAAGCGCATCCGGAGTGGAAGTGGTGTAATAAGGATCGCCGCAAGTCATCGAGCAGCAGA
GATCCTACGGGCTCTACGCCGCAGAGTCCTCGAACTCCATCCGAGGGGCCAAATCCCATG
ATGGCCAGTGCGGACATGTCTGTGAACTCACAGACATACACACACATCGGCTCGCCGCAG
CTCAGCGACGACGAGCCTATGCAAATTAGTCAAACAGTAGAAGAACCGTCGGCGCCGGCG
CAGAACATCGAGATCGATCTCAAATGTGGCGAGAAAGTGACGGACTCGGACTCCGAGGGG
ATCGACGCGAGAGAGTATCTCACGCATCATGACACGAGGCGGCCAAAACCTATTAAAGCT
AGGGCGGGATCGTCTGATAATCTGTTGGGTATAACAGCGTCCAGCCCGGGAGGCTTCAAG
GTGTTCCAGCCGACGGGAGGAGCGTTTAAATCAACGCATGCTGATAGCGGTGATAACCAT
AGACAATGGACGGCGTTTACATCGGTAAATAAACCGAACATCAATCAGGATCTGAATTCG
CCTCACCCTAACACTCAGAGCCTAACGAACAGCGTTCAGGGTATATCGATAAGCGCTCCG
AATCTGTCGACACAGGCGGCCCTAGACAACGCGATCGCATCGATAATAAGTCCCACCACT
AGTGGTGTGCAAGTTATATCCAGTGGTATATCGATGCCGCATACTATCTCCCAGTCGCAG
GCTCCAACGTCCACCACTACAGCCCTGACGAATACTTTGTTGAAGAGTGTCACATTGGTG
AAACGAAATATTGGAGACAATACTGCGGTTCCAATAACCCTGTCAGTTGATACATCCGGC
AACATAGTTATAAAGGCGAGTCAAGCGAGCGACTCCCCCGCTACCAGCGACTCTCAGCCT
CTACATTACGTACAATTACAGAGACTATATGTGTCATCGGTCAATACTGCAGAATCGGAA
CCAGCTAAGACACCCGTCTCGAACCCTCAAACCGGTCCATCTGTTATAGTGTCACAAAGT
AACAACCACATATCACCCAGTAACGCAACAATGGAACCGATGGAGACCTGGGATACTCCG
ATGTATGAGGCCCGGCCATTCCCTCTTGCACCCACACCAGCGCAATTGGGACGGGCACCA
CTACAGAAGAGACTCAGTAGAGGTACGTCAACTGGTTCGACTGGTAGCAACGAGGCTACG
ATCCCTCGGTCGGAGAGCGGGCCCACCACGCCGTTGGACGTCGGCGAGGTGGGCGTACAC
TCACCCAAGAAAGAAAACCTGCCCAGTCCATCGCTGAAGAAAAGCCTCTTCAAGAAAGGC
AACGAGGATGGAAGGGACAAAGTTCTAGAGACGGTGAACTTCTCAGAGAAGTTCAATACG
TTGCCTCAGTTCAAACCGGAAGCGTGCAGTCCCAGTGCGATGGCGGTGCCGCGCTCACCG
CAGCTCTACCTTAGAAAGAAACACCACAAAATCAGTATGGAGGAGGATCAGACGGTGGTG
ACGCCGCAGATTGAAAACGAAATCATGAATGGTAACGGTATGCCGACACCACACTCATAC
GGAACACCTCACTCTACCACCAAGCTAGTTGGTACCACCTTCTTCGGACCTGACTTCAAT
CCTGAGAATTTTAGAGTGCCATGTTCGGAGGCTTCAGAGGAGATGTCTCCCCGCACACCC
TGTTCGGCTCGCGGCGAGGCTGGTCACCGGCGGGTGTTGGAGCAGAGACGACATCTGGTG
ATGAAGCTGTTCCACGACCACGGCATGTTCCCCTCCACACAGGCCACTACACACTTCCAG
GCTGCTCATGCCGATATCTTCCCCAGCAAGGGCTCCCTGCAGCTGAAGATCCGTGAAGTC
CGTCAGAAACTGATGGCTCAGTCCAACCTCACACCGCACTCCGATCTCAACACTCCCACT
AATGTGAACTCCCCTATAGTATCGTCATTGCTACCGACCTCTACAGCCAGTTAG
Protein sequence:
MQRQAGGASSAQSPRAVYRTHASHIYDIDDQIPASVISSVGTVSLSQSTTNPTSNTTMAS
RGNTISNNNTPQTQAPVRNLPKKRKFDPSELEEIERNCVNSIAERNSLNIPTAVTNSMDY
TSSYQPIAQPSVVPRSSPHDTKQYIQYPNIDLSEWRDHRVLAKHRGLYLPGVIRQADGCK
VTVELDGQDIEPIEYSDLFGVNRYDVISDASPQLSHLPVGSACVFKTTDPARDGGHNVFV
EGQVFEVNNSPIRIRVKVIEGDTCKEVVEVKRADIRLRQPPWADELEDAGSHAPAVPHMR
QQCVSYSMGDHFATSSPMPGASHVSVGALSAGSRPFDDYGSDDDLPSENIMFPTDASHMD
CNNSKRSSLQSRGSTSSLVEGSLTPRSQPPTPRSQAATPHKYKKGDVVSTPTGIRKKFNG
KQWRRLCSKNGCGKESQRRGYCSRHLSQGGVNRSSNTPLTQGSTHTPQQRSSSKSLSSNG
TGVEGDDTSRESDTTPPNYRVTGRFDQDETEAANMLVSLGSSRSGSPGASPVSGSPVLRG
NVFVPISSPQPPLNNPPHKNYHHHLIRPESLRPAIVSPPVGGVATSVIRVSPAPTHHYQV
DNRNGQNIQSSQPNMMGLQTTPYNIQSNMPSNLNAPTTMQSSLNFPTIINNLNQKFQTYA
SSVRPTKIEDSLHNVVVHRMPSNGTDSDYRNKAYRRNGIQEQFRRDADMSPPLNNYENFL
NRRVSDYDEEDHSVPQPDSGHLELSEARLIDDKRIVKPAPLPGRYISLVDDTKDTLRKLY
VIPQNTIDKKIVLIKNEPTDIQIEHKPQSQQLNSSDQDMEHRSTDNGDTGNKLNNSAVIV
HPSQLLPVLPPPSSAIIVSSSGVPSVFSWQSLVPLLRAASPPAVPHSPRTPHTPHTPHTP
HTPHTPHTPHTPHVKTEDIKTENELYVIMYYLRQKERRIRRPMNAFMIFSKRHRQIVHQL
HPNQDNRTVSKILGEWWYSLKPDEKKKYNELASEVKEAHFKAHPEWKWCNKDRRKSSSSR
DPTGSTPQSPRTPSEGPNPMMASADMSVNSQTYTHIGSPQLSDDEPMQISQTVEEPSAPA
QNIEIDLKCGEKVTDSDSEGIDAREYLTHHDTRRPKPIKARAGSSDNLLGITASSPGGFK
VFQPTGGAFKSTHADSGDNHRQWTAFTSVNKPNINQDLNSPHPNTQSLTNSVQGISISAP
NLSTQAALDNAIASIISPTTSGVQVISSGISMPHTISQSQAPTSTTTALTNTLLKSVTLV
KRNIGDNTAVPITLSVDTSGNIVIKASQASDSPATSDSQPLHYVQLQRLYVSSVNTAESE
PAKTPVSNPQTGPSVIVSQSNNHISPSNATMEPMETWDTPMYEARPFPLAPTPAQLGRAP
LQKRLSRGTSTGSTGSNEATIPRSESGPTTPLDVGEVGVHSPKKENLPSPSLKKSLFKKG
NEDGRDKVLETVNFSEKFNTLPQFKPEACSPSAMAVPRSPQLYLRKKHHKISMEEDQTVV
TPQIENEIMNGNGMPTPHSYGTPHSTTKLVGTTFFGPDFNPENFRVPCSEASEEMSPRTP
CSARGEAGHRRVLEQRRHLVMKLFHDHGMFPSTQATTHFQAAHADIFPSKGSLQLKIREV
RQKLMAQSNLTPHSDLNTPTNVNSPIVSSLLPTSTAS