DPGLEAN16941 in OGS1.0

New model in OGS2.0DPOGS208176 
Genomic Positionscaffold472:- 17576-33228
See gene structure
CDS Length4974
Paired RNAseq reads  4255
Single RNAseq reads  10195
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010263 (5e-67)
Best Drosophila hit  CG5060, isoform B (3e-35)
Best Human hitprotein capicua homolog (2e-33)
Best NR hit (blastp)  capicua protein, putative [Pediculus humanus corporis] (9e-98)
Best NR hit (blastx)  PREDICTED: similar to capicua protein [Acyrthosiphon pisum] (7e-93)
GeneOntology terms


  
GO:0005515 protein binding
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0003677 DNA binding
InterPro families
  
IPR000910 High mobility group, HMG1/HMG2
IPR009071 High mobility group, superfamily
Orthology groupMCL11313

Nucleotide sequence:

ATGCAGCGGCAGGCGGGCGGTGCGTCCTCGGCGCAGAGCCCGCGAGCCGTGTACCGCACG
CACGCGTCGCACATATACGACATTGATGATCAGATACCAGCGTCTGTTATCAGTAGTGTG
GGTACAGTGAGCCTGTCACAATCGACAACAAACCCAACGTCGAACACAACCATGGCCAGT
CGCGGGAATACGATATCTAATAACAACACACCCCAAACACAAGCACCAGTACGGAACCTT
CCCAAGAAGCGTAAATTTGATCCGTCTGAACTCGAGGAGATAGAACGTAATTGCGTCAAC
AGTATCGCTGAGAGGAACAGCCTGAATATACCCACAGCTGTCACGAATTCAATGGATTAC
ACGTCAAGCTATCAGCCAATAGCTCAGCCATCCGTAGTACCGAGATCTTCCCCCCACGAT
ACGAAACAATACATCCAATACCCTAACATAGATCTATCTGAGTGGCGTGATCACCGAGTG
CTGGCTAAACATCGCGGGTTATACCTCCCAGGGGTGATAAGGCAGGCTGACGGCTGTAAG
GTCACCGTGGAATTGGATGGACAGGATATTGAACCGATAGAGTACAGTGATTTGTTCGGT
GTCAATAGATATGACGTCATAAGTGACGCGAGTCCTCAGCTCAGTCATTTACCGGTGGGG
TCCGCCTGCGTGTTCAAGACCACAGACCCCGCCAGAGATGGAGGGCACAACGTGTTCGTC
GAGGGTCAAGTCTTCGAAGTTAATAATTCTCCTATCAGGATTCGTGTTAAGGTTATAGAG
GGCGACACTTGTAAGGAGGTGGTAGAGGTGAAGCGTGCTGATATCCGTCTTCGTCAACCG
CCTTGGGCCGACGAGCTGGAGGACGCCGGGTCACACGCACCCGCTGTCCCTCATATGAGA
CAGCAGTGCGTCTCTTACTCTATGGGTGATCACTTCGCGACGTCGTCTCCTATGCCGGGC
GCGTCTCACGTGTCTGTGGGGGCGCTGTCTGCTGGGTCGCGTCCCTTCGACGACTACGGC
AGCGACGACGACCTGCCTAGCGAGAACATTATGTTCCCCACTGACGCCTCGCATATGGAT
TGTAATAACAGTAAGAGGAGCAGTCTACAGAGCAGAGGCAGCACATCCAGTCTGGTCGAG
GGTAGCCTGACGCCTCGCTCCCAGCCACCTACACCTAGATCTCAAGCCGCCACACCTCAT
AAGTACAAGAAGGGTGACGTGGTGTCCACTCCCACTGGGATAAGGAAAAAGTTTAACGGG
AAACAGTGGCGAAGGCTGTGCTCCAAGAACGGGTGCGGCAAGGAGAGTCAGAGGCGAGGG
TACTGTTCACGACATCTCTCGCAGGGAGGCGTCAACAGGTCATCCAACACGCCGCTAACC
CAGGGATCCACACATACTCCGCAGCAGAGAAGCAGCAGTAAGTCGTTATCATCGAACGGT
ACTGGTGTAGAGGGAGATGATACGTCCCGCGAGTCGGACACCACGCCGCCCAACTACCGC
GTCACCGGCAGGTTCGACCAGGACGAGACAGAGGCCGCCAATATGCTTGTGTCACTCGGT
AGTTCTCGTTCAGGCAGTCCGGGCGCGTCTCCAGTGAGCGGGTCGCCGGTGCTCCGCGGT
AACGTGTTCGTGCCGATATCATCGCCACAGCCTCCGCTCAATAATCCTCCGCACAAGAAC
TACCATCACCACCTTATCAGACCTGAGTCACTACGTCCAGCCATAGTGAGTCCACCGGTG
GGGGGCGTGGCCACTAGTGTCATAAGAGTCTCCCCCGCCCCCACCCATCACTATCAGGTA
GATAATCGCAACGGACAAAATATTCAATCGAGCCAACCGAATATGATGGGACTACAAACG
ACGCCATACAACATACAGAGCAACATGCCATCCAATCTGAACGCGCCCACGACGATGCAA
TCGTCATTAAACTTCCCTACCATTATAAACAATCTGAATCAAAAATTTCAAACGTACGCC
AGTTCAGTGCGACCGACCAAAATAGAGGATTCGTTACACAACGTGGTCGTCCACCGCATG
CCCAGCAACGGCACGGACTCGGACTACAGGAACAAAGCCTACCGCAGGAACGGTATACAG
GAGCAGTTCAGGCGAGACGCGGATATGTCACCGCCTTTGAATAACTATGAAAATTTTCTG
AATAGAAGAGTCTCGGATTACGATGAGGAGGACCACTCGGTCCCTCAGCCAGATAGCGGC
CACTTAGAACTGTCCGAGGCTCGCTTGATAGACGACAAGAGGATCGTCAAGCCGGCGCCG
CTACCTGGCCGGTACATCTCACTGGTCGACGACACCAAGGACACGCTGCGGAAACTGTAC
GTCATACCGCAGAACACCATCGACAAGAAGATAGTACTCATCAAGAACGAACCCACAGAC
ATACAGATAGAACACAAGCCGCAGTCGCAGCAGCTGAACAGCAGCGACCAGGACATGGAG
CATCGCAGCACGGACAACGGGGACACTGGTAATAAGCTCAACAACAGCGCCGTTATAGTA
CATCCAAGTCAACTACTGCCGGTGTTGCCGCCGCCTTCCTCGGCTATTATAGTGTCATCC
AGCGGTGTGCCCAGCGTGTTCTCTTGGCAGTCACTGGTGCCTCTGCTAAGAGCGGCGTCT
CCCCCGGCAGTGCCGCACTCGCCACGCACGCCACACACGCCACACACGCCCCACACGCCG
CACACGCCGCACACGCCACATACACCACACACGCCACACGTCAAGACAGAGGATATCAAG
ACTGAGAATGAGTTATATGTCATAATGTATTATCTTCGGCAGAAGGAGCGTCGCATCCGC
CGGCCGATGAACGCTTTCATGATATTTTCCAAGCGCCACCGCCAGATAGTCCACCAGCTG
CACCCCAACCAGGACAACAGGACCGTCAGCAAGATACTGGGAGAGTGGTGGTACTCGCTC
AAGCCCGACGAGAAGAAGAAATACAACGAACTGGCCAGCGAGGTGAAAGAGGCTCACTTC
AAAGCGCATCCGGAGTGGAAGTGGTGTAATAAGGATCGCCGCAAGTCATCGAGCAGCAGA
GATCCTACGGGCTCTACGCCGCAGAGTCCTCGAACTCCATCCGAGGGGCCAAATCCCATG
ATGGCCAGTGCGGACATGTCTGTGAACTCACAGACATACACACACATCGGCTCGCCGCAG
CTCAGCGACGACGAGCCTATGCAAATTAGTCAAACAGTAGAAGAACCGTCGGCGCCGGCG
CAGAACATCGAGATCGATCTCAAATGTGGCGAGAAAGTGACGGACTCGGACTCCGAGGGG
ATCGACGCGAGAGAGTATCTCACGCATCATGACACGAGGCGGCCAAAACCTATTAAAGCT
AGGGCGGGATCGTCTGATAATCTGTTGGGTATAACAGCGTCCAGCCCGGGAGGCTTCAAG
GTGTTCCAGCCGACGGGAGGAGCGTTTAAATCAACGCATGCTGATAGCGGTGATAACCAT
AGACAATGGACGGCGTTTACATCGGTAAATAAACCGAACATCAATCAGGATCTGAATTCG
CCTCACCCTAACACTCAGAGCCTAACGAACAGCGTTCAGGGTATATCGATAAGCGCTCCG
AATCTGTCGACACAGGCGGCCCTAGACAACGCGATCGCATCGATAATAAGTCCCACCACT
AGTGGTGTGCAAGTTATATCCAGTGGTATATCGATGCCGCATACTATCTCCCAGTCGCAG
GCTCCAACGTCCACCACTACAGCCCTGACGAATACTTTGTTGAAGAGTGTCACATTGGTG
AAACGAAATATTGGAGACAATACTGCGGTTCCAATAACCCTGTCAGTTGATACATCCGGC
AACATAGTTATAAAGGCGAGTCAAGCGAGCGACTCCCCCGCTACCAGCGACTCTCAGCCT
CTACATTACGTACAATTACAGAGACTATATGTGTCATCGGTCAATACTGCAGAATCGGAA
CCAGCTAAGACACCCGTCTCGAACCCTCAAACCGGTCCATCTGTTATAGTGTCACAAAGT
AACAACCACATATCACCCAGTAACGCAACAATGGAACCGATGGAGACCTGGGATACTCCG
ATGTATGAGGCCCGGCCATTCCCTCTTGCACCCACACCAGCGCAATTGGGACGGGCACCA
CTACAGAAGAGACTCAGTAGAGGTACGTCAACTGGTTCGACTGGTAGCAACGAGGCTACG
ATCCCTCGGTCGGAGAGCGGGCCCACCACGCCGTTGGACGTCGGCGAGGTGGGCGTACAC
TCACCCAAGAAAGAAAACCTGCCCAGTCCATCGCTGAAGAAAAGCCTCTTCAAGAAAGGC
AACGAGGATGGAAGGGACAAAGTTCTAGAGACGGTGAACTTCTCAGAGAAGTTCAATACG
TTGCCTCAGTTCAAACCGGAAGCGTGCAGTCCCAGTGCGATGGCGGTGCCGCGCTCACCG
CAGCTCTACCTTAGAAAGAAACACCACAAAATCAGTATGGAGGAGGATCAGACGGTGGTG
ACGCCGCAGATTGAAAACGAAATCATGAATGGTAACGGTATGCCGACACCACACTCATAC
GGAACACCTCACTCTACCACCAAGCTAGTTGGTACCACCTTCTTCGGACCTGACTTCAAT
CCTGAGAATTTTAGAGTGCCATGTTCGGAGGCTTCAGAGGAGATGTCTCCCCGCACACCC
TGTTCGGCTCGCGGCGAGGCTGGTCACCGGCGGGTGTTGGAGCAGAGACGACATCTGGTG
ATGAAGCTGTTCCACGACCACGGCATGTTCCCCTCCACACAGGCCACTACACACTTCCAG
GCTGCTCATGCCGATATCTTCCCCAGCAAGGGCTCCCTGCAGCTGAAGATCCGTGAAGTC
CGTCAGAAACTGATGGCTCAGTCCAACCTCACACCGCACTCCGATCTCAACACTCCCACT
AATGTGAACTCCCCTATAGTATCGTCATTGCTACCGACCTCTACAGCCAGTTAG

Protein sequence:

MQRQAGGASSAQSPRAVYRTHASHIYDIDDQIPASVISSVGTVSLSQSTTNPTSNTTMAS
RGNTISNNNTPQTQAPVRNLPKKRKFDPSELEEIERNCVNSIAERNSLNIPTAVTNSMDY
TSSYQPIAQPSVVPRSSPHDTKQYIQYPNIDLSEWRDHRVLAKHRGLYLPGVIRQADGCK
VTVELDGQDIEPIEYSDLFGVNRYDVISDASPQLSHLPVGSACVFKTTDPARDGGHNVFV
EGQVFEVNNSPIRIRVKVIEGDTCKEVVEVKRADIRLRQPPWADELEDAGSHAPAVPHMR
QQCVSYSMGDHFATSSPMPGASHVSVGALSAGSRPFDDYGSDDDLPSENIMFPTDASHMD
CNNSKRSSLQSRGSTSSLVEGSLTPRSQPPTPRSQAATPHKYKKGDVVSTPTGIRKKFNG
KQWRRLCSKNGCGKESQRRGYCSRHLSQGGVNRSSNTPLTQGSTHTPQQRSSSKSLSSNG
TGVEGDDTSRESDTTPPNYRVTGRFDQDETEAANMLVSLGSSRSGSPGASPVSGSPVLRG
NVFVPISSPQPPLNNPPHKNYHHHLIRPESLRPAIVSPPVGGVATSVIRVSPAPTHHYQV
DNRNGQNIQSSQPNMMGLQTTPYNIQSNMPSNLNAPTTMQSSLNFPTIINNLNQKFQTYA
SSVRPTKIEDSLHNVVVHRMPSNGTDSDYRNKAYRRNGIQEQFRRDADMSPPLNNYENFL
NRRVSDYDEEDHSVPQPDSGHLELSEARLIDDKRIVKPAPLPGRYISLVDDTKDTLRKLY
VIPQNTIDKKIVLIKNEPTDIQIEHKPQSQQLNSSDQDMEHRSTDNGDTGNKLNNSAVIV
HPSQLLPVLPPPSSAIIVSSSGVPSVFSWQSLVPLLRAASPPAVPHSPRTPHTPHTPHTP
HTPHTPHTPHTPHVKTEDIKTENELYVIMYYLRQKERRIRRPMNAFMIFSKRHRQIVHQL
HPNQDNRTVSKILGEWWYSLKPDEKKKYNELASEVKEAHFKAHPEWKWCNKDRRKSSSSR
DPTGSTPQSPRTPSEGPNPMMASADMSVNSQTYTHIGSPQLSDDEPMQISQTVEEPSAPA
QNIEIDLKCGEKVTDSDSEGIDAREYLTHHDTRRPKPIKARAGSSDNLLGITASSPGGFK
VFQPTGGAFKSTHADSGDNHRQWTAFTSVNKPNINQDLNSPHPNTQSLTNSVQGISISAP
NLSTQAALDNAIASIISPTTSGVQVISSGISMPHTISQSQAPTSTTTALTNTLLKSVTLV
KRNIGDNTAVPITLSVDTSGNIVIKASQASDSPATSDSQPLHYVQLQRLYVSSVNTAESE
PAKTPVSNPQTGPSVIVSQSNNHISPSNATMEPMETWDTPMYEARPFPLAPTPAQLGRAP
LQKRLSRGTSTGSTGSNEATIPRSESGPTTPLDVGEVGVHSPKKENLPSPSLKKSLFKKG
NEDGRDKVLETVNFSEKFNTLPQFKPEACSPSAMAVPRSPQLYLRKKHHKISMEEDQTVV
TPQIENEIMNGNGMPTPHSYGTPHSTTKLVGTTFFGPDFNPENFRVPCSEASEEMSPRTP
CSARGEAGHRRVLEQRRHLVMKLFHDHGMFPSTQATTHFQAAHADIFPSKGSLQLKIREV
RQKLMAQSNLTPHSDLNTPTNVNSPIVSSLLPTSTAS