DPGLEAN21773 in OGS1.0

New model in OGS2.0DPOGS200133 
Genomic Positionscaffold789:- 10592-19346
See gene structure
CDS Length3480
Paired RNAseq reads  4558
Single RNAseq reads  10394
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002779 (0.0)
Best Drosophila hit  dre4, isoform A (0.0)
Best Human hitFACT complex subunit SPT16 (0.0)
Best NR hit (blastp)  PREDICTED: similar to dre4 CG1828-PA, isoform A [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to dre4 CG1828-PA, isoform A [Apis mellifera] (0.0)
GeneOntology terms


  
GO:0006268 DNA unwinding involved in replication
GO:0003677 DNA binding
GO:0001672 regulation of chromatin assembly or disassembly
GO:0035101 FACT complex
InterPro families

  
IPR000994 Peptidase M24, structural domain
IPR013953 FACT complex subunit Spt16p/Cdc68p
IPR013719 Domain of unknown function DUF1747, eukaryote
Orthology groupMCL12140

Nucleotide sequence:

ATGTCCAATATATCTTTGGATAAGGAAACGTTTTATAGGCGTATGAAAAGGCTATATGCG
GCGTGGAAGGCTGCTGCTGCTGATTCTAAGAGTGATGATGTTTTGGCCAAATGTGATTGC
TTGGTGTCTTGCGTTGGCGTGGATGAAGATACTCTATACAGCAAGTCAACCGCATTACAG
ACTTGGCTCTTTGGATATGAACTTCCAGACACCATAACAGTTCTGACTGAACAGAGCATG
TGTTTTCTGGCAAGTAAAAAGAAGATTGAATTCCTCCGCCAAATCGAAAATGGTAAAGAA
GAAACAGATCTACCTCCAGTAAAACTTTTAATAAGAGATAGAAATGACCATGATAAAGAA
AACTTTAATAAGCTTATACAAGAAATAAAGAAATCTAAATCCGGCAAGACTCTGGGTGTG
TTTGCCAAAGACAATTATCCAGGGGAGTTCTGCGAGAGTTGGAAATCTGCAATGAAGGCG
GAGAAGTTTGAAAATGTGGATATCAGTTCATCCGTAGCTACATTCATGGCACCAAAAGAA
GATTCAGAAATAATCACCATCAAAAAGGCCTGCCTTGTCACCGTTGATGTTTTCACAAAG
TACTTAAAAGATCAAATTATGGAAATTATTGATTCGGACAAGAAAGTAAAACATTCGAAA
CTAGCGGAAGGTGTGGAAGCTGCTATATCAGATAAAAAATATGTAACCGGTGTAGACACG
AGCCAAGTAGATATGTGCTATCCACCGATCATACAGTCTGGAGGGAATTATAGTCTGAAA
TTCAGTGCTGTGTCAGATAAGAATCACCTACATTTTGGTGCAATAGTATGTTCTCTAGGA
GCCAGATACAAGTCATACTGTTCAAATATTGTCCGCACATTACTTGTCAATCCGACGGAC
AATGTCCAAAGCAATTATAATTTTCTTTTGAATTTGGAAGAAGAGGTCATGAAGCATCTT
GTGTCTGGTGCCAAGCTGTCAGCCGTTTATGAAGCTGGTTTGGCATTGGCAAAGAAAGAA
AAACCTGAATTAGTGGACAACCTCACAAAGACGTTTGGATTTGCAATGGGAATAGAATTT
CGTGAAAGTGCCATAGTTATTGGACCGAAAACCGCAGTTGTTGCAAAGAAAGGCATGGTC
TTTAACATTAATATTGGTTTGGCAAATTTAACCAACTCTGCAGCAACGGATAAAGAAGGA
AAGACTTATGCCCTATTCATTGGTGATACTGTGCTCGTGAATGATGAACAGCCAGCATCG
CTGCTAACACAATCCAAGAAGAAGATTAAAAACATAGGAATATTCCTTAAAGATGACGAT
GAAGAGGAAGAGGAGGAGAAAGAGAATAAAACAGAAATTTTGGGTCGCGGTAAAAGGACG
GCAGTTATTGAGTCGAAGCTTCGGACTGAACATTCTTCAGAGGACAAACGTAAGGAGCAT
CAGAGAGAATTGGCGATAGCTCTCAACGAGAAAGCTAAGGAGAGACTGGCGAAACAGTCG
AGTGGAAAAGAGGGAGAGAAGATAAGGAAGAGTACAGTCTCGTACAAAAGTGTCAGTCAA
ATGCCCAGAGAGAACGAAGTTAAAGAGTTGAAATTATACGTCGATCGTAAATATGAAACA
GTAATATTGCCGATATTCGGCGTGCCGGTACCATTCCATATATCTACAATTAAAAATATA
TCTCAGTCTGTGGAGGGCGACTATACATATTTGAGAATCAATTTCTTCCACCCGGGTGCC
ACTATGGGCAGAAACGAGGGTGGCAACTACGCGCAGCCTGACGCGACCTTCGTTAAAGAA
GTTACATACCGCAGTACAAACACTAAAGAGCCAGGAGAAATTTCACCTCCATCATCAAAC
CTAAACACTGGATTCCGGTTAATAAAGGAAGTTCAGAAGAAGTTCAAAACGCGAGAGGCG
GAGGAGAGGGAGAAGGAGGACTTAGTTAAACAAGATACTCTCGTTTTATCCCAGAACAAA
GGAAATCCCAAACTGAAGGATTTATACATCAGACCTAATATAGTCACAAAGAGAATGAGC
GGGTCTCTAGAAGCGCATTCGAACGGTTTCAGATTCACGTCAGTGAGAGGAGACAAAGTT
GATATTTTATATAACAACATCAAAAACGCATTCTTCCAACCGTGCGATGGAGAGATGATC
ATTCTATTGCATTTCCATCTGAAGCACGCTATTATGTTCGGGAAGAAGAAACATGTCGAC
GTGCAGTTCTATACCGAGGTCGGTGAGATTACTACAGACCTGGGCAAACACCAGCATATG
CACGACCGTGACGACCTCGCCGCCGAGCAGAGCGAACGGGAACTGAGACACAAACTGAAG
ATAGCTTTCAAAAGTTTCTGCGAGCGCGTCGAGAACATGACCAAACAGGAAGTCGAGTTT
GACACGCCGTACAGAGAACTCGGTTTCCCCGGAGCGCCGTTCCGTAGTACTGTCCTCCTA
CAACCGACCTCTGGAGCCCTCGTCAACCTGACCGAGTGGCCGCCCTTCGTCATCTCGCTG
GAGGACGTTGAACTCGTTCACTTCGAAAGAGTACAGTTCCACCTCAAGAACTTCGATATG
GTTTTCGTGTTTAAGGATTACGCCAAGAAAGTCGCCATGGTCAATGCTGTCCCCATGAAC
ATGCTCGATCACGTCAAGGAGTGGCTGAACTCGTGCGATATCCGGTATTCGGAAGGTATC
CAGTCTCTCAACTGGACAAAAGTCATGAAAACCATTACTGATGATATCGAAGGTTTCTTC
GACAACGGCGGCTGGTCTTTCCTGGACCCCGAGTCTGATGCCGAGAACGAGGAACAGCAC
GACGATGAATCTGAAGAGGAGGATGATGCGTATGAACCGACGGATGCTGAGTCGGAAGAG
GAATCCGAAGATGACTCGGAGTACGACTCCGAGGCTTCGGAAATGTCCGACGACTCCGGC
GACAGCGACGGTGGTGAAGAGGACGAAGAATCTGGGAAAGATTGGTCAGATCTTGAACGC
GAGGCCGCCGAAGAGGATAAGAAGGAACGCAATTACGACAGACCGTCGACGGACTTTGAT
CGGAAACGCAAAGGCGGGAGAGACAGACACCGCTATGACGAGGACCAAGGCAGCAAGAAG
AGCAAACACGACAAAAGTTCACACCACAAAAGCTCCAGTTCAAACCACAAAAGCTCGAGC
TCAAACCACAAAAGCTCGAGTTCAAACCACAAAAGCTCAAGTTCAAATCATAAAAGTTCA
AACCACAAGAGCCCGTCAAAGCACAGCAGTGACAGCCCTTCGAAGAGCAATAAGCACAAG
TCCTCCCACGACCGTTCCCGTGACCACAAATCTAATGGCAAGTCGAACGGTGATCACAAG
TCGCACAAGAGATCACGTGACGACAGTCGCGAACACGAACGATCCTCTAAGAAACACAAA
ATAAATTATAACAAAACTCGCAAAGGATTGATTTATCTTACAAAAAAAGTCTATAAATAA

Protein sequence:

MSNISLDKETFYRRMKRLYAAWKAAAADSKSDDVLAKCDCLVSCVGVDEDTLYSKSTALQ
TWLFGYELPDTITVLTEQSMCFLASKKKIEFLRQIENGKEETDLPPVKLLIRDRNDHDKE
NFNKLIQEIKKSKSGKTLGVFAKDNYPGEFCESWKSAMKAEKFENVDISSSVATFMAPKE
DSEIITIKKACLVTVDVFTKYLKDQIMEIIDSDKKVKHSKLAEGVEAAISDKKYVTGVDT
SQVDMCYPPIIQSGGNYSLKFSAVSDKNHLHFGAIVCSLGARYKSYCSNIVRTLLVNPTD
NVQSNYNFLLNLEEEVMKHLVSGAKLSAVYEAGLALAKKEKPELVDNLTKTFGFAMGIEF
RESAIVIGPKTAVVAKKGMVFNINIGLANLTNSAATDKEGKTYALFIGDTVLVNDEQPAS
LLTQSKKKIKNIGIFLKDDDEEEEEEKENKTEILGRGKRTAVIESKLRTEHSSEDKRKEH
QRELAIALNEKAKERLAKQSSGKEGEKIRKSTVSYKSVSQMPRENEVKELKLYVDRKYET
VILPIFGVPVPFHISTIKNISQSVEGDYTYLRINFFHPGATMGRNEGGNYAQPDATFVKE
VTYRSTNTKEPGEISPPSSNLNTGFRLIKEVQKKFKTREAEEREKEDLVKQDTLVLSQNK
GNPKLKDLYIRPNIVTKRMSGSLEAHSNGFRFTSVRGDKVDILYNNIKNAFFQPCDGEMI
ILLHFHLKHAIMFGKKKHVDVQFYTEVGEITTDLGKHQHMHDRDDLAAEQSERELRHKLK
IAFKSFCERVENMTKQEVEFDTPYRELGFPGAPFRSTVLLQPTSGALVNLTEWPPFVISL
EDVELVHFERVQFHLKNFDMVFVFKDYAKKVAMVNAVPMNMLDHVKEWLNSCDIRYSEGI
QSLNWTKVMKTITDDIEGFFDNGGWSFLDPESDAENEEQHDDESEEEDDAYEPTDAESEE
ESEDDSEYDSEASEMSDDSGDSDGGEEDEESGKDWSDLEREAAEEDKKERNYDRPSTDFD
RKRKGGRDRHRYDEDQGSKKSKHDKSSHHKSSSSNHKSSSSNHKSSSSNHKSSSSNHKSS
NHKSPSKHSSDSPSKSNKHKSSHDRSRDHKSNGKSNGDHKSHKRSRDDSREHERSSKKHK
INYNKTRKGLIYLTKKVYK