New model in OGS2.0 | DPOGS200133  |
---|---|
Genomic Position | scaffold789:- 10592-19346 |
See gene structure | |
CDS Length | 3480 |
Paired RNAseq reads   | 4558 |
Single RNAseq reads   | 10394 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002779 (0.0) |
Best Drosophila hit   | dre4, isoform A (0.0) |
Best Human hit | FACT complex subunit SPT16 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to dre4 CG1828-PA, isoform A [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to dre4 CG1828-PA, isoform A [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0006268 DNA unwinding involved in replication GO:0003677 DNA binding GO:0001672 regulation of chromatin assembly or disassembly GO:0035101 FACT complex |
InterPro families    | IPR000994 Peptidase M24, structural domain IPR013953 FACT complex subunit Spt16p/Cdc68p IPR013719 Domain of unknown function DUF1747, eukaryote |
Orthology group | MCL12140 |
Nucleotide sequence:
ATGTCCAATATATCTTTGGATAAGGAAACGTTTTATAGGCGTATGAAAAGGCTATATGCG
GCGTGGAAGGCTGCTGCTGCTGATTCTAAGAGTGATGATGTTTTGGCCAAATGTGATTGC
TTGGTGTCTTGCGTTGGCGTGGATGAAGATACTCTATACAGCAAGTCAACCGCATTACAG
ACTTGGCTCTTTGGATATGAACTTCCAGACACCATAACAGTTCTGACTGAACAGAGCATG
TGTTTTCTGGCAAGTAAAAAGAAGATTGAATTCCTCCGCCAAATCGAAAATGGTAAAGAA
GAAACAGATCTACCTCCAGTAAAACTTTTAATAAGAGATAGAAATGACCATGATAAAGAA
AACTTTAATAAGCTTATACAAGAAATAAAGAAATCTAAATCCGGCAAGACTCTGGGTGTG
TTTGCCAAAGACAATTATCCAGGGGAGTTCTGCGAGAGTTGGAAATCTGCAATGAAGGCG
GAGAAGTTTGAAAATGTGGATATCAGTTCATCCGTAGCTACATTCATGGCACCAAAAGAA
GATTCAGAAATAATCACCATCAAAAAGGCCTGCCTTGTCACCGTTGATGTTTTCACAAAG
TACTTAAAAGATCAAATTATGGAAATTATTGATTCGGACAAGAAAGTAAAACATTCGAAA
CTAGCGGAAGGTGTGGAAGCTGCTATATCAGATAAAAAATATGTAACCGGTGTAGACACG
AGCCAAGTAGATATGTGCTATCCACCGATCATACAGTCTGGAGGGAATTATAGTCTGAAA
TTCAGTGCTGTGTCAGATAAGAATCACCTACATTTTGGTGCAATAGTATGTTCTCTAGGA
GCCAGATACAAGTCATACTGTTCAAATATTGTCCGCACATTACTTGTCAATCCGACGGAC
AATGTCCAAAGCAATTATAATTTTCTTTTGAATTTGGAAGAAGAGGTCATGAAGCATCTT
GTGTCTGGTGCCAAGCTGTCAGCCGTTTATGAAGCTGGTTTGGCATTGGCAAAGAAAGAA
AAACCTGAATTAGTGGACAACCTCACAAAGACGTTTGGATTTGCAATGGGAATAGAATTT
CGTGAAAGTGCCATAGTTATTGGACCGAAAACCGCAGTTGTTGCAAAGAAAGGCATGGTC
TTTAACATTAATATTGGTTTGGCAAATTTAACCAACTCTGCAGCAACGGATAAAGAAGGA
AAGACTTATGCCCTATTCATTGGTGATACTGTGCTCGTGAATGATGAACAGCCAGCATCG
CTGCTAACACAATCCAAGAAGAAGATTAAAAACATAGGAATATTCCTTAAAGATGACGAT
GAAGAGGAAGAGGAGGAGAAAGAGAATAAAACAGAAATTTTGGGTCGCGGTAAAAGGACG
GCAGTTATTGAGTCGAAGCTTCGGACTGAACATTCTTCAGAGGACAAACGTAAGGAGCAT
CAGAGAGAATTGGCGATAGCTCTCAACGAGAAAGCTAAGGAGAGACTGGCGAAACAGTCG
AGTGGAAAAGAGGGAGAGAAGATAAGGAAGAGTACAGTCTCGTACAAAAGTGTCAGTCAA
ATGCCCAGAGAGAACGAAGTTAAAGAGTTGAAATTATACGTCGATCGTAAATATGAAACA
GTAATATTGCCGATATTCGGCGTGCCGGTACCATTCCATATATCTACAATTAAAAATATA
TCTCAGTCTGTGGAGGGCGACTATACATATTTGAGAATCAATTTCTTCCACCCGGGTGCC
ACTATGGGCAGAAACGAGGGTGGCAACTACGCGCAGCCTGACGCGACCTTCGTTAAAGAA
GTTACATACCGCAGTACAAACACTAAAGAGCCAGGAGAAATTTCACCTCCATCATCAAAC
CTAAACACTGGATTCCGGTTAATAAAGGAAGTTCAGAAGAAGTTCAAAACGCGAGAGGCG
GAGGAGAGGGAGAAGGAGGACTTAGTTAAACAAGATACTCTCGTTTTATCCCAGAACAAA
GGAAATCCCAAACTGAAGGATTTATACATCAGACCTAATATAGTCACAAAGAGAATGAGC
GGGTCTCTAGAAGCGCATTCGAACGGTTTCAGATTCACGTCAGTGAGAGGAGACAAAGTT
GATATTTTATATAACAACATCAAAAACGCATTCTTCCAACCGTGCGATGGAGAGATGATC
ATTCTATTGCATTTCCATCTGAAGCACGCTATTATGTTCGGGAAGAAGAAACATGTCGAC
GTGCAGTTCTATACCGAGGTCGGTGAGATTACTACAGACCTGGGCAAACACCAGCATATG
CACGACCGTGACGACCTCGCCGCCGAGCAGAGCGAACGGGAACTGAGACACAAACTGAAG
ATAGCTTTCAAAAGTTTCTGCGAGCGCGTCGAGAACATGACCAAACAGGAAGTCGAGTTT
GACACGCCGTACAGAGAACTCGGTTTCCCCGGAGCGCCGTTCCGTAGTACTGTCCTCCTA
CAACCGACCTCTGGAGCCCTCGTCAACCTGACCGAGTGGCCGCCCTTCGTCATCTCGCTG
GAGGACGTTGAACTCGTTCACTTCGAAAGAGTACAGTTCCACCTCAAGAACTTCGATATG
GTTTTCGTGTTTAAGGATTACGCCAAGAAAGTCGCCATGGTCAATGCTGTCCCCATGAAC
ATGCTCGATCACGTCAAGGAGTGGCTGAACTCGTGCGATATCCGGTATTCGGAAGGTATC
CAGTCTCTCAACTGGACAAAAGTCATGAAAACCATTACTGATGATATCGAAGGTTTCTTC
GACAACGGCGGCTGGTCTTTCCTGGACCCCGAGTCTGATGCCGAGAACGAGGAACAGCAC
GACGATGAATCTGAAGAGGAGGATGATGCGTATGAACCGACGGATGCTGAGTCGGAAGAG
GAATCCGAAGATGACTCGGAGTACGACTCCGAGGCTTCGGAAATGTCCGACGACTCCGGC
GACAGCGACGGTGGTGAAGAGGACGAAGAATCTGGGAAAGATTGGTCAGATCTTGAACGC
GAGGCCGCCGAAGAGGATAAGAAGGAACGCAATTACGACAGACCGTCGACGGACTTTGAT
CGGAAACGCAAAGGCGGGAGAGACAGACACCGCTATGACGAGGACCAAGGCAGCAAGAAG
AGCAAACACGACAAAAGTTCACACCACAAAAGCTCCAGTTCAAACCACAAAAGCTCGAGC
TCAAACCACAAAAGCTCGAGTTCAAACCACAAAAGCTCAAGTTCAAATCATAAAAGTTCA
AACCACAAGAGCCCGTCAAAGCACAGCAGTGACAGCCCTTCGAAGAGCAATAAGCACAAG
TCCTCCCACGACCGTTCCCGTGACCACAAATCTAATGGCAAGTCGAACGGTGATCACAAG
TCGCACAAGAGATCACGTGACGACAGTCGCGAACACGAACGATCCTCTAAGAAACACAAA
ATAAATTATAACAAAACTCGCAAAGGATTGATTTATCTTACAAAAAAAGTCTATAAATAA
Protein sequence:
MSNISLDKETFYRRMKRLYAAWKAAAADSKSDDVLAKCDCLVSCVGVDEDTLYSKSTALQ
TWLFGYELPDTITVLTEQSMCFLASKKKIEFLRQIENGKEETDLPPVKLLIRDRNDHDKE
NFNKLIQEIKKSKSGKTLGVFAKDNYPGEFCESWKSAMKAEKFENVDISSSVATFMAPKE
DSEIITIKKACLVTVDVFTKYLKDQIMEIIDSDKKVKHSKLAEGVEAAISDKKYVTGVDT
SQVDMCYPPIIQSGGNYSLKFSAVSDKNHLHFGAIVCSLGARYKSYCSNIVRTLLVNPTD
NVQSNYNFLLNLEEEVMKHLVSGAKLSAVYEAGLALAKKEKPELVDNLTKTFGFAMGIEF
RESAIVIGPKTAVVAKKGMVFNINIGLANLTNSAATDKEGKTYALFIGDTVLVNDEQPAS
LLTQSKKKIKNIGIFLKDDDEEEEEEKENKTEILGRGKRTAVIESKLRTEHSSEDKRKEH
QRELAIALNEKAKERLAKQSSGKEGEKIRKSTVSYKSVSQMPRENEVKELKLYVDRKYET
VILPIFGVPVPFHISTIKNISQSVEGDYTYLRINFFHPGATMGRNEGGNYAQPDATFVKE
VTYRSTNTKEPGEISPPSSNLNTGFRLIKEVQKKFKTREAEEREKEDLVKQDTLVLSQNK
GNPKLKDLYIRPNIVTKRMSGSLEAHSNGFRFTSVRGDKVDILYNNIKNAFFQPCDGEMI
ILLHFHLKHAIMFGKKKHVDVQFYTEVGEITTDLGKHQHMHDRDDLAAEQSERELRHKLK
IAFKSFCERVENMTKQEVEFDTPYRELGFPGAPFRSTVLLQPTSGALVNLTEWPPFVISL
EDVELVHFERVQFHLKNFDMVFVFKDYAKKVAMVNAVPMNMLDHVKEWLNSCDIRYSEGI
QSLNWTKVMKTITDDIEGFFDNGGWSFLDPESDAENEEQHDDESEEEDDAYEPTDAESEE
ESEDDSEYDSEASEMSDDSGDSDGGEEDEESGKDWSDLEREAAEEDKKERNYDRPSTDFD
RKRKGGRDRHRYDEDQGSKKSKHDKSSHHKSSSSNHKSSSSNHKSSSSNHKSSSSNHKSS
NHKSPSKHSSDSPSKSNKHKSSHDRSRDHKSNGKSNGDHKSHKRSRDDSREHERSSKKHK
INYNKTRKGLIYLTKKVYK