New model in OGS2.0 | DPOGS214550  |
---|---|
Genomic Position | scaffold1254:- 71043-78405 |
See gene structure | |
CDS Length | 2565 |
Paired RNAseq reads   | 848 |
Single RNAseq reads   | 1864 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003278 (0.0) |
Best Drosophila hit   | D12 (7e-31) |
Best Human hit | YEATS domain-containing protein 2 (8e-31) |
Best NR hit (blastp)   | PREDICTED: similar to YEATS domain containing 2 [Apis mellifera] (4e-48) |
Best NR hit (blastx)   | PREDICTED: similar to YEATS domain containing 2 [Apis mellifera] (6e-48) |
GeneOntology terms    | GO:0005634 nucleus GO:0006355 regulation of transcription, DNA-dependent GO:0005575 cellular_component GO:0008150 biological_process GO:0003674 molecular_function |
InterPro families   | IPR005033 YEATS |
Orthology group | MCL15968 |
Nucleotide sequence:
ATGGAACATAAAGAAGAGTACCACGATCCAGACTATCCAGAAGCACCGGCGGTTGAAAAA
CCAGACAAGCCAAAAGTCTCTCAGGAAGATAACATTCAAACAATAAAGACTATAATACGC
CGCGAGTTTCAAAACGAATTAGATGTTCGAGAGAGGGAAGTTAATCTGATCGACCAAAGG
ATGTCCCTAGCAAGGCGGTACCTCCACGAGTTGAGGTATGCTGTCGTGAACAGCTACTAC
AACAATCAAAAGCTACAATTATCAGCTACCCAGGTGGAAGACGAGGTTGCAGCACAAACG
GAACCACGAGCTAGATCTGAGGTGTCCTCTATACTCCGTAACACACAGCCCAGGATACAT
CCGTCAGTACAGAAGCTGCTCGGCAAGAAATCTGTTGCCATCGAGGAGATATTCAAATCA
AGAGCACCAAGGAAAACCAGGAGGGACTATGGGGCTATGGTGCAGAAGAGGAATTACACG
ATATCAGCTGATGAGACGAAGTCGCTCCGGCCGGACAAGAATGAGCCCGGCCTGAATGTG
GTGAAGACGGAAAGCAACGAGCACGAGGACAGGTCGGAGGCCAAAGGTCAAGTCCCAAGC
AGCAGCAGGCCAAAGAAGATCCCTCGCCAGATAGACCCGAAGGTGAACAATGTGATCACA
GTGGACGAGGTCACTAGGAACCAAATGAAACACAGATATAGAGTCATTATAGGCAACACG
TCAAAGTACGCGCCCCCGGCGTCCCGCTGTGACCGTTCCACCCACAAATGGTTGTTGTAT
GTCAGAGGAGCGCCCGTAGTGGAAGCCATCACTGTTAGGTTACACCACTCGTACGCGCCT
CACGACACTGTACATATAGACAAGCCTCCATTTCAAGTGTGTCGCCGTGGTTGGGGCGAG
TTCCCAGCGCTGGTTACTCTCCACTTCCTCAAGTCATATCTGAACAGACCGGCAACCATC
ACACACACCATCAAACTAGACAGACAGTACACCGGCCTGCAGACTCTAGGTGCGGAGACA
GTTGTGGATGTATGGTTATACAGCACACCGGATATGATAGAACACCAGCAGAGGGACGAA
GAAGTGAAGGAGATCAAAGAGGAAGTGAAAGAAGAAGTGAGAGAAGAAGAGAACAGAGTC
AGCGGAGACGATAAACAAGACAGCTGGCTGGAGTTCTTTGCAAAAGACACGAGTCAAGTG
AACGTTGATGAGATGTTAGTTAAGAATGAAATAAAAACCGAGACGGTGATGGACAAGCAC
GGTGATGACAATGATGAAGTGAGCGATGAAGTGAAGAACACACAGAACAAGAGGATAATG
AAGTACATAGAGCCGACCACAGGGAAAATATACTATCTGGAAATGGACAGGGCCCTAGAC
CTGACCAAGGTGCAAGAAATAGTAATAAACTCGGAGGGGAATGTGAAGACAGCAAAAATA
AGCCCGCTGAAGACAAACGGCCTGAAGACGACCAAGAACAAGGAGTCCATCTTAAGGTCG
CTATTGAAAACGGAGGATTGTGACGAGTATACTTACGATCACATAGAGAACGATCACTGT
TACCTAGCCAGCGACTGGTACAAGAGGGACCATAGACAAGCTAGAGTGGAGGAGGCCAGA
GACAAGTCGAAGAGTCTAGTCTACAGCAAATACAAAGATATTATATCCAAATTCACGTGT
GTCAAGTCTATGGTCAGTTACCTGTTGAAACACATGCCGTTGGTGAGCGAGGCGGCCGGC
GACGCGGGCTACGTCTCAATGTTCCCGTTCGTTGTCACATCCGACGACAGATACTGGAAG
TTGGACTTCGCTAAACGAAGGAATATGGAGTGGTCACGGGCCAAGTTGATCAACAAGTTA
CTCACAGAGACCTTCAAGGCTGATCCCGGTAAGGTCTGGAGGACGAAACAGATCCTGGTA
TACTCGAGATTACACGGATACTATCCAATAAGGCGCGAGAAGGCGGACCTCAGAACCGAC
GAGTGGTCCTCGTGGAACGATCTGGATGAAGGGAAATCAGAATCGAATATAAGAGAGGTG
TTCCCTAACGAGAGCGACCTGTCCACGTTGAGCGTGTTCAATAAAAGTGATTACGTCACC
GAGGGTGCGGTTGTGGATTTAGATGTGAGCGGTTCCGACGAAGAGATAGAAATAGTCGGT
GATGTGAGCGGCCAGAAGAAGCCTGTGCTGGTGGAGCGGCCTGTGAGTGATGACGTGCTG
CCCGTGGACAGCAGCGACCGGCTCAGGTTCCTGTTCATAGAGAAAGTTTGTGAAGACATC
GGCATCGTATTGAGGAATGAGGACATAGGTCACGGTTACTCTTACAGTTCGGTCCACTCA
GTCTTGTTGTCGGCCACCAAGTGTTTCGCTGAAGAGCTGATCAGGTCGTCTCTCGCCAGA
CAACTCACCTCAGAGCTGGGAGAGGGACGCGTCTGGGTCGGCTGGTCCAGGCCTCGCGTG
TGTCTCCAGCACGTGTTCCTCGCCACCAGCGACTCCAGGTTACAGCTGGTGACGTCATCA
CACCTCGCAGCCGCCGCGCACACACACACACCGCCGCCGCTATAA
Protein sequence:
MEHKEEYHDPDYPEAPAVEKPDKPKVSQEDNIQTIKTIIRREFQNELDVREREVNLIDQR
MSLARRYLHELRYAVVNSYYNNQKLQLSATQVEDEVAAQTEPRARSEVSSILRNTQPRIH
PSVQKLLGKKSVAIEEIFKSRAPRKTRRDYGAMVQKRNYTISADETKSLRPDKNEPGLNV
VKTESNEHEDRSEAKGQVPSSSRPKKIPRQIDPKVNNVITVDEVTRNQMKHRYRVIIGNT
SKYAPPASRCDRSTHKWLLYVRGAPVVEAITVRLHHSYAPHDTVHIDKPPFQVCRRGWGE
FPALVTLHFLKSYLNRPATITHTIKLDRQYTGLQTLGAETVVDVWLYSTPDMIEHQQRDE
EVKEIKEEVKEEVREEENRVSGDDKQDSWLEFFAKDTSQVNVDEMLVKNEIKTETVMDKH
GDDNDEVSDEVKNTQNKRIMKYIEPTTGKIYYLEMDRALDLTKVQEIVINSEGNVKTAKI
SPLKTNGLKTTKNKESILRSLLKTEDCDEYTYDHIENDHCYLASDWYKRDHRQARVEEAR
DKSKSLVYSKYKDIISKFTCVKSMVSYLLKHMPLVSEAAGDAGYVSMFPFVVTSDDRYWK
LDFAKRRNMEWSRAKLINKLLTETFKADPGKVWRTKQILVYSRLHGYYPIRREKADLRTD
EWSSWNDLDEGKSESNIREVFPNESDLSTLSVFNKSDYVTEGAVVDLDVSGSDEEIEIVG
DVSGQKKPVLVERPVSDDVLPVDSSDRLRFLFIEKVCEDIGIVLRNEDIGHGYSYSSVHS
VLLSATKCFAEELIRSSLARQLTSELGEGRVWVGWSRPRVCLQHVFLATSDSRLQLVTSS
HLAAAAHTHTPPPL