New model in OGS2.0 | DPOGS203122  |
---|---|
Genomic Position | scaffold299:+ 32960-65043 |
See gene structure | |
CDS Length | 2046 |
Paired RNAseq reads   | 1590 |
Single RNAseq reads   | 5704 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001447 (3e-35) |
Best Drosophila hit   | Su(Tpl), isoform B (1e-35) |
Best Human hit | RNA polymerase II elongation factor ELL (2e-15) |
Best NR hit (blastp)   | AGAP004795-PA [Anopheles gambiae str. PEST] (3e-38) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC002072 [Tribolium castaneum] (1e-59) |
GeneOntology terms    | GO:0016944 RNA polymerase II transcription elongation factor activity GO:0003711 transcription elongation regulator activity GO:0006368 RNA elongation from RNA polymerase II promoter GO:0008023 transcription elongation factor complex GO:0048190 wing disc dorsal/ventral pattern formation GO:0005700 polytene chromosome GO:0043620 regulation of transcription in response to stress |
InterPro families    | IPR019464 RNA polymerase II elongation factor ELL IPR010844 Occludin/RNA polymerase II elongation factor, ELL domain |
Orthology group | MCL14654 |
Nucleotide sequence:
ATGGCGGCCTTGCCAGCAGGTGTTCAGTATGCGTTATCATCGGAGGCTAGTTATAAAGAA
AACAAGGAACTAGTGTTTGTGAAATTAACGGATTCAGCACTAAAAGCTATAGAAGACTTC
ATTAGAAATAATAGGGACAAATTGGCAAAACCTAAAATAGAGTTCCTACCCGGAAATCAA
GGGAAAATTTCCATTCCGACTCCAAGTAGTAGCAATGGGACATCGTCGGAGTCAACGTTT
CGTTTCAGCATCAACAGCAATGCGGAAATGGAGGGTCCACAGGGTTCATTCGAGTGCGTT
AGGAGCGGTGGCGCCAAACGCCTGGAATCGTGCGGGCCGCTGCCGAGACGGATGCGAGTA
CAGGCCAACGACGACTCATACGAGGCCACCAAGGACCGTATGTCCAGGACCATAGCGGCC
GAACAGAGCAAATGCACCCGCGTCATTAAACCAAATCAAACGGATATCGGGAGACGCGTC
AAGGTACGCTCCTGTGGGTTCTCGAATGGACCAGCGGCTCAGCTCGCCGCGCGGTTAGAG
AGGGCTGAGCGTCCTGAGCGTCCTGAGCGTCCTGAGCGTGCTGAGCGTCCTGAGCGACGA
CTCGCTGGACTTGCGGGACTCGCTGGACTTGCGGGACTCGCACCCGAGCGACCTGAACGC
CCTGAGCGACAGGATCGTCCCGAACGTCAGGAATGGCCCGAAAGACCCGAGCGACAGGAA
CGTCCCGAGAGGCCCGAGAGGCCTGAACGGCAAGAGAGGCCGGAGAGAGTAGAAAGGGTT
GAGAGACCGGCGCCGGCGCCCCGGCCGCAGCCCCCCGCCCCGGTCGCCGCGCCCCCGCAC
ACACAGCCGCAGCGACCACCCCCCAATCCGGACCTCACCAGGCGGCCTCTTAAAGAAAGA
CTTATACAATTATTAGCACTGAAACCCTTCAAGAAGCCAGAACTATACGCAAGACTCATA
AGCGAGGGTATCAAAGAAAAGGAACGCAGTATGGTTAACAAGATACTGCCAGAGATCGGA
ACGCTCAAGGACAATTGTTACCACTTACGTAGACATATATGGAACGATGTTAACGAAGAT
TGGCCTTTCTACACCGAAGAAGAGAAACATATGCTGAAGAGGCGGAAACCTCAAAATCTA
ACACCGCCTTTGAGCAGCGACTCCGCCAATTCTCTGTCACCTCGCGCGTCCCCCGGCAAG
CGGCCGTCGGTCCCAGGAGACGAGCTGCCGGCGAAGAAACAGCGCATCTCTCACTACAGA
CGACCTTCACCACCCTCCTCAGGGTACGCCACCACCTCCTCCGGCGAGCGACACGCCTCT
GACAATGAGGACGACCGCACCGCGAGCGAGGAGGCGCCTGTAAAGAAGCAGCGCGTGTCT
CAATCAAGACGGTCTTCACCACCCTCCTCAGGGTACGCGACCACCTCCTCCGGCGAGCGA
CAGGCCTCTGACAATGAGGACGAACGGAACGTTAAAAAGGACAATGGGTATACTCTCAAC
TTCACAACCGTTAAAGATCTCTGTCCAAGTCCTGTTAAAACGAATGGTTTCAGTAGAAGT
AGTCCTCCTGTAGAGCAAACGAGTATCACAGTAAAAGACATCACAACGGAACCATTAGAA
AATACAGCTTTAACGTCCGTGCCTGAAGAAAATAATACGACTGATCTGGTAGACATTGAA
AGGCAATATCCTCCAATAACAAGTTCCAGTACCCGTCGCGCGTACAAGAACGAGTTTGCG
AATCTGTACACGGAGTATCAATCGTTGTACGGTCGCGTGGCACAGGTGGCAGCACTGTTC
ACACAGTTAGAACAACAACTCAAAAGAGCCGAGCCTAAGAGCCCACACCATAGGAGCATA
GAACAACGTATTGTAGAGGAGTATCATCGTATGCGTAACGACGCCGACTATCAACGTGAG
AGGCGGCGCGTCAATTATTTACATCGTAAACTAAACCACATCAAGAGAATGGTGCATCAG
TACGACCAGCTGCGTAACCTGAAACCGGAGCGCGTGTCAGCTGCCAGTACTACACAGGCG
TACTGA
Protein sequence:
MAALPAGVQYALSSEASYKENKELVFVKLTDSALKAIEDFIRNNRDKLAKPKIEFLPGNQ
GKISIPTPSSSNGTSSESTFRFSINSNAEMEGPQGSFECVRSGGAKRLESCGPLPRRMRV
QANDDSYEATKDRMSRTIAAEQSKCTRVIKPNQTDIGRRVKVRSCGFSNGPAAQLAARLE
RAERPERPERPERAERPERRLAGLAGLAGLAGLAPERPERPERQDRPERQEWPERPERQE
RPERPERPERQERPERVERVERPAPAPRPQPPAPVAAPPHTQPQRPPPNPDLTRRPLKER
LIQLLALKPFKKPELYARLISEGIKEKERSMVNKILPEIGTLKDNCYHLRRHIWNDVNED
WPFYTEEEKHMLKRRKPQNLTPPLSSDSANSLSPRASPGKRPSVPGDELPAKKQRISHYR
RPSPPSSGYATTSSGERHASDNEDDRTASEEAPVKKQRVSQSRRSSPPSSGYATTSSGER
QASDNEDERNVKKDNGYTLNFTTVKDLCPSPVKTNGFSRSSPPVEQTSITVKDITTEPLE
NTALTSVPEENNTTDLVDIERQYPPITSSSTRRAYKNEFANLYTEYQSLYGRVAQVAALF
TQLEQQLKRAEPKSPHHRSIEQRIVEEYHRMRNDADYQRERRRVNYLHRKLNHIKRMVHQ
YDQLRNLKPERVSAASTTQAY