DPGLEAN11843 in OGS1.0

New model in OGS2.0DPOGS203122 
Genomic Positionscaffold299:+ 32960-65043
See gene structure
CDS Length2046
Paired RNAseq reads  1590
Single RNAseq reads  5704
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001447 (3e-35)
Best Drosophila hit  Su(Tpl), isoform B (1e-35)
Best Human hitRNA polymerase II elongation factor ELL (2e-15)
Best NR hit (blastp)  AGAP004795-PA [Anopheles gambiae str. PEST] (3e-38)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC002072 [Tribolium castaneum] (1e-59)
GeneOntology terms





  
GO:0016944 RNA polymerase II transcription elongation factor activity
GO:0003711 transcription elongation regulator activity
GO:0006368 RNA elongation from RNA polymerase II promoter
GO:0008023 transcription elongation factor complex
GO:0048190 wing disc dorsal/ventral pattern formation
GO:0005700 polytene chromosome
GO:0043620 regulation of transcription in response to stress
InterPro families
  
IPR019464 RNA polymerase II elongation factor ELL
IPR010844 Occludin/RNA polymerase II elongation factor, ELL domain
Orthology groupMCL14654

Nucleotide sequence:

ATGGCGGCCTTGCCAGCAGGTGTTCAGTATGCGTTATCATCGGAGGCTAGTTATAAAGAA
AACAAGGAACTAGTGTTTGTGAAATTAACGGATTCAGCACTAAAAGCTATAGAAGACTTC
ATTAGAAATAATAGGGACAAATTGGCAAAACCTAAAATAGAGTTCCTACCCGGAAATCAA
GGGAAAATTTCCATTCCGACTCCAAGTAGTAGCAATGGGACATCGTCGGAGTCAACGTTT
CGTTTCAGCATCAACAGCAATGCGGAAATGGAGGGTCCACAGGGTTCATTCGAGTGCGTT
AGGAGCGGTGGCGCCAAACGCCTGGAATCGTGCGGGCCGCTGCCGAGACGGATGCGAGTA
CAGGCCAACGACGACTCATACGAGGCCACCAAGGACCGTATGTCCAGGACCATAGCGGCC
GAACAGAGCAAATGCACCCGCGTCATTAAACCAAATCAAACGGATATCGGGAGACGCGTC
AAGGTACGCTCCTGTGGGTTCTCGAATGGACCAGCGGCTCAGCTCGCCGCGCGGTTAGAG
AGGGCTGAGCGTCCTGAGCGTCCTGAGCGTCCTGAGCGTGCTGAGCGTCCTGAGCGACGA
CTCGCTGGACTTGCGGGACTCGCTGGACTTGCGGGACTCGCACCCGAGCGACCTGAACGC
CCTGAGCGACAGGATCGTCCCGAACGTCAGGAATGGCCCGAAAGACCCGAGCGACAGGAA
CGTCCCGAGAGGCCCGAGAGGCCTGAACGGCAAGAGAGGCCGGAGAGAGTAGAAAGGGTT
GAGAGACCGGCGCCGGCGCCCCGGCCGCAGCCCCCCGCCCCGGTCGCCGCGCCCCCGCAC
ACACAGCCGCAGCGACCACCCCCCAATCCGGACCTCACCAGGCGGCCTCTTAAAGAAAGA
CTTATACAATTATTAGCACTGAAACCCTTCAAGAAGCCAGAACTATACGCAAGACTCATA
AGCGAGGGTATCAAAGAAAAGGAACGCAGTATGGTTAACAAGATACTGCCAGAGATCGGA
ACGCTCAAGGACAATTGTTACCACTTACGTAGACATATATGGAACGATGTTAACGAAGAT
TGGCCTTTCTACACCGAAGAAGAGAAACATATGCTGAAGAGGCGGAAACCTCAAAATCTA
ACACCGCCTTTGAGCAGCGACTCCGCCAATTCTCTGTCACCTCGCGCGTCCCCCGGCAAG
CGGCCGTCGGTCCCAGGAGACGAGCTGCCGGCGAAGAAACAGCGCATCTCTCACTACAGA
CGACCTTCACCACCCTCCTCAGGGTACGCCACCACCTCCTCCGGCGAGCGACACGCCTCT
GACAATGAGGACGACCGCACCGCGAGCGAGGAGGCGCCTGTAAAGAAGCAGCGCGTGTCT
CAATCAAGACGGTCTTCACCACCCTCCTCAGGGTACGCGACCACCTCCTCCGGCGAGCGA
CAGGCCTCTGACAATGAGGACGAACGGAACGTTAAAAAGGACAATGGGTATACTCTCAAC
TTCACAACCGTTAAAGATCTCTGTCCAAGTCCTGTTAAAACGAATGGTTTCAGTAGAAGT
AGTCCTCCTGTAGAGCAAACGAGTATCACAGTAAAAGACATCACAACGGAACCATTAGAA
AATACAGCTTTAACGTCCGTGCCTGAAGAAAATAATACGACTGATCTGGTAGACATTGAA
AGGCAATATCCTCCAATAACAAGTTCCAGTACCCGTCGCGCGTACAAGAACGAGTTTGCG
AATCTGTACACGGAGTATCAATCGTTGTACGGTCGCGTGGCACAGGTGGCAGCACTGTTC
ACACAGTTAGAACAACAACTCAAAAGAGCCGAGCCTAAGAGCCCACACCATAGGAGCATA
GAACAACGTATTGTAGAGGAGTATCATCGTATGCGTAACGACGCCGACTATCAACGTGAG
AGGCGGCGCGTCAATTATTTACATCGTAAACTAAACCACATCAAGAGAATGGTGCATCAG
TACGACCAGCTGCGTAACCTGAAACCGGAGCGCGTGTCAGCTGCCAGTACTACACAGGCG
TACTGA

Protein sequence:

MAALPAGVQYALSSEASYKENKELVFVKLTDSALKAIEDFIRNNRDKLAKPKIEFLPGNQ
GKISIPTPSSSNGTSSESTFRFSINSNAEMEGPQGSFECVRSGGAKRLESCGPLPRRMRV
QANDDSYEATKDRMSRTIAAEQSKCTRVIKPNQTDIGRRVKVRSCGFSNGPAAQLAARLE
RAERPERPERPERAERPERRLAGLAGLAGLAGLAPERPERPERQDRPERQEWPERPERQE
RPERPERPERQERPERVERVERPAPAPRPQPPAPVAAPPHTQPQRPPPNPDLTRRPLKER
LIQLLALKPFKKPELYARLISEGIKEKERSMVNKILPEIGTLKDNCYHLRRHIWNDVNED
WPFYTEEEKHMLKRRKPQNLTPPLSSDSANSLSPRASPGKRPSVPGDELPAKKQRISHYR
RPSPPSSGYATTSSGERHASDNEDDRTASEEAPVKKQRVSQSRRSSPPSSGYATTSSGER
QASDNEDERNVKKDNGYTLNFTTVKDLCPSPVKTNGFSRSSPPVEQTSITVKDITTEPLE
NTALTSVPEENNTTDLVDIERQYPPITSSSTRRAYKNEFANLYTEYQSLYGRVAQVAALF
TQLEQQLKRAEPKSPHHRSIEQRIVEEYHRMRNDADYQRERRRVNYLHRKLNHIKRMVHQ
YDQLRNLKPERVSAASTTQAY