DPGLEAN13529 in OGS1.0

New model in OGS2.0DPOGS205945 
Genomic Positionscaffold20:+ 182682-187712
See gene structure
CDS Length2283
Paired RNAseq reads  962
Single RNAseq reads  2289
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002831 (9e-80)
Best Drosophila hit  StIP (8e-178)
Best Human hitelongator complex protein 2 (1e-147)
Best NR hit (blastp)  GJ22219 [Drosophila virilis] (0.0)
Best NR hit (blastx)  StIP [Drosophila melanogaster] (0.0)
GeneOntology terms


  
GO:0006368 RNA elongation from RNA polymerase II promoter
GO:0000502 proteasome complex
GO:0043248 proteasome assembly
GO:0061133 endopeptidase activator activity
InterPro families




  
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019781 WD40 repeat, subgroup
IPR011046 WD40 repeat-like-containing domain
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR001680 WD40 repeat
Orthology groupMCL14807

Nucleotide sequence:

ATGAAAGTATCTGTAGAGCAAGTTTATACTTCAGTTGCTTGTAACAGAGTTCCAAAGATT
GTTGATTGGAATAAGGAAGGGCTTATTGTGTACGGCGCTTCAAATGCTGTTGTTATATAC
GACACGAAAATTAAAGGCCAAGACCCTCTTACAGTACTTAGTCATCATCAGTCAAAAGTA
AACTCAGTGAAATGGCTGTATAAATCTGGTGGCTCATGTACAGAATTTCTATCATGTTCT
GCTGACAAAACCGCTGCAATATGGTCATCAGTGGACGGGGTCTGGAGTATCACAAGCAGT
CTCGTCGGTCATAATGATGGTGTAACATGTGTTTATGGAGTTTATGTTGAGGAAAATTTA
GTGGTATATACCGCCTCCATTGATTCTACTGTGAGAGTTTGGGAGAGAAAAAATGGTATC
ACAACATTGAAACAGATAATTTCTTTAAACTCTGGACTATGTCTCACTCTGCATGCACAC
ATTCTGCCAACATGTAATCAACCTCTTTTGTTCTGTGCCCTTGATGACCATAAAGTGCAC
ATATTTGCTGATGACGATGGGTATCATAGGGTCCACGCGCTCGTGGGACATGAGGACTGG
GTTCGAGGGCTGGATGTTGTAGATGTAGACGAAAGCACAGTAATGTTGGCGTCGGCATCA
CAAGACACATACATTCGATTGTGGAAAATTGCACAACACAAAGAGACAGTGGCTAGCGGT
GTCAGGGTGGAAGAGAAAACGTTCATGGCATACAATCAGGAATGGTCGGTGAAACTGGAC
GCAGTTCTAGCCGGTCATGAAGGCTGGGTTTACGGTGTTCAATGGGAAAAAAATATAAAT
GAAGATGGCGCCACCTACCGTCTGTTGACATCGTCCCTTGATAAGACACTCATCATCTGG
CAGCTCTCCGAGGTGTGGGTGGAGAGTGTTCGTGTGGGAGACGTGGGAGGGAACGGACTC
GGGTTCTATGGCAGTAGGTTTGGCCGGGACGCCGTACTGGGCCACGGATATAATGGATCA
CTACATATATGGAGACTGGATAAGGAGAGCAAACAATGGCAGCCGTCTGTGGTGGTGGGC
GGTCACTTCGCTGGCGTGGAAGACATCCGGTGGGAGTCTCGAGGACGGTACCTCGTCAGC
GTGGCCCTAGATCAGACCACGAGGTTGCACGCGCCTTGGAGGAGAGCAGACGGTGCCGGA
GTGGAGTGGCACGAGATATCCCGGCCCCAGGTGCACGGGTACGACCTGTGCTCGGTGTCG
CTGGTGTCGTCGACCCTGGTCTCCGCCGCCGAGGAGAAGGTGCTGCGAGTGTTCGCTCCG
CCCCAGAACTTCCTCCACAACTTCCAGAGGATCACTGGAGAGGAGCTGCACTGTACTGAA
GTGTGTAACCCCGAGGGGGCGTCGGTGCCGTCCCTGGGGCTATCCAACAAGGCCGTGTTC
AGCGGAGACGCGGCGGGGGACGGGGACGACAGCGACGGGTACTTTGTACCCGTGGAATTG
CGCGAGCCACCGACCGAGGAGATTTTAATGCAGAACACTCTGTGGCCGGAGACTCACAAG
CTGTACGGTCACGGATACGAGCTGTTCTGTGTGGACTCCAGCCCGGACGGAGACCTGGTC
GCCTCCGCCTGCCGCAGCACCACGCAGGAGCACGCGGCCGTCATAGTGTGGGAAACCAAG
ACCTGGCAGCAGATCCAGAAGTTGGTGTCGCACACGCTGACGGTTACACAGCTGGCCTTC
TCACCGGACAGTCGACACCTACTCTCCGTTTCTCGCGACCGAAAGTGGACTTTATACACG
AGACGAGAAGGCTCAAACTCGTTCACCATCGCGGCACACACGGACAAGACAAACGGCGTT
CACACCAGAATCATCTGGTGCTGTGCCTGGGCGGTGACTGGACACGCCTTCGCCACTGGA
TCCAGGGAAGGAAAGGTCTGCGTTTGGACCAAAACGGAAGTAACGAGCGACTCATCGCTG
AGAGACTACAGTTTGTTGGGTAAGCCACTGGAGGTCCCGAACTCGTCGGTGACCGCTCTG
TCGTTCGCCCCCCTCGCTGACGTGCAAGTGGTGGCCGTGGGGACGGACTGCGGCCGGATC
AGGATCTACAGCTTCGACCTCGCCTGGAGCTTACTGCACGAAATGAACAACAGCTCGGCC
CACCACCTCACGGTGAAGAGACTGCTCTTCAAGCCGAGGACATCAGGTGACAAAGAATTG
GTGCTGGCGAGTTGTGGGAGCGACAACTTTGTTAGAATAAACACCCTGTATATAGAATAT
TGA

Protein sequence:

MKVSVEQVYTSVACNRVPKIVDWNKEGLIVYGASNAVVIYDTKIKGQDPLTVLSHHQSKV
NSVKWLYKSGGSCTEFLSCSADKTAAIWSSVDGVWSITSSLVGHNDGVTCVYGVYVEENL
VVYTASIDSTVRVWERKNGITTLKQIISLNSGLCLTLHAHILPTCNQPLLFCALDDHKVH
IFADDDGYHRVHALVGHEDWVRGLDVVDVDESTVMLASASQDTYIRLWKIAQHKETVASG
VRVEEKTFMAYNQEWSVKLDAVLAGHEGWVYGVQWEKNINEDGATYRLLTSSLDKTLIIW
QLSEVWVESVRVGDVGGNGLGFYGSRFGRDAVLGHGYNGSLHIWRLDKESKQWQPSVVVG
GHFAGVEDIRWESRGRYLVSVALDQTTRLHAPWRRADGAGVEWHEISRPQVHGYDLCSVS
LVSSTLVSAAEEKVLRVFAPPQNFLHNFQRITGEELHCTEVCNPEGASVPSLGLSNKAVF
SGDAAGDGDDSDGYFVPVELREPPTEEILMQNTLWPETHKLYGHGYELFCVDSSPDGDLV
ASACRSTTQEHAAVIVWETKTWQQIQKLVSHTLTVTQLAFSPDSRHLLSVSRDRKWTLYT
RREGSNSFTIAAHTDKTNGVHTRIIWCCAWAVTGHAFATGSREGKVCVWTKTEVTSDSSL
RDYSLLGKPLEVPNSSVTALSFAPLADVQVVAVGTDCGRIRIYSFDLAWSLLHEMNNSSA
HHLTVKRLLFKPRTSGDKELVLASCGSDNFVRINTLYIEY