DPGLEAN13556 in OGS1.0

New model in OGS2.0DPOGS207405 
Genomic Positionscaffold518:- 110731-121612
See gene structure
CDS Length4917
Paired RNAseq reads  3389
Single RNAseq reads  7843
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009372 (5e-94)
Best Drosophila hit  Pcf11, isoform D (2e-93)
Best Human hitpre-mRNA cleavage complex 2 protein Pcf11 (7e-39)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC008263 [Tribolium castaneum] (2e-175)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC008263 [Tribolium castaneum] (1e-139)
GeneOntology terms


  
GO:0005849 mRNA cleavage factor complex
GO:0006379 mRNA cleavage
GO:0003723 RNA binding
GO:0008148 negative transcription elongation factor activity
InterPro families

  
IPR008942 ENTH/VHS
IPR006903 Domain of unknown function DUF618
IPR006569 RNA polymerase II, large subunit, CTD
Orthology groupMCL15194

Nucleotide sequence:

ATGGCGAAGGAAATTGCGGACGAGTACGCATCAAGCCTGGCGGATTTGACTGTAAACAGC
AAACCCCTTATTAATATGCTGACTATACTCGCGGAAGAGAACATCGAGCATGCAGGAGTT
ATCGTTGAGACAGTGGAGAAACATTTGGAGAAGGTGCCCCCGGACATCAAGCTGCCTGTA
CTGTATTTAGTAGATTCCATTATCAAGAATGTTGGTGGTGCATACACACAGAAGTTCTCA
CAGATCATCGTTAATATGTTCACAAAGACCTTCAAACAGGTCGATGAAAAGATTCGTTCC
CAAATGTTCAAGTTACGGGAGACGTGGCATGAAGTATTCCCATCCACTAAGTTGTACCAA
CTGGATGTGAAGGTGAACCTGATTGATCCCGCTTGGCCCATCCAGGCTCAACCACATCAG
TCTAACATCCATGTAAATCCAAGTTTTCTGAAAAAGCCAACTCCGACAACCAGTGCTGCA
GCCCTATCAGAAGAAGAGGAGAAAATGCGTGCCATACTAGCTAAGAAAGAACAAGAACTA
CTAATGCTGCAGAGGAAGAAGGTCGAGATGGAGCTGGAGCAGACTCGGAGACAGTTACAG
ATGGCTGAAAAAGTTCACAAGAAGCCGCCCTTGGTGCATCCCCCTGGGCTCCCCGCGCCC
CAAGCGGAGGCCCCGCCGGTCCCTCTGGCGCCTACATCTCTTCCTCCTCAAGCGCCCATA
TCGTCGCCAGTACTGTCCATCAAGCAACGGCTTGGACCCCCTGTTAACAAGTCGGCGGGT
CGTATCGCTCCCGTCACGGGTCCGTTGAGCGGAGCGCGGCGTGACCCGCGGCTCGCAAGG
AGACAGCCCCCACCACAACGCCAACCACCTGCCACCGCCCACACCACCCCCACCACCGCC
GCTACCACAATCGCCCCCAACGTTTTCGATATCAAGCCACTCCACCGACCGGCCAAGCGG
AGCGCCGTCATCACCATCGAGGTGGCGACCGACTCGCGCCGCAAGGACAGAAAGGAAACA
CGCACCAAAATAGAGAACGGCAACGGGGACTCCGTACACACATACATCGACAGACTCAGA
ATACCCGACCCCAAGAAGATAAACAAGCTGCCACCCATACCCAAGATACGGCGGGACAGA
GACGATCCGCCCTCCAAGAAAAAGAAGGAACTCAGAGAGAGGAAGAAGCGACGGGACAAG
GACGGGTCGGCCTCCTCCTCGCCCGACAAGAAGCGAGGCAAGGACAAAGACAAGAAAGTT
TATAATAAAGAAAGGATGGACGTCGACGACCAGCAGTACGCGCCCGAGACGGTCTCCTTC
AAGGAATTGAAGAACTTCAACAAGAAGCACTACATGCGGAGAAATAAGGAAAAGTCGGAG
AGTCCAGAGAGATTAGACAAAGAGGAAAAGAAAACGTCAGAAGAGGCTACAAATCCGCCT
GAGATTATCCCTGAGACTAAAGATGTAGATCTTCGTGTTTTGCATCCCGTTATTCCAGAA
ACTGCCAAGGCGATCGCCCAGAAAAGACCGTCCACTGAGATGCTCGAGGGCAAACCTAAG
AAAAATAAACTCGATAAATTCGATATCCTTTTTGGAAACGAGGACGTAGATCTTCGTCAG
TTGCCTCAAGTTGAAGAGGCGAACGCACCACCGCCACCGTCTATATCTGAACCAAATTCT
GTATCTAAGGAGGAAGTAGATCCAGAAGAATCTGACGATGTAATTGCGTCGCCCGTACGT
TCTCCCAAAAAAGATTGGCAAGAAGTCAAAGAGAAGGAAGAAACTAAAAAGACACCTTCG
AAGTTGGATCTCGTGAGAGCAAAGTTGGCGGAAGCTACAAAAGTTAAAGATGGCTTAGGA
CGTCCGTTACTGTTCAGTAAGTCACCAAGCATAGAAAGAGAAAGACGTCGAACATTCAGC
TCTGATGAAACTGAAGTTAAAACAGATAGCAACGAAGAATTTGATGCCGATGATCACAAG
AAAACTATCTCTATTATCATGAATCAGGCCAAGGAACAGTTCAGTGACGGTCAGCTGGAT
AAAAATCAGTACAACACATTGATGTACCAAGTGTTGCAGCTCAATGAGAAGCTCAAATTA
AAGGAAGCTAAGCAAAGGGAATCCCTAGAAATATCAAAACGAAAATTGAAAGCCCACGTC
TCAGAAGATAAAAAGGTCCCTTCACCGAAATCTTCCCCGTCGGAGAGGAATAAATTCGGG
GACATCGATGAACGAGTTCCCGTCGGTGGATTTTCTGACTCAGAAAACAACAATCAAGAC
TCTGATATGCGTCAAAATCATCACGACAATATAGATGGGAAGAGTGGTTTTCCAATGCCG
CCTATGATGCCGCCGATGTACATGGGGCCTTTCCCGATGTGGAGAGGTCAACCGCGGCCT
AGAATGGAAGAGTTCGGTCCTCGAAGGTTTAGAGGTCCATATTTTAGAGGAAAGTTTGAT
AAAAGAGGCCCAAGACCACCTTTCGATATGAGAATGCCTCTACTTCCCACTCCTAAACTC
GGTATGTGCCAAGGCGAATGTCCTCTAAAGCCTTATGAGCGGTCAATTTCCCCTCCACCA
CTCGGCGCTCCCGGGTATACATTGCCTCCCACTGATTACAAAATATTGGAATATATCGAT
CAGGATCCGGTCAAAACTATCCAAATAGACGGTATTCCCAGAGAAATAAGATTCTACGGC
GAGACGGCGATCATCATGCTCGACTGGGACGATCCTAGAGTTATCAAATTCTTACCAGGT
TGTCGGAGAGTCACATTCGATAATAAAGATTCAGTGGTATTGACTTTCAACGAGGGTTAC
AAGAAGGTGGAAATAGACGATCAAGTCTTCGATATCAGGTTCGGTGCGCCGACCAGGGAA
CTGTTCATAAATGGGAGGTGGTATGAATGTTTCTTCGGGGGTCAGCCATTGGGCGTCATA
ATAGACGGCAAACCGCGATTAGTGCATCTAGAGGGGCCCCTGCCACAGGTGGATATAGGG
AAGACGAAACGAACAGACTTAGTAGCGGGTAAAATAAATGTTATAATAAACGCGACAAAC
ATTTGTCCGGTTTACTTGGATGCTAAAGTTCAGAAGTTCCACGTAAATGGACATTTCTTT
ACGATACGTTTTGTCGATTCCTTGAAGACCGTTCTAATTAATGAGCAGCCGTTCAAAGTG
GAGTATGGGGATCTTCCGAAGCCGATATTCGTAAATGGGGAGAAATACTTTGTACGATTT
TCTGCACTGCCCAAGAACATAAAACCGGGCCAAGTAGAGATCGCTGACATGGAAGGCTGT
AAACCATCGACGGAATCCGAGAAGCTGCCCACAGTGCCGGAGAACGAAGACGTTCCGATG
GAAACTGATTCTGAACAACCGCTCGAAGCTCCAGTGAAATCCCCAAGTCCTGAAGGTGAA
ATGAAAGGTTTGGATATGCTAGCAAATTTCATGCCGAGTGATATGGCCCCGGCTTCCAGC
TCAGAATACAGTTCTGCTGAGCCGCTGTTCACAAAACCGGAAGTGATCCCAGGATTGGAA
ACTCCCGCTGAAGAGAAGCCAGCCAGTTCCCTGCCTCTTTTGGGCGGTATCAATGTCAAT
GACCTGTACGCCAAATTAGTTGCGACAGGTATAGTGCCAATGTTGAACGAAGTGAAACCA
GAGAAGAAAACTGAAGTTCCTGAGATCGAGGAGACGAAAATGAAACCCAAAGATGATAAG
AACGTCATCCACAAAGTCGATATACTGAAACCGGAAACACTTCGGATTAAGCAGCGCGGT
CTGGTGCTGAAACTGTACAGCGGTATGCAGTGCAGCGGTTGCGGGGCGCGGTTCCCTCCC
GAGCACACGGTCCGGTACTCGCAGCATCTCGACTGGCACTTCAGACAGAACAGGCGGGAG
AGGGACTCTGCGCGGCGGGCTCACTCGCGACACTGGCACTACGATCTGTCTGACTGGCTG
CTGTACGAGGAGCTAGAAGACCTAGAAGAACGAGAGAAGAGTTGGTTTGAGACGGGAGGT
TCTGAAGAAACACCAGCTCAGGTAGAAGCTGTTGTAGAAGAAAGTCCAAGCACCGCGGCT
GGTGGAGCTCCGCAACATAACTGTGCCTTGTGTGGAGACAGATTCCATCAGTTCTACAAT
GAGGACCAAGAGGAGTGGCATCTCAGGAATGCTGTCAAACATCAGGACAGTTACTATCAT
CCACTGTGTCTGCAGGATTATAAGGCTTCTCTAACAAAGGAAGAGCCACAGGCTGAGGAA
GCGGCGGTGGATGTAGACGAGGAACCTCCGGCTGCGATAGAGATAGGAGACACCGCTGAG
CTGTCGGACACTGAGTCTGTGGTTGAAGTGTTGGAGACTGAACCCTTGGAACCTGTTGAG
ATTGAAGCTGATGACGGTGATGATGATGTGGTCCTGAACGCTGAGCCCGTGGAACAGCTG
GAGGTGGACGACGGAGACACAGACGATGAAACTACTGAGACGAGGAGGCAGAGAGATCAC
CTCGCACAAGTTGATTTTGCTAACATAAAGATCAAACAGGAACCTATCGATCCAGACGAT
GAACCAATTATAACAGCAGAAGTAGAAAGCATTCCGCCGACAATCGACACAACACATACG
ACTGTTACATCATCCATAGACGGGAACGTTCAGCTCGACGACGCCACGCTGACTCCAGCT
CTACCCATCGGTGGCATCAGAATCAACATATCCAAAACCATAACCAGCTTTGCTACCAAT
CAAGATAGTCCGGATAAGTCCCTCGAAGACATCAGCACCGAGGACGAGCCCTTGCCTCCC
GGAGAGGAACCAGAAATGGAGTACGAGTTGAAGCCTTCAATGAAAGACGTGAAGTTCAGC
AGACAACCTCCCGTCCAAAAAGGAAGTGAATTGTCGGGATTGTGTTCTATAATGTGA

Protein sequence:

MAKEIADEYASSLADLTVNSKPLINMLTILAEENIEHAGVIVETVEKHLEKVPPDIKLPV
LYLVDSIIKNVGGAYTQKFSQIIVNMFTKTFKQVDEKIRSQMFKLRETWHEVFPSTKLYQ
LDVKVNLIDPAWPIQAQPHQSNIHVNPSFLKKPTPTTSAAALSEEEEKMRAILAKKEQEL
LMLQRKKVEMELEQTRRQLQMAEKVHKKPPLVHPPGLPAPQAEAPPVPLAPTSLPPQAPI
SSPVLSIKQRLGPPVNKSAGRIAPVTGPLSGARRDPRLARRQPPPQRQPPATAHTTPTTA
ATTIAPNVFDIKPLHRPAKRSAVITIEVATDSRRKDRKETRTKIENGNGDSVHTYIDRLR
IPDPKKINKLPPIPKIRRDRDDPPSKKKKELRERKKRRDKDGSASSSPDKKRGKDKDKKV
YNKERMDVDDQQYAPETVSFKELKNFNKKHYMRRNKEKSESPERLDKEEKKTSEEATNPP
EIIPETKDVDLRVLHPVIPETAKAIAQKRPSTEMLEGKPKKNKLDKFDILFGNEDVDLRQ
LPQVEEANAPPPPSISEPNSVSKEEVDPEESDDVIASPVRSPKKDWQEVKEKEETKKTPS
KLDLVRAKLAEATKVKDGLGRPLLFSKSPSIERERRRTFSSDETEVKTDSNEEFDADDHK
KTISIIMNQAKEQFSDGQLDKNQYNTLMYQVLQLNEKLKLKEAKQRESLEISKRKLKAHV
SEDKKVPSPKSSPSERNKFGDIDERVPVGGFSDSENNNQDSDMRQNHHDNIDGKSGFPMP
PMMPPMYMGPFPMWRGQPRPRMEEFGPRRFRGPYFRGKFDKRGPRPPFDMRMPLLPTPKL
GMCQGECPLKPYERSISPPPLGAPGYTLPPTDYKILEYIDQDPVKTIQIDGIPREIRFYG
ETAIIMLDWDDPRVIKFLPGCRRVTFDNKDSVVLTFNEGYKKVEIDDQVFDIRFGAPTRE
LFINGRWYECFFGGQPLGVIIDGKPRLVHLEGPLPQVDIGKTKRTDLVAGKINVIINATN
ICPVYLDAKVQKFHVNGHFFTIRFVDSLKTVLINEQPFKVEYGDLPKPIFVNGEKYFVRF
SALPKNIKPGQVEIADMEGCKPSTESEKLPTVPENEDVPMETDSEQPLEAPVKSPSPEGE
MKGLDMLANFMPSDMAPASSSEYSSAEPLFTKPEVIPGLETPAEEKPASSLPLLGGINVN
DLYAKLVATGIVPMLNEVKPEKKTEVPEIEETKMKPKDDKNVIHKVDILKPETLRIKQRG
LVLKLYSGMQCSGCGARFPPEHTVRYSQHLDWHFRQNRRERDSARRAHSRHWHYDLSDWL
LYEELEDLEEREKSWFETGGSEETPAQVEAVVEESPSTAAGGAPQHNCALCGDRFHQFYN
EDQEEWHLRNAVKHQDSYYHPLCLQDYKASLTKEEPQAEEAAVDVDEEPPAAIEIGDTAE
LSDTESVVEVLETEPLEPVEIEADDGDDDVVLNAEPVEQLEVDDGDTDDETTETRRQRDH
LAQVDFANIKIKQEPIDPDDEPIITAEVESIPPTIDTTHTTVTSSIDGNVQLDDATLTPA
LPIGGIRINISKTITSFATNQDSPDKSLEDISTEDEPLPPGEEPEMEYELKPSMKDVKFS
RQPPVQKGSELSGLCSIM