New model in OGS2.0 | DPOGS207405  |
---|---|
Genomic Position | scaffold518:- 110731-121612 |
See gene structure | |
CDS Length | 4917 |
Paired RNAseq reads   | 3389 |
Single RNAseq reads   | 7843 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009372 (5e-94) |
Best Drosophila hit   | Pcf11, isoform D (2e-93) |
Best Human hit | pre-mRNA cleavage complex 2 protein Pcf11 (7e-39) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC008263 [Tribolium castaneum] (2e-175) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC008263 [Tribolium castaneum] (1e-139) |
GeneOntology terms    | GO:0005849 mRNA cleavage factor complex GO:0006379 mRNA cleavage GO:0003723 RNA binding GO:0008148 negative transcription elongation factor activity |
InterPro families    | IPR008942 ENTH/VHS IPR006903 Domain of unknown function DUF618 IPR006569 RNA polymerase II, large subunit, CTD |
Orthology group | MCL15194 |
Nucleotide sequence:
ATGGCGAAGGAAATTGCGGACGAGTACGCATCAAGCCTGGCGGATTTGACTGTAAACAGC
AAACCCCTTATTAATATGCTGACTATACTCGCGGAAGAGAACATCGAGCATGCAGGAGTT
ATCGTTGAGACAGTGGAGAAACATTTGGAGAAGGTGCCCCCGGACATCAAGCTGCCTGTA
CTGTATTTAGTAGATTCCATTATCAAGAATGTTGGTGGTGCATACACACAGAAGTTCTCA
CAGATCATCGTTAATATGTTCACAAAGACCTTCAAACAGGTCGATGAAAAGATTCGTTCC
CAAATGTTCAAGTTACGGGAGACGTGGCATGAAGTATTCCCATCCACTAAGTTGTACCAA
CTGGATGTGAAGGTGAACCTGATTGATCCCGCTTGGCCCATCCAGGCTCAACCACATCAG
TCTAACATCCATGTAAATCCAAGTTTTCTGAAAAAGCCAACTCCGACAACCAGTGCTGCA
GCCCTATCAGAAGAAGAGGAGAAAATGCGTGCCATACTAGCTAAGAAAGAACAAGAACTA
CTAATGCTGCAGAGGAAGAAGGTCGAGATGGAGCTGGAGCAGACTCGGAGACAGTTACAG
ATGGCTGAAAAAGTTCACAAGAAGCCGCCCTTGGTGCATCCCCCTGGGCTCCCCGCGCCC
CAAGCGGAGGCCCCGCCGGTCCCTCTGGCGCCTACATCTCTTCCTCCTCAAGCGCCCATA
TCGTCGCCAGTACTGTCCATCAAGCAACGGCTTGGACCCCCTGTTAACAAGTCGGCGGGT
CGTATCGCTCCCGTCACGGGTCCGTTGAGCGGAGCGCGGCGTGACCCGCGGCTCGCAAGG
AGACAGCCCCCACCACAACGCCAACCACCTGCCACCGCCCACACCACCCCCACCACCGCC
GCTACCACAATCGCCCCCAACGTTTTCGATATCAAGCCACTCCACCGACCGGCCAAGCGG
AGCGCCGTCATCACCATCGAGGTGGCGACCGACTCGCGCCGCAAGGACAGAAAGGAAACA
CGCACCAAAATAGAGAACGGCAACGGGGACTCCGTACACACATACATCGACAGACTCAGA
ATACCCGACCCCAAGAAGATAAACAAGCTGCCACCCATACCCAAGATACGGCGGGACAGA
GACGATCCGCCCTCCAAGAAAAAGAAGGAACTCAGAGAGAGGAAGAAGCGACGGGACAAG
GACGGGTCGGCCTCCTCCTCGCCCGACAAGAAGCGAGGCAAGGACAAAGACAAGAAAGTT
TATAATAAAGAAAGGATGGACGTCGACGACCAGCAGTACGCGCCCGAGACGGTCTCCTTC
AAGGAATTGAAGAACTTCAACAAGAAGCACTACATGCGGAGAAATAAGGAAAAGTCGGAG
AGTCCAGAGAGATTAGACAAAGAGGAAAAGAAAACGTCAGAAGAGGCTACAAATCCGCCT
GAGATTATCCCTGAGACTAAAGATGTAGATCTTCGTGTTTTGCATCCCGTTATTCCAGAA
ACTGCCAAGGCGATCGCCCAGAAAAGACCGTCCACTGAGATGCTCGAGGGCAAACCTAAG
AAAAATAAACTCGATAAATTCGATATCCTTTTTGGAAACGAGGACGTAGATCTTCGTCAG
TTGCCTCAAGTTGAAGAGGCGAACGCACCACCGCCACCGTCTATATCTGAACCAAATTCT
GTATCTAAGGAGGAAGTAGATCCAGAAGAATCTGACGATGTAATTGCGTCGCCCGTACGT
TCTCCCAAAAAAGATTGGCAAGAAGTCAAAGAGAAGGAAGAAACTAAAAAGACACCTTCG
AAGTTGGATCTCGTGAGAGCAAAGTTGGCGGAAGCTACAAAAGTTAAAGATGGCTTAGGA
CGTCCGTTACTGTTCAGTAAGTCACCAAGCATAGAAAGAGAAAGACGTCGAACATTCAGC
TCTGATGAAACTGAAGTTAAAACAGATAGCAACGAAGAATTTGATGCCGATGATCACAAG
AAAACTATCTCTATTATCATGAATCAGGCCAAGGAACAGTTCAGTGACGGTCAGCTGGAT
AAAAATCAGTACAACACATTGATGTACCAAGTGTTGCAGCTCAATGAGAAGCTCAAATTA
AAGGAAGCTAAGCAAAGGGAATCCCTAGAAATATCAAAACGAAAATTGAAAGCCCACGTC
TCAGAAGATAAAAAGGTCCCTTCACCGAAATCTTCCCCGTCGGAGAGGAATAAATTCGGG
GACATCGATGAACGAGTTCCCGTCGGTGGATTTTCTGACTCAGAAAACAACAATCAAGAC
TCTGATATGCGTCAAAATCATCACGACAATATAGATGGGAAGAGTGGTTTTCCAATGCCG
CCTATGATGCCGCCGATGTACATGGGGCCTTTCCCGATGTGGAGAGGTCAACCGCGGCCT
AGAATGGAAGAGTTCGGTCCTCGAAGGTTTAGAGGTCCATATTTTAGAGGAAAGTTTGAT
AAAAGAGGCCCAAGACCACCTTTCGATATGAGAATGCCTCTACTTCCCACTCCTAAACTC
GGTATGTGCCAAGGCGAATGTCCTCTAAAGCCTTATGAGCGGTCAATTTCCCCTCCACCA
CTCGGCGCTCCCGGGTATACATTGCCTCCCACTGATTACAAAATATTGGAATATATCGAT
CAGGATCCGGTCAAAACTATCCAAATAGACGGTATTCCCAGAGAAATAAGATTCTACGGC
GAGACGGCGATCATCATGCTCGACTGGGACGATCCTAGAGTTATCAAATTCTTACCAGGT
TGTCGGAGAGTCACATTCGATAATAAAGATTCAGTGGTATTGACTTTCAACGAGGGTTAC
AAGAAGGTGGAAATAGACGATCAAGTCTTCGATATCAGGTTCGGTGCGCCGACCAGGGAA
CTGTTCATAAATGGGAGGTGGTATGAATGTTTCTTCGGGGGTCAGCCATTGGGCGTCATA
ATAGACGGCAAACCGCGATTAGTGCATCTAGAGGGGCCCCTGCCACAGGTGGATATAGGG
AAGACGAAACGAACAGACTTAGTAGCGGGTAAAATAAATGTTATAATAAACGCGACAAAC
ATTTGTCCGGTTTACTTGGATGCTAAAGTTCAGAAGTTCCACGTAAATGGACATTTCTTT
ACGATACGTTTTGTCGATTCCTTGAAGACCGTTCTAATTAATGAGCAGCCGTTCAAAGTG
GAGTATGGGGATCTTCCGAAGCCGATATTCGTAAATGGGGAGAAATACTTTGTACGATTT
TCTGCACTGCCCAAGAACATAAAACCGGGCCAAGTAGAGATCGCTGACATGGAAGGCTGT
AAACCATCGACGGAATCCGAGAAGCTGCCCACAGTGCCGGAGAACGAAGACGTTCCGATG
GAAACTGATTCTGAACAACCGCTCGAAGCTCCAGTGAAATCCCCAAGTCCTGAAGGTGAA
ATGAAAGGTTTGGATATGCTAGCAAATTTCATGCCGAGTGATATGGCCCCGGCTTCCAGC
TCAGAATACAGTTCTGCTGAGCCGCTGTTCACAAAACCGGAAGTGATCCCAGGATTGGAA
ACTCCCGCTGAAGAGAAGCCAGCCAGTTCCCTGCCTCTTTTGGGCGGTATCAATGTCAAT
GACCTGTACGCCAAATTAGTTGCGACAGGTATAGTGCCAATGTTGAACGAAGTGAAACCA
GAGAAGAAAACTGAAGTTCCTGAGATCGAGGAGACGAAAATGAAACCCAAAGATGATAAG
AACGTCATCCACAAAGTCGATATACTGAAACCGGAAACACTTCGGATTAAGCAGCGCGGT
CTGGTGCTGAAACTGTACAGCGGTATGCAGTGCAGCGGTTGCGGGGCGCGGTTCCCTCCC
GAGCACACGGTCCGGTACTCGCAGCATCTCGACTGGCACTTCAGACAGAACAGGCGGGAG
AGGGACTCTGCGCGGCGGGCTCACTCGCGACACTGGCACTACGATCTGTCTGACTGGCTG
CTGTACGAGGAGCTAGAAGACCTAGAAGAACGAGAGAAGAGTTGGTTTGAGACGGGAGGT
TCTGAAGAAACACCAGCTCAGGTAGAAGCTGTTGTAGAAGAAAGTCCAAGCACCGCGGCT
GGTGGAGCTCCGCAACATAACTGTGCCTTGTGTGGAGACAGATTCCATCAGTTCTACAAT
GAGGACCAAGAGGAGTGGCATCTCAGGAATGCTGTCAAACATCAGGACAGTTACTATCAT
CCACTGTGTCTGCAGGATTATAAGGCTTCTCTAACAAAGGAAGAGCCACAGGCTGAGGAA
GCGGCGGTGGATGTAGACGAGGAACCTCCGGCTGCGATAGAGATAGGAGACACCGCTGAG
CTGTCGGACACTGAGTCTGTGGTTGAAGTGTTGGAGACTGAACCCTTGGAACCTGTTGAG
ATTGAAGCTGATGACGGTGATGATGATGTGGTCCTGAACGCTGAGCCCGTGGAACAGCTG
GAGGTGGACGACGGAGACACAGACGATGAAACTACTGAGACGAGGAGGCAGAGAGATCAC
CTCGCACAAGTTGATTTTGCTAACATAAAGATCAAACAGGAACCTATCGATCCAGACGAT
GAACCAATTATAACAGCAGAAGTAGAAAGCATTCCGCCGACAATCGACACAACACATACG
ACTGTTACATCATCCATAGACGGGAACGTTCAGCTCGACGACGCCACGCTGACTCCAGCT
CTACCCATCGGTGGCATCAGAATCAACATATCCAAAACCATAACCAGCTTTGCTACCAAT
CAAGATAGTCCGGATAAGTCCCTCGAAGACATCAGCACCGAGGACGAGCCCTTGCCTCCC
GGAGAGGAACCAGAAATGGAGTACGAGTTGAAGCCTTCAATGAAAGACGTGAAGTTCAGC
AGACAACCTCCCGTCCAAAAAGGAAGTGAATTGTCGGGATTGTGTTCTATAATGTGA
Protein sequence:
MAKEIADEYASSLADLTVNSKPLINMLTILAEENIEHAGVIVETVEKHLEKVPPDIKLPV
LYLVDSIIKNVGGAYTQKFSQIIVNMFTKTFKQVDEKIRSQMFKLRETWHEVFPSTKLYQ
LDVKVNLIDPAWPIQAQPHQSNIHVNPSFLKKPTPTTSAAALSEEEEKMRAILAKKEQEL
LMLQRKKVEMELEQTRRQLQMAEKVHKKPPLVHPPGLPAPQAEAPPVPLAPTSLPPQAPI
SSPVLSIKQRLGPPVNKSAGRIAPVTGPLSGARRDPRLARRQPPPQRQPPATAHTTPTTA
ATTIAPNVFDIKPLHRPAKRSAVITIEVATDSRRKDRKETRTKIENGNGDSVHTYIDRLR
IPDPKKINKLPPIPKIRRDRDDPPSKKKKELRERKKRRDKDGSASSSPDKKRGKDKDKKV
YNKERMDVDDQQYAPETVSFKELKNFNKKHYMRRNKEKSESPERLDKEEKKTSEEATNPP
EIIPETKDVDLRVLHPVIPETAKAIAQKRPSTEMLEGKPKKNKLDKFDILFGNEDVDLRQ
LPQVEEANAPPPPSISEPNSVSKEEVDPEESDDVIASPVRSPKKDWQEVKEKEETKKTPS
KLDLVRAKLAEATKVKDGLGRPLLFSKSPSIERERRRTFSSDETEVKTDSNEEFDADDHK
KTISIIMNQAKEQFSDGQLDKNQYNTLMYQVLQLNEKLKLKEAKQRESLEISKRKLKAHV
SEDKKVPSPKSSPSERNKFGDIDERVPVGGFSDSENNNQDSDMRQNHHDNIDGKSGFPMP
PMMPPMYMGPFPMWRGQPRPRMEEFGPRRFRGPYFRGKFDKRGPRPPFDMRMPLLPTPKL
GMCQGECPLKPYERSISPPPLGAPGYTLPPTDYKILEYIDQDPVKTIQIDGIPREIRFYG
ETAIIMLDWDDPRVIKFLPGCRRVTFDNKDSVVLTFNEGYKKVEIDDQVFDIRFGAPTRE
LFINGRWYECFFGGQPLGVIIDGKPRLVHLEGPLPQVDIGKTKRTDLVAGKINVIINATN
ICPVYLDAKVQKFHVNGHFFTIRFVDSLKTVLINEQPFKVEYGDLPKPIFVNGEKYFVRF
SALPKNIKPGQVEIADMEGCKPSTESEKLPTVPENEDVPMETDSEQPLEAPVKSPSPEGE
MKGLDMLANFMPSDMAPASSSEYSSAEPLFTKPEVIPGLETPAEEKPASSLPLLGGINVN
DLYAKLVATGIVPMLNEVKPEKKTEVPEIEETKMKPKDDKNVIHKVDILKPETLRIKQRG
LVLKLYSGMQCSGCGARFPPEHTVRYSQHLDWHFRQNRRERDSARRAHSRHWHYDLSDWL
LYEELEDLEEREKSWFETGGSEETPAQVEAVVEESPSTAAGGAPQHNCALCGDRFHQFYN
EDQEEWHLRNAVKHQDSYYHPLCLQDYKASLTKEEPQAEEAAVDVDEEPPAAIEIGDTAE
LSDTESVVEVLETEPLEPVEIEADDGDDDVVLNAEPVEQLEVDDGDTDDETTETRRQRDH
LAQVDFANIKIKQEPIDPDDEPIITAEVESIPPTIDTTHTTVTSSIDGNVQLDDATLTPA
LPIGGIRINISKTITSFATNQDSPDKSLEDISTEDEPLPPGEEPEMEYELKPSMKDVKFS
RQPPVQKGSELSGLCSIM