New model in OGS2.0 | DPOGS202104  |
---|---|
Genomic Position | scaffold495:- 134066-146483 |
See gene structure | |
CDS Length | 1488 |
Paired RNAseq reads   | 520 |
Single RNAseq reads   | 1524 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006896 (5e-132) |
Best Drosophila hit   | CG6876 (7e-163) |
Best Human hit | U4/U6 small nuclear ribonucleoprotein Prp31 (7e-133) |
Best NR hit (blastp)   | PREDICTED: similar to AGAP012142-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG6876-PA isoform 1 [Apis mellifera] (6e-171) |
GeneOntology terms    | GO:0005681 spliceosomal complex GO:0000398 nuclear mRNA splicing, via spliceosome GO:0071011 precatalytic spliceosome |
InterPro families    | IPR019175 Prp31 C-terminal IPR002687 Pre-mRNA processing ribonucleoprotein, snoRNA-binding domain IPR012976 NOSIC |
Orthology group | MCL13257 |
Nucleotide sequence:
ATGTCTCTTGCTGATGAGCTATTGGCTGACTTAGAAGAAAATGATGACGGAGAGCTTGAA
GCTATAATTGAGAATAAAACTGCAGATTCTCACGAGTTTGCTGTACCCTTTCCTGTGATA
CCTAAAGAAGAAGAAATAAAAAATGTATCAATTCGAGAATTGGCTAAATTAAGAGATTCA
GATCGGCTTAAACGGGTTGTAGCAGAAGTAGAGCAAAATGCGGGTAATAAAAGAAAGAAA
ATTGAGGTTGTTGGTTTAATGGAATCTGATCCTGAATATCAATTAATAGTTGAAGCTAAT
AATATAGCAGTTGAAATTGATGGTGAAATTGCTACTATTCACAGGTTTGTTCGGGATAAA
TATCAGAAAAGGTTTCCAGAGCTGGAGTCATTGATTGTAACACCATTAGAATATATCCGT
ACTGTAAAGGAGTTAGGAAATGACCTTGACAAAGCTAAGAATAATGAGATTCTTCAAAGT
TTTCTCACTCAGGCAACTATTATGATAGTGTCCGTCACTGCTTCCACAACACAAGGAAAA
TTATTGTCAGATCATGAACTGAGTGAAATCTTTGAAGCATGTGATATGGCTGCAGAGTTG
AATAATTTTAAATCAAATATCTACGAGTACGTTGAGAGCAGGATGACTTTCATAGCTCCA
AACATAACAGCTATTGTTGGTGCATCAACAGCAGCGAAAATTCTTGGAGTGGCAGGTGGT
CTATCCAAGCTGTCCAAAATGCCAGCATGCAATGTTCTGCCACTTGGACAGCAAAAGAAG
ACGCTGTCTGGCTTCTCCCAAGCCGCTTCACTACCTCATACTGGCTTTATATACTTTTCT
CAAATAGTACAAGATACAACTCCTGAATTGAGATACAAAGCAGCTAAGCTTGTATCAACA
AAATTAACTCTGGCGGCTAGAGTTGATGCTTGCCATGAAAGTACAGATGGTGCCATTGGT
CGGTCATTGAGGGAAGGAATAGAGAAGAAATTAGACAAATTACAGGAACCGCCTCCAGTG
AAGTTCGTGAAGCCGCTTCCAAAGCCGATTGAACAGAGTAGGAAGAAACGTGGCGGGAAA
CGTGTGAGGAAGATGAAGGAGCGATACGCCATGACGGAGTTCAGGAAGAACGCCAACAGA
CTCAACTTCGCTGACATCGAAGACGACGCTTATCAAGAAGACCTGGGGTACACTCGTGGT
ACGATCGGGAAATCTAGAACGGGTCGCGTCCGCCTGCCTCAAATAGACGAGAAGACCAAA
GTTCGCATCAGCAAAACCTTGCAAAAGAACCTGCAAAAACAAAACCAGCAGTACGGCGGG
GCTACGAGTATAAGAAGACAAGTGTCAGGAACGGCCTCCTCGGTGGCCTTCACGCCTTTG
CAGGGTCTCGAGATAGTGAATCCTCAGGCCGCTGAGACGAGAGTGAATGAAGCGAACGCG
AAATACTTTTCAAATACCTCTGGATTCCTATCGGTTGGAAAGACTTAA
Protein sequence:
MSLADELLADLEENDDGELEAIIENKTADSHEFAVPFPVIPKEEEIKNVSIRELAKLRDS
DRLKRVVAEVEQNAGNKRKKIEVVGLMESDPEYQLIVEANNIAVEIDGEIATIHRFVRDK
YQKRFPELESLIVTPLEYIRTVKELGNDLDKAKNNEILQSFLTQATIMIVSVTASTTQGK
LLSDHELSEIFEACDMAAELNNFKSNIYEYVESRMTFIAPNITAIVGASTAAKILGVAGG
LSKLSKMPACNVLPLGQQKKTLSGFSQAASLPHTGFIYFSQIVQDTTPELRYKAAKLVST
KLTLAARVDACHESTDGAIGRSLREGIEKKLDKLQEPPPVKFVKPLPKPIEQSRKKRGGK
RVRKMKERYAMTEFRKNANRLNFADIEDDAYQEDLGYTRGTIGKSRTGRVRLPQIDEKTK
VRISKTLQKNLQKQNQQYGGATSIRRQVSGTASSVAFTPLQGLEIVNPQAAETRVNEANA
KYFSNTSGFLSVGKT