DPGLEAN13813 in OGS1.0

New model in OGS2.0DPOGS202104 
Genomic Positionscaffold495:- 134066-146483
See gene structure
CDS Length1488
Paired RNAseq reads  520
Single RNAseq reads  1524
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006896 (5e-132)
Best Drosophila hit  CG6876 (7e-163)
Best Human hitU4/U6 small nuclear ribonucleoprotein Prp31 (7e-133)
Best NR hit (blastp)  PREDICTED: similar to AGAP012142-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG6876-PA isoform 1 [Apis mellifera] (6e-171)
GeneOntology terms

  
GO:0005681 spliceosomal complex
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0071011 precatalytic spliceosome
InterPro families

  
IPR019175 Prp31 C-terminal
IPR002687 Pre-mRNA processing ribonucleoprotein, snoRNA-binding domain
IPR012976 NOSIC
Orthology groupMCL13257

Nucleotide sequence:

ATGTCTCTTGCTGATGAGCTATTGGCTGACTTAGAAGAAAATGATGACGGAGAGCTTGAA
GCTATAATTGAGAATAAAACTGCAGATTCTCACGAGTTTGCTGTACCCTTTCCTGTGATA
CCTAAAGAAGAAGAAATAAAAAATGTATCAATTCGAGAATTGGCTAAATTAAGAGATTCA
GATCGGCTTAAACGGGTTGTAGCAGAAGTAGAGCAAAATGCGGGTAATAAAAGAAAGAAA
ATTGAGGTTGTTGGTTTAATGGAATCTGATCCTGAATATCAATTAATAGTTGAAGCTAAT
AATATAGCAGTTGAAATTGATGGTGAAATTGCTACTATTCACAGGTTTGTTCGGGATAAA
TATCAGAAAAGGTTTCCAGAGCTGGAGTCATTGATTGTAACACCATTAGAATATATCCGT
ACTGTAAAGGAGTTAGGAAATGACCTTGACAAAGCTAAGAATAATGAGATTCTTCAAAGT
TTTCTCACTCAGGCAACTATTATGATAGTGTCCGTCACTGCTTCCACAACACAAGGAAAA
TTATTGTCAGATCATGAACTGAGTGAAATCTTTGAAGCATGTGATATGGCTGCAGAGTTG
AATAATTTTAAATCAAATATCTACGAGTACGTTGAGAGCAGGATGACTTTCATAGCTCCA
AACATAACAGCTATTGTTGGTGCATCAACAGCAGCGAAAATTCTTGGAGTGGCAGGTGGT
CTATCCAAGCTGTCCAAAATGCCAGCATGCAATGTTCTGCCACTTGGACAGCAAAAGAAG
ACGCTGTCTGGCTTCTCCCAAGCCGCTTCACTACCTCATACTGGCTTTATATACTTTTCT
CAAATAGTACAAGATACAACTCCTGAATTGAGATACAAAGCAGCTAAGCTTGTATCAACA
AAATTAACTCTGGCGGCTAGAGTTGATGCTTGCCATGAAAGTACAGATGGTGCCATTGGT
CGGTCATTGAGGGAAGGAATAGAGAAGAAATTAGACAAATTACAGGAACCGCCTCCAGTG
AAGTTCGTGAAGCCGCTTCCAAAGCCGATTGAACAGAGTAGGAAGAAACGTGGCGGGAAA
CGTGTGAGGAAGATGAAGGAGCGATACGCCATGACGGAGTTCAGGAAGAACGCCAACAGA
CTCAACTTCGCTGACATCGAAGACGACGCTTATCAAGAAGACCTGGGGTACACTCGTGGT
ACGATCGGGAAATCTAGAACGGGTCGCGTCCGCCTGCCTCAAATAGACGAGAAGACCAAA
GTTCGCATCAGCAAAACCTTGCAAAAGAACCTGCAAAAACAAAACCAGCAGTACGGCGGG
GCTACGAGTATAAGAAGACAAGTGTCAGGAACGGCCTCCTCGGTGGCCTTCACGCCTTTG
CAGGGTCTCGAGATAGTGAATCCTCAGGCCGCTGAGACGAGAGTGAATGAAGCGAACGCG
AAATACTTTTCAAATACCTCTGGATTCCTATCGGTTGGAAAGACTTAA

Protein sequence:

MSLADELLADLEENDDGELEAIIENKTADSHEFAVPFPVIPKEEEIKNVSIRELAKLRDS
DRLKRVVAEVEQNAGNKRKKIEVVGLMESDPEYQLIVEANNIAVEIDGEIATIHRFVRDK
YQKRFPELESLIVTPLEYIRTVKELGNDLDKAKNNEILQSFLTQATIMIVSVTASTTQGK
LLSDHELSEIFEACDMAAELNNFKSNIYEYVESRMTFIAPNITAIVGASTAAKILGVAGG
LSKLSKMPACNVLPLGQQKKTLSGFSQAASLPHTGFIYFSQIVQDTTPELRYKAAKLVST
KLTLAARVDACHESTDGAIGRSLREGIEKKLDKLQEPPPVKFVKPLPKPIEQSRKKRGGK
RVRKMKERYAMTEFRKNANRLNFADIEDDAYQEDLGYTRGTIGKSRTGRVRLPQIDEKTK
VRISKTLQKNLQKQNQQYGGATSIRRQVSGTASSVAFTPLQGLEIVNPQAAETRVNEANA
KYFSNTSGFLSVGKT