DPGLEAN21397 in OGS1.0

New model in OGS2.0DPOGS203358 
Genomic Positionscaffold2123:+ 2117-8556
See gene structure
CDS Length2205
Paired RNAseq reads  270
Single RNAseq reads  818
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014334 (9e-91)
Best Drosophila hit  CG2063 (1e-27)
Best Human hitSAP30-binding protein (5e-27)
Best NR hit (blastp)  PREDICTED: similar to CG2063-PA [Apis mellifera] (2e-60)
Best NR hit (blastx)  PREDICTED: similar to CG2063 CG2063-PA [Tribolium castaneum] (3e-44)
GeneOntology terms

  
GO:0071013 catalytic step 2 spliceosome
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0071011 precatalytic spliceosome
InterPro families  IPR012479 HCNGP-like
Orthology groupMCL14600

Nucleotide sequence:

ATGACTTCGCAAGCGTTAGCCACATTGACCGCGACGTACACAGACTCCGAAGGTGAGGAG
GAAATGGAAGATGGAGATCCTACACCTGAGAAATCGGTCACTCATCACACACAGTCAGCT
CCAACCAGTCCCAAGAACATCGACGACACCAAACAATCTGCTTCCGCACCAGTTTCTCCA
AAACGAAGTTTGGTCTCGTACGTAGACGACACTATCGTATCCGATGACGAACAATTGTCT
CCTAACGCGGAAACTCAGGACGATATGAGAAGATTATCGATGGAAACCGACACAGATGAA
GCTGTCCCACGATCAGATCCCGACGACTCAGAGGATAGTGTCCTTATACCTCCGGAACCA
ACAGCCAAATGTCCCAAGGAATTACAAGACAAAATAACAAAATTCTACACAAGAATGGTC
AACGAAGGTTACGACATGAACAAAATAATTCAGGATAAAAAGAATTTCAGAAATCCAAGC
ATATACGAGAAGTTGATACAATTCTGCGACATCAACGAGCTAGACACGAACTACCCACCA
GAAATATACGATCCTCTAAAATGGGGCAAGGAATCCTACTACGATGAGCTCGCTAAAGTC
CAAAAACTAGAGATGGAGAAACGGGAAAAGGATCGCAAAGAGAAGTCCAAAATAGATTTC
ATCACCGGAGTGGCAAAGAAGTCGGACAGCGACGATGACAAGAAACGGAAGTCCAAGTGG
GACCAAGCGGCGCCCAACGTAGCCAACAAACCCAGCATCAAACAACCGGGGCTCCTCCAG
CAACCGCTGACGAGCAACGTCACCGGCACCAAGGGCACTGTATCAGACTTTGAAGATTCA
TTATCGAAGTATGGCATCAAGAAGGTGCTCCCGGTCGCTACGTCGACTCCAAAGAAACCA
GTCAAGAGTCATTCTTTACACAGCAGCATACAGAAGAAAGAAAAACAGAAACTCTCTAAT
AAAAGTGACAGCACGGTCACCTGTGATAGCTTCGTCAAAACAGACTCTACAAGCTCCTAC
AGGCAGGCCGTGAGCACAGAGCTCGATTACTCTTCAATGTCTCCGAACTCCGCGGCGTCA
TCGGACAGCGCGGGCGCCGTCCAGGGCACTTTGGTGTCTCCACATACCCACGATCTTCAA
GACTCCAGAAACGAAGGATTACACCTAACACCGAGAGAAACACAGAACATCATTAAATGC
GCCCACATCCTTGGGAATGTCCTCACTAAAGCGATCGAAAGACAATCAAAAGAGTATGAA
TACAGCCAAGAGAAAATACAAGAGCTGTACGTTGAAAAACCGATGCCTGAAACTGAGATA
AAAAAAAAGAATCTGACATTAGATCTGAAAGAGACAGTCTTACCTCTGGAGGTTAAAGAG
GAAAAGAGATGGGAGAGCGTCCCGACACAAACTGACATTTCGCTGCCGAATACGAAAAGC
GCGCCGAAAATATTTGAAAGCATTTTAAGACAATTATCAAGGAGTTCAATAGACGAAGCC
GAAAAGACGATAATAGAATGCCAAGAAGAGAATACAGAGGAGAATGAGACGGGACAGTGG
AAAACCAGCACGGGTATCAGTACTTTCCGCGAGGACAACTCCGGTGAGTGGGGCCAGTTC
TGGGCGAACTACAACAACTCGCTGGCGAGCGTGCCCAGCAGATACTACGACCAATGTCCA
ACGCCGTACAGGACTGAGGACATCGACCTTGCGGATTTAGAATTCTCAACAGAGGGTTCA
AGGAAACGTTCACCAGAAAACATTAAAACAATCAACAATATCATAAGAAACGAAGGATTA
CACCTAACACCGAGAGAAACACAGAACATCATTAAATGCGCCCACATCCTTGGGAATGTC
CTCACTAAAGCGATCGAAAGACAATCAAAAGAGTATGAATACAGCCAAGAGAAAATACAA
GAGCTGTACGTTGAAAAACCGATGCCTGAAACTGAGATAAAAAAAAAGAATCTGACATTA
GATCTGAAAGAGACAGTCTTACCTCTGGAGGTTAAAGAGGAAAAGAGATGGGAGAGCGTC
CCGACACAAACTGACATTTCGCTGCCGAATACGAAAAGCGCGCCGAAAATATTTGAAAGC
ATTTTAAGACAATTATCAAGGAGTTCAATAGACGAAGCCGAAAAGACGATAATAGAATGC
CAAGAAGAGAATACAGAGGAGAATGAGAGTAAGGAGACAAAATAG

Protein sequence:

MTSQALATLTATYTDSEGEEEMEDGDPTPEKSVTHHTQSAPTSPKNIDDTKQSASAPVSP
KRSLVSYVDDTIVSDDEQLSPNAETQDDMRRLSMETDTDEAVPRSDPDDSEDSVLIPPEP
TAKCPKELQDKITKFYTRMVNEGYDMNKIIQDKKNFRNPSIYEKLIQFCDINELDTNYPP
EIYDPLKWGKESYYDELAKVQKLEMEKREKDRKEKSKIDFITGVAKKSDSDDDKKRKSKW
DQAAPNVANKPSIKQPGLLQQPLTSNVTGTKGTVSDFEDSLSKYGIKKVLPVATSTPKKP
VKSHSLHSSIQKKEKQKLSNKSDSTVTCDSFVKTDSTSSYRQAVSTELDYSSMSPNSAAS
SDSAGAVQGTLVSPHTHDLQDSRNEGLHLTPRETQNIIKCAHILGNVLTKAIERQSKEYE
YSQEKIQELYVEKPMPETEIKKKNLTLDLKETVLPLEVKEEKRWESVPTQTDISLPNTKS
APKIFESILRQLSRSSIDEAEKTIIECQEENTEENETGQWKTSTGISTFREDNSGEWGQF
WANYNNSLASVPSRYYDQCPTPYRTEDIDLADLEFSTEGSRKRSPENIKTINNIIRNEGL
HLTPRETQNIIKCAHILGNVLTKAIERQSKEYEYSQEKIQELYVEKPMPETEIKKKNLTL
DLKETVLPLEVKEEKRWESVPTQTDISLPNTKSAPKIFESILRQLSRSSIDEAEKTIIEC
QEENTEENESKETK