DPGLEAN01725 in OGS1.0

New model in OGS2.0DPOGS206070 
Genomic Positionscaffold48:- 160583-167329
See gene structure
CDS Length4023
Paired RNAseq reads  4399
Single RNAseq reads  10844
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006851 (0.0)
Best Drosophila hit  CG2807 (0.0)
Best Human hitsplicing factor 3B subunit 1 isoform 1 (0.0)
Best NR hit (blastp)  PREDICTED: similar to CG2807-PA isoform 1 [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to U2 small nuclear ribonucleoprotein [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0005681 spliceosomal complex
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0005686 U2 snRNP
GO:0005488 binding
GO:0071011 precatalytic spliceosome
GO:0071013 catalytic step 2 spliceosome
InterPro families

  
IPR011989 Armadillo-like helical
IPR015016 Splicing factor 3B subunit 1
IPR016024 Armadillo-type fold
Orthology groupMCL11726

Nucleotide sequence:

ATGGATAAAATACCGCGTACTCATGAAGCCATCGAGGCCCAAATAAAAGAAATACAATCG
AAGAAAAAAGAGCTCCCTGAAAATGGTTCAGGCAAGGGTGTTTCTCTGGGTGATGCCTTC
TATGACAGTGACATTTATGACAATTCTGGCCAAGGAGGGAAATCCAGATACGATGGCTAC
GTGACTTCTATTGCTGCGAATGATGAAGTGGAGGATGAGGATGTAGAGAATGTTCCAATA
TCACAGAAGAGACCTGGCTATACTGCTCCCGCTTCTTTATTAAATGATATAGCTCAGAGT
GATAAAGATTATGATCCTTTTGCTGATAAGAGAAGACCTACTATTGCAGACAGGGAAGAT
GAGTATCGCCAGAAAAGGCGTAGGATGATTATATCACCAGAACGTTCCGATCCTTTTGCT
GAAGGTGGAAAGACACCAGATGTTGGTTCTAGGACATACACAGAAATTATGAAGGAACAG
TATTTGCGGGCTGAAGAAACAGAGTTACGGAAAAAGCTCCTGGAAAGAGCACGAGAAGGC
ACACTCAAGGCAGTATCACAGTCAAATGGTGAGGCAACAAAACCAGCAGCGAAACGTAAA
GGTCGCTGGGACCAGAGCTCAGAGGATACACCGTCTGTGAAAAAACCTGTGGTTCAAGCT
ACACCAAGCTCTCAGGCTACACCATCCTGGGAAAATGAGCGAGGTGCATGGGAAGAAACA
CCAAGTGCAGGTGGCCGTGGTGGTGAGACACCTGGTGCGACGCCGTCTGCACGTGTGTGG
GACGCCACGCCGGCTCATCTTACACCGGGCCATGCTACACCCGGTCGTGAGACGCCCGCT
CACCATGCCTCCAGACGAAACCGGTGGGATGAGACACCCAAAACTGATAGAGAAACGCCT
GGACATGCCAGCGGTTGGGCCGAAACTCCTCGTACAGATCGCGGCGTAGGTGTCGATACT
ATACAAGAGACGCCCACGCCGGGCACTAAGAGACGATCGCGCTGGGATGAGACTCCTGGC
GCCACTCCCGCCGCAGCTACACCCACACCCTCACACGCGACACCTTCACACGCCACGCCC
TCACATGCTACACCCTCCATGGGCACACCGACACCTCATACACCAATGTTTACTCCAGGC
GGGTCAACACCGGTGGGTGTTAAGGCAATGGCCATGGCGACACCAACGCCGGGCCACATC
GCAGCTATGACACCAGAGCAGTTGCAAGCGTATCGCTGGGAGAAAGAAATCGACGAACGA
AATAGACCGTACACTGATGAAGAACTGGATGCCATGTTCCCACCTGGGTACAAGGTTTTG
CCTCCACCGGCCGGTTATGTTCCTATTCGGACCCCGGCTCGTAAGCTGACCGCGACGCCC
ACACCTTTGGCTGGTACCCCAATCGGCTTTTTCATGCAGACGGAGGAAGTAGGCGGGAGT
GCTGCAGCAGCGGCGCGGCTCCTCGACCCGCAGCCCAAAGGCAGTCAGCAGCTGCCGTTC
ATGAAGCCCGAGGACGCTCAGTACTTCGACAAACTTCTTATCGACGTCGACGAAGAAACA
CTGTCACCCGAAGAACTGAAGGAGAGAAAGATCATGAAGTTGCTGCTTAAGATTAAGAAT
GGAACACCGCCTATGTGCAAAGCAGCTCTCCGTCAAATCACAGACAAAGCTCGGGATTTC
GGCGCCGGACCGCTCTTTAATCAAATCCTACCGTTATTGATGAGTCCTACACTCGAAGAT
CAAGAACGTCATCTCTTAGTAAAAGTTATAGATCGAATTCTTTACAAATTAGATGATTTG
GTCCGCCCATATGTACACAAAATTTTGGTCGTCATAGAACCTCTGCTTATTGATGAAGAT
TACTACGCCCGTGTCGAGGGTCGAGAGATCATATCCAACTTGGCAAAAGCAGCTGGTTTA
GCCACAATGATCTCTACAATGAGACCAGATATTGATAATATCGATGAATATGTTCGAAAC
ACCACGGCCAGGGCCTTCGCTGTTGTTGCATCTGCTTTAGGTATACCGTCATTATTGCCG
TTTTTAAAGGCCGTGTGCAGATCGAAGAAGTCATGGCAGGCTCGTCACACCGGTATCAAA
ATCGTGCAACAAATCGCAATTCTAATGGGATGTGCCATTTTGCCCCATCTGAAGTCGCTC
GTGGAAATCATTGAGCATGGCTTGGTCGACGAACAACAAAAAGTTAGGACAATCACGGCG
TTGGCGAGCGCCGCTTTAGCCGAAGCAGCCACGCCGTACGGTATCGAGTCCTTTGACTCT
GTGCTAAAACCATTATGGAAGGGTATCAGAACCCATCGCGGTAAGGGTCTAGCGGCCTTC
CTTAAAGCTATCGGCTACCTCATACCTCTCATGGACGCCGAATATGCAAACTATTACACC
CGTGAGGTGATGCTTATATTGATCCGTGAGTTCCAGTCGCCCGACGAGGAAATGAAGAAG
ATTGTATTGAAGGTGGTGAAGCAGTGCTGCGGAACAGATGGTGTTGAACCTCAGTATATA
ATGGATGAAATCTTACCTCACTTCTTCAAACATTTCTGGAATCACAGAATGGCTTTGGAC
CGTCGCAACTATCGCCAACTTGTCGATACAACACAGCTTTATCGATTGTTCCATCAAGTT
GGGGCGTCCGAAATAATAAACAGAATCGTAGACGATCTCAAGGATGACAACGAACAGTAT
AGGAAAATGGTTATGGAGTCCATTGAAAAAATTCTAGCCAACTTGGGCGCAGCTGATATA
GATTCTAAGCTTGAGGAAGCCTTGATTGACGGCATTCTATACGCCTTCCAAGAACAGACC
ACTGAGGACGTGGTGATGTTGAATGGATTTGGTACAATAGTGAATCAACTCGGTAAGCGA
GTCAAGCCTTATTTACCACAAATCTGTGGTATAATTCTGTGGCGTATGAACAACAAGTCG
GCAAAGGTGAGGCAACAAGCTGCCGATCTTATTTCTCGTATCGCCGTAGTCATGAAAACT
TGTCAGGAGGAAAAACTTATGGGGCATCTCGGTGTAGTGCTATATGAATATCTCGGGGAG
GAATATCCTGAAGTACTCGGTTCTATTCTGGGTGCCTTAAAGGCTATAGTGAATGTGATC
GGTATGACCAAAATGACACCACCCATCAAGGATTTACTTCCTAGATTAACGCCAATTCTC
AAGAACAGACATGAAAAAGTGCAAGAAAATTGCATTGATCTGGTCGGACGTATTGCAGAC
AGGGGTCCCGAATTCGTGTCAGCGAGAGAGTGGATGAGGATTTGCTTTGAACTGCTGGAA
TTGCTCAAAGCACACAAGAAAGCTATCAGGAGAGCCACAGTCAATACATTTGGTTACATC
GCCAAAGCTATCGGTCCGCATGACGTACTTGCTACACTGCTCAATAATCTTAAAGTTCAA
GAGAGACAGAACAGAGTGTGCACAACAGTTGCAATTGCCATTGTAGCTGAGACATGTTCT
CCATTCACAGTCTTGCCAGCGCTGATGAATGAGTACAGAGTTCCAGAATTAAATGTTCAG
AATGGTGTTTTGAAATCGTTGTCATTTTTGTTTGAATACATCGGAGAAATGGGTAAAGAT
TACATATATGCTGTGTGCCCGTTACTAGAGGACGCACTTATGGACAGAGATTTAGTGCAT
CGACAAACTGCATGTGCCGCAATAAAACATATGGCATTGGGAGTGTATGGTTTCGGCTGT
GAGGATGCTCTAATACATTTGCTGAACCATGTTTGGCCGAATATATTTGAAACCTCGCCT
CATCTTGTACAAGCTTTTATGGACGCGGTTGAGGGCATGAGAGTTGCACTTGGCCCAATA
AAAATACTCCAGTACGCATTACAGGGCTTATTCCATCCAGCTCGAAAGGTCCGTGATGTT
TACTGGAAGATATATAACACATTATATATCGGAGGCCAAGACGCCCTGGTCGCTGGTTAC
CCACGGATACAAAATGATCCCAACAATCATTTTGTCAGATACGAGTTAGACTATTTGTTG
TAG

Protein sequence:

MDKIPRTHEAIEAQIKEIQSKKKELPENGSGKGVSLGDAFYDSDIYDNSGQGGKSRYDGY
VTSIAANDEVEDEDVENVPISQKRPGYTAPASLLNDIAQSDKDYDPFADKRRPTIADRED
EYRQKRRRMIISPERSDPFAEGGKTPDVGSRTYTEIMKEQYLRAEETELRKKLLERAREG
TLKAVSQSNGEATKPAAKRKGRWDQSSEDTPSVKKPVVQATPSSQATPSWENERGAWEET
PSAGGRGGETPGATPSARVWDATPAHLTPGHATPGRETPAHHASRRNRWDETPKTDRETP
GHASGWAETPRTDRGVGVDTIQETPTPGTKRRSRWDETPGATPAAATPTPSHATPSHATP
SHATPSMGTPTPHTPMFTPGGSTPVGVKAMAMATPTPGHIAAMTPEQLQAYRWEKEIDER
NRPYTDEELDAMFPPGYKVLPPPAGYVPIRTPARKLTATPTPLAGTPIGFFMQTEEVGGS
AAAAARLLDPQPKGSQQLPFMKPEDAQYFDKLLIDVDEETLSPEELKERKIMKLLLKIKN
GTPPMCKAALRQITDKARDFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDL
VRPYVHKILVVIEPLLIDEDYYARVEGREIISNLAKAAGLATMISTMRPDIDNIDEYVRN
TTARAFAVVASALGIPSLLPFLKAVCRSKKSWQARHTGIKIVQQIAILMGCAILPHLKSL
VEIIEHGLVDEQQKVRTITALASAALAEAATPYGIESFDSVLKPLWKGIRTHRGKGLAAF
LKAIGYLIPLMDAEYANYYTREVMLILIREFQSPDEEMKKIVLKVVKQCCGTDGVEPQYI
MDEILPHFFKHFWNHRMALDRRNYRQLVDTTQLYRLFHQVGASEIINRIVDDLKDDNEQY
RKMVMESIEKILANLGAADIDSKLEEALIDGILYAFQEQTTEDVVMLNGFGTIVNQLGKR
VKPYLPQICGIILWRMNNKSAKVRQQAADLISRIAVVMKTCQEEKLMGHLGVVLYEYLGE
EYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIAD
RGPEFVSAREWMRICFELLELLKAHKKAIRRATVNTFGYIAKAIGPHDVLATLLNNLKVQ
ERQNRVCTTVAIAIVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKD
YIYAVCPLLEDALMDRDLVHRQTACAAIKHMALGVYGFGCEDALIHLLNHVWPNIFETSP
HLVQAFMDAVEGMRVALGPIKILQYALQGLFHPARKVRDVYWKIYNTLYIGGQDALVAGY
PRIQNDPNNHFVRYELDYLL