New model in OGS2.0 | DPOGS206070  |
---|---|
Genomic Position | scaffold48:- 160583-167329 |
See gene structure | |
CDS Length | 4023 |
Paired RNAseq reads   | 4399 |
Single RNAseq reads   | 10844 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006851 (0.0) |
Best Drosophila hit   | CG2807 (0.0) |
Best Human hit | splicing factor 3B subunit 1 isoform 1 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to CG2807-PA isoform 1 [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to U2 small nuclear ribonucleoprotein [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005681 spliceosomal complex GO:0000398 nuclear mRNA splicing, via spliceosome GO:0005686 U2 snRNP GO:0005488 binding GO:0071011 precatalytic spliceosome GO:0071013 catalytic step 2 spliceosome |
InterPro families    | IPR011989 Armadillo-like helical IPR015016 Splicing factor 3B subunit 1 IPR016024 Armadillo-type fold |
Orthology group | MCL11726 |
Nucleotide sequence:
ATGGATAAAATACCGCGTACTCATGAAGCCATCGAGGCCCAAATAAAAGAAATACAATCG
AAGAAAAAAGAGCTCCCTGAAAATGGTTCAGGCAAGGGTGTTTCTCTGGGTGATGCCTTC
TATGACAGTGACATTTATGACAATTCTGGCCAAGGAGGGAAATCCAGATACGATGGCTAC
GTGACTTCTATTGCTGCGAATGATGAAGTGGAGGATGAGGATGTAGAGAATGTTCCAATA
TCACAGAAGAGACCTGGCTATACTGCTCCCGCTTCTTTATTAAATGATATAGCTCAGAGT
GATAAAGATTATGATCCTTTTGCTGATAAGAGAAGACCTACTATTGCAGACAGGGAAGAT
GAGTATCGCCAGAAAAGGCGTAGGATGATTATATCACCAGAACGTTCCGATCCTTTTGCT
GAAGGTGGAAAGACACCAGATGTTGGTTCTAGGACATACACAGAAATTATGAAGGAACAG
TATTTGCGGGCTGAAGAAACAGAGTTACGGAAAAAGCTCCTGGAAAGAGCACGAGAAGGC
ACACTCAAGGCAGTATCACAGTCAAATGGTGAGGCAACAAAACCAGCAGCGAAACGTAAA
GGTCGCTGGGACCAGAGCTCAGAGGATACACCGTCTGTGAAAAAACCTGTGGTTCAAGCT
ACACCAAGCTCTCAGGCTACACCATCCTGGGAAAATGAGCGAGGTGCATGGGAAGAAACA
CCAAGTGCAGGTGGCCGTGGTGGTGAGACACCTGGTGCGACGCCGTCTGCACGTGTGTGG
GACGCCACGCCGGCTCATCTTACACCGGGCCATGCTACACCCGGTCGTGAGACGCCCGCT
CACCATGCCTCCAGACGAAACCGGTGGGATGAGACACCCAAAACTGATAGAGAAACGCCT
GGACATGCCAGCGGTTGGGCCGAAACTCCTCGTACAGATCGCGGCGTAGGTGTCGATACT
ATACAAGAGACGCCCACGCCGGGCACTAAGAGACGATCGCGCTGGGATGAGACTCCTGGC
GCCACTCCCGCCGCAGCTACACCCACACCCTCACACGCGACACCTTCACACGCCACGCCC
TCACATGCTACACCCTCCATGGGCACACCGACACCTCATACACCAATGTTTACTCCAGGC
GGGTCAACACCGGTGGGTGTTAAGGCAATGGCCATGGCGACACCAACGCCGGGCCACATC
GCAGCTATGACACCAGAGCAGTTGCAAGCGTATCGCTGGGAGAAAGAAATCGACGAACGA
AATAGACCGTACACTGATGAAGAACTGGATGCCATGTTCCCACCTGGGTACAAGGTTTTG
CCTCCACCGGCCGGTTATGTTCCTATTCGGACCCCGGCTCGTAAGCTGACCGCGACGCCC
ACACCTTTGGCTGGTACCCCAATCGGCTTTTTCATGCAGACGGAGGAAGTAGGCGGGAGT
GCTGCAGCAGCGGCGCGGCTCCTCGACCCGCAGCCCAAAGGCAGTCAGCAGCTGCCGTTC
ATGAAGCCCGAGGACGCTCAGTACTTCGACAAACTTCTTATCGACGTCGACGAAGAAACA
CTGTCACCCGAAGAACTGAAGGAGAGAAAGATCATGAAGTTGCTGCTTAAGATTAAGAAT
GGAACACCGCCTATGTGCAAAGCAGCTCTCCGTCAAATCACAGACAAAGCTCGGGATTTC
GGCGCCGGACCGCTCTTTAATCAAATCCTACCGTTATTGATGAGTCCTACACTCGAAGAT
CAAGAACGTCATCTCTTAGTAAAAGTTATAGATCGAATTCTTTACAAATTAGATGATTTG
GTCCGCCCATATGTACACAAAATTTTGGTCGTCATAGAACCTCTGCTTATTGATGAAGAT
TACTACGCCCGTGTCGAGGGTCGAGAGATCATATCCAACTTGGCAAAAGCAGCTGGTTTA
GCCACAATGATCTCTACAATGAGACCAGATATTGATAATATCGATGAATATGTTCGAAAC
ACCACGGCCAGGGCCTTCGCTGTTGTTGCATCTGCTTTAGGTATACCGTCATTATTGCCG
TTTTTAAAGGCCGTGTGCAGATCGAAGAAGTCATGGCAGGCTCGTCACACCGGTATCAAA
ATCGTGCAACAAATCGCAATTCTAATGGGATGTGCCATTTTGCCCCATCTGAAGTCGCTC
GTGGAAATCATTGAGCATGGCTTGGTCGACGAACAACAAAAAGTTAGGACAATCACGGCG
TTGGCGAGCGCCGCTTTAGCCGAAGCAGCCACGCCGTACGGTATCGAGTCCTTTGACTCT
GTGCTAAAACCATTATGGAAGGGTATCAGAACCCATCGCGGTAAGGGTCTAGCGGCCTTC
CTTAAAGCTATCGGCTACCTCATACCTCTCATGGACGCCGAATATGCAAACTATTACACC
CGTGAGGTGATGCTTATATTGATCCGTGAGTTCCAGTCGCCCGACGAGGAAATGAAGAAG
ATTGTATTGAAGGTGGTGAAGCAGTGCTGCGGAACAGATGGTGTTGAACCTCAGTATATA
ATGGATGAAATCTTACCTCACTTCTTCAAACATTTCTGGAATCACAGAATGGCTTTGGAC
CGTCGCAACTATCGCCAACTTGTCGATACAACACAGCTTTATCGATTGTTCCATCAAGTT
GGGGCGTCCGAAATAATAAACAGAATCGTAGACGATCTCAAGGATGACAACGAACAGTAT
AGGAAAATGGTTATGGAGTCCATTGAAAAAATTCTAGCCAACTTGGGCGCAGCTGATATA
GATTCTAAGCTTGAGGAAGCCTTGATTGACGGCATTCTATACGCCTTCCAAGAACAGACC
ACTGAGGACGTGGTGATGTTGAATGGATTTGGTACAATAGTGAATCAACTCGGTAAGCGA
GTCAAGCCTTATTTACCACAAATCTGTGGTATAATTCTGTGGCGTATGAACAACAAGTCG
GCAAAGGTGAGGCAACAAGCTGCCGATCTTATTTCTCGTATCGCCGTAGTCATGAAAACT
TGTCAGGAGGAAAAACTTATGGGGCATCTCGGTGTAGTGCTATATGAATATCTCGGGGAG
GAATATCCTGAAGTACTCGGTTCTATTCTGGGTGCCTTAAAGGCTATAGTGAATGTGATC
GGTATGACCAAAATGACACCACCCATCAAGGATTTACTTCCTAGATTAACGCCAATTCTC
AAGAACAGACATGAAAAAGTGCAAGAAAATTGCATTGATCTGGTCGGACGTATTGCAGAC
AGGGGTCCCGAATTCGTGTCAGCGAGAGAGTGGATGAGGATTTGCTTTGAACTGCTGGAA
TTGCTCAAAGCACACAAGAAAGCTATCAGGAGAGCCACAGTCAATACATTTGGTTACATC
GCCAAAGCTATCGGTCCGCATGACGTACTTGCTACACTGCTCAATAATCTTAAAGTTCAA
GAGAGACAGAACAGAGTGTGCACAACAGTTGCAATTGCCATTGTAGCTGAGACATGTTCT
CCATTCACAGTCTTGCCAGCGCTGATGAATGAGTACAGAGTTCCAGAATTAAATGTTCAG
AATGGTGTTTTGAAATCGTTGTCATTTTTGTTTGAATACATCGGAGAAATGGGTAAAGAT
TACATATATGCTGTGTGCCCGTTACTAGAGGACGCACTTATGGACAGAGATTTAGTGCAT
CGACAAACTGCATGTGCCGCAATAAAACATATGGCATTGGGAGTGTATGGTTTCGGCTGT
GAGGATGCTCTAATACATTTGCTGAACCATGTTTGGCCGAATATATTTGAAACCTCGCCT
CATCTTGTACAAGCTTTTATGGACGCGGTTGAGGGCATGAGAGTTGCACTTGGCCCAATA
AAAATACTCCAGTACGCATTACAGGGCTTATTCCATCCAGCTCGAAAGGTCCGTGATGTT
TACTGGAAGATATATAACACATTATATATCGGAGGCCAAGACGCCCTGGTCGCTGGTTAC
CCACGGATACAAAATGATCCCAACAATCATTTTGTCAGATACGAGTTAGACTATTTGTTG
TAG
Protein sequence:
MDKIPRTHEAIEAQIKEIQSKKKELPENGSGKGVSLGDAFYDSDIYDNSGQGGKSRYDGY
VTSIAANDEVEDEDVENVPISQKRPGYTAPASLLNDIAQSDKDYDPFADKRRPTIADRED
EYRQKRRRMIISPERSDPFAEGGKTPDVGSRTYTEIMKEQYLRAEETELRKKLLERAREG
TLKAVSQSNGEATKPAAKRKGRWDQSSEDTPSVKKPVVQATPSSQATPSWENERGAWEET
PSAGGRGGETPGATPSARVWDATPAHLTPGHATPGRETPAHHASRRNRWDETPKTDRETP
GHASGWAETPRTDRGVGVDTIQETPTPGTKRRSRWDETPGATPAAATPTPSHATPSHATP
SHATPSMGTPTPHTPMFTPGGSTPVGVKAMAMATPTPGHIAAMTPEQLQAYRWEKEIDER
NRPYTDEELDAMFPPGYKVLPPPAGYVPIRTPARKLTATPTPLAGTPIGFFMQTEEVGGS
AAAAARLLDPQPKGSQQLPFMKPEDAQYFDKLLIDVDEETLSPEELKERKIMKLLLKIKN
GTPPMCKAALRQITDKARDFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDL
VRPYVHKILVVIEPLLIDEDYYARVEGREIISNLAKAAGLATMISTMRPDIDNIDEYVRN
TTARAFAVVASALGIPSLLPFLKAVCRSKKSWQARHTGIKIVQQIAILMGCAILPHLKSL
VEIIEHGLVDEQQKVRTITALASAALAEAATPYGIESFDSVLKPLWKGIRTHRGKGLAAF
LKAIGYLIPLMDAEYANYYTREVMLILIREFQSPDEEMKKIVLKVVKQCCGTDGVEPQYI
MDEILPHFFKHFWNHRMALDRRNYRQLVDTTQLYRLFHQVGASEIINRIVDDLKDDNEQY
RKMVMESIEKILANLGAADIDSKLEEALIDGILYAFQEQTTEDVVMLNGFGTIVNQLGKR
VKPYLPQICGIILWRMNNKSAKVRQQAADLISRIAVVMKTCQEEKLMGHLGVVLYEYLGE
EYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIAD
RGPEFVSAREWMRICFELLELLKAHKKAIRRATVNTFGYIAKAIGPHDVLATLLNNLKVQ
ERQNRVCTTVAIAIVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKD
YIYAVCPLLEDALMDRDLVHRQTACAAIKHMALGVYGFGCEDALIHLLNHVWPNIFETSP
HLVQAFMDAVEGMRVALGPIKILQYALQGLFHPARKVRDVYWKIYNTLYIGGQDALVAGY
PRIQNDPNNHFVRYELDYLL