DPGLEAN00713 in OGS1.0

New model in OGS2.0DPOGS214942 
Genomic Positionscaffold103:- 113346-120523
See gene structure
CDS Length2538
Paired RNAseq reads  2729
Single RNAseq reads  6531
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004823 (0.0)
Best Drosophila hit  CG3605 (9e-169)
Best Human hitsplicing factor 3B subunit 2 (1e-147)
Best NR hit (blastp)  PREDICTED: similar to CG3605 CG3605-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC008829 [Tribolium castaneum] (0.0)
GeneOntology terms



  
GO:0005681 spliceosomal complex
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0005686 U2 snRNP
GO:0071013 catalytic step 2 spliceosome
GO:0071011 precatalytic spliceosome
InterPro families
  
IPR007180 Domain of unknown function DUF382
IPR006568 PSP, proline-rich
Orthology groupMCL12528

Nucleotide sequence:

ATGGGCCCACCGCCAGGAATGCCGAGTTTTCCTCCCATGCCTCCATCGTCAGGCCCCATG
GGTCCTGGAAGTATGCCGCCTCCTCCAGTGGGTCCACCTGGTACAATGCCTGCAGTCACA
ACATCTGGTGGTCCACCAAACATGCCGCCACCAGGAATGGGTCCACCTCCTAACATGATG
GGTATGGGTCCTCCTGGTATCGGTCCTCCACCCCCGCCCGGTTTGGGACCCCCAGGAATC
AACATGGGACCTCCTCCGATGGGACCGCCAGGCCTTCCGTCACGAATGCCTCCTAACATG
ATGCGGGGAACATCTAATATGAAGAGTAACTACAATCAAACTATAGATATGGGACCGCCT
GGTATGGTGCCACCCTCTAGTATGAATCCTTGGGACAATCAATGCCCTCCTGGTTGGGGG
CGACAAGGGAGAGGGGATGGCCCTCCAGGATGGGACGATCAGGACGATGATGAAGATGAT
AATGATGATGAAAGTGATCCTTCAGGACCTCCACTACCATCCTTGTTGACCATGAAAATA
GATACACCCGAGGAGTTCAGAAATAAACCCCCTTCTGCTGTGGGTGGTGTTGTGCTACCA
AAAGCCTTGGAGGAGGCACTCGCTTACAAAGATCAAAGACAAGCTGCCTTAGGAGATGAA
GCAGATAAAGTAACAGAGCAAACAAAGAAACCTGAACCTCCACCGGCACCTGTGATCAGT
ACAGAGTATGATGGTGAAGAAGAAGGAGACTCGGATGAAGATAACATACCAGAAGCTCCC
TTACCACCAATAATATCTAAGCAAGAGAATCAAACCAAAGCGAGTAAAACTAAACGGAAA
AAGAAGAAGAAGAAGGCGGCGAAACAGAAGAGAAAAGAAGCAAAGTCGGCCGACGAAAGT
AGCAAAGAAGCCCAGAAGACCAGCGACAAAGAAAACGAAAAGGAAGCTGAAATCGAATAC
GTCCAAGAGAACATACAGTTCCACGAACTGGAGCCCATGTACCGTCAGTTCCACCGCATC
CTGGAATCGTTTAAGATAACGGAGAGGAAGGAGGAGATCAAGGATGAACCCGGGAAAGAT
GCACCGAAACCGAGCAAGCCGCTGGAGAAAGTTACCGACCAATTTGCAGCTGACGAAGAG
GCTGTTGAGAAACATGCAGCCGATGAGAAGGAGCGGCTCTCAAAACGCAAGTTAAAGAAG
CTGTCTCGTCTGTCCGTGGCGGAGCTGAAGCAACTGGTGGCCCGGCCGGATGTAGTGGAG
ATGTACGACGTCACCGCCAGGGACCCCAAACTGCTGGTACAGCTGAAGGCTCACAGGAAC
ACTGTCCAAGTGCCGCGCCACTGGTGTTACAAACGGAAGTATCTGCAAGGCAAGCGCGGT
ATCGAGAAGCCGCCGTTCGACCTGCCGGACTTCATCAAGAAGACCGGCATCATGGAGATG
AGAGCCTCGCTCCAGGACAAGGAGGAAACTAAGACATTGAAGGCGAAGATGAGGGAGAGG
ACGCGACCCAAGCTCGGGAAGATTGACATCGACTACCAGAAGCTGCACGACGCGTTCTTC
AAGTGGCAGACGAAACCTCGCATGACCATCCACGGTGACCTCTACTACGAGGGTAAGGAG
TATGAAACTCGACTGAGAGAAAAGAAACCGGGAGATCTCTCAGAGGAACTGAGAACCGCA
CTGGGCATGCCGGTGGGACCTGGCTCTCATAAGGTGCCGCCGCCGTGGCTGATCGCCCAG
CAGCGTTACGGACCGCCTCCGTCTTACCCAAACCTCAAGATCCCGGGCCTGAACGCTCCT
ATACCCGAGGGTTGCGCCTTCGGGTACCACGCGGGCGGCTGGGGTAAGCCTCCCGTCGAT
GAAGCCGGCAAACCTCTCTACGGAGACGTGTTCGGACATCAGAGCAGCGGCCAAGATGAT
GCCGAGGATCAAGATATAGACAGGACCATGTGGGGTGAACTGGAGTCGGAGTCAGAGGAG
GAATCGGAAGAAGAGGAATCAGATGAGGGCGAGAAGGCCGGTGAGGGTGAGGCCGTGGCA
GCGGGCGTGGCGACTCCTGGTGAGGGACTCGTCACACCGCTGGGCACCAGCTCTGTACCG
CCCGGACTGGAGACACCTGACACCATCGAGCTCAGGAAGAAGAAGATGGAGGATCTAGAA
GGCGGTGAGACACCGGCCTTGTATCAAGTGGTCCCCGAGAGACGAGTTGGTCTCACGTCT
GGTATGATGGCGTCCACACATGTGTATGACATCAATGCCGCAAATCCTGGTAAACGAGCT
CCGACCGGTGCAACCAGTGAGGTTGGTCCCAGCGCTGCAGCTGGTGTAGAAGTGGCGCTG
GACCCCTCGGAGCTGGAGCTGGAGCCCGAGGCTGTGGCGGCCAGGTACGAGAGACACCTG
CGGGAACACAGGCCCAAGGGACGCGAGGACCTCTCAGATATGTTGGCCGACCACGTCGCC
AGACAGAAGAATAAACGAAAGCGTCAACAAAACACAGATTCCAAGCAAGCGAAGAAATAC
AAAGAATTCAAGTTCTAA

Protein sequence:

MGPPPGMPSFPPMPPSSGPMGPGSMPPPPVGPPGTMPAVTTSGGPPNMPPPGMGPPPNMM
GMGPPGIGPPPPPGLGPPGINMGPPPMGPPGLPSRMPPNMMRGTSNMKSNYNQTIDMGPP
GMVPPSSMNPWDNQCPPGWGRQGRGDGPPGWDDQDDDEDDNDDESDPSGPPLPSLLTMKI
DTPEEFRNKPPSAVGGVVLPKALEEALAYKDQRQAALGDEADKVTEQTKKPEPPPAPVIS
TEYDGEEEGDSDEDNIPEAPLPPIISKQENQTKASKTKRKKKKKKAAKQKRKEAKSADES
SKEAQKTSDKENEKEAEIEYVQENIQFHELEPMYRQFHRILESFKITERKEEIKDEPGKD
APKPSKPLEKVTDQFAADEEAVEKHAADEKERLSKRKLKKLSRLSVAELKQLVARPDVVE
MYDVTARDPKLLVQLKAHRNTVQVPRHWCYKRKYLQGKRGIEKPPFDLPDFIKKTGIMEM
RASLQDKEETKTLKAKMRERTRPKLGKIDIDYQKLHDAFFKWQTKPRMTIHGDLYYEGKE
YETRLREKKPGDLSEELRTALGMPVGPGSHKVPPPWLIAQQRYGPPPSYPNLKIPGLNAP
IPEGCAFGYHAGGWGKPPVDEAGKPLYGDVFGHQSSGQDDAEDQDIDRTMWGELESESEE
ESEEEESDEGEKAGEGEAVAAGVATPGEGLVTPLGTSSVPPGLETPDTIELRKKKMEDLE
GGETPALYQVVPERRVGLTSGMMASTHVYDINAANPGKRAPTGATSEVGPSAAAGVEVAL
DPSELELEPEAVAARYERHLREHRPKGREDLSDMLADHVARQKNKRKRQQNTDSKQAKKY
KEFKF