DPGLEAN09658 in OGS1.0

New model in OGS2.0DPOGS202047 
Genomic Positionscaffold795:+ 35055-42564
See gene structure
CDS Length3591
Paired RNAseq reads  3764
Single RNAseq reads  9094
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001281 (5e-35)
Best Drosophila hit  coro, isoform B (1e-06)
Best Human hitgem-associated protein 5 (5e-32)
Best NR hit (blastp)  PREDICTED: similar to gemin 5 [Tribolium castaneum] (3e-77)
Best NR hit (blastx)  PREDICTED: similar to gem (nuclear organelle) associated protein 5 [Nasonia vitripennis] (2e-69)
GeneOntology terms











  
GO:0005515 protein binding
GO:0005737 cytoplasm
GO:0008380 RNA splicing
GO:0017069 snRNA binding
GO:0005634 nucleus
GO:0005654 nucleoplasm
GO:0005829 cytosol
GO:0015030 Cajal body
GO:0016604 nuclear body
GO:0000387 spliceosomal snRNP assembly
GO:0005681 spliceosomal complex
GO:0006461 protein complex assembly
GO:0000398 nuclear mRNA splicing, via spliceosome
InterPro families





  
IPR019781 WD40 repeat, subgroup
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019775 WD40 repeat, conserved site
IPR011046 WD40 repeat-like-containing domain
IPR001680 WD40 repeat
Orthology groupMCL12851

Nucleotide sequence:

ATGGACGAGACTGTAATATTTCCGTCGCCAAATTGGTTCCAAGCATCTGTGATAGCGATT
TCTCATGATGGATGGCTTATTTATGGCGGGCCCATTAAAAGCCTTTGTATCTTAGAACCG
TTACATTCTGAACACGATGGAGTTTTCAAACACAATCAATCTTATAGAGCTCATGTGATG
AATAAAGCACATTTAGAAAAGATCACAAGTGTGGACATTTCAAAGGAATGGCCAGAAAAG
AAATTAGTTTTAACTGGAAGTGCTGATGGTTGTGTAAAGCAGTGGAACTTAGAACATTTT
AAGAACTCTATAAGACTTAAATCTACCCTCAGTCATGAAATTCATTATAATGATAAAGAG
GATGTAGCTGGTCTAGGTTATAGTACGGATGTGTTTGCCATCACAGTGGGAGGCTATGGT
AATATCGTGAAATGGGATTTGAAATCCAATGTTGTTAAGACTTATAATCAATTTTTAAAA
AGCTTCAAGCCAACTTGTGTTGCATGTTCTCAGCACACACCATTAAATGTTGCCGTGGGA
ACAAAACAGGGAGTTGTATTTGTTTTAGATTTAAATGGGAATGGCAAAATTGTCTATAAG
GTGAGGGGTCAGGATGATGAGATAATCAACTTGTCATGGTGTCCACAATATGAGGTGATT
CTTAAAAAGACCCTCAAAGAATCACAAAATCGTACACATCTAGAAAGTAAATTAGATAAA
TTAAAATTAAAGGATGCTGAAGAAGATTTAAATGATTCAGGAATATCAAAGAACCTTCCA
GAAGACAGCTTTGATGAATCCATTGCACAGGAAGATGATATGTTTGATATATACAAAGAC
CATGAAGCAGATGAATTTGGTCATAAAAAGTTTCAGCCGACCGACATCATAGTGAAGTTG
AAAAAAGAAACACCTTCCGGTGACTTCTTGGCTGAGTGTTTAAAACTAAAGGAGGCAATA
ATAAATAAAAAAAATGATAAGGAATCTTCAATAGCAACTCTAGTTGATGCATTGGATAAG
ACGCATGTAGACAGTAATGATTATAATGCAGGAGAGGTTACAGAAGAAACACAATCTAAA
AATATGGAAGAATGTAGTTCTCATTTGCACAAACATCTCTTGGCTACTATCGGAAAATAC
GGGGGCGTAAGGCTGTGGTCCAAATCGGGGAAACTTGTAGGTTCTTGTGTTGTACCAAAC
GCCGTGAACAAGAATCATAGGAGTAAAGGTCCCATAGCAACAACATTATTATGGTACAAG
CCTGATGTATTGCTCATCGCGGATGGAAAAAGTCAATTGCTTGAGTGCAATCCAATGAAG
ATAGACTGTAGGAACAAACTAGATTGGCAGATAGTCCACTCGTTGCACAAACGTGGTTTG
TACGCGATCGCAACTAACGCGCCTCGTGTCCAAACAGAAAATTCGAATGGTTCAGATGAT
TGGCTAGTTTGGACGATTGCGCAAGATCGTAACATTGTCTGTTATTCTATGGAAAGAAAG
GAAAAGATATCTGTGCACAACACCTGTGGAGGCTTTGTATACTCCATACAGCACTGTCCT
TATGATGCTAAGAAGATAGCAGTAAGTGTAGGTGATGGGGCTGTACGCATTTGGAACACA
GATACGCTTGTAGAAGATGACAGCAAATTGTCTATGGGTCATGTGACTTCATACTGGCAG
AATGTTCAAGGCAAGGTGTTGACGGTTGCATGGCACCCCACAAAAGAGAATTTACTTGCC
TTTGGTACTGCTGAAGCCAGGGTGGGTTTGATTGATACAAGTGGTAAGACAGAGCGGCCG
GCCAGAACATTACTTCCAGTACTACAAGGTGGAGTGTATTCTTTGTGTTGGGGACGGAAT
GATCAACTTTATGCCTGCGGTGGAGGGAAACTGGTCGTCTATAACACAGATGCTATCGAT
AAAGATCCAATGCCAATAAAAGTCCAATTTGAAGGAAAGCAATGGGAATTAAGTTCAGTG
TTGTTTCATAGCCGAGGTCTAGTGTGTGGTGGAGTTAATGGGGCTTTGGCTGTATTAGAT
CCTGACACCAATGAAATTTTAACTGCGTCTTTTATATTTGGCAAAATGATATACACTACA
GAATGGCACCCTCAGCAGACATCTACATCCAGCGAGGATTCTATATATAGAGACTTAATA
GCTGTCTCGTCTCTTGATAAAGCGTGCAGTATTATAATCGTGGAGTATAATGACAAAGGA
GACGGTCCTAAAATACACCCGTTTAAAACTTTGTCTGGTCACACGGCGACCGTGCTACAG
CTGTCATGGAATCCACACAACGATGTCCAGCTCTTGTCGACCTCACATGACACTACAGTT
CGAATCTGGGATATCTCATCTGGTGAATGTACTCATATTTTCGGAGGTCATTGCCACGCC
TCGCTCAGTGCATGTTGGAGCTCATTTCCATCACTGTCCAATGTAGTAATGTCAAGCGGG
TCCGACTGTTGTTTGAGATTGTGGCAAGTGGACAGACATACAACAGATGTTTATACTGAC
ATGTTTCGCAAAATGGCTCCGGGAGGCGCGAAAAAAACTAAGACTAAAAAGGCTGAAATT
CAAGAATTGGAAAAAGGTGAAGAACAAGTCGCTACCACCTTCGATACGAAAGCGTCCACC
AAAGCACCAAAGAAATTCCTCTTACCTATAATTAGTAAACAAATATCGCCATGCACTGTG
TACAGTGTGAGACAAATGCTAGTCAAATATTGGAGCGATCGAAATGGAACAAACGAGAAG
GTGGCTAACGGTCAACCTGATGTCGTGGAAGAAAACGTCGAAGGGAAGGAGGCAGAGGAA
AAAATCGTGGAATTCACAAAGATTTTCGGTACAACGAATGATCTGAACGAAGTCTTGGAC
ATGGAAATGGCTCGTCACTCCACATGTAACCGTTGGGAGTCGTGTGTTGTGCTGAACGTG
CTCCGCGGTCAGATGTCCGATATGGTGACGTCAGCGGCCGCCCGCGGGGAGCTCTGCCCG
TTCATTGTGAGCCTCGCCCCCACCGTCTCTCACAAATTTTGGAAAGATGCAACGCAAATG
TATTTGGCTCAAATCGATCGAATGATTGCTAAAGGAGAGGAAGAGAAGCTTAGCGAGAAC
AAACAGTACGGTGGCGCCATCTACCGTAAGGCGTGTCTTCAGCTGTGTTCACACGACGTA
CGAGCCGCCGTACATACACTCGTAGACGCGAGACTGTTCAAGGAGGCTTACATATTGGGG
AGGGTCAGGCATATGGACAGCATAGCGGAGGACACGTTAAAGAAATGGGCAACTGATTGT
TTACAAACTGGCAACATTTGTATGGCTGCGGTGTGTTATTTAGCCTTAGGCGATCCGTAC
CAAGCCGCTCTGGCCTTGTCAAAATCGGACGATCAAGAACTACTCGGCATAGCGTCGGAA
CTAGCGAAGGAATCTGGACAGGCGACATTCGCTAATCATATAGAAGATAAGAAAACGCAA
ATATTAAGCGAAACGTCGGAAAATGATGAACAACTAAAGAAACTTCCTACAAAAATCGAT
CTATTGATTGATAGTGTTGGCACTAGTGAAGTTACATCGGATGTGATATGA

Protein sequence:

MDETVIFPSPNWFQASVIAISHDGWLIYGGPIKSLCILEPLHSEHDGVFKHNQSYRAHVM
NKAHLEKITSVDISKEWPEKKLVLTGSADGCVKQWNLEHFKNSIRLKSTLSHEIHYNDKE
DVAGLGYSTDVFAITVGGYGNIVKWDLKSNVVKTYNQFLKSFKPTCVACSQHTPLNVAVG
TKQGVVFVLDLNGNGKIVYKVRGQDDEIINLSWCPQYEVILKKTLKESQNRTHLESKLDK
LKLKDAEEDLNDSGISKNLPEDSFDESIAQEDDMFDIYKDHEADEFGHKKFQPTDIIVKL
KKETPSGDFLAECLKLKEAIINKKNDKESSIATLVDALDKTHVDSNDYNAGEVTEETQSK
NMEECSSHLHKHLLATIGKYGGVRLWSKSGKLVGSCVVPNAVNKNHRSKGPIATTLLWYK
PDVLLIADGKSQLLECNPMKIDCRNKLDWQIVHSLHKRGLYAIATNAPRVQTENSNGSDD
WLVWTIAQDRNIVCYSMERKEKISVHNTCGGFVYSIQHCPYDAKKIAVSVGDGAVRIWNT
DTLVEDDSKLSMGHVTSYWQNVQGKVLTVAWHPTKENLLAFGTAEARVGLIDTSGKTERP
ARTLLPVLQGGVYSLCWGRNDQLYACGGGKLVVYNTDAIDKDPMPIKVQFEGKQWELSSV
LFHSRGLVCGGVNGALAVLDPDTNEILTASFIFGKMIYTTEWHPQQTSTSSEDSIYRDLI
AVSSLDKACSIIIVEYNDKGDGPKIHPFKTLSGHTATVLQLSWNPHNDVQLLSTSHDTTV
RIWDISSGECTHIFGGHCHASLSACWSSFPSLSNVVMSSGSDCCLRLWQVDRHTTDVYTD
MFRKMAPGGAKKTKTKKAEIQELEKGEEQVATTFDTKASTKAPKKFLLPIISKQISPCTV
YSVRQMLVKYWSDRNGTNEKVANGQPDVVEENVEGKEAEEKIVEFTKIFGTTNDLNEVLD
MEMARHSTCNRWESCVVLNVLRGQMSDMVTSAAARGELCPFIVSLAPTVSHKFWKDATQM
YLAQIDRMIAKGEEEKLSENKQYGGAIYRKACLQLCSHDVRAAVHTLVDARLFKEAYILG
RVRHMDSIAEDTLKKWATDCLQTGNICMAAVCYLALGDPYQAALALSKSDDQELLGIASE
LAKESGQATFANHIEDKKTQILSETSENDEQLKKLPTKIDLLIDSVGTSEVTSDVI