New model in OGS2.0 | DPOGS202047  |
---|---|
Genomic Position | scaffold795:+ 35055-42564 |
See gene structure | |
CDS Length | 3591 |
Paired RNAseq reads   | 3764 |
Single RNAseq reads   | 9094 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001281 (5e-35) |
Best Drosophila hit   | coro, isoform B (1e-06) |
Best Human hit | gem-associated protein 5 (5e-32) |
Best NR hit (blastp)   | PREDICTED: similar to gemin 5 [Tribolium castaneum] (3e-77) |
Best NR hit (blastx)   | PREDICTED: similar to gem (nuclear organelle) associated protein 5 [Nasonia vitripennis] (2e-69) |
GeneOntology terms    | GO:0005515 protein binding GO:0005737 cytoplasm GO:0008380 RNA splicing GO:0017069 snRNA binding GO:0005634 nucleus GO:0005654 nucleoplasm GO:0005829 cytosol GO:0015030 Cajal body GO:0016604 nuclear body GO:0000387 spliceosomal snRNP assembly GO:0005681 spliceosomal complex GO:0006461 protein complex assembly GO:0000398 nuclear mRNA splicing, via spliceosome |
InterPro families    | IPR019781 WD40 repeat, subgroup IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR015943 WD40/YVTN repeat-like-containing domain IPR019775 WD40 repeat, conserved site IPR011046 WD40 repeat-like-containing domain IPR001680 WD40 repeat |
Orthology group | MCL12851 |
Nucleotide sequence:
ATGGACGAGACTGTAATATTTCCGTCGCCAAATTGGTTCCAAGCATCTGTGATAGCGATT
TCTCATGATGGATGGCTTATTTATGGCGGGCCCATTAAAAGCCTTTGTATCTTAGAACCG
TTACATTCTGAACACGATGGAGTTTTCAAACACAATCAATCTTATAGAGCTCATGTGATG
AATAAAGCACATTTAGAAAAGATCACAAGTGTGGACATTTCAAAGGAATGGCCAGAAAAG
AAATTAGTTTTAACTGGAAGTGCTGATGGTTGTGTAAAGCAGTGGAACTTAGAACATTTT
AAGAACTCTATAAGACTTAAATCTACCCTCAGTCATGAAATTCATTATAATGATAAAGAG
GATGTAGCTGGTCTAGGTTATAGTACGGATGTGTTTGCCATCACAGTGGGAGGCTATGGT
AATATCGTGAAATGGGATTTGAAATCCAATGTTGTTAAGACTTATAATCAATTTTTAAAA
AGCTTCAAGCCAACTTGTGTTGCATGTTCTCAGCACACACCATTAAATGTTGCCGTGGGA
ACAAAACAGGGAGTTGTATTTGTTTTAGATTTAAATGGGAATGGCAAAATTGTCTATAAG
GTGAGGGGTCAGGATGATGAGATAATCAACTTGTCATGGTGTCCACAATATGAGGTGATT
CTTAAAAAGACCCTCAAAGAATCACAAAATCGTACACATCTAGAAAGTAAATTAGATAAA
TTAAAATTAAAGGATGCTGAAGAAGATTTAAATGATTCAGGAATATCAAAGAACCTTCCA
GAAGACAGCTTTGATGAATCCATTGCACAGGAAGATGATATGTTTGATATATACAAAGAC
CATGAAGCAGATGAATTTGGTCATAAAAAGTTTCAGCCGACCGACATCATAGTGAAGTTG
AAAAAAGAAACACCTTCCGGTGACTTCTTGGCTGAGTGTTTAAAACTAAAGGAGGCAATA
ATAAATAAAAAAAATGATAAGGAATCTTCAATAGCAACTCTAGTTGATGCATTGGATAAG
ACGCATGTAGACAGTAATGATTATAATGCAGGAGAGGTTACAGAAGAAACACAATCTAAA
AATATGGAAGAATGTAGTTCTCATTTGCACAAACATCTCTTGGCTACTATCGGAAAATAC
GGGGGCGTAAGGCTGTGGTCCAAATCGGGGAAACTTGTAGGTTCTTGTGTTGTACCAAAC
GCCGTGAACAAGAATCATAGGAGTAAAGGTCCCATAGCAACAACATTATTATGGTACAAG
CCTGATGTATTGCTCATCGCGGATGGAAAAAGTCAATTGCTTGAGTGCAATCCAATGAAG
ATAGACTGTAGGAACAAACTAGATTGGCAGATAGTCCACTCGTTGCACAAACGTGGTTTG
TACGCGATCGCAACTAACGCGCCTCGTGTCCAAACAGAAAATTCGAATGGTTCAGATGAT
TGGCTAGTTTGGACGATTGCGCAAGATCGTAACATTGTCTGTTATTCTATGGAAAGAAAG
GAAAAGATATCTGTGCACAACACCTGTGGAGGCTTTGTATACTCCATACAGCACTGTCCT
TATGATGCTAAGAAGATAGCAGTAAGTGTAGGTGATGGGGCTGTACGCATTTGGAACACA
GATACGCTTGTAGAAGATGACAGCAAATTGTCTATGGGTCATGTGACTTCATACTGGCAG
AATGTTCAAGGCAAGGTGTTGACGGTTGCATGGCACCCCACAAAAGAGAATTTACTTGCC
TTTGGTACTGCTGAAGCCAGGGTGGGTTTGATTGATACAAGTGGTAAGACAGAGCGGCCG
GCCAGAACATTACTTCCAGTACTACAAGGTGGAGTGTATTCTTTGTGTTGGGGACGGAAT
GATCAACTTTATGCCTGCGGTGGAGGGAAACTGGTCGTCTATAACACAGATGCTATCGAT
AAAGATCCAATGCCAATAAAAGTCCAATTTGAAGGAAAGCAATGGGAATTAAGTTCAGTG
TTGTTTCATAGCCGAGGTCTAGTGTGTGGTGGAGTTAATGGGGCTTTGGCTGTATTAGAT
CCTGACACCAATGAAATTTTAACTGCGTCTTTTATATTTGGCAAAATGATATACACTACA
GAATGGCACCCTCAGCAGACATCTACATCCAGCGAGGATTCTATATATAGAGACTTAATA
GCTGTCTCGTCTCTTGATAAAGCGTGCAGTATTATAATCGTGGAGTATAATGACAAAGGA
GACGGTCCTAAAATACACCCGTTTAAAACTTTGTCTGGTCACACGGCGACCGTGCTACAG
CTGTCATGGAATCCACACAACGATGTCCAGCTCTTGTCGACCTCACATGACACTACAGTT
CGAATCTGGGATATCTCATCTGGTGAATGTACTCATATTTTCGGAGGTCATTGCCACGCC
TCGCTCAGTGCATGTTGGAGCTCATTTCCATCACTGTCCAATGTAGTAATGTCAAGCGGG
TCCGACTGTTGTTTGAGATTGTGGCAAGTGGACAGACATACAACAGATGTTTATACTGAC
ATGTTTCGCAAAATGGCTCCGGGAGGCGCGAAAAAAACTAAGACTAAAAAGGCTGAAATT
CAAGAATTGGAAAAAGGTGAAGAACAAGTCGCTACCACCTTCGATACGAAAGCGTCCACC
AAAGCACCAAAGAAATTCCTCTTACCTATAATTAGTAAACAAATATCGCCATGCACTGTG
TACAGTGTGAGACAAATGCTAGTCAAATATTGGAGCGATCGAAATGGAACAAACGAGAAG
GTGGCTAACGGTCAACCTGATGTCGTGGAAGAAAACGTCGAAGGGAAGGAGGCAGAGGAA
AAAATCGTGGAATTCACAAAGATTTTCGGTACAACGAATGATCTGAACGAAGTCTTGGAC
ATGGAAATGGCTCGTCACTCCACATGTAACCGTTGGGAGTCGTGTGTTGTGCTGAACGTG
CTCCGCGGTCAGATGTCCGATATGGTGACGTCAGCGGCCGCCCGCGGGGAGCTCTGCCCG
TTCATTGTGAGCCTCGCCCCCACCGTCTCTCACAAATTTTGGAAAGATGCAACGCAAATG
TATTTGGCTCAAATCGATCGAATGATTGCTAAAGGAGAGGAAGAGAAGCTTAGCGAGAAC
AAACAGTACGGTGGCGCCATCTACCGTAAGGCGTGTCTTCAGCTGTGTTCACACGACGTA
CGAGCCGCCGTACATACACTCGTAGACGCGAGACTGTTCAAGGAGGCTTACATATTGGGG
AGGGTCAGGCATATGGACAGCATAGCGGAGGACACGTTAAAGAAATGGGCAACTGATTGT
TTACAAACTGGCAACATTTGTATGGCTGCGGTGTGTTATTTAGCCTTAGGCGATCCGTAC
CAAGCCGCTCTGGCCTTGTCAAAATCGGACGATCAAGAACTACTCGGCATAGCGTCGGAA
CTAGCGAAGGAATCTGGACAGGCGACATTCGCTAATCATATAGAAGATAAGAAAACGCAA
ATATTAAGCGAAACGTCGGAAAATGATGAACAACTAAAGAAACTTCCTACAAAAATCGAT
CTATTGATTGATAGTGTTGGCACTAGTGAAGTTACATCGGATGTGATATGA
Protein sequence:
MDETVIFPSPNWFQASVIAISHDGWLIYGGPIKSLCILEPLHSEHDGVFKHNQSYRAHVM
NKAHLEKITSVDISKEWPEKKLVLTGSADGCVKQWNLEHFKNSIRLKSTLSHEIHYNDKE
DVAGLGYSTDVFAITVGGYGNIVKWDLKSNVVKTYNQFLKSFKPTCVACSQHTPLNVAVG
TKQGVVFVLDLNGNGKIVYKVRGQDDEIINLSWCPQYEVILKKTLKESQNRTHLESKLDK
LKLKDAEEDLNDSGISKNLPEDSFDESIAQEDDMFDIYKDHEADEFGHKKFQPTDIIVKL
KKETPSGDFLAECLKLKEAIINKKNDKESSIATLVDALDKTHVDSNDYNAGEVTEETQSK
NMEECSSHLHKHLLATIGKYGGVRLWSKSGKLVGSCVVPNAVNKNHRSKGPIATTLLWYK
PDVLLIADGKSQLLECNPMKIDCRNKLDWQIVHSLHKRGLYAIATNAPRVQTENSNGSDD
WLVWTIAQDRNIVCYSMERKEKISVHNTCGGFVYSIQHCPYDAKKIAVSVGDGAVRIWNT
DTLVEDDSKLSMGHVTSYWQNVQGKVLTVAWHPTKENLLAFGTAEARVGLIDTSGKTERP
ARTLLPVLQGGVYSLCWGRNDQLYACGGGKLVVYNTDAIDKDPMPIKVQFEGKQWELSSV
LFHSRGLVCGGVNGALAVLDPDTNEILTASFIFGKMIYTTEWHPQQTSTSSEDSIYRDLI
AVSSLDKACSIIIVEYNDKGDGPKIHPFKTLSGHTATVLQLSWNPHNDVQLLSTSHDTTV
RIWDISSGECTHIFGGHCHASLSACWSSFPSLSNVVMSSGSDCCLRLWQVDRHTTDVYTD
MFRKMAPGGAKKTKTKKAEIQELEKGEEQVATTFDTKASTKAPKKFLLPIISKQISPCTV
YSVRQMLVKYWSDRNGTNEKVANGQPDVVEENVEGKEAEEKIVEFTKIFGTTNDLNEVLD
MEMARHSTCNRWESCVVLNVLRGQMSDMVTSAAARGELCPFIVSLAPTVSHKFWKDATQM
YLAQIDRMIAKGEEEKLSENKQYGGAIYRKACLQLCSHDVRAAVHTLVDARLFKEAYILG
RVRHMDSIAEDTLKKWATDCLQTGNICMAAVCYLALGDPYQAALALSKSDDQELLGIASE
LAKESGQATFANHIEDKKTQILSETSENDEQLKKLPTKIDLLIDSVGTSEVTSDVI