New model in OGS2.0 | DPOGS215570  |
---|---|
Genomic Position | scaffold392:+ 9917-12205 |
See gene structure | |
CDS Length | 1275 |
Paired RNAseq reads   | 377 |
Single RNAseq reads   | 1422 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010974 (5e-165) |
Best Drosophila hit   | transport and golgi organization 4 (6e-159) |
Best Human hit | pleiotropic regulator 1 (7e-150) |
Best NR hit (blastp)   | AGAP005631-PA [Anopheles gambiae str. PEST] (2e-178) |
Best NR hit (blastx)   | striatin, putative [Aedes aegypti] (4e-167) |
GeneOntology terms    | GO:0007030 Golgi organization GO:0005829 cytosol GO:0071013 catalytic step 2 spliceosome GO:0071011 precatalytic spliceosome GO:0000398 nuclear mRNA splicing, via spliceosome |
InterPro families    | IPR019775 WD40 repeat, conserved site IPR019781 WD40 repeat, subgroup IPR001680 WD40 repeat IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR015943 WD40/YVTN repeat-like-containing domain IPR011046 WD40 repeat-like-containing domain IPR020472 G-protein beta WD-40 repeat |
Orthology group | MCL14294 |
Nucleotide sequence:
ATGGCAAATGCCGCTTTGAAGGGAAACAGTAGTCTTTTAAACTGGAATGGCGAATCAGTT
AGAATTTCTGGGCTACAGGGAAACAAAACACTTTCAACATTTTTAGTAGAGGAGCCCTGT
TCGAGAAACGTCGTGAAGCTACACGACATAGTAGTAGCGAGACAAATCATCACATACATA
GCGGAGTCTGCGGCGTTGGGCTGCGACGGCGCCATGTTGCCGGCCTCAGGTCCTAGCGCG
GCGCCAGGTCCGGTGACGCCGACACCCCCGCGACGCGCCGCCGCCCCCCGCCCTCGCTGG
CACGCTCCCTGGAAACTGTTCCGCGTCATCTCCGGTCACCTGGGCTGGGTGCGCTGCGTC
GCCGTGGAGCCTGGGAACGAGTGGTTCGCGACCGGTGCCGCCGACAGAGTCATCAAGATA
TGGGACCTCGCCAGCGGGAAGCTCAAAGTGTCTCTCACCGGTCACGTGAGCACAGTCAGA
GGTCTGGAGGTGTCGGCGCGTCACCCCTACCTGTTCAGTTGCGGCGAAGACAGACAGGTG
AAATGTTGGGACCTGGAATATAACAAGGTGATTCGTCACTACCACGGCCACCTGTCGGCT
GTGTACTCTCTGGCGCTGCACCCCACCATCGACGTGCTGGTGTCGGCGGGCCGCGACGCC
ACGGCCCGGGTGTGGGACGCCCGCAGCAAGGCCAACGTACACACCCTGTCGGGCCACACG
GACACGGTGGCGTCCTTGGCGTGCCAGGCCGCCGAGCCGCAGGTCATCACGGGTTCGCAC
GACGCCACCGTGCGCCTGTGGGACTTGGCGGCCGGCAAGTCGCTGTGCACGCTCACCAAC
CACAAGAAGTCGGTGCGCGCGCTGGCTCTGCATCCCGCGCTGTACACGTTCGCGTCCGCG
TCGCCGGACAACATCAAGCAGTGGAAATGCCCGGACGGCCGCTTCATCCAGAACCTGTCG
GGCCACAACGCCATCGTCAACTGCCTGGCCGTGAACAACGAGGGCGTGCTGGCGAGCGGC
GGCGACAACGGCTCCATGTTTCTGTGGGACTGGCGCACGGGGTACAACTTCCAGCGCCTG
CAGTCGGCCGTGCAGCCCGGTTCCATGGACTCGGAGGCCGGCATCTTCGCGCTCGCCTTC
GACCGCTCGGGCTCGCGCCTGCTCACGGCCGAAGCCGACAAGACCATCAAGATATACCGT
GAGGACGACGCCGCCTCGGAGGAGACGCACCCCGTCAACTGGAGGCCGGAGATCATCAAG
AGGAGGAAGTACTAG
Protein sequence:
MANAALKGNSSLLNWNGESVRISGLQGNKTLSTFLVEEPCSRNVVKLHDIVVARQIITYI
AESAALGCDGAMLPASGPSAAPGPVTPTPPRRAAAPRPRWHAPWKLFRVISGHLGWVRCV
AVEPGNEWFATGAADRVIKIWDLASGKLKVSLTGHVSTVRGLEVSARHPYLFSCGEDRQV
KCWDLEYNKVIRHYHGHLSAVYSLALHPTIDVLVSAGRDATARVWDARSKANVHTLSGHT
DTVASLACQAAEPQVITGSHDATVRLWDLAAGKSLCTLTNHKKSVRALALHPALYTFASA
SPDNIKQWKCPDGRFIQNLSGHNAIVNCLAVNNEGVLASGGDNGSMFLWDWRTGYNFQRL
QSAVQPGSMDSEAGIFALAFDRSGSRLLTAEADKTIKIYREDDAASEETHPVNWRPEIIK
RRKY