New model in OGS2.0 | DPOGS203950 |
---|---|
Genomic Position | scaffold65:+ 118233-136242 |
See gene structure | |
CDS Length | 3393 |
Paired RNAseq reads | 620 |
Single RNAseq reads | 1438 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000478 (0.0) |
Best Drosophila hit | Oseg1 (0.0) |
Best Human hit | intraflagellar transport protein 122 homolog isoform 3 (0.0) |
Best NR hit (blastp) | PREDICTED: similar to intraflagellar transport 122 homolog [Tribolium castaneum] (0.0) |
Best NR hit (blastx) | PREDICTED: similar to intraflagellar transport 122 homolog [Tribolium castaneum] (0.0) |
GeneOntology terms | GO:0019861 flagellum |
InterPro families | IPR015943 WD40/YVTN repeat-like-containing domain IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR019781 WD40 repeat, subgroup IPR011046 WD40 repeat-like-containing domain IPR001680 WD40 repeat |
Orthology group | MCL11472 |
Nucleotide sequence:
ATGAGAACTGTTCCGAAGTGGGTAAATAAAATTCATGAAGCTGATAAAATTGAGTTTTCG
AGTGTACACGCAATATGCTTCAGTCCAGATGGCACTCAATTAGTAGTAGGTGCTGGAGAA
AAGGTCATGGTTTATGACCCAAGAGATGGTTCACTGTTACAACTTCTGCAGGCCCATAAA
GGAATGGTTCATACTGTAGCTTATTGCAGTGATGGCAAAAAGTTTGCTAGTGGTAGTGCA
GACAAAAATGTAATCATATGGACATCTAAAATGGAAGGTGTTCTTAAATATTCACACAGT
GAAGCAATCCAATGTGTAGCATATAATCCGGTCACTTATCACTTAGCTTCATGCGCACTC
TCTGATTTTGCATTCTGGTCAGCCGATGTCAAAGCTGTTCAGAAATATCGAGTGGCAGGT
CGGATTACTAGTTGTGCTTGGGCAGCAACAGGTCAATACCTAGCTATTGGACTTGCTAGT
GGCATAGTTTCAATTCGCAATAAGGTTGGTGATGAAATTACCAGAATAACAAGAGATGCT
GCAGTGTGGGCTGTAGCATTCTATAAGAACACATTATTGGTTACAGACTGGAATGATACC
CTATCATTTTATGATATGATGGGACAACCTCTTTTGAAAGAAAGAAATATAGAAATTTCG
GCTGTTTCAATGACAATTTTGGGTGCCTTGATATTAGTTGGTGGCTTGGGGGGTTGGGCC
ATACTTACCTCAGAAGGAGTATCAATATTAAATACATCACTGGACTGGGTATGGTCTATT
GCACCATCACCAATTACAAACACTATGGCAGTTGCATGTCAAGACGGAACTCTGTGGTGT
TACCAAGTTGTCTTCAACACAGTTCATGGGCTATTTCGAGAGAGATATGCTTATAGAGAA
AATATGACAGATGTTATTATACAGCACCTAACAACGGGCAATAAGGTTCGGATTAAATGT
CATGATAGAGTGCAAAAGATTGCTATTTACAAACATCGTTTGGCGGTTCAACTCCCCGAA
AGGGTAGTGGTTTATGAACAAGGTGATCCTGAGGGCATGTTATATCGCGTCAAGGAGAAG
TTGGTTCAGAAATCTGAATGTTCGTTGTTGGTAGCCACCAGCGAATCCCTGTTACTATGT
CAGGACACAAAACTGGTAATGATCGGTTTGAAGATACCAAAATCATGGACCGTCCCATCA
CCGATACGTTACGTCAAAGTTACTAGTTTATACTTTGAAGAAGTATTACTCTTAGGATTG
CTTAATGGACAGATATGGCAAGTGGAGCCTAACAAAGGTACGGCTAGGATGGTTGTGCAG
ACTGCGGGTAGTGTCCGTTGTTTGGACGTGAGCGCCTCACGTGGCCGGCTAGCTGTCGTG
GATGAGAACTCAGTCTGCCGCGTGTACAGCCTCCCAGCTGGGGATCTCCAATATACGGAG
GAGAACGTGTCGTCAGCTTCGTGGAATTCATGGTGCGAGGAACTTCTCTGTCTCTCTGGT
AACGGCTTGTTGTCAATCAAGGCGGGACAATTTCCACCAGCAACACAACCACTGGCCGGC
TCCGTTGTCGGCTTCCAGGGTGGTCGTGTTTTTTGTCTGCAAGCCAATTTGATGCAGACT
ATCAATGTTCCTCTATCTCACGCTGTCCATCAGTACGTACAACAAAAGATGTTCAACGAA
GCGTATGCAGTCGCTTGTATGGGAGTGACAGCATTGGACTGGGAGCGTCTTGGAACCGCT
GCCTTAGAAGAGCTCTCCTTCGAAGTGGCTCGCAAGGCCTTCCAGAGGAGCGAGAATATT
GTTTTATTATCGCTCATTGATCACTTACAGGAACGCTTGGAGAGCGGTGAGAAGAGGCAG
GTTATAATAGGCGAGGTGTTGGCTTACCGCGGACGATACAACGAGGCGGCGAGGGCGTTC
CAAACAGCTGCGAGGAATGACAAGGCACTGGCGCTCTATCTAGACCTTCGCATGTTTAAT
AAGGCACAAGAATATGTAGGCGAGGGCGAGGGTGTGACCAAACTAGCACGTCAAAGGGCG
GAGTGGGCTCGAAGAGTCAATGAACCCAGAGCAGCGGCGGAGATGTACCTCGCGGCAGGA
GATGTGCGCAGTGCTGCCACCATACTAGCAGAGAGCGGCCGACGGGATATGCTAATAGAA
CTAGCTCGCAAAATGGACAAAGGTTCCAGTGAATCTTTACGTCATCTGGCAGAGGCGCTC
GTAACTGCTGGTGAATACCCCACAGCTGGAGATGTCTATCACCGACTAGGAGATTACAAG
AAAATGGCTCAGTTAGCTGTGACAGCTGGCGATTGGGTACGTGCGTTCTCTTTGGCACGC
GAGCACGAGGAATGCAGGCGTGATGTTTATCTGCCTCATGCGCACCGTATGGCCAGAGAG
AACAAATTTGTTGAAGCCCAAAAAGCATATCATATGGCGGGTGAAACAGAAACAGCTATG
CGTGTTTTCAGTATTCTTGTAAATAATGCTGTGGCAGAGGAAAGATTCAATGATGCCGGA
TACTTACATTACTTGCTAGCAACACAGTGCTTAGAATCAGCAACTGCAGCACAGAAGGAT
AGAGCTACATTGCTGCATCAGTACGCTCACAACGAGCGTCTGTCCCGTGTGTATAACGCA
TACGACTCGGTACATCATTGTGTTCACGAGCCCTTTTCGTTGTCTCAACCGGACGTCCTA
TTGAACGCCGCCAGGTACGTCCTCGCGCTCTTGGAGGAACCCCCGCCCGGACTCTCCATG
TTTTGTTTGTACTTATGCTTGGCGAAACAAGCTAAAGTGCTCAATGCAAACACGCTGGCT
CGTCAGATGCTCAATAAGATACACGGTCTCCAAGTGCCACCCAAGTTTCAGGAAGGAGTC
GAGTTACTAATGCTGAACAGCGGAGCTAGTAGCTCGTCTGAATCAGAAGACATTTTGCCA
TTATGTTGGCGGTGTCGTAGCCACGTGCCAGCGCTGGCTACCACTTGTCCAAGATGCAGA
CATGTGTTGGCCCATTCGCTGGCAACTCACGAGGTGCTGCCGCTAGTTCAGTTCGAGCCG
GCTGAAGGAATCACGTTTGAAGAGGCGATGGATCTTATAGAACGCACTCCGATACCGGAG
ATTGAAGGAGCTAATGAAGGCGCTGACATACTTAAGATAGATAACGACATAGACTACGCT
GATCCGTTCCTTGATAAGGTCGATGAGGAGGACAAAGGCGTAGTAGTTTGCAGTCGTTTA
GCTCTACTGAGATTGAATCCAGCCAGTCTAGTGATAGTGAACCGTCCCCCTCTTAAACCA
GTCTTTTACCGTAACATGTTGCCCGAACTGCCAGTCACCACCTGCCCAGCCTGCTATAAT
GAAGCCCAAAAATCGGTTAGTGATTTATCGTAA
Protein sequence:
MRTVPKWVNKIHEADKIEFSSVHAICFSPDGTQLVVGAGEKVMVYDPRDGSLLQLLQAHK
GMVHTVAYCSDGKKFASGSADKNVIIWTSKMEGVLKYSHSEAIQCVAYNPVTYHLASCAL
SDFAFWSADVKAVQKYRVAGRITSCAWAATGQYLAIGLASGIVSIRNKVGDEITRITRDA
AVWAVAFYKNTLLVTDWNDTLSFYDMMGQPLLKERNIEISAVSMTILGALILVGGLGGWA
ILTSEGVSILNTSLDWVWSIAPSPITNTMAVACQDGTLWCYQVVFNTVHGLFRERYAYRE
NMTDVIIQHLTTGNKVRIKCHDRVQKIAIYKHRLAVQLPERVVVYEQGDPEGMLYRVKEK
LVQKSECSLLVATSESLLLCQDTKLVMIGLKIPKSWTVPSPIRYVKVTSLYFEEVLLLGL
LNGQIWQVEPNKGTARMVVQTAGSVRCLDVSASRGRLAVVDENSVCRVYSLPAGDLQYTE
ENVSSASWNSWCEELLCLSGNGLLSIKAGQFPPATQPLAGSVVGFQGGRVFCLQANLMQT
INVPLSHAVHQYVQQKMFNEAYAVACMGVTALDWERLGTAALEELSFEVARKAFQRSENI
VLLSLIDHLQERLESGEKRQVIIGEVLAYRGRYNEAARAFQTAARNDKALALYLDLRMFN
KAQEYVGEGEGVTKLARQRAEWARRVNEPRAAAEMYLAAGDVRSAATILAESGRRDMLIE
LARKMDKGSSESLRHLAEALVTAGEYPTAGDVYHRLGDYKKMAQLAVTAGDWVRAFSLAR
EHEECRRDVYLPHAHRMARENKFVEAQKAYHMAGETETAMRVFSILVNNAVAEERFNDAG
YLHYLLATQCLESATAAQKDRATLLHQYAHNERLSRVYNAYDSVHHCVHEPFSLSQPDVL
LNAARYVLALLEEPPPGLSMFCLYLCLAKQAKVLNANTLARQMLNKIHGLQVPPKFQEGV
ELLMLNSGASSSSESEDILPLCWRCRSHVPALATTCPRCRHVLAHSLATHEVLPLVQFEP
AEGITFEEAMDLIERTPIPEIEGANEGADILKIDNDIDYADPFLDKVDEEDKGVVVCSRL
ALLRLNPASLVIVNRPPLKPVFYRNMLPELPVTTCPACYNEAQKSVSDLS