DPGLEAN18373 in OGS1.0

New model in OGS2.0DPOGS203950 
Genomic Positionscaffold65:+ 118233-136242
See gene structure
CDS Length3393
Paired RNAseq reads  620
Single RNAseq reads  1438
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000478 (0.0)
Best Drosophila hit  Oseg1 (0.0)
Best Human hitintraflagellar transport protein 122 homolog isoform 3 (0.0)
Best NR hit (blastp)  PREDICTED: similar to intraflagellar transport 122 homolog [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to intraflagellar transport 122 homolog [Tribolium castaneum] (0.0)
GeneOntology terms  GO:0019861 flagellum
InterPro families




  
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR019781 WD40 repeat, subgroup
IPR011046 WD40 repeat-like-containing domain
IPR001680 WD40 repeat
Orthology groupMCL11472

Nucleotide sequence:

ATGAGAACTGTTCCGAAGTGGGTAAATAAAATTCATGAAGCTGATAAAATTGAGTTTTCG
AGTGTACACGCAATATGCTTCAGTCCAGATGGCACTCAATTAGTAGTAGGTGCTGGAGAA
AAGGTCATGGTTTATGACCCAAGAGATGGTTCACTGTTACAACTTCTGCAGGCCCATAAA
GGAATGGTTCATACTGTAGCTTATTGCAGTGATGGCAAAAAGTTTGCTAGTGGTAGTGCA
GACAAAAATGTAATCATATGGACATCTAAAATGGAAGGTGTTCTTAAATATTCACACAGT
GAAGCAATCCAATGTGTAGCATATAATCCGGTCACTTATCACTTAGCTTCATGCGCACTC
TCTGATTTTGCATTCTGGTCAGCCGATGTCAAAGCTGTTCAGAAATATCGAGTGGCAGGT
CGGATTACTAGTTGTGCTTGGGCAGCAACAGGTCAATACCTAGCTATTGGACTTGCTAGT
GGCATAGTTTCAATTCGCAATAAGGTTGGTGATGAAATTACCAGAATAACAAGAGATGCT
GCAGTGTGGGCTGTAGCATTCTATAAGAACACATTATTGGTTACAGACTGGAATGATACC
CTATCATTTTATGATATGATGGGACAACCTCTTTTGAAAGAAAGAAATATAGAAATTTCG
GCTGTTTCAATGACAATTTTGGGTGCCTTGATATTAGTTGGTGGCTTGGGGGGTTGGGCC
ATACTTACCTCAGAAGGAGTATCAATATTAAATACATCACTGGACTGGGTATGGTCTATT
GCACCATCACCAATTACAAACACTATGGCAGTTGCATGTCAAGACGGAACTCTGTGGTGT
TACCAAGTTGTCTTCAACACAGTTCATGGGCTATTTCGAGAGAGATATGCTTATAGAGAA
AATATGACAGATGTTATTATACAGCACCTAACAACGGGCAATAAGGTTCGGATTAAATGT
CATGATAGAGTGCAAAAGATTGCTATTTACAAACATCGTTTGGCGGTTCAACTCCCCGAA
AGGGTAGTGGTTTATGAACAAGGTGATCCTGAGGGCATGTTATATCGCGTCAAGGAGAAG
TTGGTTCAGAAATCTGAATGTTCGTTGTTGGTAGCCACCAGCGAATCCCTGTTACTATGT
CAGGACACAAAACTGGTAATGATCGGTTTGAAGATACCAAAATCATGGACCGTCCCATCA
CCGATACGTTACGTCAAAGTTACTAGTTTATACTTTGAAGAAGTATTACTCTTAGGATTG
CTTAATGGACAGATATGGCAAGTGGAGCCTAACAAAGGTACGGCTAGGATGGTTGTGCAG
ACTGCGGGTAGTGTCCGTTGTTTGGACGTGAGCGCCTCACGTGGCCGGCTAGCTGTCGTG
GATGAGAACTCAGTCTGCCGCGTGTACAGCCTCCCAGCTGGGGATCTCCAATATACGGAG
GAGAACGTGTCGTCAGCTTCGTGGAATTCATGGTGCGAGGAACTTCTCTGTCTCTCTGGT
AACGGCTTGTTGTCAATCAAGGCGGGACAATTTCCACCAGCAACACAACCACTGGCCGGC
TCCGTTGTCGGCTTCCAGGGTGGTCGTGTTTTTTGTCTGCAAGCCAATTTGATGCAGACT
ATCAATGTTCCTCTATCTCACGCTGTCCATCAGTACGTACAACAAAAGATGTTCAACGAA
GCGTATGCAGTCGCTTGTATGGGAGTGACAGCATTGGACTGGGAGCGTCTTGGAACCGCT
GCCTTAGAAGAGCTCTCCTTCGAAGTGGCTCGCAAGGCCTTCCAGAGGAGCGAGAATATT
GTTTTATTATCGCTCATTGATCACTTACAGGAACGCTTGGAGAGCGGTGAGAAGAGGCAG
GTTATAATAGGCGAGGTGTTGGCTTACCGCGGACGATACAACGAGGCGGCGAGGGCGTTC
CAAACAGCTGCGAGGAATGACAAGGCACTGGCGCTCTATCTAGACCTTCGCATGTTTAAT
AAGGCACAAGAATATGTAGGCGAGGGCGAGGGTGTGACCAAACTAGCACGTCAAAGGGCG
GAGTGGGCTCGAAGAGTCAATGAACCCAGAGCAGCGGCGGAGATGTACCTCGCGGCAGGA
GATGTGCGCAGTGCTGCCACCATACTAGCAGAGAGCGGCCGACGGGATATGCTAATAGAA
CTAGCTCGCAAAATGGACAAAGGTTCCAGTGAATCTTTACGTCATCTGGCAGAGGCGCTC
GTAACTGCTGGTGAATACCCCACAGCTGGAGATGTCTATCACCGACTAGGAGATTACAAG
AAAATGGCTCAGTTAGCTGTGACAGCTGGCGATTGGGTACGTGCGTTCTCTTTGGCACGC
GAGCACGAGGAATGCAGGCGTGATGTTTATCTGCCTCATGCGCACCGTATGGCCAGAGAG
AACAAATTTGTTGAAGCCCAAAAAGCATATCATATGGCGGGTGAAACAGAAACAGCTATG
CGTGTTTTCAGTATTCTTGTAAATAATGCTGTGGCAGAGGAAAGATTCAATGATGCCGGA
TACTTACATTACTTGCTAGCAACACAGTGCTTAGAATCAGCAACTGCAGCACAGAAGGAT
AGAGCTACATTGCTGCATCAGTACGCTCACAACGAGCGTCTGTCCCGTGTGTATAACGCA
TACGACTCGGTACATCATTGTGTTCACGAGCCCTTTTCGTTGTCTCAACCGGACGTCCTA
TTGAACGCCGCCAGGTACGTCCTCGCGCTCTTGGAGGAACCCCCGCCCGGACTCTCCATG
TTTTGTTTGTACTTATGCTTGGCGAAACAAGCTAAAGTGCTCAATGCAAACACGCTGGCT
CGTCAGATGCTCAATAAGATACACGGTCTCCAAGTGCCACCCAAGTTTCAGGAAGGAGTC
GAGTTACTAATGCTGAACAGCGGAGCTAGTAGCTCGTCTGAATCAGAAGACATTTTGCCA
TTATGTTGGCGGTGTCGTAGCCACGTGCCAGCGCTGGCTACCACTTGTCCAAGATGCAGA
CATGTGTTGGCCCATTCGCTGGCAACTCACGAGGTGCTGCCGCTAGTTCAGTTCGAGCCG
GCTGAAGGAATCACGTTTGAAGAGGCGATGGATCTTATAGAACGCACTCCGATACCGGAG
ATTGAAGGAGCTAATGAAGGCGCTGACATACTTAAGATAGATAACGACATAGACTACGCT
GATCCGTTCCTTGATAAGGTCGATGAGGAGGACAAAGGCGTAGTAGTTTGCAGTCGTTTA
GCTCTACTGAGATTGAATCCAGCCAGTCTAGTGATAGTGAACCGTCCCCCTCTTAAACCA
GTCTTTTACCGTAACATGTTGCCCGAACTGCCAGTCACCACCTGCCCAGCCTGCTATAAT
GAAGCCCAAAAATCGGTTAGTGATTTATCGTAA

Protein sequence:

MRTVPKWVNKIHEADKIEFSSVHAICFSPDGTQLVVGAGEKVMVYDPRDGSLLQLLQAHK
GMVHTVAYCSDGKKFASGSADKNVIIWTSKMEGVLKYSHSEAIQCVAYNPVTYHLASCAL
SDFAFWSADVKAVQKYRVAGRITSCAWAATGQYLAIGLASGIVSIRNKVGDEITRITRDA
AVWAVAFYKNTLLVTDWNDTLSFYDMMGQPLLKERNIEISAVSMTILGALILVGGLGGWA
ILTSEGVSILNTSLDWVWSIAPSPITNTMAVACQDGTLWCYQVVFNTVHGLFRERYAYRE
NMTDVIIQHLTTGNKVRIKCHDRVQKIAIYKHRLAVQLPERVVVYEQGDPEGMLYRVKEK
LVQKSECSLLVATSESLLLCQDTKLVMIGLKIPKSWTVPSPIRYVKVTSLYFEEVLLLGL
LNGQIWQVEPNKGTARMVVQTAGSVRCLDVSASRGRLAVVDENSVCRVYSLPAGDLQYTE
ENVSSASWNSWCEELLCLSGNGLLSIKAGQFPPATQPLAGSVVGFQGGRVFCLQANLMQT
INVPLSHAVHQYVQQKMFNEAYAVACMGVTALDWERLGTAALEELSFEVARKAFQRSENI
VLLSLIDHLQERLESGEKRQVIIGEVLAYRGRYNEAARAFQTAARNDKALALYLDLRMFN
KAQEYVGEGEGVTKLARQRAEWARRVNEPRAAAEMYLAAGDVRSAATILAESGRRDMLIE
LARKMDKGSSESLRHLAEALVTAGEYPTAGDVYHRLGDYKKMAQLAVTAGDWVRAFSLAR
EHEECRRDVYLPHAHRMARENKFVEAQKAYHMAGETETAMRVFSILVNNAVAEERFNDAG
YLHYLLATQCLESATAAQKDRATLLHQYAHNERLSRVYNAYDSVHHCVHEPFSLSQPDVL
LNAARYVLALLEEPPPGLSMFCLYLCLAKQAKVLNANTLARQMLNKIHGLQVPPKFQEGV
ELLMLNSGASSSSESEDILPLCWRCRSHVPALATTCPRCRHVLAHSLATHEVLPLVQFEP
AEGITFEEAMDLIERTPIPEIEGANEGADILKIDNDIDYADPFLDKVDEEDKGVVVCSRL
ALLRLNPASLVIVNRPPLKPVFYRNMLPELPVTTCPACYNEAQKSVSDLS