DPGLEAN20554 in OGS1.0

New model in OGS2.0DPOGS205797 
Genomic Positionscaffold143:- 97630-108341
See gene structure
CDS Length5481
Paired RNAseq reads  71
Single RNAseq reads  189
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010357 (5e-95)
Best Drosophila hit  CG34124 (9e-73)
Best Human hitWD repeat-containing protein 52 isoform 1 (2e-78)
Best NR hit (blastp)  AGAP006735-PA [Anopheles gambiae str. PEST] (8e-158)
Best NR hit (blastx)  wd-repeat protein [Aedes aegypti] (2e-148)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families


  
IPR019781 WD40 repeat, subgroup
IPR011046 WD40 repeat-like-containing domain
IPR001680 WD40 repeat
IPR015943 WD40/YVTN repeat-like-containing domain
Orthology groupMCL13795

Nucleotide sequence:

ATGGATGTTAACACTAAACAAATTTGGTTTCGAAGATCTACAACAGGAGGTTCCGTGGGA
GCCCTCACGTCTTACAGGAAGGACCCTAATTATCGTATAGCCGTAGCGGAGGGCAGAGAA
GGTGATAATGATCCAATAATTCTACTTTACATCTGGCCACAAATGGACATTGATGCTGTA
CTGCGAGACGGCACAGCCAATGCTTATTCGATTTTGGATTTCAGTCCAGATGGCGAACTC
TTAGCTTCGGTAGGTAAGGCCCCAGACTACAATTTGACAATTTGGAACTGGAAGAAACAC
AGGATTCTTTTACGTGCAAGCGCCTTTACGTTCGATGTAAATACTGTAATGTTTTCGCCG
TACTGTCCGGGGCAACTAACTACAGCAGGAGCGGCGCATATAAAGAACTGGAAAATGGCA
GAAACATTTACTGGACTAAAGTTAAAAGGCGAACTGGGAAGATTCGGCAAAACAGAGATT
TGTGATGTTCTTGGAGTATATCCTATGCCAGACGAAAAAGTTCTATCTGGATGCGAATGG
GGTAATATTTTAGTATGGGAAGCTGGTCTGGTTAAGTTGGAAGTTACACAACGCAGCAGA
AAAACTTGCCACAAAGCACCAGTGGTGCAATTCATGCTTAGTCCAGCTGGTGATGAAGTT
ACAACTATAAGTCAAGACGGTTACGTTCGTGCCTGGTATTGGGATACAGTGGAACAAGCT
GATCCCCCAGAAGAAGACCCCACTGTGGAACTTAATCCAGTTGCAGAGACCTATGTTCCT
GGGTGTAAAATAATGTGTTTGAAACATCAGAAAGACATGTATTTTTATGCTCAAGACGGC
AACGGCGGTATTTGGACAACCGATTTAGAAATAGACAAGCTCGAATGTAATCACCGCAAG
ATTATGACTTTTCATGCAAGTGGGATAGTAGCCATGGCTGCATTACGTTCTTACCCTATC
CTTATAACTGCGGGGGAAGATGGAGCTCTTCATGCATATAATTCAGAGACGCATGATATT
TTGGCAAAATACCAATTCCAAACTGCTATAACCTGTATGCTATATCCACCATTAGATGTC
GACCCAACTTCTCGTATTCTACTAGTGGGATTTGCAGATGGCATTATGCGAACAATCCTA
GTCCACCCTGAACGTCTCCAGGCTCAATCGACTTTAATTGAAGTTCGAGTACATTCCGCT
CTAACAATCCACAGCGATGATTCAATTAATGCTGATGTTATAGATCTTATATCGCTGCTG
AAGCCCCATTCCAAGGCGATAACTCAAATAACAATAAATGACCCAAGAACCCTATTAGTA
ACTTGTGCAGAAGACTGCACTTTATTTATGTATAGTTTGAAAATGGGAACTCCATTCACT
CTTAAGAGGTTAGGTTTTATAGAAACACCGAACAACGTCGCTTTTATGGCTTGGAAACCA
AACGAAGAAAGGACAATACTGCTTTGTGGCCAAGCTGGTGTGATTACTGAAGCAGTACTT
CCGAAAATTCCTGACAGACTATATACCGAAATCACTACTTTCAAGCAAGAATTTGTTTCT
CATCAAGACATTCTAGTCAAAAAATATTACATGCAGCACAGACCATTTCCCAGAGAAGAA
GACTTGGCTAGTATAGACGAAGAAGCTCTTAAGAGACAGGAGGAAGCAGAAAATGAAAAA
GATGATGAAGAAGAAGAATGGATAGGAGAAATTCAATTAATTGAGAGTGACACCATGTCG
GGAACTACGATAACTTGGGCCAAATATTGTGAAGAAGGCATTTGGATTGTGCAAGAAGGG
ACTGGAGCTTTGTTGTTAGTTAAACCAGGGCACAATAAGATCCTTAAATATGGCCCGTTT
CCAGGAGCTTGGTGTGATAATATAACAACATTACAGTTTGTATGTGACGATCGCTATTTA
GTTATTGGAACAAATTCAGGATACATTCGTGTTGTGCGTATGCCCACAGAAGAAGAGGAT
TCTCCGGAACGTCATCGCATGGTTTGGCTCTTAGCACAACAAAAGTTGCTGAAGAAACTT
AAGGGACGACGCTTAGCTAAGGAAGAGACCCAACCTACTCCTCGAATAGATTTTGAAGAC
AACTATTATTTGCCTATGCATGATTTTTATACAGGCGCTATAACTTGTTTTGAGTTCAGC
AGTGACGGAAGATATTTTTACACTGGTGGAACAGATGGCAACATATTTTCATATAAAATT
TCATTCACGGAACCTCTGCTGCCAGTGTCTGAAGCGCCGGAAATAGAGGAAATGCCTAAA
GTGGAAAAAATTCGAGAACCGAGTACTCTTGACGGGGAATTAATGTCACATGAACAACTA
AAACAAAAAGAGGAATATGATAAAATGATATCTACAGCAAATGCCCATAAAAAACGTGTT
CGTGACCAACTCTCCGAGCTGGGTATAGAATACAACAAATTGATTAAAGCCAATCGAGCA
CTGCCATATTCTCAACAAATCGATGTTGTTCTGGACCCACGACCTTTGGTTGTACAAGAA
AAGGAACTAGATGATTTAAAAGCCTTAACTCGCCGCAAACTCGCTCATCAACTTGAAGCC
TCCGACCTGGGCTTACAGAAGATGTATTCAAGGAATATTATACAGCTAGATGTGTATCCA
TTTACTTTAAAAGCGATACGGGATCCAGAAATAAAGATAAGACCGTTGCGACAAAAAAAC
CTTTCCAAGGCCTTCTACGATCAGCTTCAAGAAGTGCACCAAAAAATGGCCGAGGCTTCA
TTGCGTGGAAGACGTGCGGAAGCATCTACAGCAAGAGCAGCGGCTAAGAGAGCGTCATGG
GGTCCACCTAGAATCGCGTCTTTTCTACTGGGACTTCCGCCTAAACCACCACACCCCCTG
AAAAAAGCTCTACGCAACTACCACCAAAGACTGAATCGGCATCACATTCAATTTATAGAA
TGGCAAGATCATCTATCTCATAAACCGGACCCGCATGCATTGCCGCCTGGTGCAGGTGAA
GCGCTCAAACAAGCAGAAGAGACCATTGGCAATCGCGTACTCAAGACACAAGGGGACTAT
GTTGCACCGCAGGGACATAATACTCAGTTGCGAATATGTCTTGCCAGGAAAGAGATCTAC
GATAATAAACGTGAGTTTAATGAAAAAGTGTTGGAATTACGTGAGCAAAAGGTTTGGCTA
GTTTCAAAAATGCAAGATATAGGTAGACGTTTAGCTGAAATCCGAGTTGAAATTCCAAAC
AAATTAGCAAAAGCACCTCCTCCAGTGCCTGTGATTGATGAAGACTTGGAATTTCCAGAA
AAAAAATTAGAAGAGAAAAATCAGTTGCTTGAACTACATCTGTATCAGTTACATAGAGAA
ATGACTGTGCTTAATCGTTTCGAAGCTCATGAAGACAGACTCGCTGAGAGAGTTTATGCT
AAGCTTATGCAGGTTCGTGGTGTTAATGATCAGATACAAGATTGTGAACAACGCATTGAA
GAACATAAGCAAGAAAAGGAGAGTCTTGATCTTGCTTGTCAAGACTTACAGCGTCAATTC
AAAAAACTTGTTCAAGATAACAAATTTGCTGATTTCTTACGCAGAATATTTAAGAAGAAA
TACCGTCCACCTCGCGAGCGCAACGAAGACGAATCATCGGAATCGGAATCTAGCTCTTCT
TCGAGCGAAGAAGAAGATGAGGGTAGTTTGGACAGCAGGGATATTGGACCCATACGACTT
GATCCCAATATTTGTCCTGAGGGATGTGATGTTGACATTTACAATAAAACATATGACCTT
AGAAATACAAGACATAAATTTGAACAAGAAATGATTGAAAAGGACCATTTAGTTGACTTG
TTACGTAAAGACATTGATGCCCATAATAAGATAAAAAGAAAGTTTTCGGTACAACTGGAA
AAGCGGAAAATGGAACTAAGGGAATTTATGATGGAAAAACAAAACTGCATGAATGAAGTA
GACCAAGTCGTGATACTGCGTTACGACCAAATCCGAGCATCTGCAATCAAAGACTGCACC
GGACCAGGCGGGTTGTCCCACACTGTGGTGTTCCCAGAAAAAATGCTGTCTAAACTTAGA
AATAGAGTTTTGGAATTGCAGGATGAGATAAAACAGCAGAAACAACGTCAGAAAATAAAC
CGAACCCACCTGTTCCGTATGAATGTAGATTTACGCGCTATGGAAAACAGAGCGGCAGAG
TTGAAGGCCCAAATGAGGGATGTTTTAACGCGGAAACTCGGCAAGCCCCGGAAAGTGGAC
AAGACACTTGATGAACTGCTACGCCAGCTGGCACGGAGGCACAAATTCTCTGCAGCATTA
TCAGCTATACCCCATTTACTAAACCAGCTTAATAAGTGGAGGAAGCGTCACAGCGAACTT
GAACAAAAATATTTAACCACTATAAACCGTTACTCTGACCGTCTCCGCTTAGCGGCAGCA
CTCCAGGCCGACATTTATCCTCAAAAGCCCCACAAAGATCCAACAGTAATGCCCGGCGCT
TACGAACCGGCGCAATACCACCGCGATGTTATCCGTCTGCGTATTATAAGCGCTCAGCAG
CAGGAGCAAATTAAGGTTTTGGAGGAAGAGATTCACAATTTGCGTCTGAAGCCGCTGTCT
CAAATCGTGGCTCCGTCGGAATGGTCACCATCGGAATCTGATCGGTCTCAGATATATCTG
CAGATGGTGCCGCCTACAGCACGACCGCCAACGACCAAATATTTCCCAATAACGCCTCTT
GGCTCTCGAGCAAAATTGGTGTACGAAGTGAACTTGATGAAGTTACTGTACGATTGCCTG
GACATGATGCGTGTATCCCGGGATGATGCGGAGGATTTACTCCGAGAGCTTACCACAGAA
CTACGAAAAGTCTTGTCTGGTTCAAAAACACGATTTGACGTCGTCGATACATTGGTACGT
AAGTGGCTCCTGAAGTATGGCGGTGATCCTAGCCTTTACAAAAAACAGACGCGAGCTTTT
GACGCTCTTGCAACATTGGCCGACCGGCTTCTTAGCCAGCATGTAGAAGCAATGGAAGGG
ATGACTCCGAAAACTCACGAAGTTATGAAGTCACTAGAGGTAGCGTTAGATAGCGTCTCT
GATAAAAAACAAAAACTCGAAGAGCGACTTGGACCAGCCCTTGCTACAATTTTGCATACA
ACATCGATAGAGCATATGGATAACGAGGAAACATTGACAATGGCGATGACATCCCTCGTG
AATGCCTTAACTGACGATGATAATCCGCTTAGCGCAGAATCTATAGACGCAATTGAAGTG
TCTGACATCGTTCAAGATATCAAAGATTGTGGTATATTTGCACCTACAAAACAATTACAA
AACTTGGTCAAAATCGCCATAAATAGTCTACGAGCACAAATCATTACTCCTGAAGAAAGA
CAACAGCTGGATAAAGATGTTGAACAGGCTATGACGGAATCGAAAATGAGTTTTCCTGAT
TCTTCTAGAACGGAAAAGTAG

Protein sequence:

MDVNTKQIWFRRSTTGGSVGALTSYRKDPNYRIAVAEGREGDNDPIILLYIWPQMDIDAV
LRDGTANAYSILDFSPDGELLASVGKAPDYNLTIWNWKKHRILLRASAFTFDVNTVMFSP
YCPGQLTTAGAAHIKNWKMAETFTGLKLKGELGRFGKTEICDVLGVYPMPDEKVLSGCEW
GNILVWEAGLVKLEVTQRSRKTCHKAPVVQFMLSPAGDEVTTISQDGYVRAWYWDTVEQA
DPPEEDPTVELNPVAETYVPGCKIMCLKHQKDMYFYAQDGNGGIWTTDLEIDKLECNHRK
IMTFHASGIVAMAALRSYPILITAGEDGALHAYNSETHDILAKYQFQTAITCMLYPPLDV
DPTSRILLVGFADGIMRTILVHPERLQAQSTLIEVRVHSALTIHSDDSINADVIDLISLL
KPHSKAITQITINDPRTLLVTCAEDCTLFMYSLKMGTPFTLKRLGFIETPNNVAFMAWKP
NEERTILLCGQAGVITEAVLPKIPDRLYTEITTFKQEFVSHQDILVKKYYMQHRPFPREE
DLASIDEEALKRQEEAENEKDDEEEEWIGEIQLIESDTMSGTTITWAKYCEEGIWIVQEG
TGALLLVKPGHNKILKYGPFPGAWCDNITTLQFVCDDRYLVIGTNSGYIRVVRMPTEEED
SPERHRMVWLLAQQKLLKKLKGRRLAKEETQPTPRIDFEDNYYLPMHDFYTGAITCFEFS
SDGRYFYTGGTDGNIFSYKISFTEPLLPVSEAPEIEEMPKVEKIREPSTLDGELMSHEQL
KQKEEYDKMISTANAHKKRVRDQLSELGIEYNKLIKANRALPYSQQIDVVLDPRPLVVQE
KELDDLKALTRRKLAHQLEASDLGLQKMYSRNIIQLDVYPFTLKAIRDPEIKIRPLRQKN
LSKAFYDQLQEVHQKMAEASLRGRRAEASTARAAAKRASWGPPRIASFLLGLPPKPPHPL
KKALRNYHQRLNRHHIQFIEWQDHLSHKPDPHALPPGAGEALKQAEETIGNRVLKTQGDY
VAPQGHNTQLRICLARKEIYDNKREFNEKVLELREQKVWLVSKMQDIGRRLAEIRVEIPN
KLAKAPPPVPVIDEDLEFPEKKLEEKNQLLELHLYQLHREMTVLNRFEAHEDRLAERVYA
KLMQVRGVNDQIQDCEQRIEEHKQEKESLDLACQDLQRQFKKLVQDNKFADFLRRIFKKK
YRPPRERNEDESSESESSSSSSEEEDEGSLDSRDIGPIRLDPNICPEGCDVDIYNKTYDL
RNTRHKFEQEMIEKDHLVDLLRKDIDAHNKIKRKFSVQLEKRKMELREFMMEKQNCMNEV
DQVVILRYDQIRASAIKDCTGPGGLSHTVVFPEKMLSKLRNRVLELQDEIKQQKQRQKIN
RTHLFRMNVDLRAMENRAAELKAQMRDVLTRKLGKPRKVDKTLDELLRQLARRHKFSAAL
SAIPHLLNQLNKWRKRHSELEQKYLTTINRYSDRLRLAAALQADIYPQKPHKDPTVMPGA
YEPAQYHRDVIRLRIISAQQQEQIKVLEEEIHNLRLKPLSQIVAPSEWSPSESDRSQIYL
QMVPPTARPPTTKYFPITPLGSRAKLVYEVNLMKLLYDCLDMMRVSRDDAEDLLRELTTE
LRKVLSGSKTRFDVVDTLVRKWLLKYGGDPSLYKKQTRAFDALATLADRLLSQHVEAMEG
MTPKTHEVMKSLEVALDSVSDKKQKLEERLGPALATILHTTSIEHMDNEETLTMAMTSLV
NALTDDDNPLSAESIDAIEVSDIVQDIKDCGIFAPTKQLQNLVKIAINSLRAQIITPEER
QQLDKDVEQAMTESKMSFPDSSRTEK