DPGLEAN01870 in OGS1.0

New model in OGS2.0DPOGS203796 
Genomic Positionscaffold21:+ 107279-114240
See gene structure
CDS Length3723
Paired RNAseq reads  190
Single RNAseq reads  454
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003703 (0.0)
Best Drosophila hit  CG4329, isoform A (5e-128)
Best Human hitWD repeat-containing protein 65 isoform a (1e-178)
Best NR hit (blastp)  PREDICTED: similar to CG4329 CG4329-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG4329 CG4329-PA [Tribolium castaneum] (0.0)
GeneOntology terms  ND
InterPro families




  
IPR011046 WD40 repeat-like-containing domain
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR015943 WD40/YVTN repeat-like-containing domain
IPR001680 WD40 repeat
IPR019781 WD40 repeat, subgroup
Orthology groupMCL13594

Nucleotide sequence:

ATGGCAACTAACACCCCACCACATATATCAGCACACATATTCTACGGCTTGAGAACAGAT
ATTCAATATAATGCCCATTACATGACTGATTCCGAAATTATTTATCCTGCCGGTGGAGTT
ATTGTGATCCATAACCATATACAAAAGAAACAGAAGTTTATAAGACTTCAGGACAAACAC
AAACCAATCAAATCGTTGGTGTTGGCACCTAATAGACGTTGGTTAGCGTTAAACGAGATT
GCCGAAGAGGGTCAAAAACCAATTATAACCATTTACGATCTAACTACATATAAAAGGCGT
AAAATTCTGACTGTCCCTTTCGAGAATTCAACTGCTCGTGAATTTGCTTGTGTACAATTT
ACGTATGATTCAAAATATTTGGTAGCGATAACTGGCGAGCCAGATTGGTATTTATATTAT
TATAATTGGGATAAAGGCAAGGTCGAAAGTCACGCTAAGGCACAAAATCCGAGCGGACAA
GGCACTGTTGAAAGCGTCCAATGTAATCCGTCGGATGCAACCCTTGTGGTTATAACAGGG
CCCTACACATTTCGTATCATGAATGTCTCTGAGACTGTTTGGAGACAGTGGGGTTGGTGC
AAGGCTGAAAATATCAACATAACCAGCTGTATGTGGTTGACATCCGATCGCATTATGTTC
GGGACTGATGCTGGGGTCATCATGGTAGTGGAGAATGGTGAACTGAGACAGAACTGCATA
TTCCGTGCCACTGAAGTAACTGAGATGTCATTGAAGAAAGTTGACATCGAGGCGACAGAA
AGCGAAAAAACCAGTACAGCCAGCGCTGAAGCCACCCCCACTGAATCAGGACTCGTGGAC
TCTGACAGCTGTCCAGTGATGTGCTTCATTAATTTCAGCAAAGGATTCGCTTATGCTTGT
GGCCAAGGATATGTTCATATGTTTGAGAAAGAGTCACCGCACCATTGGCGGAAAAGAAAT
TTGTTTAGAATATCAAAGAAATCTTATAAACATACCCGAGAGCATCCATTGTGGTCGCCA
CTTGACGCTATTCAACATATAACAATTGATCCGAATCAAGAAACTCTTCTCATCACAACT
CTGAGAAAGCAGCTCTATTACGTCAAATTGTTCGGACAACATATGCTTCAAAATCCAGAA
ATATCATTTACGGAACTCGGTCCGGCTATGCATTACGGAAGAATAAATTCTCTCTCTATG
TGCGCGTGGAAACCAATTTTCATGACATCCGGTGAACTGGACAAGAGTATACGAATTTGG
AACTACATGACTGATGACGTTGAATTGATTAAACTATATCAAGAGGAGATACATTGTCTG
TCTTTACATCCTTCTGGGCTATTTGCCATAGTAGGGTTTTCAGACAAGTTACGATTTATG
GTAGTACTAATTGATGACTTTGAAGTCATGCGAGAATTTCCTATACGTAACTGCCCTCGT
GCGAAATTTAGCACCAACGGACACCTTTTCGCAGCTGTCAATGGTCAAGTTGTGCAAGTT
TTTTCTTCGGTTTCATTTCAAAATGTTTATAATTTGAAGGGTCATAACGGAAAAATAACG
TGTCTGGCGTGGTCAGCCAATGATTTAACACTAGTGTCCTGTGGCACTGAAGGCGCTGTG
TACGAGTGGAACATGGCGTCAGGCCAACGAGTCGGCGAAGTTATATTAAAAACGAACCAA
TTTAAAGCCTGTGCCGTGAATAATAACGGTAAAACAACTTACGGCGTTGGAAGTGACGGC
GAAATAAAAGAAATCGGCTCAAATACGATTCGTAGAAATCTTGGTTTGATCGGATGCGGT
CTCGACACTATAGTCCTATCCCGTTCCGATTTAATGCTTTTCATTACCGGCGGCGAGGGT
GGTGTCACTGCTGTGCAGTTACCATTACTAGACAAGGCCATTTACAACGAATTTCATATG
CACAATAAAAATGTAACCGCCATTGCCCTGTCATACGATGATCAAACATTAGTTTCTGTG
GCTGAAGACTCGTCTATTTGCTTATGGAGATTAACTAATGCTGACGGAAGAGCGATAGCT
TTAGATAAAGACTTCGCATATTCCAAAGAGATTCTGATCAGTAAAAAAGATCTTCAAGAG
AAAATTAACAGTATTAATTTACTCAGTACTAGGATGAGTGAATTGGAAACTGAACATACA
TACCAGCTACGCCAGGCTGAAGCAGCTCAGGCTGAGAAATTAAAAGAGGTTCATGAGGGA
TATTGTGCCGCTATAGAGGAACTAAAAGAGAAGAACGAGCAAATGGAAAATGAACATACC
CACGAAATTGGCATGATACAACAAGATATCGCAAAGCTGCGTTCGGGTCATGAAAGAACC
TTACAAGCTTATGAAGCAGACTTTAATATTCGCCTAATTAGCGAATATGACAGATACCAG
AGTCTAGAAGACAAAACAGCTCGCATGAGGAAAGATTATGAACAACGATTAGATGATTTG
GCGGAGAGCAAGCGGCAGGCTTTAAGAGAATTGAATAAAGCTTTTGAAGCGAGATTAGAA
GAAAAAGATCTCATGCTTCAGGAGTTACAAGAACAAGCTGATATGGAGAAGAAGGAACAT
GAAACTATTAAGGCGTCTATTGAAGAAGATGCTGATCGCGAGATAATAGAGATAAGAACG
GCCTACGAAGTTCAACTGAAAGAAGAAAAAGACGCAAATGTCAGGCTGAAAGGTGAAACC
GGTCTGATGAAGAAAAAACTTATATCCGCTAATAAAGAGATCGATGAATTCAAACATCAA
GTTTCACAACTTAAAGCCGAACACAAACAGTTTCAAAAAGTAATATCGACTTTGGAACGA
GACGTCGCTGATCTTAAGAAAGAAATATCGGAGAGAGACGGCACAATACAAGATAAGGAA
AAACGGATATATGAATTGAAACGCAAGAAGCAGGAACTAGAAAAATACAAATTCGTTTTG
AATTTTAAGATAATTGAATTGAAAAATCAGATTGAACCCAAAGAAAAAGAAATTCGGGAG
TTAAAAGTTCAGATTGATGACATGGAAAACGAAGAGCTAAAATTATTGAATACTAAACAT
GATCTTGAACTTAAGATCAATCAGTTGAATGAAAAGTTAGCATCGGCCAAGAAAGATTTC
TTCAGTGAGGCGAATCGTAATTTGACTCTTAAGAACACTTTAAAGAAAATAAAAATTGAT
CTTCACAATATGACGGCCAATTTCCAAGATCCGACTCAGCTTAAACTGAGCGTTAAGGCG
CTATTTCAAAAATATGTAGAGGACATTGACTTTGTACGGAGTCGCATGGCTGAGGATGAG
GCGATAAGAGAATTCAATAGACAAAGAGATCACCTTGAAAAGCAGGTTGCAGGTCTTAAA
ATGCAACTATCGAAATCACTGGATGGGTCCAAGAGTGACATTGGAAAGATTATGGATGAG
AATTGCACTCTGTTAGGGGAAATTAATAATCTTCGAAGCGAGTTGAAAGCTACCCGTACA
AGGTGTTTTCAAATGGAGTCTATATTGGGTCTGTCAGCGCGTTACATCCCGCCCGCAACT
GCGCGCGCTAAACTCAAACACGTCACAGAGGAGCGGGAAAAGCTTGATGAGAAATTTAAA
CAGAAAATCGAAGAGAGAGAGGAAATCATTGTCGCTTTAAAGGAAGAAAATGATCGTCTC
CTTGGAAAAGTAAGATGTCCTGATGAAACAGAACCTTCTGAAAATGACACTGAAGAACAA
TAA

Protein sequence:

MATNTPPHISAHIFYGLRTDIQYNAHYMTDSEIIYPAGGVIVIHNHIQKKQKFIRLQDKH
KPIKSLVLAPNRRWLALNEIAEEGQKPIITIYDLTTYKRRKILTVPFENSTAREFACVQF
TYDSKYLVAITGEPDWYLYYYNWDKGKVESHAKAQNPSGQGTVESVQCNPSDATLVVITG
PYTFRIMNVSETVWRQWGWCKAENINITSCMWLTSDRIMFGTDAGVIMVVENGELRQNCI
FRATEVTEMSLKKVDIEATESEKTSTASAEATPTESGLVDSDSCPVMCFINFSKGFAYAC
GQGYVHMFEKESPHHWRKRNLFRISKKSYKHTREHPLWSPLDAIQHITIDPNQETLLITT
LRKQLYYVKLFGQHMLQNPEISFTELGPAMHYGRINSLSMCAWKPIFMTSGELDKSIRIW
NYMTDDVELIKLYQEEIHCLSLHPSGLFAIVGFSDKLRFMVVLIDDFEVMREFPIRNCPR
AKFSTNGHLFAAVNGQVVQVFSSVSFQNVYNLKGHNGKITCLAWSANDLTLVSCGTEGAV
YEWNMASGQRVGEVILKTNQFKACAVNNNGKTTYGVGSDGEIKEIGSNTIRRNLGLIGCG
LDTIVLSRSDLMLFITGGEGGVTAVQLPLLDKAIYNEFHMHNKNVTAIALSYDDQTLVSV
AEDSSICLWRLTNADGRAIALDKDFAYSKEILISKKDLQEKINSINLLSTRMSELETEHT
YQLRQAEAAQAEKLKEVHEGYCAAIEELKEKNEQMENEHTHEIGMIQQDIAKLRSGHERT
LQAYEADFNIRLISEYDRYQSLEDKTARMRKDYEQRLDDLAESKRQALRELNKAFEARLE
EKDLMLQELQEQADMEKKEHETIKASIEEDADREIIEIRTAYEVQLKEEKDANVRLKGET
GLMKKKLISANKEIDEFKHQVSQLKAEHKQFQKVISTLERDVADLKKEISERDGTIQDKE
KRIYELKRKKQELEKYKFVLNFKIIELKNQIEPKEKEIRELKVQIDDMENEELKLLNTKH
DLELKINQLNEKLASAKKDFFSEANRNLTLKNTLKKIKIDLHNMTANFQDPTQLKLSVKA
LFQKYVEDIDFVRSRMAEDEAIREFNRQRDHLEKQVAGLKMQLSKSLDGSKSDIGKIMDE
NCTLLGEINNLRSELKATRTRCFQMESILGLSARYIPPATARAKLKHVTEEREKLDEKFK
QKIEEREEIIVALKEENDRLLGKVRCPDETEPSENDTEEQ