DPGLEAN16096 in OGS1.0

New model in OGS2.0DPOGS209460 
Genomic Positionscaffold24:+ 93788-98291
See gene structure
CDS Length1674
Paired RNAseq reads  2327
Single RNAseq reads  6664
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005843 (3e-120)
Best Drosophila hit  CG7611, isoform I (7e-130)
Best Human hitWD repeat-containing protein 26 isoform b (2e-141)
Best NR hit (blastp)  PREDICTED: similar to WD repeat protein 26 [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to WD repeat protein 26 [Nasonia vitripennis] (9e-166)
GeneOntology terms  GO:0005737 cytoplasm
InterPro families







  
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR006594 LisH dimerisation motif
IPR006595 CTLH, C-terminal LisH motif
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019781 WD40 repeat, subgroup
IPR011046 WD40 repeat-like-containing domain
IPR001680 WD40 repeat
IPR020472 G-protein beta WD-40 repeat
Orthology groupMCL14527

Nucleotide sequence:

ATGCACCAGCCCTGCACCAACGGCGCTCACCTCAACGGTGACGCCGCCCGCAACGGAGAC
CTGCCGCCGGGGCTCCGCATGAGCCAAACAGACCAGGAGATTGTGCGCCTCATCGGACAG
CATCTGCTCTCGGTCGGACTAGAACGTAGCGCGACTCTGCTGATGGAGGAGTCAGGGTTA
CACCTGGAGCACCCCGCGGCGGCCACGTTCCGTACACACGTGTTGGCCGGAGACTGGGTG
AAGGCGGACCACGACCTGCGAGCGCTGCACGACCTGCTCAGGGACTCGCCTCAGGTGGAG
CCTCACAACCTCGCCGAGATGAAGTTCGTAGTGCTCGAGCAGAAGTACCTCGAGCACCTG
GAGGCGGGCCGTGTGCTGGACGCCCTGCACGTGCTGCGGAATGAGCTAACGCCGCTGCAG
TACGACACGGCTCGCGTGCACCGCCTGTCCGCGCTCATGATGTGCGCCGACGCCGCTGAG
TTGAGGCAGCGCGCTCGCTGGCCAGGGGGCCCGCGCTCGAGGGCACGTCTTCTTGCCACC
GTGCAGGCGGTGCTGCCGCCCGCGCTCATGATGTCTCCGGGCCGGCTGCGGGCGCTTCTG
GCGCAGGCCGCCGCCCAGCAGGCCGCCCGGTGCCGGTTCCACGCGGCGCCTCGCCCTTCG
CCTCCCGTCCCGTCCCCCGATCGCGACGACGAGCTCGCCGCCCCCGAGCACATCCCCTTC
TCTCTCCTGGCAGACCACCACTGCTCGGCCGACCAGTTCCCCATACACTCCTTGCAGGTG
TTAAACGGTCACTGTGACGAGGTGTGGTACTGCAAGTGGTCCCCGGATGGCTCCAAGCTG
GCCTCGGGCTCCAAAGACAACACCGTCATGATATGGGACTACAACCCTGTCACAAAACGA
TTGGCTTTCAGGAAGTCGCTGGAAGGTCACTCGTACGGCGTGTCCTTCCTAGCGTGGAGT
CCCGACGGCCGACACCTGCTGGCCGCCGGACCCGAGGACTGCCCCGACCTCTGGATCTGG
AACATGGAGACGGAGCAGCTGCACCTGAAGATGACTCACTCCCAGGAGGACTCGCTGACG
GCGGCCGCCTGGCACGCCAGCGGGAACGCCTTCGTCTGCGGCGGCGCCCGGGGACAGTTC
TATCACTGCGCACTCGACGGTACCCTCATCAACAACTGGGACGGTGTCCGTGTGAACGCG
CTGGCGTGCCGCTCCGAGGGCCGCGTGTTGGCCGCCGACACTCACCACCGCGTCCGGCTC
TATGACTTCAGCGACCTCACCGACAGGAACCTCATCCAGGAGGAGCACGCGGTGATGGCG
ATGACCCTGAACGCGGCGGACACGCTGCTGCTGCTCAACGTGGCCAACCAGGGAGTCCAC
CTCTGGGATATCCGAGCCCGAGCGCTCGTCCGTCGCTTCAGGGGCCTGTCTCAGGGACAC
TTCACCATCCACGCCTGCTTCGGAGGAGCTCATCAAGACTTCATAGCGTCCGGCAGCGAG
GACAATAAGGTGTACATCTGGCACATCGACGGCGAGGAGCCCATCGCGGTGGTGTCGGGA
CACACGAGGTGTGTGAACGCCGTGGCTTGGAACCCCGTGCATCATGACGTGCTGGTGTCC
GCCTCCGACGACTACTCCCTGAGGCTGTGGGGCCCGAGGACCCACCAGACCTAG

Protein sequence:

MHQPCTNGAHLNGDAARNGDLPPGLRMSQTDQEIVRLIGQHLLSVGLERSATLLMEESGL
HLEHPAAATFRTHVLAGDWVKADHDLRALHDLLRDSPQVEPHNLAEMKFVVLEQKYLEHL
EAGRVLDALHVLRNELTPLQYDTARVHRLSALMMCADAAELRQRARWPGGPRSRARLLAT
VQAVLPPALMMSPGRLRALLAQAAAQQAARCRFHAAPRPSPPVPSPDRDDELAAPEHIPF
SLLADHHCSADQFPIHSLQVLNGHCDEVWYCKWSPDGSKLASGSKDNTVMIWDYNPVTKR
LAFRKSLEGHSYGVSFLAWSPDGRHLLAAGPEDCPDLWIWNMETEQLHLKMTHSQEDSLT
AAAWHASGNAFVCGGARGQFYHCALDGTLINNWDGVRVNALACRSEGRVLAADTHHRVRL
YDFSDLTDRNLIQEEHAVMAMTLNAADTLLLLNVANQGVHLWDIRARALVRRFRGLSQGH
FTIHACFGGAHQDFIASGSEDNKVYIWHIDGEEPIAVVSGHTRCVNAVAWNPVHHDVLVS
ASDDYSLRLWGPRTHQT