DPGLEAN05119 in OGS1.0

New model in OGS2.0DPOGS212647 
Genomic Positionscaffold1301:- 2284-8606
See gene structure
CDS Length1914
Paired RNAseq reads  6
Single RNAseq reads  12
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013939 (8e-93)
Best Drosophila hit  ND
Best Human hitPREDICTED: WD repeat-containing protein on Y chromosome (1e-07)
Best NR hit (blastp)  hypothetical protein Phum_PHUM415020 [Pediculus humanus corporis] (6e-57)
Best NR hit (blastx)  hypothetical protein Phum_PHUM415020 [Pediculus humanus corporis] (1e-54)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0008150 biological_process
GO:0003674 molecular_function
InterPro families



  
IPR017986 WD40-repeat-containing domain
IPR001680 WD40 repeat
IPR011046 WD40 repeat-like-containing domain
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019781 WD40 repeat, subgroup
Orthology groupMCL24138

Nucleotide sequence:

ATGTCTATTAAGTCTCAAAAATCTCTAAACGACGAATGCTCGGAAATATTTGAAAAGAGT
TACTCAGTGAGAAGTGCTTCAGAGGAGAGTCTAACGTGCGACGAAGACGAGGATTCTTTA
TCAAGGGAGTTGCTTATGAAATTGACTCCAGCGGTGCTGAGTAAACTGAGAAGATGTTTT
AAGAAAGCCAAGGAAAAGAACGCTGGAGATGTTGATAAACGAGTTGAAGAGGTGATGCGA
GCAGCGGCCGCAGAGGAGGGCATAGAATTTGCTGCGACCGCGAACCCCACGCCGCAGGCG
ACTCTATGTTTAGATGAGAAGGGCTTTGTGACCGCCTTCGAACAAATATTCGGTCATCGC
AAGTATTCAGTTCACGCTCGCCAATTGTTTCGGTCGTTGGACATGTTCGGTGGTGGTAGG
GTGTGGTGGAAGCAGGTGGTGGGGAGGCTGGTAGCGGCCGGAGCTCGCACCACCAGCTCA
CGAGTCGAACGATGGGACGACCTCCTCCCTGGTGGTATCAAAAAATTAAAGCATTGCAAG
CGAGAGACGATAGTAAAAGTAGTAAGTATTGAAAGAGAGGACAGTTTCTGTTACGTGATA
ATAACACGAGGGGGACGAGTCGGAGTCTACAGCGGACAGTTGGAACTACTCAACACTTAC
GAGGATGTTCAACATCTAGTAATATCTTCATCAGACAGAAGTCTGACAATCTATGACGTG
GTTACCCTTAGCCATTCCCCCGTTTTTTGTATCACCGGACTGACACACATACCTACGTGC
CTCGCTTACAAACCGTTTCTGAACCCTGGAGACGAATCTGAACTGATATTTGGCAACGAG
AGAGGAGACCTCACCAGGATGCGGTTCCTTCAACCTCGAATATCACTTCTGCATTTAAAG
TCACCGGATAATATTAACTATTACTTTTGGATGGAGCTATCATCTGCTCCTCACACGACA
TACGTCTCTATATCAACCTGGCGAAAGGTCCACTCAAGGTCGGTGCGTCGTGTGATGTAT
GAAAGGGATGGAGACATCGTGATGTCCTGTTCCCTCGACAACACAGTCAGCGTCCGATCG
AGACACGCGCGGGGAAAGTTAGATGATTATGTCTTCAAAGTTCAAAGGGGTGTGTCATGT
TTCGTCGTGGTGTCATCTCTTCACCTCGTGGTGACAGGCAGCCCTGACGGCGTGGTCCGT
CTCTGGTCCAGCCCTCAGGGTTGTCAGTTCGCAAGTCTCTCGGCTCCAGGAATAGTTGCC
ATCCTGGATGTGGCCGTGGTTACGTCTTCCGAAATTGTGGTCGCTTATTGTAACAATTGC
AACATTCACATCTGGGACTTGTTTGAAGAATGTCTACTTCAAACAGTGAAGATAAGATTT
CCATTTCTTGGTGTGCTTGGCAAAAAAGTTGAATTCGGACCCTATTGCATTCATTTTGGT
CCTCGTCAATATTATCAAAGAGTGCAGCATTTGGAAGACGATGAAACTAATGAAACTAGT
GCCAACGAAGGACAGAAGAAAATCGCCAATAGTCCAAGGTCGTTGTTGTTTTCGTGTTGT
GACCACGTGTTTCTACTGTCTCTGGTACGTGCCCAGACCTCTGCTCCCCCACCACCGGCC
GGCGTGCTGCGATCGAGACGACCCTCAGTTTGGGAAATACCAGACTTTATAGAAGGATTG
TCACCGAGACCAGCGGGTCCCAAACCATCGCAATCCACCGACGTTCTGACTCCAGTCGCT
GCAGACTCCTCGTCAGACCAAGACCTCGAGGAACTCCTAGAGAAAGCGGGGCTTCAGGGA
ATATTAGAAAAAGATTTTGTCCTGATGAGAGGATTGAAGCACGACTTGAATGAAAAACTT
CACAGAATGGGAACCGTCATGAAGACTGTAAGGATTCAGCTAGTGACAGTGTGA

Protein sequence:

MSIKSQKSLNDECSEIFEKSYSVRSASEESLTCDEDEDSLSRELLMKLTPAVLSKLRRCF
KKAKEKNAGDVDKRVEEVMRAAAAEEGIEFAATANPTPQATLCLDEKGFVTAFEQIFGHR
KYSVHARQLFRSLDMFGGGRVWWKQVVGRLVAAGARTTSSRVERWDDLLPGGIKKLKHCK
RETIVKVVSIEREDSFCYVIITRGGRVGVYSGQLELLNTYEDVQHLVISSSDRSLTIYDV
VTLSHSPVFCITGLTHIPTCLAYKPFLNPGDESELIFGNERGDLTRMRFLQPRISLLHLK
SPDNINYYFWMELSSAPHTTYVSISTWRKVHSRSVRRVMYERDGDIVMSCSLDNTVSVRS
RHARGKLDDYVFKVQRGVSCFVVVSSLHLVVTGSPDGVVRLWSSPQGCQFASLSAPGIVA
ILDVAVVTSSEIVVAYCNNCNIHIWDLFEECLLQTVKIRFPFLGVLGKKVEFGPYCIHFG
PRQYYQRVQHLEDDETNETSANEGQKKIANSPRSLLFSCCDHVFLLSLVRAQTSAPPPPA
GVLRSRRPSVWEIPDFIEGLSPRPAGPKPSQSTDVLTPVAADSSSDQDLEELLEKAGLQG
ILEKDFVLMRGLKHDLNEKLHRMGTVMKTVRIQLVTV