DPGLEAN02553 in OGS1.0

New model in OGS2.0DPOGS200897 
Genomic Positionscaffold5:- 107003-110054
See gene structure
CDS Length1578
Paired RNAseq reads  1406
Single RNAseq reads  3515
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000548 (8e-39)
Best Drosophila hit  CG10338 (8e-77)
Best Human hithypothetical protein LOC64755 (3e-55)
Best NR hit (blastp)  PREDICTED: similar to UPF0420 protein C16orf58 homolog [Tribolium castaneum] (7e-88)
Best NR hit (blastx)  PREDICTED: similar to UPF0420 protein C16orf58 homolog [Tribolium castaneum] (4e-86)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families  IPR006968 Protein of unknown function DUF647
Orthology groupMCL13535

Nucleotide sequence:

ATGTCTTCAAACGAGGGAGAGATATTGCTACAAGAAAAATATGGAACATCTGCTAAGGAG
AGGTATTATGTTAAAGCCGCAGATCAATTGCCAATAGTATTAGTAGTCAATGAGAAGTCT
CGCGACGTCGCTGGATTATTCGCTAAAATATTTCTCCCCCAAGGATATCCTAACAGTGTC
AGCAAAGATTACATTTTTTACCAAATTTGGGACACTGCTCAAGCATTTTGCAGCACTATT
ACAGGTATACTGGCCACACAGGAAGTTTTCCGTGGGGTGGGAGTGGGAGATACAAATGCT
TCACCATTAGCAGCCACTGTTACTTGGGTGTTCAAAGATGGCTGTGGGCATATTGGGAAA
ATATTATTCGCTTATACCCATGGAACATATTTAGATGCCTATAGCAAAAAATGGCGTCTG
TATGCAGATACATTAAATGATGCTGCCATGTGTATTGAAATAGCACTACCATTGTTCAAG
AATTATATTACATTTGCTCTCTGCGTCAGCACTTGTATGAAGGCTATTGTCGGAGTTGCG
GGGGGTGCTACCAGAGTGGCAATGACCCAACACCATGCTCTCCGTGGTAATCTCGCTGAT
GTATCAGCTAAAGACTCCGCCCAAGAGACTGCTGTTAATCTTATAGCTTCTTTCGCTGCA
CTGTTCCTAATATCTTTGATAGGGAATTCGGTGACGATATTTATAATATTATTAATTATG
CATATTGTATTCAACTACATGGCAGTTCGGGCAGTTTGTTTACGAACACTGAATGAACCC
CGTTTCTTACAAGTAATTGACACATACTTGCGGAAGGAGGTAATTGCCAACCCATGTGAA
ATAAATCGTAACGAACCCATTATTTTCTATCAACTGGGACCCAATTTGTTAGATTTAAAA
ATATGCGGTTTTCATATCATAATTGGCGACTCGATATCGAAGATTTTAAACCCAAGAACT
AATGCAGTGTATATAAACAAAGTAAAAGATATTTATAACGATAAGAAATACATAATTCAT
CCTGATACCGGAAACAGAGTGATGTACGTTTTTCCAAAGGAAGATGCGTCGGTAGACGAC
ATGCTATGCGCTTACTTTCAGTCTGTTTTGCTTGCGATTATTACTTGTGCTATTAACGAC
CACCAATTGGCTATATTCAGCTCCAATAATAACACGAAGCCATTCGCTCAAGTGTGTGTG
ACACTACAATCAGCTGAGTGGAGCCGGGCTACCGGTTCCGGGGGAGACTTTCAATATGAA
CCGTCTTATGATCTGCATCGTTATGTTAAGAATATAGCTAGCGATGAATGGACAGCCATC
AGAGAAGGTCTTTTGCAGACGGGTTGGGATCTAAGCAAGCATTTATTGATAGTAGATGAA
TGGCGATTATGTAGTGAAAATGTCACTCCTGTAGCTATACTACCTGAAGAAGTGAAGTAC
AATCGCCCGATCGCTATACCAGAAACTCGCAAAGAATCTTTCACGATAGAACCGGACACA
TCGGATAGCACACTCAGCAATATACCAGAAGCCACAAAATCGAAAACCGATTTAAACTAT
CGTTTAAAAAAGGAATGA

Protein sequence:

MSSNEGEILLQEKYGTSAKERYYVKAADQLPIVLVVNEKSRDVAGLFAKIFLPQGYPNSV
SKDYIFYQIWDTAQAFCSTITGILATQEVFRGVGVGDTNASPLAATVTWVFKDGCGHIGK
ILFAYTHGTYLDAYSKKWRLYADTLNDAAMCIEIALPLFKNYITFALCVSTCMKAIVGVA
GGATRVAMTQHHALRGNLADVSAKDSAQETAVNLIASFAALFLISLIGNSVTIFIILLIM
HIVFNYMAVRAVCLRTLNEPRFLQVIDTYLRKEVIANPCEINRNEPIIFYQLGPNLLDLK
ICGFHIIIGDSISKILNPRTNAVYINKVKDIYNDKKYIIHPDTGNRVMYVFPKEDASVDD
MLCAYFQSVLLAIITCAINDHQLAIFSSNNNTKPFAQVCVTLQSAEWSRATGSGGDFQYE
PSYDLHRYVKNIASDEWTAIREGLLQTGWDLSKHLLIVDEWRLCSENVTPVAILPEEVKY
NRPIAIPETRKESFTIEPDTSDSTLSNIPEATKSKTDLNYRLKKE