DPGLEAN16378 in OGS1.0

New model in OGS2.0DPOGS215910 
Genomic Positionscaffold396:+ 18773-24491
See gene structure
CDS Length1698
Paired RNAseq reads  4
Single RNAseq reads  35
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000282 (6e-08)
Best Drosophila hit  cuticular protein 62Bc (8e-14)
Best Human hitND
Best NR hit (blastp)  cuticular protein RR-2 motif 82 [Bombyx mori] (9e-34)
Best NR hit (blastx)  PREDICTED: similar to Cuticular protein 62Bc CG1919-PA [Tribolium castaneum] (2e-30)
GeneOntology terms  GO:0005214 structural constituent of chitin-based cuticle
InterPro families  IPR000618 Insect cuticle protein
Orthology groupND

Nucleotide sequence:

ATGTATAAGTTATTAGTTTTGTGTTCATTAATGTCGGCCAGTTACGCGATATATCCACAT
CATCATCACCTGGCTGTTTCCCATCAGCAGGTGATCAAGCACGACGGACCACACCATCCG
ATACCAATTCATCATCACGTGCCACACCACCACGTTGGAATCATTCACAATCTTCCATTA
CCCGTACCCATCCACCACGTGCCTCACCATCAAATACACCACGATCATTATGCATTCCCG
GAGTACAAGTTTGCGTACTCAGTCCATGACCATCATACTGGTGATGTGAAATCACAGCAT
GAGTTCCGTCATGGAGACGTGGTGCAAGGCGGATACGAGCTCATCGAACCCGACGGCCGC
CAGAGAAAAGTTGAATACAAAGCTGACGATCATTCTGGATTATGCAAAGAAGTCATCTCG
TCTGCCCGTGATCACGGTTGCTGGAAAGTAACCAAAACGTCGGGAGTAGTAATATTAGTA
GGTACGGTGCATATTGCTCTAACCCTAGCCCGTCCACCTTATTCAGAACAAAAGGTGACG
ATAAACCATCACGGCAAGCTCTCGCACATCGGATCTAATGCTCCTGAGGAACACGAAAAA
TACGGCTGGTCACATCCGTCATACGAATTTACGTATGAAGTATCTGATCCACATACACAC
GATTTCAAAGGACACCATGAAGCTCGAGAGGGTGATAAAGTAAGAGGTTATTACTGGCTC
ATTCAACCTGATGGACTCAAACGAACTGTAAAGTATCACGTTGATAAACACAGCGGATTT
AACGCTAATGTTTTAATTTCAAAACCATGGGAAGAAGGAAGTAATAACGAAGAAAACGGA
GGAGAAAATTCAAATGAAAATAATGAAGGCCAAATGGAAAACGAAGAAAATGGAGGTGAA
AATCAAGAAAGAGAAGAAGAGCGTAATCAAGGAAATGAACGAGAGTCTAATCAAAATAAT
AGCTCAGAAAATGGTGGAAATGAAAATAATAACGAAGAAGAAGGAGAGACAATAAATATA
AACAGGAGTCAGGAGAACAACAATGGTAGAAATAATAATAGTGGTGGACGAAATTCAGCA
GAAAATTCGGGTGAAGGTCAGGGCCGCGGTGAAAGTGGTAGGGGTTGGCAAAATAACCGA
GGAATGGAATGGCAAAGAGGCAACAGTGGTTCTAATGAAAATAGCGAAGAAAGACGCGGC
GGAGAAAATAATGGCGGTAGAGCAGGAGGAAATTGGCGCGGACGTGTTAAGTGGAATGGA
TGGCAAGAGGGTGGCAATCGTGGCGAACGAAATCAGGAAAACAATGAGGGAAGACGAGAA
GAAGGTAATGAAAATGAAAACCGTAACGATAGAAGCAGGCAGAGTAGCGAGAGCTCAGAA
GTCCAAGAAAACGATCGTGGGCAATGGAGCAATGAGAGACAAGAAAACAACAATCAGGAT
GAAGAAAATCAGGAAGAAAATGGAGGCGAGCGAAATCGTAACAGTGGAGAAAATGGACAA
TGGAATGAAAACGAAGGAAACAATGAAAATGGGGGAAACAATGAAAATAGGGGTAGTAAT
GAAGGCCAGGAAAATAACGGCAAAAGCAATGGCCGTAAAGGCGAGAGGAACGAAAAAGGG
AAGCAAGAAGTCACAAAAACCCATTATCACATAATTATTCATCATCCTAAACATCATTAC
AAGTCAAAACAAAATTAA

Protein sequence:

MYKLLVLCSLMSASYAIYPHHHHLAVSHQQVIKHDGPHHPIPIHHHVPHHHVGIIHNLPL
PVPIHHVPHHQIHHDHYAFPEYKFAYSVHDHHTGDVKSQHEFRHGDVVQGGYELIEPDGR
QRKVEYKADDHSGLCKEVISSARDHGCWKVTKTSGVVILVGTVHIALTLARPPYSEQKVT
INHHGKLSHIGSNAPEEHEKYGWSHPSYEFTYEVSDPHTHDFKGHHEAREGDKVRGYYWL
IQPDGLKRTVKYHVDKHSGFNANVLISKPWEEGSNNEENGGENSNENNEGQMENEENGGE
NQEREEERNQGNERESNQNNSSENGGNENNNEEEGETININRSQENNNGRNNNSGGRNSA
ENSGEGQGRGESGRGWQNNRGMEWQRGNSGSNENSEERRGGENNGGRAGGNWRGRVKWNG
WQEGGNRGERNQENNEGRREEGNENENRNDRSRQSSESSEVQENDRGQWSNERQENNNQD
EENQEENGGERNRNSGENGQWNENEGNNENGGNNENRGSNEGQENNGKSNGRKGERNEKG
KQEVTKTHYHIIIHHPKHHYKSKQN