DPGLEAN02303 in OGS1.0

New model in OGS2.0DPOGS215901 
Genomic Positionscaffold2156:+ 517-2871
See gene structure
CDS Length1482
Paired RNAseq reads  2596
Single RNAseq reads  17439
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000429 (1e-57)
Best Drosophila hit  cuticular protein 64Ab (8e-17)
Best Human hitND
Best NR hit (blastp)  TPA: putative cuticle protein [Bombyx mori] (4e-43)
Best NR hit (blastx)  TPA: putative cuticle protein [Bombyx mori] (5e-37)
GeneOntology terms
  
GO:0008010 structural constituent of chitin-based larval cuticle
GO:0005214 structural constituent of chitin-based cuticle
InterPro families  IPR000618 Insect cuticle protein
Orthology groupMCL19900

Nucleotide sequence:

ATGTTTACAAAAGTACTGTCATTAAGCGCTGTTTTGGCGGTTGCGGCAGCAGGCCTTATC
GCTGAACCTCATTACTCCTCTGCTGCTGCAGTTTCTTCCCAAAGTATCGTTCGCCACGAT
CAATCTCATGCTGTAGCTGCCGCTCCAGTTGCAATCCACTCTGCTCCAGTTGCCATTCAC
TCTGCTCCTGTTGCCTACCATGCAGCACCAGTACACTATTCATCTGCCGGAGCTGTATCT
TCTCAGTCGATCCAACGTCACGACCAACAGCGTGCTGCTATTGCTGTTGCACCCGTAGCC
CACTACGCCGCTGCTCCAGTTGCTGTTCACTCTGCTCCTGTTGCTTACCACGTAGCACCA
GTACACTATTCATCTGCCGGAGCTGTATCTTCTCAGTCGATCCAACGTCACGACCAACAG
CGTGCTGCTATTGCTGTTGCACCCGTAGCCCACTACGCCACTGCTCCAGTTGCTGTTCAC
TCTGCTCCTGTTGCCTATCACGCAGCACCAGTACACTATTCATCTGCCGGAGCTGTATCT
TCTCAGTCCATCCAACGTCATGACCAACCCCGTGCTGCTATTGCAGTGGCTCCCGTAGCT
CACTACTCAGCTGCTCCAGTTGCTCACTATGCAGCTGCCCCAGTAGCTCACTACTCTGCC
CCTATCGCCCATGCTGCATATGCTGCCCACGAAGAAATCGACTCTCACCCTCAATACGAC
TTCTCTTACTCCGTACATGACGGACACACCGGCGACAACAAGTCACAGCACGAGAGCCGC
GACGGTGACGCAGTGCATGGCGAGTACTCTCTAGTAGAGGCTGACGGATCTGTACGTACC
GTTCAATACAGCGCTGATGATCACTCTGGATTCAACGCCGTCGTCAGCCACTCCGCTCCA
TCAGCTCACGCTGTTCCAGTGCCAACGCACTCGATCCAACGTCACGACCAACAGCGTGCT
GCTATTGCTGTTGCACCCGTAGCCCACTACGCCGCTGCTCCGGTTGCTGTTCACTCTGCT
CCTGTTGCCTATCAAGCAGCACCAGTACACTATTCATCTGCTGGAGCTGTATCTTCTCAG
TCCATCCAACGTCATGACCAACCCCGTGCTGCTATTGCCGTGGCTCCCGTAGCTCACTAC
TCAGCTGCTCCAGTCGCTCACTACGCAGCTGCCCCAGTAGCTCACTACTCTGCCCCTATC
GCCCATGCTGCATATGCTGCCCACGAAGAAATCGACTCTCACCCTCAATACGACTTCTCT
TACTCCGTACATGACGGACACACCGGCGACAACAAGTCACAGCACGAGAGCCGCGACGGT
GACGCAGTGCACGGCGAGTACTCCCTGGTAGAGGCTGACGGATCTGTACGTACCGTTCAA
TACAGCGCTGATGATCACTCTGGTTTCAACGCCGTCGTCAGCCACTCCGCTCCATCAGCT
CACGCTGTTCCAGTGCCAACGCACGTACTCGCACATCATTAA

Protein sequence:

MFTKVLSLSAVLAVAAAGLIAEPHYSSAAAVSSQSIVRHDQSHAVAAAPVAIHSAPVAIH
SAPVAYHAAPVHYSSAGAVSSQSIQRHDQQRAAIAVAPVAHYAAAPVAVHSAPVAYHVAP
VHYSSAGAVSSQSIQRHDQQRAAIAVAPVAHYATAPVAVHSAPVAYHAAPVHYSSAGAVS
SQSIQRHDQPRAAIAVAPVAHYSAAPVAHYAAAPVAHYSAPIAHAAYAAHEEIDSHPQYD
FSYSVHDGHTGDNKSQHESRDGDAVHGEYSLVEADGSVRTVQYSADDHSGFNAVVSHSAP
SAHAVPVPTHSIQRHDQQRAAIAVAPVAHYAAAPVAVHSAPVAYQAAPVHYSSAGAVSSQ
SIQRHDQPRAAIAVAPVAHYSAAPVAHYAAAPVAHYSAPIAHAAYAAHEEIDSHPQYDFS
YSVHDGHTGDNKSQHESRDGDAVHGEYSLVEADGSVRTVQYSADDHSGFNAVVSHSAPSA
HAVPVPTHVLAHH