DPGLEAN13678 in OGS1.0

New model in OGS2.0DPOGS210120 
Genomic Positionscaffold3783:+ 752-4131
See gene structure
CDS Length1491
Paired RNAseq reads  200
Single RNAseq reads  466
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000226 (8e-146)
Best Drosophila hit  cuticular protein 73D (9e-60)
Best Human hitND
Best NR hit (blastp)  cuticular protein RR-1 motif 47 [Bombyx mori] (3e-160)
Best NR hit (blastx)  cuticular protein RR-1 motif 47 [Bombyx mori] (1e-157)
GeneOntology terms  GO:0005214 structural constituent of chitin-based cuticle
InterPro families  IPR000618 Insect cuticle protein
Orthology groupMCL18416

Nucleotide sequence:

ATGTTTTGGACATACGGTGCGGTCGTCCTGAGTGTGTGTGTGTCCGTCCGGACTCAAGTT
ATTCCGGGCAAGTCCAGGACTAACGATGGTGACTTTAATAATGTTAACGCTGATGGATCT
TTCGATTTCGGGTACGCGAACAAAGATCGTGGTGGCAGCTACCACTTGGCGCAAGGTAGT
TCGAAAGGACTAGTGGGAGGACGGTTCGGTGCCAGAGAGCCTGGTACTGACGAAGTCAAA
GAAACTATTTATACTGCTGGTCCTAGAGGATTTCGCGCTAAGGGCCCTAACGTTCACCGA
AAGATAGATCTGGATCAGCGGCCGCGAGGTCCCATTGGAAATAAAGACGATCCCTATTTC
GATCCTAACGAAGATCCCAGTTACGCGTACAAAATAGAGACGAGAACTTACTCCAAGAAT
GAGAATGCTGACAGCAGAGGCGATGTCAAAGGTCATTACTCCTTCGTGGATGATATCGGA
GAACGGCACGACGTGTCCTATATAGCTGGGCGCGATACAGGTTTTCATGTATCCTCAGCC
AACCCTGACGTGCCCAGTCTTATTGGGTCACCTTTTCACCGAGCGCCTCTGGTTAGAGGG
GAGAGCAAATCTCGAGGACGTACTGCGGTACAGAGAGGATTAGATGGTTCATATAGATTC
ATTTCTGCCGGACCTGACCAACGGCGAACGGAAAGTAGTGACTCACACGGTAACGTTAGA
GGATCCTACACATTTTTAGATGACAAAGGTGTACAAAGGACAGTACATTATATAGCGGGG
CCAGGTATTGGATATCGGATTGTGAAGAACAGCAACGATCCCTTCATTCCTTCCTATTTT
CCTACTATACCTAGTCCTTATGATCCGGCATTTAACGCAGGAGGTAGCGCTGGGGCACCG
GCTTTCGCTCCCAGCGACGAAGGTAGCGATGATGTCTTCAAAGGACCCGATGGCACTGCA
GCGTCAGGGCACGTTAAGCCACCTCCTTTCCCTCCCTCCGAATCAGAGAGACCTAGTAAC
ACTGGTAGCATATCGACAGTAATTGAAACTCCTGATGATTCAGATAACAACGGTTACGAC
ACAGGTCCAAGTTTTAATCAGGAACCTGATAATACAGATCTGGGTTACGTCGACGAGGAC
GCTTCTAGTTTTCAAAATCAAAAGCCAACTCAAGCACCAGGCAAACCGTGGCGCCCAGAG
AACAAACCGTATAGACCGTCGAAGAAACCGTTTAGACCGATAAAACCTTATCAAGAAACG
AATAATAATAACGACAATTTCGCTGGTGATAAAAGTAAACCAGAATTTGCTGTAGGATTC
AATATACATCACACGAAACCTGGCACAACGATCATTAGGAATATAGGTGAAGAATACTTC
GGCATACCTCCTGGTGTGTCCGTCCGCGCTCATGTACAGAGCATCGATCTTTATCCCTTC
GGTTCCAAACCAATTTCACCATCAGAGGCTCTGGAAAATGACCAAACATAG

Protein sequence:

MFWTYGAVVLSVCVSVRTQVIPGKSRTNDGDFNNVNADGSFDFGYANKDRGGSYHLAQGS
SKGLVGGRFGAREPGTDEVKETIYTAGPRGFRAKGPNVHRKIDLDQRPRGPIGNKDDPYF
DPNEDPSYAYKIETRTYSKNENADSRGDVKGHYSFVDDIGERHDVSYIAGRDTGFHVSSA
NPDVPSLIGSPFHRAPLVRGESKSRGRTAVQRGLDGSYRFISAGPDQRRTESSDSHGNVR
GSYTFLDDKGVQRTVHYIAGPGIGYRIVKNSNDPFIPSYFPTIPSPYDPAFNAGGSAGAP
AFAPSDEGSDDVFKGPDGTAASGHVKPPPFPPSESERPSNTGSISTVIETPDDSDNNGYD
TGPSFNQEPDNTDLGYVDEDASSFQNQKPTQAPGKPWRPENKPYRPSKKPFRPIKPYQET
NNNNDNFAGDKSKPEFAVGFNIHHTKPGTTIIRNIGEEYFGIPPGVSVRAHVQSIDLYPF
GSKPISPSEALENDQT