DPGLEAN10398 in OGS1.0

New model in OGS2.0DPOGS202780 
Genomic Positionscaffold1363:+ 14432-16450
See gene structure
CDS Length2019
Paired RNAseq reads  245
Single RNAseq reads  709
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010501 (2e-144)
Best Drosophila hit  cuticular protein 97Ea (6e-09)
Best Human hitND
Best NR hit (blastp)  TPA: putative cuticle protein [Bombyx mori] (3e-135)
Best NR hit (blastx)  TPA: putative cuticle protein [Bombyx mori] (4e-107)
GeneOntology terms  GO:0005214 structural constituent of chitin-based cuticle
InterPro families  IPR000618 Insect cuticle protein
Orthology groupMCL23936

Nucleotide sequence:

ATGAACCTCTTCATGCTGCTCGCTTGTATCATAGCGACGACCACTGCTACCAATTCAACA
GAGACTACAACCAATCCTCCAAATAATACCACCCAGGAAATTCTCAAGGCCAGTGAAACT
GCGGACCCGACCAAAGCTGAAATTGTAAAACAAATTCGTCGTCTGAACGAAGATGGATCG
TATACGATTGGATACGAAGCCAACGATGGAACGTTTAAAATTGAAAGTAGAGATGTTTTG
GGTAACGTAAAAGGAACCTTTGGTTATGTGTCAGATGATGGAGAAATAAAACGTGTCACT
TACAGTTCCTCTGCTGATAGCACACCGGCATCGGTAACTACTAGTACCACGCCAACTACA
CCCACCATGGTCGTGAGAGTTAACAAAACTATATCGTCGACGACAAGAAGACCGTTAGCC
ACTGTAGTGTATCCGACCAGAGGAAGCACCACCACCAGAGGTACAGTTATACAAGCCATT
CCGAGACGACGAACCGGGTCAAGCTCCGTGCGCCCTCAAACGACTGATACAACTACGGAA
ACTCAAAAACAAATCACTGCAAGTAGCTCGAACGTTCATCGAAGAGAAGATCTTCTGAAA
TCACGGTCTCAATCAGCAAAAACTGCACCTGTAACCTCCAAAGACGATTTATTAACCAAA
CAGACTACAAGTACGGCGTCTCAGACTCTAAAACCTGTTTACGAGCATACGACAGAAAGA
GAGACGGATATTAATAAATCGACAGCTACGAGACGTGAATTATCAGGAGCGAGCGCTAAT
CATCATATGTTGAATCTTCAACAATCAATGGGAGATGATTCAACTGATGTTTATGGGAGT
CACTTATCACATGGAACTCTCAGACCTTTATTTACTACAACCACGATTCGTCCTAAATTA
GTTACCCTTCATTCAATAATTGCCGCAAGACAACAACAACAACAGCAGCAGCAGCAGCAT
CAACAGCAGCAACAATATGATGACGAACAAGAAGAGGAACAGGAGACGACATTGGAAGCC
ACCGCTGGCCGTGTTTATGAGCCAGATGAAAGTGTTACCTCCAATCCTGTACCAGTTGTG
CACATATCAGCACAGAGAGGTTCCGATAAAATATTTTATCAACCCCAATACCGTCGACCT
GCGGCTGTTCTTTTCAGGACTCAGGAATATTTGAGAGACAATCCCGGTGCCCCTATTCCC
ATTGGCAACCAGCGGCCCTTTCTTAACTATGAATATCCAGATAAAATATTGGACTCACAG
TATGTAAAAGAATCACAGCAAGTCAATAACAATCAAGAAGCGGAGTCTGGTCCTTATGAA
TACAGACATAATGATTACAGACCAGCCCCAAGAATTATCCACGTGCCAGTCGACGATAGA
GGCGTTCCAATTCAGGGGTACGAAGCGAGATATGTCAACCCATATCGACCACAGCCTTTA
ATTCAAAGATACGATCCTGTAAACGAAATGCATTCCATTTCAGCACCGGTGAGCACGAGA
GATTTCAAACGCTTGCTACATATTTTGATACTAAGACAGAACCGTCTGCAAGCTCTCATG
GAGCAGATAATGCCAGAGGCCTACCAGGCGGCTCATTACCGCTCTGAACCGTATCACGCA
CAATCGCGGCCCTATTCTCGCCACCGAGACGACGACCAATACGACTACAGATATCAACCG
CAGTACAGACAAGATTTTTATTCAACACAAGTATCAAACTACGACGATCGCGACTACGAG
TCCCATCGCTATTCGCCTCGAAGAAGATTATACTCGCGACCCTATGACGCTCAGGGTTCA
GCCTCGGAACATATAGAACAGACACCGGAGTATCTCCCGGTCGAAGTGAGAGAAGCTCTC
TTGTTGAAAATGCTGTTGTTGGCTATCAGCCCGGACTTCATGCCGACACCAGCGCCCGCC
ACCGAGCTGACCACCGCAGCACCAACAAGGAAACAGGTGAGAAACGTTCAAATACTTGGA
GAAGAAGGTTCCGACAAAAAAACAAGGCAGGGGCACTAG

Protein sequence:

MNLFMLLACIIATTTATNSTETTTNPPNNTTQEILKASETADPTKAEIVKQIRRLNEDGS
YTIGYEANDGTFKIESRDVLGNVKGTFGYVSDDGEIKRVTYSSSADSTPASVTTSTTPTT
PTMVVRVNKTISSTTRRPLATVVYPTRGSTTTRGTVIQAIPRRRTGSSSVRPQTTDTTTE
TQKQITASSSNVHRREDLLKSRSQSAKTAPVTSKDDLLTKQTTSTASQTLKPVYEHTTER
ETDINKSTATRRELSGASANHHMLNLQQSMGDDSTDVYGSHLSHGTLRPLFTTTTIRPKL
VTLHSIIAARQQQQQQQQQHQQQQQYDDEQEEEQETTLEATAGRVYEPDESVTSNPVPVV
HISAQRGSDKIFYQPQYRRPAAVLFRTQEYLRDNPGAPIPIGNQRPFLNYEYPDKILDSQ
YVKESQQVNNNQEAESGPYEYRHNDYRPAPRIIHVPVDDRGVPIQGYEARYVNPYRPQPL
IQRYDPVNEMHSISAPVSTRDFKRLLHILILRQNRLQALMEQIMPEAYQAAHYRSEPYHA
QSRPYSRHRDDDQYDYRYQPQYRQDFYSTQVSNYDDRDYESHRYSPRRRLYSRPYDAQGS
ASEHIEQTPEYLPVEVREALLLKMLLLAISPDFMPTPAPATELTTAAPTRKQVRNVQILG
EEGSDKKTRQGH