DPGLEAN12551 in OGS1.0

New model in OGS2.0DPOGS209415 
Genomic Positionscaffold5845:+ 2104-6328
See gene structure
CDS Length3246
Paired RNAseq reads  3024
Single RNAseq reads  9422
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012602 (4e-07)
Best Drosophila hit  cuticular protein 65Eb (1e-19)
Best Human hitND
Best NR hit (blastp)  cuticular protein RR-1 motif 12 [Bombyx mori] (7e-37)
Best NR hit (blastx)  cuticular protein RR-1 motif 12 [Bombyx mori] (2e-46)
GeneOntology terms  GO:0005214 structural constituent of chitin-based cuticle
InterPro families  IPR000618 Insect cuticle protein
Orthology groupMCL25348

Nucleotide sequence:

ATGCCAAAGGTTACTGACATTCTAGTTCTATCAATAATAGCTTATGGCTATGCGGATAAA
TTAGATAAGGGATACCTGCCTCCTGTAAATGCAGCATCATCAGGTGGCAGCCCAGCAGAA
TTAATTGCTCCAGCTGATCAATCTGAGGTTTTCGGGCAAGGGTTGCCCGTTCCAGAAAGT
CAACCAGGCTCTTATATCCAAGATATTGGACAGGAAGTTCTTCAAGCTTACAACCAGGAA
CGCCCTCAAGCAGCAGCTGATAGAAATGCGGAGATACTAAAGTTTAATAACGAAAATAAC
GGTGAATCCTATGCATATAATTATGAAACATCTAACGGGATTTCTGTGGAGGAATCAGGT
GTTGCATCTAATGGAGTTAATGCTCAAGGTGGCTATGCCTACACTGGGGATGACGGTAAA
TCCTATTCAGTCACTTATACGGCAGATATAAATGGCTATCAACCTCAGGGTGAACATTTA
CCTACACCTCACCCTATCCCAGAAGAAATATTAAAGTCCATAGAAGAAAATGCTAGGGCT
GCTGCTGCCGGTACACAAGAAGGAGCTTACAATCCTGAGGAGTATGATTCCAACGTTTAT
TATCAAACAAAGCCAGATCAAGAATCTGATGGTTCTTTAGATGTAATCGAAAGAAACAAA
AATCAAGAAATAAGCTCCATATATAACAATCCCCTCGATTCTGTGGGTCAACAATATCAA
AAAGCATCTAGTTTAGATGTAAATCCATCAGACCAAAAACGGAATAAGGAAAGTGGACAA
GATATTAACCAGATGTACACTCCAAATCCTTATCAATCCCACCAGACCTCTGGTATTGGT
ATTCAAGGAAACAGTGGTTTTGAATCTAGTAGTCAGTCACCTATTCAAGGAATAGCAGGA
CAATTTTTACAAGGAGACGGTTATCAATATAATCAACCTAAATTTTCATTGCAGCCAGTT
TTTCCAGGACAAGATCAATACAAACCGCAAGTAACTAGCGACAATGAAAATATATCATCA
TCGTTAAGACCAAGTTCATCAGGTCCCGCATTTAATAGAGATCAAAATTTGAAACCTTCA
TTTGGAAGCCTTCCCTCAGCTGAACAAACTCCGCAATATCAATCTGGCCAACAAATTCTT
CCAGAATTTAGGCCATCCTCTCATAGCGGACCAAATGCATCTCAAAGTATGAGGGGATCT
GTCCCAAACCAAGACCAACAAATCCAAATCAAGGAATCCTCTGGCGATTTGAAGGAGAGC
AAAGGAAATGGTTATCATTACAATCAACCAAAGCCCGTGTTCCAACCCGCTAATTCCGAA
CAAAGCTCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGCGAAAAGATATCATCT
TCAAGACCAGGCTTATCAGGATCCATATACAATGGAGATCAGAATTTGAAACCTTCATTT
GGAAGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCA
GAATTTAGGCCCTCCTCTCATAGCGGACCAAATGCATCTCAAAATATGAGGGGATCTCTC
CCAAACCAAGACCAACAAATCCAAATCAAGGAATCATCTGGCGATTTGAACGAAAGCAAA
GGAAATGGTTATCATTACAATCAACCAAAGCCCGCTTTCCAACCCGCTAATTCCGAACAA
AGCTCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCT
TCAAGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACCTTCATTT
GGAAGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCA
GAATTTAGGCCCTCATCACATAGCGGACCAAATGCATCTCAAAATATGAAGGGATCTCTC
CCAAATCAAGACCAACAAATCCAAATTAAAGAATCCTCTGGCGATTTGAACGAAAACAAA
GGAAATGGTTATCATTACAATCAACCAAAGCCCGCTTTCCAACCCGCTAATTCCGAACAA
AGCTCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCT
TCAAGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACCTTCATTT
GGAAGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCA
GAATTTAGGCCCTCCTCACATAGCGGACCAAATGCATCTCAAAATATGAAGGGATCTCTC
CCAAATCAAGACCAACAAATCCAAATTAAAGAATCCTCTGGCGATTTGAACGAAAACAAA
GGAAATGGTTATCATTACAATCAACCAAAGCCCGCTTTCCAACCCGCTAATTCCGAACAA
AGCTCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCT
TCAAGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACCTTCATTT
GGAAGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCA
GAATTTAGGCCCTCCTCACATAGCGGACCAAATGCATCTCAAAATATGAAGGGATCTCTC
CCAAATCAAGACCAACAAATCCAAATTAAAGAATCCTCTGGCGATTTGAACGAAAACAAA
GGAAATGGTTATCATTACAATCAACCAAAGCCCGCTTTCCAACCCGCTAATTCCGAACAA
AGCTCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCT
TCAAGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACCTTCATTT
GGAAGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCA
GAATTTAGGCCCTCCTCACATAGCGGACCAAATGCATCTCAAAATATGAAGGGATCTCTC
CCAAATCAAGACCAACAAATCCAAATTAAAGAATCCTCTGGCGATTTGAACGAAAACAAA
GGAAATGGTTATCATTACAATCAACCAAAGCCCGCTTTCCAACCCGCTAATTCCGAACAA
AGCTCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCT
TCAAGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACACAAAATA
AAATGA

Protein sequence:

MPKVTDILVLSIIAYGYADKLDKGYLPPVNAASSGGSPAELIAPADQSEVFGQGLPVPES
QPGSYIQDIGQEVLQAYNQERPQAAADRNAEILKFNNENNGESYAYNYETSNGISVEESG
VASNGVNAQGGYAYTGDDGKSYSVTYTADINGYQPQGEHLPTPHPIPEEILKSIEENARA
AAAGTQEGAYNPEEYDSNVYYQTKPDQESDGSLDVIERNKNQEISSIYNNPLDSVGQQYQ
KASSLDVNPSDQKRNKESGQDINQMYTPNPYQSHQTSGIGIQGNSGFESSSQSPIQGIAG
QFLQGDGYQYNQPKFSLQPVFPGQDQYKPQVTSDNENISSSLRPSSSGPAFNRDQNLKPS
FGSLPSAEQTPQYQSGQQILPEFRPSSHSGPNASQSMRGSVPNQDQQIQIKESSGDLKES
KGNGYHYNQPKPVFQPANSEQSSLNQYRPELSGQSEKISSSRPGLSGSIYNGDQNLKPSF
GSLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMRGSLPNQDQQIQIKESSGDLNESK
GNGYHYNQPKPAFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKPSF
GSLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMKGSLPNQDQQIQIKESSGDLNENK
GNGYHYNQPKPAFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKPSF
GSLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMKGSLPNQDQQIQIKESSGDLNENK
GNGYHYNQPKPAFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKPSF
GSLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMKGSLPNQDQQIQIKESSGDLNENK
GNGYHYNQPKPAFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKPSF
GSLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMKGSLPNQDQQIQIKESSGDLNENK
GNGYHYNQPKPAFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKHKI
K