New model in OGS2.0 | DPOGS209415  |
---|---|
Genomic Position | scaffold2372:- 313-4687 |
See gene structure | |
CDS Length | 3270 |
Paired RNAseq reads   | 3379 |
Single RNAseq reads   | 9865 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012601 (6e-31) |
Best Drosophila hit   | cuticular protein 65Eb (8e-19) |
Best Human hit | ND |
Best NR hit (blastp)   | cuticular protein RR-1 motif 12 [Bombyx mori] (6e-37) |
Best NR hit (blastx)   | cuticular protein RR-1 motif 12 [Bombyx mori] (3e-46) |
GeneOntology terms   | GO:0005214 structural constituent of chitin-based cuticle |
InterPro families   | IPR000618 Insect cuticle protein |
Orthology group | MCL25348 |
Nucleotide sequence:
ATGGAATCCAGGATTCTAGTTCTATCAATAATAGCTTATGGCTATGCGGATAAATTAGAT
AAGGGATACCTGCCTCCTGTCAATGCAGCATCATCAGGTGGCAGCCCAGCAGAATTAATT
GCTCCAGCTGATCAATCTGAGGTTTTCGGGCAAGGGTTGCCCGTTCCAGAAAGTCAACCA
GGCTCTTATATCCAAGATATTGGACAGGAAGTTCTTCAAGCTTACAACCAGGAACGCCCT
CAAGCAGCAGCTGATAGAAATGCGGAGATACTAAAGTTTAATAACGAAAATAACGGTGAA
TCCTATGCATATAATTATGAAACATCTAACGGGATTTCTGTGGAGGAATCAGGTGTTGCA
TCTAATGGAGTTAATGCTCAAGGTGGCTATGCCTATACTGGGGATGACGGTAAATCCTAT
TCAGTCACTTATACGGCAGATATAAATGGCTATCAACCTCAGGGTGAACATTTACCTACA
CCTCACCCTATCCCAGAAGAAATATTAAAGTCCATAGAAGAAAATGCTAGGGCTGCTGCT
GCCGGTACACAAGAAGGAGCTTACAATCCTGAGGAGTATGATTCCAACGTTTATTATCAA
ACAAAGCCAGATCAAGAATCTGATGGTTCTTTAGATGTAATCGAAAGAAACAAAAATCAA
GAAATAAGCTCCATATATAACAATCCCCTCGATTCTGTGGGTCAACAATATCAAAAAGCA
TCTAGTTTAGATGTAAATCCATCAGACCAAAAACGGAATAAGGAAAGTGGACAAGATATT
AACCAGATGTACACTCCAAATCCTTATCAATCCCACCAGACCTCTGGTATTGGTATTCAA
GGAAACAGTGGTTTTGAATCTAGTAGTCAGTCACCTATTCAAGGAATAGCAGGACAATTT
TTACAAGGAGACGGTTATCAATATAATCAACCTAAATTTTCATTGCAGCCAGTTTTTCCA
GGACAAGATCAATACAAACCGCAAGTAACTAGCGACAATGAAAATATATCATCATCGTTA
AGACCAAGTTCATCAGGTCCCGCATTTAATAGAGATCAAAATTTGAAACCTTCATTTGGA
AGCCTTCCCTCAGCTGAACAAACTCCGCAATATCAATCTGGCCAACAAATTCTTCCAGAA
TTTAGGCCCTCCTCTCATAGCGGACCAAATGCATCTCAAAGTATGAGGGGATCTGTCCCA
AACCAAGACCAACAAATCCAAATCAAGGAATCATCTGGCGATTTGAACGAAAGCAAAGGA
AATGGTTATCATTACAATCAACCAAAGCCCGTGTTCCAACCCGCTAATTCCGAACAAAGC
TCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCTTCA
AGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACCTTCATTTGGA
AGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCAGAA
TTTAGGCCCTCCTCACATAGCGGACCAAATGCATCTCAAAATATGAAGGGATCTCTCCCA
AATCAAGACCAACAAATCCAAATTAAAGAATCCTCTGGCGATTTGAACGAAAGCAAAGGA
AATGGTTATCATTACAATCAACCAAAGCCCGTGTTCCAACCCGCTAATTCCGAACAAAGC
TCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCTTCA
AGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACCTTCATTTGGA
AGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCAGAA
TTTAGGCCCTCCTCTCATAGCGGACCAAATGCATCTCAAAATATGAGGGGATCTCTCCCA
AACCAAGACCAACAAATTCAAATCAAGGAATCATCTGGCGATTTGAACGAAAGCAAAGGA
AATGGTTATCATTACAATCAACCAAAGCCCGCTTTCCAACCCGCTAATTCCGAACAAAGC
TCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCTTCA
AGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACCTTCATTTGGA
AGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCAGAA
TTTAGGCCCTCCTCACATAGCGGACCAAATGCATCTCAAAATATGAAGGGATCTCTCCCA
AATCAAGACCAACAAATCCAAATTAAAGAATCCTCTGGCGATTTAAACGAAAGCAAAGGA
AATGGTTATCATTACAATCAACCAAAGCCCGCGTTCCAACCCGCTAATTCCGGACAAAGC
TCCTTTAACCAGTATCGTCCGGAATTATCTGGCCAAAGCGAAAAGATATCATCTTCAGCA
AGACCAAGTATGTCAATTCCCATATTTAATAGAGATCAGAATTTGAAACCCTCATTTGGA
AGTCGTCCCTCAGCTGATCAATCTCAAAAGTATCAATTCGGCAAACAAATTCCTTCAGTT
TTTAAGCCTTCTTCTTATAGAACATCAAACGTTTCTCAAAATAAAGAAAGTCCTTTCCTC
AATCGAGACCAACGAGTCCAAATTAAACGACCATCAAGTGGTTTAATACAAAACAAAGTA
AACGGTTATCAATATAATCGACCAAAACCTGCCTTTCAGCCAACTATTTCGGGACAAAAT
AGACCTCGAGTTTCTAACGAAGGAAATAAGAAGCCACTATTAAGTCAAAACACTTTCACT
TCAGTAGTTAGTGGAAATAACGGAAACATTAATCCTTCACCACAAAACGGACCAAATGGT
TCTCAAAATAAAGGATCTTACCCTATTAAAGTTCTAAAGAACCCAGGCAGTCAGGCAGCT
GGCTCTTCACAAGGCAACGGTTCACCTCGCTTCAGTATTTTGAATAAAAATAAACCTTAT
TCTGCTTTGCAAAAACCCGGACAAGGTTTCCAATCGTCTCCTAGTGAAGGACTTGGAAAC
AATAAATTTGGAAAAGGACCTATTTCTGCTTTAAAACAAGTCGAAGCGCCTTACCACTAC
AAAAGACCAAGTGTAAGTTTTACCACACAACGTCCAAACTCTTTTTCGCAAACAACACAG
ATAAGCAGAGGCAATCAGGATAAAAGTGAACAGTTTGCGGGGTCTCGTCCACCGCCGAGT
TTCAGCGAGGAAGAAGGTTACAAATATTAG
Protein sequence:
MESRILVLSIIAYGYADKLDKGYLPPVNAASSGGSPAELIAPADQSEVFGQGLPVPESQP
GSYIQDIGQEVLQAYNQERPQAAADRNAEILKFNNENNGESYAYNYETSNGISVEESGVA
SNGVNAQGGYAYTGDDGKSYSVTYTADINGYQPQGEHLPTPHPIPEEILKSIEENARAAA
AGTQEGAYNPEEYDSNVYYQTKPDQESDGSLDVIERNKNQEISSIYNNPLDSVGQQYQKA
SSLDVNPSDQKRNKESGQDINQMYTPNPYQSHQTSGIGIQGNSGFESSSQSPIQGIAGQF
LQGDGYQYNQPKFSLQPVFPGQDQYKPQVTSDNENISSSLRPSSSGPAFNRDQNLKPSFG
SLPSAEQTPQYQSGQQILPEFRPSSHSGPNASQSMRGSVPNQDQQIQIKESSGDLNESKG
NGYHYNQPKPVFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKPSFG
SLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMKGSLPNQDQQIQIKESSGDLNESKG
NGYHYNQPKPVFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKPSFG
SLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMRGSLPNQDQQIQIKESSGDLNESKG
NGYHYNQPKPAFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKPSFG
SLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMKGSLPNQDQQIQIKESSGDLNESKG
NGYHYNQPKPAFQPANSGQSSFNQYRPELSGQSEKISSSARPSMSIPIFNRDQNLKPSFG
SRPSADQSQKYQFGKQIPSVFKPSSYRTSNVSQNKESPFLNRDQRVQIKRPSSGLIQNKV
NGYQYNRPKPAFQPTISGQNRPRVSNEGNKKPLLSQNTFTSVVSGNNGNINPSPQNGPNG
SQNKGSYPIKVLKNPGSQAAGSSQGNGSPRFSILNKNKPYSALQKPGQGFQSSPSEGLGN
NKFGKGPISALKQVEAPYHYKRPSVSFTTQRPNSFSQTTQISRGNQDKSEQFAGSRPPPS
FSEEEGYKY