New model in OGS2.0 | DPOGS215902  |
---|---|
Genomic Position | scaffold2156:+ 18115-22362 |
See gene structure | |
CDS Length | 1419 |
Paired RNAseq reads   | 153 |
Single RNAseq reads   | 832 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000270 (3e-16) |
Best Drosophila hit   | cuticular protein 66Cb (2e-20) |
Best Human hit | ND |
Best NR hit (blastp)   | cuticular protein RR-2 motif 93 [Bombyx mori] (7e-51) |
Best NR hit (blastx)   | cuticular protein RR-2 motif 94 [Bombyx mori] (3e-43) |
GeneOntology terms   | GO:0005214 structural constituent of chitin-based cuticle |
InterPro families   | IPR000618 Insect cuticle protein |
Orthology group | MCL39421 |
Nucleotide sequence:
TATTTTCAGGTGGCATTATTTTGTTGCATCATTGAAATATGTTCAGCAATCCGAATAGAC
CCGAATGAAGGAGCAGGAAGAGAATTAATGAATGAAAGAAGATGGCAAATGGTGACCCTA
TATCCGTCGCATAATCAACAATTATTCAAATATAATACCAATACATATCCTAAATATAAG
TTCGAATATTCTGTGTCTGATAAAAAAACTGGTGACCACAAACATCACCATGAATTCCGA
GATGGTCACAGAGTGCAAGGAGGCTATAGTCTTATTGAACCTGACGGTTCTTTGAGGACA
GTTGAGTATAATGCTGACGACTACAATGGTTTCAATGCTGTTGTAAGCAGAGGTTTCCAT
CGCCATGGTGGACATGCATTCTCTACGTTTGATCATACAAGACATTTTCATCCCATAAGA
AGTGAAATGAAAATAAAACACTTCCTCCCAAGTAGCAATTATCTTTTTGATAAACCACAA
AACAAAGGTAATATTGAAGTAAAACCAGTAAGTGAGAAAAATCCGAACCAAAATTTACCG
GAAAAAAAAGCTTTAGATAAGGAAGTTGAGGAGGAATTAGAAGATCTAAGTAAAATGACG
ACTTTTGATCCAATTGAAAACGAGAATTCAATTACCACCGCAGCCTCTGCAATGGAAATA
CCAACAACGGAATCTGACAAAACAATTGAATCTACTTCGACGGTCAAGAATGACGATATT
GATTCTGTTAATCATATGATCACAGAGATGCCGGTTGTTGAAGTTTCATCCACTTCTAAT
GAAAAAGCCGCAGTAGAAACAGAAAAATTAAAAACTCAAGAAGATTCCGAAGCAGCATCG
TCACGTTTTTATTCTAAGTTCTACTACATCATCGTCATCTACGCTATGGTGGCAGCAAGC
AATGCAGGTCTCCTGCATCATGCACCAGCAGTGTCTCATAATTATATTAACCATGAAGTC
CAGGGACATTACGCTGTCCATGCAGCTCCCTCATATCAAGGGTCCCAGATTTCTCCAATT
CACACATCAATTGTCCACCCAGCTCCAGCTGTTCATGCAGCGCCCGTTGCTTATACAGCC
TCTGTACACGCTGCCCCAGCAGCTTATGCCTCACCTGCTCACTCAGCCCCTGTCTCCCAT
GCTGCTCCAGTTGCTCATGGGGCATCCAGTGCTCACGAAGATGACCACTACGTAGGAGAA
TTCGCTCATCCCAAATACGGCTACTCTTACTCCGTCGAGGATCCCCATACTGGTGACCAC
AAGTCCCAGCACGAGACTCGTGATGGCGATGTCGTAAAGGGCGAGTACTCTCTTCTTCAA
CCTGACGGTTCCTTCCGAAAAGTCACCTACACCGCTGACCACCACAATGGATTCAACGCT
GTGGTTCACAACACTCCACCCGTCATCCATCATCATTGA
Protein sequence:
YFQVALFCCIIEICSAIRIDPNEGAGRELMNERRWQMVTLYPSHNQQLFKYNTNTYPKYK
FEYSVSDKKTGDHKHHHEFRDGHRVQGGYSLIEPDGSLRTVEYNADDYNGFNAVVSRGFH
RHGGHAFSTFDHTRHFHPIRSEMKIKHFLPSSNYLFDKPQNKGNIEVKPVSEKNPNQNLP
EKKALDKEVEEELEDLSKMTTFDPIENENSITTAASAMEIPTTESDKTIESTSTVKNDDI
DSVNHMITEMPVVEVSSTSNEKAAVETEKLKTQEDSEAASSRFYSKFYYIIVIYAMVAAS
NAGLLHHAPAVSHNYINHEVQGHYAVHAAPSYQGSQISPIHTSIVHPAPAVHAAPVAYTA
SVHAAPAAYASPAHSAPVSHAAPVAHGASSAHEDDHYVGEFAHPKYGYSYSVEDPHTGDH
KSQHETRDGDVVKGEYSLLQPDGSFRKVTYTADHHNGFNAVVHNTPPVIHHH