New model in OGS2.0 | DPOGS215905  |
---|---|
Genomic Position | scaffold2156:+ 41254-45342 |
See gene structure | |
CDS Length | 1512 |
Paired RNAseq reads   | 833 |
Single RNAseq reads   | 3725 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000277 (3e-12) |
Best Drosophila hit   | CG34461 (4e-22) |
Best Human hit | ND |
Best NR hit (blastp)   | cuticular protein RR-2 motif 87 [Bombyx mori] (2e-60) |
Best NR hit (blastx)   | cuticular protein RR-2 motif 85 [Bombyx mori] (3e-31) |
GeneOntology terms    | GO:0005575 cellular_component GO:0008150 biological_process GO:0005214 structural constituent of chitin-based cuticle |
InterPro families   | IPR000618 Insect cuticle protein |
Orthology group | MCL19401 |
Nucleotide sequence:
ATGACATCTTCAAGTTCTCTACTTTTCCTGTTGGTGACGATCTTGTATCAAGATATCACT
TCTGGGGCATCATTTTCAAATGTTATAGTGCAAAGAGGATCTGAAGTACTTCCCATAGAA
TACATCAACTATAATCCATCTGCATTAACATACACGTCTCAAGTGGCTCGTTTATCTCAA
GCACCATTACAATCAGTAATTTCTGAACCGCTAGTATCAAGTGTTTTATCTCCATGTGTG
TCGCCTTGCGTCATTCCTAATGCAGCTCCCGTCCAATCCACGGTTAATGCTAAAATTGGT
GCTTATAAGACATCAGCGGCGAATGTACCTGTCATATTGACAAATAAAGACGAATCAAGG
GGATACCAATATGCTTACGCAGTTTTTGACGTAGACACCGGTGACAAGAAAACTCAAAGC
GAACGCAGTGACGGGTCAGTAGTTCAAGGGCACTATTCGTTCATTCAACCTGACGGCTTC
TTGCGCGAAGTCGTTTATACGGCGGACGATTTGAAAGGATTCAACGCTATAGTACGTAAC
ATATCTCCTGAACCGGAACACAAACATACTTCTGAAACAGAAAAATCAGAAGAGAAACAC
ATCATACCGCCATGCAAAGACAATAAAAATGAACATTTAACACATGCACATGAGAGAAAT
GAAGAAAGTCATCCAGCTCATGAACAAGAAAATCACGAAGCAAGTTCATCAGAGGAAATG
CATGTAAAACAGGAACATTTAGAGGTAGAAAATTTGAATGAAAAAAAAAGTGAAGAAGCA
AGTGCAGAAAAAAATGAAGAAAATGTTGATAAAAGCCATGAGCATAGTGCTGAAAAAAGC
GAAAAAGATAGTGAGGAAAACTCGGGTGAAAAATCTGTCGAAAAATCAAACGAAGAAGTC
CATCCAGTAATCCTATCTGAAGGAGGTCAAGAATCAAACTTGTTAATTACTTATAGTGAC
ATAGTTAAATGTCTGCAATCTAAACTACAAGGGCCGAATGTGGTTTCTCCGCTCACCTAT
TTGTTCAACCAAACACACAAAATGTTCAGCAAAATTGTAGCATTTGGAGCTATGTTGGCT
GCTGCCAACGCTGGTCTTCTTCATGGTCATGGACACGCTGTGTCCTCCCAAAGCATTGTC
CGTCATGATGAAGGACACTACGCGGCGCCCATCGCATATGCTGCCCCAATCGCTCATGCT
GCTCCAGTTGCCTACGCCCCAGTCGCTCACTACGCTGCACCCGCTGCCCATTACGATGGA
CATGATGAATATGCCCACCCCAAATACGACTTCGCATACTCAGTAGCAGACCCTCACACC
GGTGATCACAAGTCACAGCATGAGAGCCGCGACGGTGACGCCGTCCATGGCTACTACTCC
CTGGTACAGCCTGACGGCTCCGTACGTAAAGTGGAATACTCTGCTGATGACCACAATGGA
TTCAATGCCATCGTACACAACTCAGCTCCCTCTGTGCATGCCGCACCAGTGCCAGCCTAC
CATCACTACTAA
Protein sequence:
MTSSSSLLFLLVTILYQDITSGASFSNVIVQRGSEVLPIEYINYNPSALTYTSQVARLSQ
APLQSVISEPLVSSVLSPCVSPCVIPNAAPVQSTVNAKIGAYKTSAANVPVILTNKDESR
GYQYAYAVFDVDTGDKKTQSERSDGSVVQGHYSFIQPDGFLREVVYTADDLKGFNAIVRN
ISPEPEHKHTSETEKSEEKHIIPPCKDNKNEHLTHAHERNEESHPAHEQENHEASSSEEM
HVKQEHLEVENLNEKKSEEASAEKNEENVDKSHEHSAEKSEKDSEENSGEKSVEKSNEEV
HPVILSEGGQESNLLITYSDIVKCLQSKLQGPNVVSPLTYLFNQTHKMFSKIVAFGAMLA
AANAGLLHGHGHAVSSQSIVRHDEGHYAAPIAYAAPIAHAAPVAYAPVAHYAAPAAHYDG
HDEYAHPKYDFAYSVADPHTGDHKSQHESRDGDAVHGYYSLVQPDGSVRKVEYSADDHNG
FNAIVHNSAPSVHAAPVPAYHHY