New model in OGS2.0 | DPOGS215910  |
---|---|
Genomic Position | scaffold396:+ 18773-24491 |
See gene structure | |
CDS Length | 1698 |
Paired RNAseq reads   | 4 |
Single RNAseq reads   | 35 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000282 (6e-08) |
Best Drosophila hit   | cuticular protein 62Bc (8e-14) |
Best Human hit | ND |
Best NR hit (blastp)   | cuticular protein RR-2 motif 82 [Bombyx mori] (9e-34) |
Best NR hit (blastx)   | PREDICTED: similar to Cuticular protein 62Bc CG1919-PA [Tribolium castaneum] (2e-30) |
GeneOntology terms   | GO:0005214 structural constituent of chitin-based cuticle |
InterPro families   | IPR000618 Insect cuticle protein |
Orthology group | ND |
Nucleotide sequence:
ATGTATAAGTTATTAGTTTTGTGTTCATTAATGTCGGCCAGTTACGCGATATATCCACAT
CATCATCACCTGGCTGTTTCCCATCAGCAGGTGATCAAGCACGACGGACCACACCATCCG
ATACCAATTCATCATCACGTGCCACACCACCACGTTGGAATCATTCACAATCTTCCATTA
CCCGTACCCATCCACCACGTGCCTCACCATCAAATACACCACGATCATTATGCATTCCCG
GAGTACAAGTTTGCGTACTCAGTCCATGACCATCATACTGGTGATGTGAAATCACAGCAT
GAGTTCCGTCATGGAGACGTGGTGCAAGGCGGATACGAGCTCATCGAACCCGACGGCCGC
CAGAGAAAAGTTGAATACAAAGCTGACGATCATTCTGGATTATGCAAAGAAGTCATCTCG
TCTGCCCGTGATCACGGTTGCTGGAAAGTAACCAAAACGTCGGGAGTAGTAATATTAGTA
GGTACGGTGCATATTGCTCTAACCCTAGCCCGTCCACCTTATTCAGAACAAAAGGTGACG
ATAAACCATCACGGCAAGCTCTCGCACATCGGATCTAATGCTCCTGAGGAACACGAAAAA
TACGGCTGGTCACATCCGTCATACGAATTTACGTATGAAGTATCTGATCCACATACACAC
GATTTCAAAGGACACCATGAAGCTCGAGAGGGTGATAAAGTAAGAGGTTATTACTGGCTC
ATTCAACCTGATGGACTCAAACGAACTGTAAAGTATCACGTTGATAAACACAGCGGATTT
AACGCTAATGTTTTAATTTCAAAACCATGGGAAGAAGGAAGTAATAACGAAGAAAACGGA
GGAGAAAATTCAAATGAAAATAATGAAGGCCAAATGGAAAACGAAGAAAATGGAGGTGAA
AATCAAGAAAGAGAAGAAGAGCGTAATCAAGGAAATGAACGAGAGTCTAATCAAAATAAT
AGCTCAGAAAATGGTGGAAATGAAAATAATAACGAAGAAGAAGGAGAGACAATAAATATA
AACAGGAGTCAGGAGAACAACAATGGTAGAAATAATAATAGTGGTGGACGAAATTCAGCA
GAAAATTCGGGTGAAGGTCAGGGCCGCGGTGAAAGTGGTAGGGGTTGGCAAAATAACCGA
GGAATGGAATGGCAAAGAGGCAACAGTGGTTCTAATGAAAATAGCGAAGAAAGACGCGGC
GGAGAAAATAATGGCGGTAGAGCAGGAGGAAATTGGCGCGGACGTGTTAAGTGGAATGGA
TGGCAAGAGGGTGGCAATCGTGGCGAACGAAATCAGGAAAACAATGAGGGAAGACGAGAA
GAAGGTAATGAAAATGAAAACCGTAACGATAGAAGCAGGCAGAGTAGCGAGAGCTCAGAA
GTCCAAGAAAACGATCGTGGGCAATGGAGCAATGAGAGACAAGAAAACAACAATCAGGAT
GAAGAAAATCAGGAAGAAAATGGAGGCGAGCGAAATCGTAACAGTGGAGAAAATGGACAA
TGGAATGAAAACGAAGGAAACAATGAAAATGGGGGAAACAATGAAAATAGGGGTAGTAAT
GAAGGCCAGGAAAATAACGGCAAAAGCAATGGCCGTAAAGGCGAGAGGAACGAAAAAGGG
AAGCAAGAAGTCACAAAAACCCATTATCACATAATTATTCATCATCCTAAACATCATTAC
AAGTCAAAACAAAATTAA
Protein sequence:
MYKLLVLCSLMSASYAIYPHHHHLAVSHQQVIKHDGPHHPIPIHHHVPHHHVGIIHNLPL
PVPIHHVPHHQIHHDHYAFPEYKFAYSVHDHHTGDVKSQHEFRHGDVVQGGYELIEPDGR
QRKVEYKADDHSGLCKEVISSARDHGCWKVTKTSGVVILVGTVHIALTLARPPYSEQKVT
INHHGKLSHIGSNAPEEHEKYGWSHPSYEFTYEVSDPHTHDFKGHHEAREGDKVRGYYWL
IQPDGLKRTVKYHVDKHSGFNANVLISKPWEEGSNNEENGGENSNENNEGQMENEENGGE
NQEREEERNQGNERESNQNNSSENGGNENNNEEEGETININRSQENNNGRNNNSGGRNSA
ENSGEGQGRGESGRGWQNNRGMEWQRGNSGSNENSEERRGGENNGGRAGGNWRGRVKWNG
WQEGGNRGERNQENNEGRREEGNENENRNDRSRQSSESSEVQENDRGQWSNERQENNNQD
EENQEENGGERNRNSGENGQWNENEGNNENGGNNENRGSNEGQENNGKSNGRKGERNEKG
KQEVTKTHYHIIIHHPKHHYKSKQN