New model in OGS2.0 | DPOGS202780  |
---|---|
Genomic Position | scaffold1363:+ 14432-16450 |
See gene structure | |
CDS Length | 2019 |
Paired RNAseq reads   | 245 |
Single RNAseq reads   | 709 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010501 (2e-144) |
Best Drosophila hit   | cuticular protein 97Ea (6e-09) |
Best Human hit | ND |
Best NR hit (blastp)   | TPA: putative cuticle protein [Bombyx mori] (3e-135) |
Best NR hit (blastx)   | TPA: putative cuticle protein [Bombyx mori] (4e-107) |
GeneOntology terms   | GO:0005214 structural constituent of chitin-based cuticle |
InterPro families   | IPR000618 Insect cuticle protein |
Orthology group | MCL23936 |
Nucleotide sequence:
ATGAACCTCTTCATGCTGCTCGCTTGTATCATAGCGACGACCACTGCTACCAATTCAACA
GAGACTACAACCAATCCTCCAAATAATACCACCCAGGAAATTCTCAAGGCCAGTGAAACT
GCGGACCCGACCAAAGCTGAAATTGTAAAACAAATTCGTCGTCTGAACGAAGATGGATCG
TATACGATTGGATACGAAGCCAACGATGGAACGTTTAAAATTGAAAGTAGAGATGTTTTG
GGTAACGTAAAAGGAACCTTTGGTTATGTGTCAGATGATGGAGAAATAAAACGTGTCACT
TACAGTTCCTCTGCTGATAGCACACCGGCATCGGTAACTACTAGTACCACGCCAACTACA
CCCACCATGGTCGTGAGAGTTAACAAAACTATATCGTCGACGACAAGAAGACCGTTAGCC
ACTGTAGTGTATCCGACCAGAGGAAGCACCACCACCAGAGGTACAGTTATACAAGCCATT
CCGAGACGACGAACCGGGTCAAGCTCCGTGCGCCCTCAAACGACTGATACAACTACGGAA
ACTCAAAAACAAATCACTGCAAGTAGCTCGAACGTTCATCGAAGAGAAGATCTTCTGAAA
TCACGGTCTCAATCAGCAAAAACTGCACCTGTAACCTCCAAAGACGATTTATTAACCAAA
CAGACTACAAGTACGGCGTCTCAGACTCTAAAACCTGTTTACGAGCATACGACAGAAAGA
GAGACGGATATTAATAAATCGACAGCTACGAGACGTGAATTATCAGGAGCGAGCGCTAAT
CATCATATGTTGAATCTTCAACAATCAATGGGAGATGATTCAACTGATGTTTATGGGAGT
CACTTATCACATGGAACTCTCAGACCTTTATTTACTACAACCACGATTCGTCCTAAATTA
GTTACCCTTCATTCAATAATTGCCGCAAGACAACAACAACAACAGCAGCAGCAGCAGCAT
CAACAGCAGCAACAATATGATGACGAACAAGAAGAGGAACAGGAGACGACATTGGAAGCC
ACCGCTGGCCGTGTTTATGAGCCAGATGAAAGTGTTACCTCCAATCCTGTACCAGTTGTG
CACATATCAGCACAGAGAGGTTCCGATAAAATATTTTATCAACCCCAATACCGTCGACCT
GCGGCTGTTCTTTTCAGGACTCAGGAATATTTGAGAGACAATCCCGGTGCCCCTATTCCC
ATTGGCAACCAGCGGCCCTTTCTTAACTATGAATATCCAGATAAAATATTGGACTCACAG
TATGTAAAAGAATCACAGCAAGTCAATAACAATCAAGAAGCGGAGTCTGGTCCTTATGAA
TACAGACATAATGATTACAGACCAGCCCCAAGAATTATCCACGTGCCAGTCGACGATAGA
GGCGTTCCAATTCAGGGGTACGAAGCGAGATATGTCAACCCATATCGACCACAGCCTTTA
ATTCAAAGATACGATCCTGTAAACGAAATGCATTCCATTTCAGCACCGGTGAGCACGAGA
GATTTCAAACGCTTGCTACATATTTTGATACTAAGACAGAACCGTCTGCAAGCTCTCATG
GAGCAGATAATGCCAGAGGCCTACCAGGCGGCTCATTACCGCTCTGAACCGTATCACGCA
CAATCGCGGCCCTATTCTCGCCACCGAGACGACGACCAATACGACTACAGATATCAACCG
CAGTACAGACAAGATTTTTATTCAACACAAGTATCAAACTACGACGATCGCGACTACGAG
TCCCATCGCTATTCGCCTCGAAGAAGATTATACTCGCGACCCTATGACGCTCAGGGTTCA
GCCTCGGAACATATAGAACAGACACCGGAGTATCTCCCGGTCGAAGTGAGAGAAGCTCTC
TTGTTGAAAATGCTGTTGTTGGCTATCAGCCCGGACTTCATGCCGACACCAGCGCCCGCC
ACCGAGCTGACCACCGCAGCACCAACAAGGAAACAGGTGAGAAACGTTCAAATACTTGGA
GAAGAAGGTTCCGACAAAAAAACAAGGCAGGGGCACTAG
Protein sequence:
MNLFMLLACIIATTTATNSTETTTNPPNNTTQEILKASETADPTKAEIVKQIRRLNEDGS
YTIGYEANDGTFKIESRDVLGNVKGTFGYVSDDGEIKRVTYSSSADSTPASVTTSTTPTT
PTMVVRVNKTISSTTRRPLATVVYPTRGSTTTRGTVIQAIPRRRTGSSSVRPQTTDTTTE
TQKQITASSSNVHRREDLLKSRSQSAKTAPVTSKDDLLTKQTTSTASQTLKPVYEHTTER
ETDINKSTATRRELSGASANHHMLNLQQSMGDDSTDVYGSHLSHGTLRPLFTTTTIRPKL
VTLHSIIAARQQQQQQQQQHQQQQQYDDEQEEEQETTLEATAGRVYEPDESVTSNPVPVV
HISAQRGSDKIFYQPQYRRPAAVLFRTQEYLRDNPGAPIPIGNQRPFLNYEYPDKILDSQ
YVKESQQVNNNQEAESGPYEYRHNDYRPAPRIIHVPVDDRGVPIQGYEARYVNPYRPQPL
IQRYDPVNEMHSISAPVSTRDFKRLLHILILRQNRLQALMEQIMPEAYQAAHYRSEPYHA
QSRPYSRHRDDDQYDYRYQPQYRQDFYSTQVSNYDDRDYESHRYSPRRRLYSRPYDAQGS
ASEHIEQTPEYLPVEVREALLLKMLLLAISPDFMPTPAPATELTTAAPTRKQVRNVQILG
EEGSDKKTRQGH