New model in OGS2.0 | DPOGS213501  |
---|---|
Genomic Position | scaffold252:+ 14985-18800 |
See gene structure | |
CDS Length | 2040 |
Paired RNAseq reads   | 49 |
Single RNAseq reads   | 251 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008136 (2e-35) |
Best Drosophila hit   | cuticular protein 50Ca (1e-13) |
Best Human hit | ND |
Best NR hit (blastp)   | cuticular protein RR-2 motif 143 [Bombyx mori] (1e-65) |
Best NR hit (blastx)   | cuticular protein RR-2 motif 143 [Bombyx mori] (8e-106) |
GeneOntology terms   | GO:0005214 structural constituent of chitin-based cuticle |
InterPro families   | IPR000618 Insect cuticle protein |
Orthology group | MCL40230 |
Nucleotide sequence:
ATGGATCATCCAAAGCACTTGACATTAACGGTCTCCATATTGTTAGTTGCTTTTCAACTG
GCATCATCCAATAAAAACGCCAGGCCGGTTTACGAGGTTATTAACGAACAGTCAAGAATT
GATGATAGAGTCAGAATATCCAGCAGTCCAGGGTTACTTCCATATGGCAACCCTCTTGCG
GGTAACTCCATACAAAATGCAAGATTACACGCCAGCAACCAGCAAGCCATGTTCAACGCT
GAGAGGATACGCCAACATAAAGCCCAATTACAACGACAGGCCAATCTGCCAAGAGTCGTC
GATAGCTATATGCAAGCGTACCACGAATCACAGGAGAGTCATCATCTAGCTCTCGAACGA
CAACAGGCGACAATTAATGATAATCCCACAACTAAGGCACCCGTGAGACCACAGCACAGA
CAAAAACAAAGAAATAGGAATAGAAATTATAGATCATATGGTCCAGTCGATTATCAACAA
TACTTGGAACCGCAGAAGTACAGAACTGTTTACGTGACTCCAACGACGGATAACGACCAG
GCAGTGTCCATAAAAGATAACCTTAACGTACTAGGACATTTGAATAGCAAAACACCGACC
AAACTTTACACGGAAGCGATCTCAAGCGATTCTAAATATGGGTATCCAAAACACTATACA
CCAACACAAAACTTGAAATCGATCCAAGATATTCAAGTTCTTAATTCATTACTGACCAAA
AATCCAGTGGATCAATTGACAGAATTTAATGCACTCATAACATCTAGCGCTGAAAACGAC
AAAAAAGTACCAATCGATCTTTATTTTTATGTTAAAGATCCAAATAGCCAGTCATTATCT
CAAGAACAATCAAAATACGCACCAAATTCTAATTCGTACATAATTCCATCAGATTTGAAT
ATAAAAGATCACACGCCGATAACAGAAGATGTAGACGATATTGGTAATCCGATCGAAGAC
CAACAATATTACAGCCTTCAAACAATCCCAACGACTAAAGCAAGCGAAGCGACAACTACA
AAAAACAATTACTACAAAGTAGAAGTGGCCAGTCAAACAATAACCGTACCGAAGCAAGAA
GAATACAATCTGAATGATCCCACTTACCAGATATCGCATTACGGTCAAACTTATGGTAAA
AACGACGCGAAAAGTGAACAATATTTGAACCTGCATTCACAACCGACCGGAGTCCAACAT
CTATCTGGCGACGGCACAGAAGTATCGGCTTACGGCGACGATGATAAACGCACCGGCGAT
TTCAACGTTTTTCTATCCCCTAACTTTCCCGTCGGTGAGGCAGCCGACTATAATGATGAT
TACGATTACCAACAATTAAACTCGAAGCGATCCAAGCAAAAACTGGAATCGTTTTACGAT
GACGATTATTACGACGACTTTAGTTCAGACTATGATAGCCCTAGACAATATTTTCCCCCT
CAATCCGAACCTAAGGGCTATCCTTCAAGAACGCGTAACTTTCGTCGCTCTCCAATTTCT
CAGCTCTATGGGAATCCTCCCGCTACTTCTTATGGTGTTCCTATATACGGTACATCAGTG
TTCAGCACAGCCCAAACTTTCATTCCTCCTAAATTAGAAGTACCAACATTAACAAACCAA
TTCCCGAATGTCATCGAACCCGTTTACATGCTCACGCAAACACAATTAAAAGAATTAGTA
GGACATCACAATTTGAATATTGAACACCTCGATGTGTACCAACTTCTAAAAGAAAACAGA
GCCAAGAAAACACCGTACTACCCAAGAAAATATCGTAAAAGAAATCCCTTTAGGCACCTC
AGAAGTAATTTACATAAATTAAACAAATTTCACCTCAAATACGCTGCGAATTATGAATTT
GGTTACCGTGTGAGAGACGCCAACACAGCTAATTACTACGGACATAGAGAGTCTAGGAAC
GGCTTGAAAACCAAAGGACAGTACCATGTGTTGTTGCCAGATGGTAGGATGCAGCAAGTA
AACTACGTCGCTGGACCAGAAGGTTACCACGCTGATATAACGTACGACCAACCTCATTAA
Protein sequence:
MDHPKHLTLTVSILLVAFQLASSNKNARPVYEVINEQSRIDDRVRISSSPGLLPYGNPLA
GNSIQNARLHASNQQAMFNAERIRQHKAQLQRQANLPRVVDSYMQAYHESQESHHLALER
QQATINDNPTTKAPVRPQHRQKQRNRNRNYRSYGPVDYQQYLEPQKYRTVYVTPTTDNDQ
AVSIKDNLNVLGHLNSKTPTKLYTEAISSDSKYGYPKHYTPTQNLKSIQDIQVLNSLLTK
NPVDQLTEFNALITSSAENDKKVPIDLYFYVKDPNSQSLSQEQSKYAPNSNSYIIPSDLN
IKDHTPITEDVDDIGNPIEDQQYYSLQTIPTTKASEATTTKNNYYKVEVASQTITVPKQE
EYNLNDPTYQISHYGQTYGKNDAKSEQYLNLHSQPTGVQHLSGDGTEVSAYGDDDKRTGD
FNVFLSPNFPVGEAADYNDDYDYQQLNSKRSKQKLESFYDDDYYDDFSSDYDSPRQYFPP
QSEPKGYPSRTRNFRRSPISQLYGNPPATSYGVPIYGTSVFSTAQTFIPPKLEVPTLTNQ
FPNVIEPVYMLTQTQLKELVGHHNLNIEHLDVYQLLKENRAKKTPYYPRKYRKRNPFRHL
RSNLHKLNKFHLKYAANYEFGYRVRDANTANYYGHRESRNGLKTKGQYHVLLPDGRMQQV
NYVAGPEGYHADITYDQPH