New model in OGS2.0 | DPOGS209413  |
---|---|
Genomic Position | scaffold2372:- 14247-17783 |
See gene structure | |
CDS Length | 1470 |
Paired RNAseq reads   | 41 |
Single RNAseq reads   | 114 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012601 (5e-53) |
Best Drosophila hit   | cuticular protein 67Fb (4e-16) |
Best Human hit | ND |
Best NR hit (blastp)   | cuticular protein RR-1 motif 11 [Bombyx mori] (4e-61) |
Best NR hit (blastx)   | cuticular protein RR-1 motif 11 [Bombyx mori] (9e-53) |
GeneOntology terms   | GO:0005214 structural constituent of chitin-based cuticle |
InterPro families   | IPR000618 Insect cuticle protein |
Orthology group | MCL40765 |
Nucleotide sequence:
ATGAACTGGGTATTGTTTGCCGCTCTTACAGTGGCATGTAATGCTGCGAAATTAGACCGT
ACATATTTACCTCCTGCATCAGCTTCAACAGCTGGTGGAAGTCCAGGTTCAATACAAACT
CCGATTGCAAAGTCTGAAATAGAGAATTTGCCACTTGGTAGCTTTGTTAACGAACATGAT
GGGGTTGTCATTGATGTTGGCGCTGCTGGCATAAAAGCAGTGCAATCCGGCTTAGGAGCT
TTAAGAATATCATACGGATCAATGGATTCTAAAGTAGGAGAGGCAGCATTCAGAGGTACA
AACCAGGCTCACGTAGCAAACCCGATTTTGGATTACAACCCCGACCTGTCAGCTTTCAAT
ACGCCGAAGAAACCGATGAGATCTGAAATTCAGGTTACACGTGACCGTGGAGCTAGTATT
GTTAAGTACCACAATGATAACAACGGTGAGCGTTACAGCTATGGCTATGAAACTGATAAT
GGAATAAAAGCTGAGGAGAACGGAGTAGCCATCAATGGTGTTCAAGCTGAGGGTGGATTT
TCCTATGTGGGTGATGATGGGAAAGTGTACAGTGTCGTCTACACCGCCGATGAGGGTGGT
TATCGGCCGATGGGTAATCACCTCCCGACTCCACCACCTATACCGGTGGAGATATTGAGG
GCCTTGGAGCAGAACATGAGAGATGAAGCAGCCGGTATTTTTGAGGATGGCTCATACGAT
GCACGAAAATACAACAATAATGATTATAAGCAAGCTGGGATCAATAATAACTATGATGAC
AGACAAACAAACATGTTTAATGTTCAATCTGGGCTCATCGCTAATATGAATGGAGGATTC
ATGAATAACAACCCAGCAGCTGCTGTCAGACCAAACCCAATAGAATTGTTTGGTACACAA
ACTGAATCTTCGAAATTTGGTGCTGTGAATAAGTTCGGGACAGATTTTAACAAAGGAAGT
GGTTTAGGACAAAACCAGATGAACATAAATGCTGAACTTGGCTCAAGAAATGAATATTCA
CAGAAGCAGAATGCTCAAAATACAATTTCTTCTAATTTTGAGTCCTCGTCTACGAACATG
AACGGCAGTAACAGTTTTACGCCCGCGAAGCCTCTTTCACAACAAGCACTTCAGGAAACA
ATGAATTTAGGACAAGGCCAACTTGATAGCCCAAGGCCAAGTTCATACGATACCTCGATA
TCATCAGTGACTTCTGAGAAGGTATCTTCAGAAGATAAACAAAAGAATAATCAAGATTCG
GGAATAAATTTCTCGATAGAAAGTAATCAACACAAACCAAGCAACAATGGTCAATCATTA
ATGGCACCGATGCAGAGATTACCAACATATAATATTAACAAAAATACAAGTCCACAATAT
ACGCAATCAACGCAACAAAGTAACGAACAGAATCTTTTCGTGGTCCTCAAGGTTCTCTAC
AATCTTCTATTTCTAGTCCTTCTCGTTTAG
Protein sequence:
MNWVLFAALTVACNAAKLDRTYLPPASASTAGGSPGSIQTPIAKSEIENLPLGSFVNEHD
GVVIDVGAAGIKAVQSGLGALRISYGSMDSKVGEAAFRGTNQAHVANPILDYNPDLSAFN
TPKKPMRSEIQVTRDRGASIVKYHNDNNGERYSYGYETDNGIKAEENGVAINGVQAEGGF
SYVGDDGKVYSVVYTADEGGYRPMGNHLPTPPPIPVEILRALEQNMRDEAAGIFEDGSYD
ARKYNNNDYKQAGINNNYDDRQTNMFNVQSGLIANMNGGFMNNNPAAAVRPNPIELFGTQ
TESSKFGAVNKFGTDFNKGSGLGQNQMNINAELGSRNEYSQKQNAQNTISSNFESSSTNM
NGSNSFTPAKPLSQQALQETMNLGQGQLDSPRPSSYDTSISSVTSEKVSSEDKQKNNQDS
GINFSIESNQHKPSNNGQSLMAPMQRLPTYNINKNTSPQYTQSTQQSNEQNLFVVLKVLY
NLLFLVLLV