New model in OGS2.0 | DPOGS209417  |
---|---|
Genomic Position | scaffold1008:- 75161-77628 |
See gene structure | |
CDS Length | 2160 |
Paired RNAseq reads   | 6 |
Single RNAseq reads   | 18 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012604 (1e-29) |
Best Drosophila hit   | cuticular protein 67Fb (5e-18) |
Best Human hit | ND |
Best NR hit (blastp)   | cuticular protein RR-1 motif 14 [Bombyx mori] (8e-29) |
Best NR hit (blastx)   | cuticular protein RR-1 motif 14 [Bombyx mori] (5e-29) |
GeneOntology terms   | GO:0005214 structural constituent of chitin-based cuticle |
InterPro families   | IPR000618 Insect cuticle protein |
Orthology group | ND |
Nucleotide sequence:
ATGAAAATAATATGGGTACTTTTGGGGCTCATAGGCTCAGCCCTAGCGGAGAAATTAGAT
CGTTCATATCTTCCACCACCTGGATCAAAATTTTCTGGAGGCAGTCCAGGGGCAATCGAT
GTTCCACTAGAGTTCCCTAAGGAAACGGTTCTTCCTAACCCTGGCAGCAATAATTTGGGA
AAACCAGAAATAGCAATTGGTATCAACCGCATTAGCCCAATAGCCTTCAATCAACAATAT
GGCAGTACATCAGAACCATACCAAAAAAATGAATATAACAGTCCTGCAACTGAATCGTAT
GCATTATCAGAAATTTCAACAAAAACACCTGATGTGGTTTCTAACGATTTCGTTCAAACT
CCTAACTTAGGTGATTTGAAAATCAATTATGAAAACTTGTACCAGACTTCATCAACCACA
AAGCCTGGCAACCTTTTTAGTGTGATAGAAGAAACAAAATATATTAATAATGGTGGTGAT
CTTACAGATGACAACAATTACGAAACAGATAATAAAATAAGCGATGAAAGTAAAAAAATA
TACGGTGGTGATATTGACAGTATTCCAGAAATTAATTCATACTTTGCGAAACCTTCGTCA
CCTGCTGCATTTCATTTTAGTTTGTCAAAATTATATGAAACTGGTGCCAATGTTTCGTCT
ACACCGACTTACGACAGTTCATCGAATTTATCTGAAAGTTTATATAGCACAAAACCATCG
CAATACGGTCTGAAACCTGATTCAAAGACAACATTCAACATTCCTTACACTTATTCACCA
CGTACAGAAAGAATACAGGCGCAGAGAGATAGGGAAGCAATCATCTTAAATTATGACAGT
GAAATTACTCCAGATGGTTATGCTTATAGCTTTGATACGTCAAATGGAATCCATGTAGAT
GAAAAGGCAACTGCACTAAATGGAGTCCGGGCCACTGGATCATATTCATACATTGGAGAT
GATGGCAAACTCTATAATGTCAGTTATACAGCTGATGAAAATGGATTCAGGCCTATTGGG
GATCATTTGCCATCTCCTCCGCCAATTCCTGATGCCATTATGAAAGTCATAGAACAGGCT
ACAAAAGATAGAGATTTAGGAATTTATGATGATGGTGATCCCAAATACATCAAAAAGCAA
ACAGACCAGAAAGAAATCAAAAACAAGAAAATTCTTTCAAATAAAAAAGAAATGGGGGAC
AAATTGATTACAAAAGATTCGATATTTACACCTCCGGTCACGACTTTATTACCTTACGAG
GAGGATAGATCAAAGAGTAATTATGATGAATTTAAAAATGAAAATATTAATGATTCAGAA
AATGAAGGTAATATATTGAGTAATGAAAGTGATGGAACTGGGTACGAATACTCAAAACCT
CTCGACGACTTCTCTTTGGTAACAGACAAAGAGTTTGTCAATAGAATTTCAGAGACTCCA
AATCTGAATACCGTTCGAATTATTGAAGAGAAGACAAACGGTAATATTCGAGGAAAACCG
TTCATGACACCGTACGTTTATGAAAATGATAACACATTTCTCGACTATGACAACAGTGAA
TCAGTATTGGAAACACTAGGACAATACCAAGTTCAAAACAAAAATATACCGAGTGTTCAG
AATACAAACACTGTCCTTCCAACTTTTTCAATTTCAGAATTAAATTCTACCAATAATATA
AGTGGGGATACTGGGATAACTAACATACTACCCCAAGATCATCAAGGATACTTCTATCCC
ACCACAGGATCTAATTTCAACAGTGATGCTTATAATCCCATTGCAATTTCTTCCGAGAAA
AACATATCTGAAAATCAACCAACAATTCCAAGAGAATTTCCTTCAAGGTTAAATTTGCAA
GCTACAAAAGTAGAAACTGTTTCCAGCAATCCATCCCCATATAGAGACTTCGGTTTATTC
AACGATGAATCTAAAATTAGAAATGATTCTACAGATGTGGGAGATAAACGAATGTTTATA
CAACAAGAAACAACCCAACCCAGTGATCAAAACAGTTATGGTGAATACATTTCAGTACCT
ACGAATGAATTAAATGACACAGAATTCACAATTAACAGAGAAAATGCCATCAAAGGCGAG
GATTTCAGTGGTCCGAAACAGAGACAAAAATATGACCCTCTTACTGGATACTACTATTGA
Protein sequence:
MKIIWVLLGLIGSALAEKLDRSYLPPPGSKFSGGSPGAIDVPLEFPKETVLPNPGSNNLG
KPEIAIGINRISPIAFNQQYGSTSEPYQKNEYNSPATESYALSEISTKTPDVVSNDFVQT
PNLGDLKINYENLYQTSSTTKPGNLFSVIEETKYINNGGDLTDDNNYETDNKISDESKKI
YGGDIDSIPEINSYFAKPSSPAAFHFSLSKLYETGANVSSTPTYDSSSNLSESLYSTKPS
QYGLKPDSKTTFNIPYTYSPRTERIQAQRDREAIILNYDSEITPDGYAYSFDTSNGIHVD
EKATALNGVRATGSYSYIGDDGKLYNVSYTADENGFRPIGDHLPSPPPIPDAIMKVIEQA
TKDRDLGIYDDGDPKYIKKQTDQKEIKNKKILSNKKEMGDKLITKDSIFTPPVTTLLPYE
EDRSKSNYDEFKNENINDSENEGNILSNESDGTGYEYSKPLDDFSLVTDKEFVNRISETP
NLNTVRIIEEKTNGNIRGKPFMTPYVYENDNTFLDYDNSESVLETLGQYQVQNKNIPSVQ
NTNTVLPTFSISELNSTNNISGDTGITNILPQDHQGYFYPTTGSNFNSDAYNPIAISSEK
NISENQPTIPREFPSRLNLQATKVETVSSNPSPYRDFGLFNDESKIRNDSTDVGDKRMFI
QQETTQPSDQNSYGEYISVPTNELNDTEFTINRENAIKGEDFSGPKQRQKYDPLTGYYY