New model in OGS2.0 | DPOGS210120  |
---|---|
Genomic Position | scaffold3783:+ 752-4131 |
See gene structure | |
CDS Length | 1491 |
Paired RNAseq reads   | 200 |
Single RNAseq reads   | 466 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000226 (8e-146) |
Best Drosophila hit   | cuticular protein 73D (9e-60) |
Best Human hit | ND |
Best NR hit (blastp)   | cuticular protein RR-1 motif 47 [Bombyx mori] (3e-160) |
Best NR hit (blastx)   | cuticular protein RR-1 motif 47 [Bombyx mori] (1e-157) |
GeneOntology terms   | GO:0005214 structural constituent of chitin-based cuticle |
InterPro families   | IPR000618 Insect cuticle protein |
Orthology group | MCL18416 |
Nucleotide sequence:
ATGTTTTGGACATACGGTGCGGTCGTCCTGAGTGTGTGTGTGTCCGTCCGGACTCAAGTT
ATTCCGGGCAAGTCCAGGACTAACGATGGTGACTTTAATAATGTTAACGCTGATGGATCT
TTCGATTTCGGGTACGCGAACAAAGATCGTGGTGGCAGCTACCACTTGGCGCAAGGTAGT
TCGAAAGGACTAGTGGGAGGACGGTTCGGTGCCAGAGAGCCTGGTACTGACGAAGTCAAA
GAAACTATTTATACTGCTGGTCCTAGAGGATTTCGCGCTAAGGGCCCTAACGTTCACCGA
AAGATAGATCTGGATCAGCGGCCGCGAGGTCCCATTGGAAATAAAGACGATCCCTATTTC
GATCCTAACGAAGATCCCAGTTACGCGTACAAAATAGAGACGAGAACTTACTCCAAGAAT
GAGAATGCTGACAGCAGAGGCGATGTCAAAGGTCATTACTCCTTCGTGGATGATATCGGA
GAACGGCACGACGTGTCCTATATAGCTGGGCGCGATACAGGTTTTCATGTATCCTCAGCC
AACCCTGACGTGCCCAGTCTTATTGGGTCACCTTTTCACCGAGCGCCTCTGGTTAGAGGG
GAGAGCAAATCTCGAGGACGTACTGCGGTACAGAGAGGATTAGATGGTTCATATAGATTC
ATTTCTGCCGGACCTGACCAACGGCGAACGGAAAGTAGTGACTCACACGGTAACGTTAGA
GGATCCTACACATTTTTAGATGACAAAGGTGTACAAAGGACAGTACATTATATAGCGGGG
CCAGGTATTGGATATCGGATTGTGAAGAACAGCAACGATCCCTTCATTCCTTCCTATTTT
CCTACTATACCTAGTCCTTATGATCCGGCATTTAACGCAGGAGGTAGCGCTGGGGCACCG
GCTTTCGCTCCCAGCGACGAAGGTAGCGATGATGTCTTCAAAGGACCCGATGGCACTGCA
GCGTCAGGGCACGTTAAGCCACCTCCTTTCCCTCCCTCCGAATCAGAGAGACCTAGTAAC
ACTGGTAGCATATCGACAGTAATTGAAACTCCTGATGATTCAGATAACAACGGTTACGAC
ACAGGTCCAAGTTTTAATCAGGAACCTGATAATACAGATCTGGGTTACGTCGACGAGGAC
GCTTCTAGTTTTCAAAATCAAAAGCCAACTCAAGCACCAGGCAAACCGTGGCGCCCAGAG
AACAAACCGTATAGACCGTCGAAGAAACCGTTTAGACCGATAAAACCTTATCAAGAAACG
AATAATAATAACGACAATTTCGCTGGTGATAAAAGTAAACCAGAATTTGCTGTAGGATTC
AATATACATCACACGAAACCTGGCACAACGATCATTAGGAATATAGGTGAAGAATACTTC
GGCATACCTCCTGGTGTGTCCGTCCGCGCTCATGTACAGAGCATCGATCTTTATCCCTTC
GGTTCCAAACCAATTTCACCATCAGAGGCTCTGGAAAATGACCAAACATAG
Protein sequence:
MFWTYGAVVLSVCVSVRTQVIPGKSRTNDGDFNNVNADGSFDFGYANKDRGGSYHLAQGS
SKGLVGGRFGAREPGTDEVKETIYTAGPRGFRAKGPNVHRKIDLDQRPRGPIGNKDDPYF
DPNEDPSYAYKIETRTYSKNENADSRGDVKGHYSFVDDIGERHDVSYIAGRDTGFHVSSA
NPDVPSLIGSPFHRAPLVRGESKSRGRTAVQRGLDGSYRFISAGPDQRRTESSDSHGNVR
GSYTFLDDKGVQRTVHYIAGPGIGYRIVKNSNDPFIPSYFPTIPSPYDPAFNAGGSAGAP
AFAPSDEGSDDVFKGPDGTAASGHVKPPPFPPSESERPSNTGSISTVIETPDDSDNNGYD
TGPSFNQEPDNTDLGYVDEDASSFQNQKPTQAPGKPWRPENKPYRPSKKPFRPIKPYQET
NNNNDNFAGDKSKPEFAVGFNIHHTKPGTTIIRNIGEEYFGIPPGVSVRAHVQSIDLYPF
GSKPISPSEALENDQT