New model in OGS2.0 | DPOGS207948  |
---|---|
Genomic Position | scaffold340:- 134588-152786 |
See gene structure | |
CDS Length | 5085 |
Paired RNAseq reads   | 7003 |
Single RNAseq reads   | 20214 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000388 (7e-25) |
Best Drosophila hit   | CG10625, isoform H (1e-25) |
Best Human hit | ND |
Best NR hit (blastp)   | collagen [Bombyx mori] (6e-62) |
Best NR hit (blastx)   | collagen [Bombyx mori] (4e-48) |
GeneOntology terms   | GO:0042302 structural constituent of cuticle |
InterPro families   | IPR000618 Insect cuticle protein |
Orthology group | MCL17977 |
Nucleotide sequence:
ATGGCTCCGGGACTCCTCTTTTATTTTCTGGTGTCTCTGGTAGTAGCGCATGCTACCGAA
GATGCTAATGAAACGAGAGCATTGATTCAAGAAGACGCCCACGAGGCTCGAGATTACGGA
ACGTATGGCGATAACAGTAAGGAGACCGTAGTTAATATTGAGGACGATGAAAAAACACAG
TACTATGAAACCAACTACGACACTAGTGCATACGGATTCGGTTACGATGTAGGCCCCAAC
GGTCAATTTCACCATGAAAATAAAGGCCCCGATGGTGTGACTTACGGTTGCTACGGCTAC
GTTGACCCCGACGGTTACCTTCGCGTCACACACTACGTCGCTGATAGCCACGGCTACAGA
ATTATAGAACCCGAAAAACCTGTGGAAGTTTTCCCAGAGGAAAACCACGAATACGATGAA
AATTTGGTGACTCCGAGTCCTCTTCCTGGTCAGATAGTACCATGGAAGAAGCTATACATG
CCACGAGGATGTGGTAAAACTCCTGGTGGAATTCCTCCTCGCCCTCTACCAAAACCGAAA
CCGACAAGCCCACCTCGTCCACCACCAGATAGCGCTGGACAAAACAGCAACCCAAAACCT
GGTGTTGTGTATCCAGGAGGACAAGGTGGTTACTATCCTGGCACGCCTGGTACCTCTGGT
TCCCCTGGTACACCCGGCAGCCCGGGTATACCTGGCAGACCCGGTAGCCCTGGAACCCCA
GGCAGTCCTGGCGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAATTCA
GGCTATTACCCCGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGA
AGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCT
GGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCA
GGCTATTACCCAGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGA
AGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCT
GGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCA
GGCTATTACCCAGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGA
AGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCT
GGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCA
GGCTATTACCCCGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGA
AGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCT
GGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCA
GGCTATTACCCCGGTCAAGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCACCCGGT
AGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGAGGACCA
GCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATATTATCCC
GGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGAAGT
CCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCAGGT
GGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGCCCG
TATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATTGGA
CCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCACCT
GGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAATGGA
CCCGGCGGACCCAATGGACCGAATAGACCCAATGGACCCAATGGACCGAGTGGGCCAAAT
GGACCAAGTGGACCTAATCAACCTTTAGGACCTGGTGGACAAACTGGACCTGGACAAGGT
GGATCTTACCCGGTTAGACCAGGAGCACCCGGTAGCCCGGGAGCACCAGGTGGTCCCGGT
GGACCCGGCGGTCCCGGTGGTCCCGGAGGACCAGCAGGACCAAGTGGCCCTAGCACACCT
CCATCGCAAGGACCTGGTAGCGGATATTATCCCGGACAAGGACAAGGTGGGTACTACCCT
AGCGGTCCTGGATCTCCCGGTCAACCGGGAAGTCCTGGCAGCCCCGGTAGCCCTGGCAGC
CCTGGCAGCCCCGGTAGCCCTGGATCACCAGGTGGACCAGGTGGATCTTATGTACCCAGT
GGACCTAGTGGACCGGTCGGACCCAATGGCCCGTATGGACCCAATAGACCGAGTGGACCA
AATCAACCATCAGGCCCCGGTGGACAAATTGGACCCGAACAAGGTGGATCTTACCCCGTC
AGACCTGGTGCACCTGGTAGCCCTGGAGCACCTGGTGGACCCGGTGGACCAGGAGGGCCT
GGCGGTCCCGGTGGACCAGCTGGACCAAATGGACCCGGCGGACCCAATGGACCGAATAGA
CCCAATGGACCCAATGGACCGAGTGGGCCAAATGGACCAAGTGGACCTAATCAACCTTTA
GGACCTGGTGGACAAACTGGACCTGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCA
CCCGGTAGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGA
GGACCAGCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATAT
TATCCCGGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCG
GGAAGTCCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCA
CCAGGTGGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAAT
GGCCCGTATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAA
ATTGGACCCGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGA
AGTCCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCA
GGTGGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGC
CCGTATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATT
GGACCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCA
CCTGGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAAT
GGACCCGGCGGACCCAATGGACCGAATAGACCCAATGGACCCAATGGACCGAGTGGGCCA
AATGGACCAAGTGGACCTAATCAACCTTTAGGACCTGGTGGACAAACTGGACCTGGTAAA
CATTTTGCAACGTTGTCTCAGAAAACTTTTACAAAATGTGGAAAATATATCTTTAACTAC
CCAATCCGTCTGCTTACAGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCACCCGGT
AGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGAGGACCA
GCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATATTATCCC
GGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGAAGT
CCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCAGGT
GGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGCCCG
TATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATTGGA
CCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCACCT
GGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAATGGA
CCCGGCGGACCCAATGGACCGAACATACCCAATGGACCCAATGGACCGAGTGGGCCAAAT
GGACCAAGTGGACCTAATCAACCGTTAGGACCTGGTGGACAAACTGGACCTGGACAAGGT
GGTTCTTACCCGGTTAGCCCAGGAGCACCCGGTAGCCCAGGAGCACCAGGTGGTCCCGGT
GGACCCGGCGGTCCCGGTGGTCCCGGAGGACCTGGTGGACCAAATGGACCCGGTCAAGTT
CCCGATTCCGTCGGTTCACCTTCCGGAACAGGTGTTCAGCCACCAGTTTACCCCCCGCCC
AGTCAGCAGCCGCCATTCCCTATATATGTTATACCATATCCATTGCCGATCGTGCCAAGC
CCCGCATCATGTCCCTGTTATCTCTTGAATCCGGGCCAAAATAATCAACAATCTTCTCCA
CAAATGCAGTATAACCAATACCCTTACCAAGGGTACCAACCTTATGGCATTATAGGGTTT
ATACCAGTCGTATTCGTTCCCAACTGTCCTGGAAATAATACTGGTATGCAAACTGCGCAA
CAAAACTTCCCTAATGCTGTATCTGTTCCCTATAATTGTGGCCAATGTCAAGCGTCGAAT
GACATTTACCGGTACTTCGGAAGATTAAATGGAGGACGTAGCATTGAAATGAACGACTTA
AAAGAAATCAAATCTCTACCAGAACTGGAGAATCTCTTGAAGAATCAAATTAAACCTCCA
AGAAAGAGTTTAAGGAGGATAGCCGTGAATGCCAGAGTTCTGGACGACATGACGAACGAC
AAGAAAAATAAAAAGAATTTGATAATTAAGGCGAAAGAAGATTAA
Protein sequence:
MAPGLLFYFLVSLVVAHATEDANETRALIQEDAHEARDYGTYGDNSKETVVNIEDDEKTQ
YYETNYDTSAYGFGYDVGPNGQFHHENKGPDGVTYGCYGYVDPDGYLRVTHYVADSHGYR
IIEPEKPVEVFPEENHEYDENLVTPSPLPGQIVPWKKLYMPRGCGKTPGGIPPRPLPKPK
PTSPPRPPPDSAGQNSNPKPGVVYPGGQGGYYPGTPGTSGSPGTPGSPGIPGRPGSPGTP
GSPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGP
GGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGP
GGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGP
GGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGP
GGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGP
AGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGSPGSPGSPGSPGSPGSPGSPG
GPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIGPEQGGSYPVRPGAPGSPGAP
GGPGGPGGPGGPGGPAGPNGPGGPNGPNRPNGPNGPSGPNGPSGPNQPLGPGGQTGPGQG
GSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPSGPSTPPSQGPGSGYYPGQGQGGYYP
SGPGSPGQPGSPGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGP
NQPSGPGGQIGPEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNR
PNGPNGPSGPNGPSGPNQPLGPGGQTGPGQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPG
GPAGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGSPGSPGSPGSPGSPGSPGS
PGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIGPGQGGYYPSGPGSPGQPG
SPGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQI
GPEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNRPNGPNGPSGP
NGPSGPNQPLGPGGQTGPGKHFATLSQKTFTKCGKYIFNYPIRLLTGQGGSYPVRPGAPG
SPGAPGGPGGPGGPGGPGGPAGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGS
PGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIG
PEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNIPNGPNGPSGPN
GPSGPNQPLGPGGQTGPGQGGSYPVSPGAPGSPGAPGGPGGPGGPGGPGGPGGPNGPGQV
PDSVGSPSGTGVQPPVYPPPSQQPPFPIYVIPYPLPIVPSPASCPCYLLNPGQNNQQSSP
QMQYNQYPYQGYQPYGIIGFIPVVFVPNCPGNNTGMQTAQQNFPNAVSVPYNCGQCQASN
DIYRYFGRLNGGRSIEMNDLKEIKSLPELENLLKNQIKPPRKSLRRIAVNARVLDDMTND
KKNKKNLIIKAKED