DPGLEAN17469 in OGS1.0

New model in OGS2.0DPOGS207948 
Genomic Positionscaffold340:- 134588-152786
See gene structure
CDS Length5085
Paired RNAseq reads  7003
Single RNAseq reads  20214
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000388 (7e-25)
Best Drosophila hit  CG10625, isoform H (1e-25)
Best Human hitND
Best NR hit (blastp)  collagen [Bombyx mori] (6e-62)
Best NR hit (blastx)  collagen [Bombyx mori] (4e-48)
GeneOntology terms  GO:0042302 structural constituent of cuticle
InterPro families  IPR000618 Insect cuticle protein
Orthology groupMCL17977

Nucleotide sequence:

ATGGCTCCGGGACTCCTCTTTTATTTTCTGGTGTCTCTGGTAGTAGCGCATGCTACCGAA
GATGCTAATGAAACGAGAGCATTGATTCAAGAAGACGCCCACGAGGCTCGAGATTACGGA
ACGTATGGCGATAACAGTAAGGAGACCGTAGTTAATATTGAGGACGATGAAAAAACACAG
TACTATGAAACCAACTACGACACTAGTGCATACGGATTCGGTTACGATGTAGGCCCCAAC
GGTCAATTTCACCATGAAAATAAAGGCCCCGATGGTGTGACTTACGGTTGCTACGGCTAC
GTTGACCCCGACGGTTACCTTCGCGTCACACACTACGTCGCTGATAGCCACGGCTACAGA
ATTATAGAACCCGAAAAACCTGTGGAAGTTTTCCCAGAGGAAAACCACGAATACGATGAA
AATTTGGTGACTCCGAGTCCTCTTCCTGGTCAGATAGTACCATGGAAGAAGCTATACATG
CCACGAGGATGTGGTAAAACTCCTGGTGGAATTCCTCCTCGCCCTCTACCAAAACCGAAA
CCGACAAGCCCACCTCGTCCACCACCAGATAGCGCTGGACAAAACAGCAACCCAAAACCT
GGTGTTGTGTATCCAGGAGGACAAGGTGGTTACTATCCTGGCACGCCTGGTACCTCTGGT
TCCCCTGGTACACCCGGCAGCCCGGGTATACCTGGCAGACCCGGTAGCCCTGGAACCCCA
GGCAGTCCTGGCGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAATTCA
GGCTATTACCCCGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGA
AGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCT
GGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCA
GGCTATTACCCAGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGA
AGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCT
GGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCA
GGCTATTACCCAGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGA
AGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCT
GGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCA
GGCTATTACCCCGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGA
AGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCT
GGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCA
GGCTATTACCCCGGTCAAGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCACCCGGT
AGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGAGGACCA
GCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATATTATCCC
GGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGAAGT
CCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCAGGT
GGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGCCCG
TATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATTGGA
CCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCACCT
GGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAATGGA
CCCGGCGGACCCAATGGACCGAATAGACCCAATGGACCCAATGGACCGAGTGGGCCAAAT
GGACCAAGTGGACCTAATCAACCTTTAGGACCTGGTGGACAAACTGGACCTGGACAAGGT
GGATCTTACCCGGTTAGACCAGGAGCACCCGGTAGCCCGGGAGCACCAGGTGGTCCCGGT
GGACCCGGCGGTCCCGGTGGTCCCGGAGGACCAGCAGGACCAAGTGGCCCTAGCACACCT
CCATCGCAAGGACCTGGTAGCGGATATTATCCCGGACAAGGACAAGGTGGGTACTACCCT
AGCGGTCCTGGATCTCCCGGTCAACCGGGAAGTCCTGGCAGCCCCGGTAGCCCTGGCAGC
CCTGGCAGCCCCGGTAGCCCTGGATCACCAGGTGGACCAGGTGGATCTTATGTACCCAGT
GGACCTAGTGGACCGGTCGGACCCAATGGCCCGTATGGACCCAATAGACCGAGTGGACCA
AATCAACCATCAGGCCCCGGTGGACAAATTGGACCCGAACAAGGTGGATCTTACCCCGTC
AGACCTGGTGCACCTGGTAGCCCTGGAGCACCTGGTGGACCCGGTGGACCAGGAGGGCCT
GGCGGTCCCGGTGGACCAGCTGGACCAAATGGACCCGGCGGACCCAATGGACCGAATAGA
CCCAATGGACCCAATGGACCGAGTGGGCCAAATGGACCAAGTGGACCTAATCAACCTTTA
GGACCTGGTGGACAAACTGGACCTGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCA
CCCGGTAGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGA
GGACCAGCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATAT
TATCCCGGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCG
GGAAGTCCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCA
CCAGGTGGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAAT
GGCCCGTATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAA
ATTGGACCCGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGA
AGTCCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCA
GGTGGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGC
CCGTATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATT
GGACCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCA
CCTGGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAAT
GGACCCGGCGGACCCAATGGACCGAATAGACCCAATGGACCCAATGGACCGAGTGGGCCA
AATGGACCAAGTGGACCTAATCAACCTTTAGGACCTGGTGGACAAACTGGACCTGGTAAA
CATTTTGCAACGTTGTCTCAGAAAACTTTTACAAAATGTGGAAAATATATCTTTAACTAC
CCAATCCGTCTGCTTACAGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCACCCGGT
AGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGAGGACCA
GCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATATTATCCC
GGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGAAGT
CCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCAGGT
GGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGCCCG
TATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATTGGA
CCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCACCT
GGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAATGGA
CCCGGCGGACCCAATGGACCGAACATACCCAATGGACCCAATGGACCGAGTGGGCCAAAT
GGACCAAGTGGACCTAATCAACCGTTAGGACCTGGTGGACAAACTGGACCTGGACAAGGT
GGTTCTTACCCGGTTAGCCCAGGAGCACCCGGTAGCCCAGGAGCACCAGGTGGTCCCGGT
GGACCCGGCGGTCCCGGTGGTCCCGGAGGACCTGGTGGACCAAATGGACCCGGTCAAGTT
CCCGATTCCGTCGGTTCACCTTCCGGAACAGGTGTTCAGCCACCAGTTTACCCCCCGCCC
AGTCAGCAGCCGCCATTCCCTATATATGTTATACCATATCCATTGCCGATCGTGCCAAGC
CCCGCATCATGTCCCTGTTATCTCTTGAATCCGGGCCAAAATAATCAACAATCTTCTCCA
CAAATGCAGTATAACCAATACCCTTACCAAGGGTACCAACCTTATGGCATTATAGGGTTT
ATACCAGTCGTATTCGTTCCCAACTGTCCTGGAAATAATACTGGTATGCAAACTGCGCAA
CAAAACTTCCCTAATGCTGTATCTGTTCCCTATAATTGTGGCCAATGTCAAGCGTCGAAT
GACATTTACCGGTACTTCGGAAGATTAAATGGAGGACGTAGCATTGAAATGAACGACTTA
AAAGAAATCAAATCTCTACCAGAACTGGAGAATCTCTTGAAGAATCAAATTAAACCTCCA
AGAAAGAGTTTAAGGAGGATAGCCGTGAATGCCAGAGTTCTGGACGACATGACGAACGAC
AAGAAAAATAAAAAGAATTTGATAATTAAGGCGAAAGAAGATTAA

Protein sequence:

MAPGLLFYFLVSLVVAHATEDANETRALIQEDAHEARDYGTYGDNSKETVVNIEDDEKTQ
YYETNYDTSAYGFGYDVGPNGQFHHENKGPDGVTYGCYGYVDPDGYLRVTHYVADSHGYR
IIEPEKPVEVFPEENHEYDENLVTPSPLPGQIVPWKKLYMPRGCGKTPGGIPPRPLPKPK
PTSPPRPPPDSAGQNSNPKPGVVYPGGQGGYYPGTPGTSGSPGTPGSPGIPGRPGSPGTP
GSPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGP
GGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGP
GGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGP
GGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGP
GGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGP
AGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGSPGSPGSPGSPGSPGSPGSPG
GPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIGPEQGGSYPVRPGAPGSPGAP
GGPGGPGGPGGPGGPAGPNGPGGPNGPNRPNGPNGPSGPNGPSGPNQPLGPGGQTGPGQG
GSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPSGPSTPPSQGPGSGYYPGQGQGGYYP
SGPGSPGQPGSPGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGP
NQPSGPGGQIGPEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNR
PNGPNGPSGPNGPSGPNQPLGPGGQTGPGQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPG
GPAGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGSPGSPGSPGSPGSPGSPGS
PGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIGPGQGGYYPSGPGSPGQPG
SPGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQI
GPEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNRPNGPNGPSGP
NGPSGPNQPLGPGGQTGPGKHFATLSQKTFTKCGKYIFNYPIRLLTGQGGSYPVRPGAPG
SPGAPGGPGGPGGPGGPGGPAGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGS
PGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIG
PEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNIPNGPNGPSGPN
GPSGPNQPLGPGGQTGPGQGGSYPVSPGAPGSPGAPGGPGGPGGPGGPGGPGGPNGPGQV
PDSVGSPSGTGVQPPVYPPPSQQPPFPIYVIPYPLPIVPSPASCPCYLLNPGQNNQQSSP
QMQYNQYPYQGYQPYGIIGFIPVVFVPNCPGNNTGMQTAQQNFPNAVSVPYNCGQCQASN
DIYRYFGRLNGGRSIEMNDLKEIKSLPELENLLKNQIKPPRKSLRRIAVNARVLDDMTND
KKNKKNLIIKAKED