DPGLEAN04907 in OGS1.0

New model in OGS2.0DPOGS209867 
Genomic Positionscaffold1934:- 31714-54180
See gene structure
CDS Length1281
Paired RNAseq reads  1488
Single RNAseq reads  4376
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004421 (2e-70)
Best Drosophila hit  CG3074, isoform A (4e-93)
Best Human hittubulointerstitial nephritis antigen-like (2e-74)
Best NR hit (blastp)  tubulointerstitial nephritis antigen [Bombyx mori] (6e-114)
Best NR hit (blastx)  tubulointerstitial nephritis antigen [Bombyx mori] (3e-121)
GeneOntology terms




  
GO:0004197 cysteine-type endopeptidase activity
GO:0005044 scavenger receptor activity
GO:0006508 proteolysis
GO:0006955 immune response
GO:0030247 polysaccharide binding
GO:0042600 chorion
InterPro families


  
IPR000668 Peptidase C1A, papain C-terminal
IPR001212 Somatomedin B domain
IPR000169 Peptidase, cysteine peptidase active site
IPR013128 Peptidase C1A, papain
Orthology groupMCL13890

Nucleotide sequence:

ATGATGAATGTATTATATTCGCTGCTGTTCTGTGGGCTGGTGAGCGTAACCACGGCGTAC
TGGCGCCCGGGACTGCCGCCGGGGCCGTACTGCGGCATCAACAACCAGTGCTGCACCGAT
CGCAAGGATGACTGCTCACATCGGATACGAGATACTCTATGTTACTGCGATCAATTCTGT
AACCGCACTCACGACGATTGCTGTCCTGATTACGAGGAAGTTTGCCTCGGGAAACCCTCG
AACATTCTGGAGCCGTGCAAGCATAACGGCAAGTTGTATTTCAAGGGGGACAAGCGAATG
GATAACTGTAATACATGTGAATGCGTCCAAGACCCCTACACCAACCAGCCTCAGTGGAGC
TGTGAACGCGACGCGTGCATCATCAGTGATGACGTCATCTATGGTGTCAACAGAGGGAAC
AGCTGGAGGGCCTACAACTATACTCAGTTCTACGGAAAGAAGCTGAGAGACGGGATCATA
TATAAGCTAGGTACAATGCCATTGAGCCACGAAACAAGACGCATGGGTCCGATCAGATAC
GACAAGGATATACCGTATCCAAGGGATTTCGACGCTCGCCGTCGCTGGCCAAACTTCATC
TCGCCGGTGTTAGATCAAGGATGGTGTGGCTCGGACTGGGCGGTCACCATAGCTACCGTC
GCCTCTGATAGGTTCGCGATCCAGTCGAACGGCGCTGAGAGGATGGTGCTGTCCCCTCAG
GTGCTTCTCTCTTGTAACATCAGACGTCAGCAGGGCTGTCGCGGCGGCCATATCGACGTA
GCCTGGAACTTCGCCAGAGGCCACGGTCTCGTCGACGAGGAATGCTTTCCTTACAAAGCC
GCGACTACCAGCTGTCCCTTCAGACCGAAAGCTAATCTCATAGAGGACGGTTGCCGGCCT
CCGGTCCGCCAAAGAACCTCCCGCTACAAGGTGGGTCCTCCCGGGAAACTCGCCACAGAA
AACGACATCATGTACGACATCATGGAGTCCGGGCCAGTCCACGCCGTAATGACGGTACAC
CAGGACTTTTTCCACTACCACGATGGTATCTACCGCCGTTCTCCGTACGGTGACAACACC
CTTCAGGGCTTGCATAGCGTCAGGATCGTGGGTTGGGGAGAAGACAGAGGAGATAAATAC
TGGGTGGTTGCCAACAGCTGGGGCTGTGACTGGGGTGAGAACGGCTACTTCCGTATAGCG
CGTGGCAGCAACGAGTCCGGCATCGAGTCGTTCGTGGTCACCGTCCTCAGTGACGTCACT
GAGGCCTACCAAAAGAAATAA

Protein sequence:

MMNVLYSLLFCGLVSVTTAYWRPGLPPGPYCGINNQCCTDRKDDCSHRIRDTLCYCDQFC
NRTHDDCCPDYEEVCLGKPSNILEPCKHNGKLYFKGDKRMDNCNTCECVQDPYTNQPQWS
CERDACIISDDVIYGVNRGNSWRAYNYTQFYGKKLRDGIIYKLGTMPLSHETRRMGPIRY
DKDIPYPRDFDARRRWPNFISPVLDQGWCGSDWAVTIATVASDRFAIQSNGAERMVLSPQ
VLLSCNIRRQQGCRGGHIDVAWNFARGHGLVDEECFPYKAATTSCPFRPKANLIEDGCRP
PVRQRTSRYKVGPPGKLATENDIMYDIMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNT
LQGLHSVRIVGWGEDRGDKYWVVANSWGCDWGENGYFRIARGSNESGIESFVVTVLSDVT
EAYQKK