New model in OGS2.0 | DPOGS209867  |
---|---|
Genomic Position | scaffold1934:- 31714-54180 |
See gene structure | |
CDS Length | 1281 |
Paired RNAseq reads   | 1488 |
Single RNAseq reads   | 4376 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004421 (2e-70) |
Best Drosophila hit   | CG3074, isoform A (4e-93) |
Best Human hit | tubulointerstitial nephritis antigen-like (2e-74) |
Best NR hit (blastp)   | tubulointerstitial nephritis antigen [Bombyx mori] (6e-114) |
Best NR hit (blastx)   | tubulointerstitial nephritis antigen [Bombyx mori] (3e-121) |
GeneOntology terms    | GO:0004197 cysteine-type endopeptidase activity GO:0005044 scavenger receptor activity GO:0006508 proteolysis GO:0006955 immune response GO:0030247 polysaccharide binding GO:0042600 chorion |
InterPro families    | IPR000668 Peptidase C1A, papain C-terminal IPR001212 Somatomedin B domain IPR000169 Peptidase, cysteine peptidase active site IPR013128 Peptidase C1A, papain |
Orthology group | MCL13890 |
Nucleotide sequence:
ATGATGAATGTATTATATTCGCTGCTGTTCTGTGGGCTGGTGAGCGTAACCACGGCGTAC
TGGCGCCCGGGACTGCCGCCGGGGCCGTACTGCGGCATCAACAACCAGTGCTGCACCGAT
CGCAAGGATGACTGCTCACATCGGATACGAGATACTCTATGTTACTGCGATCAATTCTGT
AACCGCACTCACGACGATTGCTGTCCTGATTACGAGGAAGTTTGCCTCGGGAAACCCTCG
AACATTCTGGAGCCGTGCAAGCATAACGGCAAGTTGTATTTCAAGGGGGACAAGCGAATG
GATAACTGTAATACATGTGAATGCGTCCAAGACCCCTACACCAACCAGCCTCAGTGGAGC
TGTGAACGCGACGCGTGCATCATCAGTGATGACGTCATCTATGGTGTCAACAGAGGGAAC
AGCTGGAGGGCCTACAACTATACTCAGTTCTACGGAAAGAAGCTGAGAGACGGGATCATA
TATAAGCTAGGTACAATGCCATTGAGCCACGAAACAAGACGCATGGGTCCGATCAGATAC
GACAAGGATATACCGTATCCAAGGGATTTCGACGCTCGCCGTCGCTGGCCAAACTTCATC
TCGCCGGTGTTAGATCAAGGATGGTGTGGCTCGGACTGGGCGGTCACCATAGCTACCGTC
GCCTCTGATAGGTTCGCGATCCAGTCGAACGGCGCTGAGAGGATGGTGCTGTCCCCTCAG
GTGCTTCTCTCTTGTAACATCAGACGTCAGCAGGGCTGTCGCGGCGGCCATATCGACGTA
GCCTGGAACTTCGCCAGAGGCCACGGTCTCGTCGACGAGGAATGCTTTCCTTACAAAGCC
GCGACTACCAGCTGTCCCTTCAGACCGAAAGCTAATCTCATAGAGGACGGTTGCCGGCCT
CCGGTCCGCCAAAGAACCTCCCGCTACAAGGTGGGTCCTCCCGGGAAACTCGCCACAGAA
AACGACATCATGTACGACATCATGGAGTCCGGGCCAGTCCACGCCGTAATGACGGTACAC
CAGGACTTTTTCCACTACCACGATGGTATCTACCGCCGTTCTCCGTACGGTGACAACACC
CTTCAGGGCTTGCATAGCGTCAGGATCGTGGGTTGGGGAGAAGACAGAGGAGATAAATAC
TGGGTGGTTGCCAACAGCTGGGGCTGTGACTGGGGTGAGAACGGCTACTTCCGTATAGCG
CGTGGCAGCAACGAGTCCGGCATCGAGTCGTTCGTGGTCACCGTCCTCAGTGACGTCACT
GAGGCCTACCAAAAGAAATAA
Protein sequence:
MMNVLYSLLFCGLVSVTTAYWRPGLPPGPYCGINNQCCTDRKDDCSHRIRDTLCYCDQFC
NRTHDDCCPDYEEVCLGKPSNILEPCKHNGKLYFKGDKRMDNCNTCECVQDPYTNQPQWS
CERDACIISDDVIYGVNRGNSWRAYNYTQFYGKKLRDGIIYKLGTMPLSHETRRMGPIRY
DKDIPYPRDFDARRRWPNFISPVLDQGWCGSDWAVTIATVASDRFAIQSNGAERMVLSPQ
VLLSCNIRRQQGCRGGHIDVAWNFARGHGLVDEECFPYKAATTSCPFRPKANLIEDGCRP
PVRQRTSRYKVGPPGKLATENDIMYDIMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNT
LQGLHSVRIVGWGEDRGDKYWVVANSWGCDWGENGYFRIARGSNESGIESFVVTVLSDVT
EAYQKK