New model in OGS2.0 | DPOGS202945  |
---|---|
Genomic Position | scaffold222:+ 88718-114215 |
See gene structure | |
CDS Length | 2955 |
Paired RNAseq reads   | 282 |
Single RNAseq reads   | 710 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005824 (1e-22) |
Best Drosophila hit   | corin (0.0) |
Best Human hit | atrial natriuretic peptide-converting enzyme (1e-51) |
Best NR hit (blastp)   | PREDICTED: similar to transmembrane protease, serine [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to transmembrane protease, serine [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis GO:0016020 membrane GO:0005044 scavenger receptor activity |
InterPro families    | IPR020067 Frizzled domain IPR002172 Low-density lipoprotein (LDL) receptor class A repeat IPR017448 Speract/scavenger receptor-related IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR009003 Peptidase cysteine/serine, trypsin-like IPR001190 Speract/scavenger receptor IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL12868 |
Nucleotide sequence:
ATGCACCTTCAACCCGACGCGCCCGGAGAAGATTCAGACTCCGTGCGAGGACGCTATGGC
CGGGCCGGCAGCCTCGCGGTGACTCCGGGAGAGAAACTGATGGAACTACTTCGTCCCAAC
GGTGGGACACCACGACGACATTCGACAGCTGCATGCGCGCCTCCACAACCAAAACCCTCA
GGTTTTGTGTACTGCCCTTCAGACGCCCTACCGTACTGTCCACCTTCTAGGTTTGCACCA
TCTCATCAAGTCAACAAGCCCAGAGCCGGTTCCGTGTATTGTCCGTCTGATGCGCTGCCC
TATTGCCCGCCTTCGGGGTTCGGTTCTCCGCATCCAGTCAAAAAGCTACCACCACCACAA
GTTCAACAGCCGCCTCAACCACCACCAGTACCTCAGAGGACCGCGCCGGTGGTAACACCC
TTGCCGGTTCCACCACCAGCACCACCGCGACGATCATCACCACAACCCCCACGCAGAGTA
CCACCTCCGACTCCCCCCCGGCCAAATGCAACGACAACAGATCCTAACGGCAACAAGCGA
CCACCGCCCGCACCACAGAAGGAACAAGCACCACCTAAACTCGACTGTAACAGAAACCCT
CAACCACCAGCGATAAGAAGGTCTCCACAAAAAATGCCGGACACGCCATACACACCGGCA
CCTGTACCGCAAAATAATCAAATGAAACAATCACCGAGTTCCTCATCTAATATTAGCGTT
AGACAAGATTCAAATGTATCCTCTGACTCATTTAGTCAGACTTCCTCTCCCTCTTATACT
ACAAAAACTATGGAAGCCCCGTTGCTACCGCACCAGCATGTCAATAAGAGCCTGAACGCC
AAGATAGCTCGGGGACTATTACTCAAGGAGCAACAGGAAAAAGAAGCTGGGAACTCGTCC
ATTACAAAGAGCATGTCCACTCCGGCTTCACTGCAAACCATAGTGAGATTTCAAAACGGA
AGCAACATGTCTCTACATCACAGAATGCTTCGTGATATGCGCGGCACCAATACCGACACC
TCCGCTCACAAGTTTCGTCTGATGCAGCTCGCTCTCAACGCGGTCATGCTGCTAGCTATC
ACCGGAGCTTTGTTCGCATACTTTAAAGCGAACCCCGCTGTTCAGTATGTGTCTCAAGCT
GTGAATTTGTCGGCGGCGGTGACGTGGCCGACGCCAACGGAGCCCCCGGGTGCAAGGAAC
CCAGCGCCAGGAGTCTGTCTGCCTGTCATCGTTACCTTCTGTCACCAGCACCGCATCTCC
TACAATTTCACTGTTTTTCCAAATTACATTGGACACTTCGGACAGAGGGATGCACAACAG
GACTTGGAAATTTACGATGCAGTTGTGGACGTTCGCTGCTATGAACTGACGGCGCTTTTT
CTGTGTTCTTTATTTGTCCCCAAGTGCGGTCCACTGGGTCACATGGTCCGACCCTGTCGG
AGTCTGTGTCAAGAAACAATGCGTCGTTGTGGTTTCTTTCTGGAAGTATTCGGTCTTTCG
ATGCCGGACTACCTTCAATGTGAAATTTTTCCGGAGTCTACTGACAGAGACGTGTGCCTT
GGGAATAGAGAAGTAAAGGAGGCGCGATTTAGAGCTGCAAAACCAGTGTGTCCAACTGGT
TTCCAATGCGACATGAACCGCTGCATCCCCCACGACTGGCGCTGTGATGGACACGTGGAC
TGCGCTGATCGCTCCGACGAACTGAACTGTCGCGTCTGTAAAAGAAGCGGAGACGTCCAC
TGCGGAAACCAGAGATGTATCTCACAGGCACATCTGTGTGACGGGAAGATAGACTGCCCC
TGGGGACAGGATGAAAGGAACTGCTTACGTCTAAGTAAGGCAAACGGTGACGTTGGTCGT
GGGGAGCTCCAAGTATATCGCGCCGCCAACCAGTCCTGGTTCCCAGCTTGCATCACCACC
TTGGACGATCCAACTGCTTCCAAACTGTGCTCAATGCTCGGATACTCTTGGGTTAACAAG
AGTACGGTGGTGGGTGGTGCGGGCGCACGAGCCGGAAGCGGGGTGCAGGCTCATGGCGTG
GCGCAGTCCTACCGAGCCTTCCAACGGAGCGAGGGAGGTCTATTGCGGGAGCTCAAAGAC
TGTCGCCACGACTCAGCCAGGGTGCACCTCGTCTGTGATCATTACGAATGTGGTAGACGG
CGAACTATAGGTGGAGGTACCAAGCGAATAGTCGGTGGTGTGGAGGCATCTCCGGGTGAC
TGGCCTTTCCTCGCTGCGATTCTAGGAGGTCCTGAAGAGGTGTTCTACTGCGCTGGAGTA
CTGGTCGCCGATCAGTGGGTCCTGACCGCTTCCCATTGTGTTGGCAATCATTCAGATGTG
AATGGGTGGACCATACAGTTAGGTATAACACGGCGTCGCTCGCACGCATACTACGGTCAA
AAAGTTAAGGTCCGAAGAGTTGTTCCTCATCCGCTATACAACGTGGGCGTCGCTCATGAC
AATGATATAGCACTGTTCCAGCTGGCAGTCCGCGTCCGTTACCACGAACAGTTATCACCT
GTTTGTCTTCCACCACCGCGGCCAGCTTTAGCTCCCGGTACACTCTGCACGGTCATCGGA
TGGGGAAAGCGAGATGACAAAGATATGTCCGAGTATGAGCCAGCTGTGAATGAGGTGGAG
GTTCCTGTACTACAGCGTGAGTTGTGTAACCAGTGGCTGGAGCACCGAGACCTGAATGTC
ACTGAGGGAATGATCTGTGCTGGGTACCCCGAGGGCGGCAAGGACGCCTGCCAGGGAGAC
TCCGGCGGACCCCTGTTATGTCGTGATCCTAGTGAGCCGTCTCGCTGGGTGGTGGGTGGA
ATCGTGTCGTGGGGTATCAAATGCGCCCATCCTCGCCTACCCGGGGTTTATGCTTACGTC
CCGCGATACGTGCCCTGGATACTAGCCCAGATAAGACTTTACAACGATGACTCCGCCGTC
TCCGAGGACATCTAA
Protein sequence:
MHLQPDAPGEDSDSVRGRYGRAGSLAVTPGEKLMELLRPNGGTPRRHSTAACAPPQPKPS
GFVYCPSDALPYCPPSRFAPSHQVNKPRAGSVYCPSDALPYCPPSGFGSPHPVKKLPPPQ
VQQPPQPPPVPQRTAPVVTPLPVPPPAPPRRSSPQPPRRVPPPTPPRPNATTTDPNGNKR
PPPAPQKEQAPPKLDCNRNPQPPAIRRSPQKMPDTPYTPAPVPQNNQMKQSPSSSSNISV
RQDSNVSSDSFSQTSSPSYTTKTMEAPLLPHQHVNKSLNAKIARGLLLKEQQEKEAGNSS
ITKSMSTPASLQTIVRFQNGSNMSLHHRMLRDMRGTNTDTSAHKFRLMQLALNAVMLLAI
TGALFAYFKANPAVQYVSQAVNLSAAVTWPTPTEPPGARNPAPGVCLPVIVTFCHQHRIS
YNFTVFPNYIGHFGQRDAQQDLEIYDAVVDVRCYELTALFLCSLFVPKCGPLGHMVRPCR
SLCQETMRRCGFFLEVFGLSMPDYLQCEIFPESTDRDVCLGNREVKEARFRAAKPVCPTG
FQCDMNRCIPHDWRCDGHVDCADRSDELNCRVCKRSGDVHCGNQRCISQAHLCDGKIDCP
WGQDERNCLRLSKANGDVGRGELQVYRAANQSWFPACITTLDDPTASKLCSMLGYSWVNK
STVVGGAGARAGSGVQAHGVAQSYRAFQRSEGGLLRELKDCRHDSARVHLVCDHYECGRR
RTIGGGTKRIVGGVEASPGDWPFLAAILGGPEEVFYCAGVLVADQWVLTASHCVGNHSDV
NGWTIQLGITRRRSHAYYGQKVKVRRVVPHPLYNVGVAHDNDIALFQLAVRVRYHEQLSP
VCLPPPRPALAPGTLCTVIGWGKRDDKDMSEYEPAVNEVEVPVLQRELCNQWLEHRDLNV
TEGMICAGYPEGGKDACQGDSGGPLLCRDPSEPSRWVVGGIVSWGIKCAHPRLPGVYAYV
PRYVPWILAQIRLYNDDSAVSEDI