DPGLEAN07000 in OGS1.0

New model in OGS2.0DPOGS202945 
Genomic Positionscaffold222:+ 88718-114215
See gene structure
CDS Length2955
Paired RNAseq reads  282
Single RNAseq reads  710
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005824 (1e-22)
Best Drosophila hit  corin (0.0)
Best Human hitatrial natriuretic peptide-converting enzyme (1e-51)
Best NR hit (blastp)  PREDICTED: similar to transmembrane protease, serine [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to transmembrane protease, serine [Tribolium castaneum] (0.0)
GeneOntology terms


  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
GO:0016020 membrane
GO:0005044 scavenger receptor activity
InterPro families






  
IPR020067 Frizzled domain
IPR002172 Low-density lipoprotein (LDL) receptor class A repeat
IPR017448 Speract/scavenger receptor-related
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001190 Speract/scavenger receptor
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL12868

Nucleotide sequence:

ATGCACCTTCAACCCGACGCGCCCGGAGAAGATTCAGACTCCGTGCGAGGACGCTATGGC
CGGGCCGGCAGCCTCGCGGTGACTCCGGGAGAGAAACTGATGGAACTACTTCGTCCCAAC
GGTGGGACACCACGACGACATTCGACAGCTGCATGCGCGCCTCCACAACCAAAACCCTCA
GGTTTTGTGTACTGCCCTTCAGACGCCCTACCGTACTGTCCACCTTCTAGGTTTGCACCA
TCTCATCAAGTCAACAAGCCCAGAGCCGGTTCCGTGTATTGTCCGTCTGATGCGCTGCCC
TATTGCCCGCCTTCGGGGTTCGGTTCTCCGCATCCAGTCAAAAAGCTACCACCACCACAA
GTTCAACAGCCGCCTCAACCACCACCAGTACCTCAGAGGACCGCGCCGGTGGTAACACCC
TTGCCGGTTCCACCACCAGCACCACCGCGACGATCATCACCACAACCCCCACGCAGAGTA
CCACCTCCGACTCCCCCCCGGCCAAATGCAACGACAACAGATCCTAACGGCAACAAGCGA
CCACCGCCCGCACCACAGAAGGAACAAGCACCACCTAAACTCGACTGTAACAGAAACCCT
CAACCACCAGCGATAAGAAGGTCTCCACAAAAAATGCCGGACACGCCATACACACCGGCA
CCTGTACCGCAAAATAATCAAATGAAACAATCACCGAGTTCCTCATCTAATATTAGCGTT
AGACAAGATTCAAATGTATCCTCTGACTCATTTAGTCAGACTTCCTCTCCCTCTTATACT
ACAAAAACTATGGAAGCCCCGTTGCTACCGCACCAGCATGTCAATAAGAGCCTGAACGCC
AAGATAGCTCGGGGACTATTACTCAAGGAGCAACAGGAAAAAGAAGCTGGGAACTCGTCC
ATTACAAAGAGCATGTCCACTCCGGCTTCACTGCAAACCATAGTGAGATTTCAAAACGGA
AGCAACATGTCTCTACATCACAGAATGCTTCGTGATATGCGCGGCACCAATACCGACACC
TCCGCTCACAAGTTTCGTCTGATGCAGCTCGCTCTCAACGCGGTCATGCTGCTAGCTATC
ACCGGAGCTTTGTTCGCATACTTTAAAGCGAACCCCGCTGTTCAGTATGTGTCTCAAGCT
GTGAATTTGTCGGCGGCGGTGACGTGGCCGACGCCAACGGAGCCCCCGGGTGCAAGGAAC
CCAGCGCCAGGAGTCTGTCTGCCTGTCATCGTTACCTTCTGTCACCAGCACCGCATCTCC
TACAATTTCACTGTTTTTCCAAATTACATTGGACACTTCGGACAGAGGGATGCACAACAG
GACTTGGAAATTTACGATGCAGTTGTGGACGTTCGCTGCTATGAACTGACGGCGCTTTTT
CTGTGTTCTTTATTTGTCCCCAAGTGCGGTCCACTGGGTCACATGGTCCGACCCTGTCGG
AGTCTGTGTCAAGAAACAATGCGTCGTTGTGGTTTCTTTCTGGAAGTATTCGGTCTTTCG
ATGCCGGACTACCTTCAATGTGAAATTTTTCCGGAGTCTACTGACAGAGACGTGTGCCTT
GGGAATAGAGAAGTAAAGGAGGCGCGATTTAGAGCTGCAAAACCAGTGTGTCCAACTGGT
TTCCAATGCGACATGAACCGCTGCATCCCCCACGACTGGCGCTGTGATGGACACGTGGAC
TGCGCTGATCGCTCCGACGAACTGAACTGTCGCGTCTGTAAAAGAAGCGGAGACGTCCAC
TGCGGAAACCAGAGATGTATCTCACAGGCACATCTGTGTGACGGGAAGATAGACTGCCCC
TGGGGACAGGATGAAAGGAACTGCTTACGTCTAAGTAAGGCAAACGGTGACGTTGGTCGT
GGGGAGCTCCAAGTATATCGCGCCGCCAACCAGTCCTGGTTCCCAGCTTGCATCACCACC
TTGGACGATCCAACTGCTTCCAAACTGTGCTCAATGCTCGGATACTCTTGGGTTAACAAG
AGTACGGTGGTGGGTGGTGCGGGCGCACGAGCCGGAAGCGGGGTGCAGGCTCATGGCGTG
GCGCAGTCCTACCGAGCCTTCCAACGGAGCGAGGGAGGTCTATTGCGGGAGCTCAAAGAC
TGTCGCCACGACTCAGCCAGGGTGCACCTCGTCTGTGATCATTACGAATGTGGTAGACGG
CGAACTATAGGTGGAGGTACCAAGCGAATAGTCGGTGGTGTGGAGGCATCTCCGGGTGAC
TGGCCTTTCCTCGCTGCGATTCTAGGAGGTCCTGAAGAGGTGTTCTACTGCGCTGGAGTA
CTGGTCGCCGATCAGTGGGTCCTGACCGCTTCCCATTGTGTTGGCAATCATTCAGATGTG
AATGGGTGGACCATACAGTTAGGTATAACACGGCGTCGCTCGCACGCATACTACGGTCAA
AAAGTTAAGGTCCGAAGAGTTGTTCCTCATCCGCTATACAACGTGGGCGTCGCTCATGAC
AATGATATAGCACTGTTCCAGCTGGCAGTCCGCGTCCGTTACCACGAACAGTTATCACCT
GTTTGTCTTCCACCACCGCGGCCAGCTTTAGCTCCCGGTACACTCTGCACGGTCATCGGA
TGGGGAAAGCGAGATGACAAAGATATGTCCGAGTATGAGCCAGCTGTGAATGAGGTGGAG
GTTCCTGTACTACAGCGTGAGTTGTGTAACCAGTGGCTGGAGCACCGAGACCTGAATGTC
ACTGAGGGAATGATCTGTGCTGGGTACCCCGAGGGCGGCAAGGACGCCTGCCAGGGAGAC
TCCGGCGGACCCCTGTTATGTCGTGATCCTAGTGAGCCGTCTCGCTGGGTGGTGGGTGGA
ATCGTGTCGTGGGGTATCAAATGCGCCCATCCTCGCCTACCCGGGGTTTATGCTTACGTC
CCGCGATACGTGCCCTGGATACTAGCCCAGATAAGACTTTACAACGATGACTCCGCCGTC
TCCGAGGACATCTAA

Protein sequence:

MHLQPDAPGEDSDSVRGRYGRAGSLAVTPGEKLMELLRPNGGTPRRHSTAACAPPQPKPS
GFVYCPSDALPYCPPSRFAPSHQVNKPRAGSVYCPSDALPYCPPSGFGSPHPVKKLPPPQ
VQQPPQPPPVPQRTAPVVTPLPVPPPAPPRRSSPQPPRRVPPPTPPRPNATTTDPNGNKR
PPPAPQKEQAPPKLDCNRNPQPPAIRRSPQKMPDTPYTPAPVPQNNQMKQSPSSSSNISV
RQDSNVSSDSFSQTSSPSYTTKTMEAPLLPHQHVNKSLNAKIARGLLLKEQQEKEAGNSS
ITKSMSTPASLQTIVRFQNGSNMSLHHRMLRDMRGTNTDTSAHKFRLMQLALNAVMLLAI
TGALFAYFKANPAVQYVSQAVNLSAAVTWPTPTEPPGARNPAPGVCLPVIVTFCHQHRIS
YNFTVFPNYIGHFGQRDAQQDLEIYDAVVDVRCYELTALFLCSLFVPKCGPLGHMVRPCR
SLCQETMRRCGFFLEVFGLSMPDYLQCEIFPESTDRDVCLGNREVKEARFRAAKPVCPTG
FQCDMNRCIPHDWRCDGHVDCADRSDELNCRVCKRSGDVHCGNQRCISQAHLCDGKIDCP
WGQDERNCLRLSKANGDVGRGELQVYRAANQSWFPACITTLDDPTASKLCSMLGYSWVNK
STVVGGAGARAGSGVQAHGVAQSYRAFQRSEGGLLRELKDCRHDSARVHLVCDHYECGRR
RTIGGGTKRIVGGVEASPGDWPFLAAILGGPEEVFYCAGVLVADQWVLTASHCVGNHSDV
NGWTIQLGITRRRSHAYYGQKVKVRRVVPHPLYNVGVAHDNDIALFQLAVRVRYHEQLSP
VCLPPPRPALAPGTLCTVIGWGKRDDKDMSEYEPAVNEVEVPVLQRELCNQWLEHRDLNV
TEGMICAGYPEGGKDACQGDSGGPLLCRDPSEPSRWVVGGIVSWGIKCAHPRLPGVYAYV
PRYVPWILAQIRLYNDDSAVSEDI