DPGLEAN08126 in OGS1.0

New model in OGS2.0DPOGS215619 
Genomic Positionscaffold300:- 146624-153707
See gene structure
CDS Length4686
Paired RNAseq reads  871
Single RNAseq reads  2213
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003679 (2e-70)
Best Drosophila hit  nudel (9e-38)
Best Human hitenteropeptidase precursor (1e-40)
Best NR hit (blastp)  serine protease P54 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  serine protease P54 [Tribolium castaneum] (5e-179)
GeneOntology terms





  
GO:0016020 membrane
GO:0016021 integral to membrane
GO:0005903 brush border
GO:0008233 peptidase activity
GO:0004252 serine-type endopeptidase activity
GO:0005044 scavenger receptor activity
GO:0006508 proteolysis
InterPro families



  
IPR002172 Low-density lipoprotein (LDL) receptor class A repeat
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site
Orthology groupMCL18512

Nucleotide sequence:

ATGGGTGCCTTAAGGCCTGATAATGGAGAACCAATAATGACATTTGAGGGAACATTTCGC
GTTACTCGCGGGGACGTTTACGGTGGTGTCCCAGGAAGTCCGTCCTGGCGAGAGCGTGCT
CGCAGATACAGCGCGTCCCTTAAACAGGTGTACGCAGCTCCATCGCCCTTAAGACAGGCC
TTCGCCGGAGCAATAGTAACAGGGTTCGGTGACAGACGCCTTGACGTTCACTTCAAACTA
TACTTAGATAGAAGAAAAATACCAAGTTCTCTTACAAATATAGAGGAATCTTTGAAAAAA
ATATTAATACAAGATTTGATATCCAAACATAGTGCATTTGGACAAATAAAAATAGATGCA
TCTAGTATAATTATAAAAAGGGACTTAGAACACACATACCACTCGGAACAGTATGTCAAG
GAAGCCATGAATGAAACTGTAACTACACCTAATCCCAAGGTGTTATCACCTCAAAACGCG
AAAGACAAAACTCTCCAAAGCCGAATTGGTGTTGTTCGAAAAACGACAGTTAAACCTAAA
CAAACTTTGAGAAAAGATGATCCAGATGAACCCGATATAGATACAGAGAACATTCCAGTA
GTACAAGGCTCTTTCCAAATAACAAAGACAGAGGCGGATATAACGGAGAATAAACATAAT
CCAAGCAAAACTAATCCCAGTAGAGGCGATGAAAAAAACAACCATAAAACACCATCTACA
CCAAAAACAGCGACATCTACAAACACGAAATCGCCTACAACTTATACAACAACAACCACC
AGCACAAGAAAGTTCCCACCAAGTACACTGAAACCAAAACCAATCACAGTAAAGTCTTCT
TTGAATATGAAGCCAAAAGTGGATATAAATAACAGTTTCAGAGAAGTAAGTACTGCCAAG
CCTTCAACAACAACGAGTACAATGAAAACAACAACTACATCTAAAAGAACAACAACTACT
TCTATGGCAACAACTACTCAAAATGTGTCTCAAATCTTATATGATCTTCTTACAAATGAA
AATCACGATAAAGATTTGCCAAAAATTGATACTTTATTCACAGTACCTCATGTCATTGAT
AATGAACCGTGGAGGCCTATTACAAGACCTTATTATGAAACAACCAGTAAACAATCTACT
TTACCTATAATTGAACAAAATGCAGAAGATCGAATAGGTGTTGCAGAAGTGGTTGAAGAT
ATTTCTCTTTTAGAATCTATGTTAACTCCCAGTCCACCGGTGAAACACAAAGATATAACA
ACACGTAGACCATCAGGGCTGTACAACGTAGATCCCCATCTAGCGGCAGATGTATACATC
CCCAATCCAGTTTATACTAGTTTTACTATTCCAGCATTCATTCCACCGCTCAAAGATATG
GAAACATTGGGATCAAGCTATCCAAAACCACATCCTTTACCAGTTGACAAAATAAGTGGT
GCAATAGAAGTGGTACCTGAATCAAATTTAAACATGGATGACAATGACGGTAGACCTATA
ATAAGGCCTCCCAAAGAAAAAACTTCTTCCGTTAGTATAAATGTATTTCAATTAGATAGT
AATACAGAAAAAGTTTCAATTGAGGGTGCATCAATTGTAAAGAAACAAAATATTACTTCC
ACAAGCACAACAATGAAAACAACTACAATTGTTGATCGAAATACAAGAATTTCTACAACA
GATATTCCTTTGAGTACACCTTCTAGCACTACTACAATAGGACATAAAAAGGAAAGTACT
ACGAAGAGACCAAATAATAAAGTATCAATAATACCATCGACCGGAACTCCTCACCATACA
TGGGAATTAGTCAACACCTCAACAAATAATAACGACACTTCTAATAAAGTATCTCCCCAA
AAGTATTACAATGATACATTGCAAGCTATAATTGTTAAAAACGATGCTTCTCTCAATACG
ACAACAAGATTCCCGAGCAAATTTTCTATTTTAAGAAATTTAACTGACCTAATTAAACGA
TATTCTCAAAATAGTACGTTAAAGCCGTCTGAAATCAAAATTGAAACTACTACCGTTCAT
TCCAAAACAACAACTTCAGTAAAATTGGAGGACATTGGAGAAATTGTTCGACACACACCA
GTAGAAATGACTGGATCAGTGGAAGTTATTTCCGAGGAAGACCTAGAGACAACAACAGCT
AGAATCATAACATTGATGCCTGCTAAATCCAATCTAGGAGTGAATCGACCTCTACGACCC
CGTCCAAAAATTGATCCTCAAGTAACAGAGGATAGTGAACGTAGTTTCAATGATCCATCA
GATGTAAATAATTCTGCAGATGATCTTGAATTGAGTTCTTATAATTACACAGAATTATTG
TCAGAAGCGTCTGAAATGATATCGAGCTCAGCAAGTATGTACAATGATACAAATATTGAT
GTAGTTGAAACTAATGAAGATCCCAAAGCTCTTCGATCATCGGGTATACCGGCCAATCCT
GTGAGTGGAAATAGACTGCCTAAGTCAAACGATTTAAAAAATCCAAATGTTGAAAACTTT
AAAGAGAGTAATATTCCTGAAGGGACGTACAAGGTATCTTATCATGTAACGGGCAGCGTT
AGCAGTAAACAGGCTAATAAAACTAAACACCTTCCTGCCTATGAGCTAGCTCTAGAACCG
GATGTAGTGCTAGAAATACCATCAAATCAAAGTAGTACATTAACCCTAGATAAATTAAAG
CAACTAGCTAGCCTTGCTACAATAACAAATTTTAACAACAGCACATTTTTCCGTGCTCCT
GGTGGTGTAATTTCGACCAAAGCGATCCCGTCAAGTTATACATTAAATCAAGCTGGATTT
AAAATACTCACAAAAACATTTAACAAAGCAACGCCCGCGAAACAAGAGGAAAACAGCTTT
AATCAACCAGAAAAACCTATTAGTAAACCAATTCTTACAAAAAAACAAAATAAACCAGAA
TTTGAAAAAGAAATAAAAGTTGAAGAATTCTGTGATAACTCCACGTCGTTTTCATGCTCC
AGCGGAGCTTGCATTCCCCTGACGTCTCGGTGTAATAGATTGATAGATTGTCCCTCCGGG
GAGGACGAGAAGGCATGCTCGTGTGCAGATTACTTACGAGCTGATTTTTCACAATCTAAG
ATTTGTGACGGTTTCGTTGATTGTTGGGATTACTCGGATGAAAATAAATGTGACTGGTGT
AAAGAAGGCCAATTTGTATGTGCGAACGCTCGTCAATGTATAGAAATGAATAAAGTTTGC
GATGGAAATCCTGACTGTCCACTCGGCGACGACGAGAAAAGCTGTGTAGCTTTAGGTGAC
GACATTGACAGCAACGAAGTCATTCCGTATAACGAGGAAGGTTTTGTTATGGTTCGTAAG
CGCGGTGTTTGGGGCAGACTCTGCGTGGAGAGTTTCAATGATGTGGTCACTCAAGCACAT
AGTTCACTTAAGTTACCAGACCTTGGTAGGGCCGTCTGTCGTGCAATGACCTTTCAAGAT
TCGCCGTGGGTTCGCGAGGCACGTGAGGGTAGAAAAGTGAGCACGATAGGTTACTGGGAA
GTTTGGCACAATGTACACGCTCGAGCCGCGGACACGCGGTTGACTTTCAAACGATCTAGT
TGTACGAGACATCGCGCTCTGCGTGTCAGATGCGAGGACTTGGACTGCGGAATACGACCT
CACGCTGATGCACAGCAACCCAGGGGTGTAACTTACGAGCGAGTGCGGTGGGGCAGGGTG
GTAGGTGGTGGAGGAGCGGCGGCAGGCGCCTGGCCCTGGCAGGCAGCTTTATACCGCGAT
GGAGACTTCCAGTGTGGCGCTACCCTTATCTCAACGCAGTGGCTTCTATCAGCAAGTCAT
TGTTTCTATCAAGCTACTGAAGCCCATTGGGTTGCACGACTCGGAGCGTTGCGGAGAGGA
GCCTGGCCTCGTGGTCCTTGGGAGCGAGTGACACGCGTTCGTCAAGTAGTGTTACATCCG
AAGTATGCACCACGTGGATTTAAAAATGACATAGCGTTATTGCGAGTTGACCCTCTGCCT
CTGCACGCTCGTCTGCGGCCGGCCTGTCTGCCACCGTCGCGTTCACAACCGCCAGCCGGA
CACCATTGTACCGTGGTTGGTTGGGGACAATTGTATGAACATGAACGGGTATTCCCGGAC
ACGCTCCAGGAGGTGGAGTTGCCGGTGATATCCACAGCAGAGTGTCGTCGCCGCACTCGT
CTGCTGCCCCTCTACAGGATCACTGAAGATATGTTCTGTGCCGGCTATGAACGCGGCGGA
CGCGACGCTTGTCTTGGAGACTCGGGAGGGCCGCTTATGTGCCAAGAGGACGATAGATGG
TATATTTACGGTGTAACCAGCAATGGCTATGGATGTGCCAGGGCGAACCGACCTGGCGTT
TACACGAAGGTCTCCAACTACATCGAGTGGATTGACAGCGTCATGACGACTCACACGACG
ACAACGAACAAAACTATATCGAACAGCGAAGAAAACTCCAAAGATTTCTACGCGGATTTA
GAAACGGCAGAGAACAAGAGAGTTCTTCATAGGACCTACGATACTTGTAGAGGTTTCCGA
TGCCCTCTTGGGGAATGCCTACCACAGTCTAGCGTCTGCAATGGCTTCCTTGAATGTTCG
GACGGCAGTGACGAATGGCAATGCGATAATTTTATGACGAATTCAAGCTGGTACAGTCCC
GTCTAA

Protein sequence:

MGALRPDNGEPIMTFEGTFRVTRGDVYGGVPGSPSWRERARRYSASLKQVYAAPSPLRQA
FAGAIVTGFGDRRLDVHFKLYLDRRKIPSSLTNIEESLKKILIQDLISKHSAFGQIKIDA
SSIIIKRDLEHTYHSEQYVKEAMNETVTTPNPKVLSPQNAKDKTLQSRIGVVRKTTVKPK
QTLRKDDPDEPDIDTENIPVVQGSFQITKTEADITENKHNPSKTNPSRGDEKNNHKTPST
PKTATSTNTKSPTTYTTTTTSTRKFPPSTLKPKPITVKSSLNMKPKVDINNSFREVSTAK
PSTTTSTMKTTTTSKRTTTTSMATTTQNVSQILYDLLTNENHDKDLPKIDTLFTVPHVID
NEPWRPITRPYYETTSKQSTLPIIEQNAEDRIGVAEVVEDISLLESMLTPSPPVKHKDIT
TRRPSGLYNVDPHLAADVYIPNPVYTSFTIPAFIPPLKDMETLGSSYPKPHPLPVDKISG
AIEVVPESNLNMDDNDGRPIIRPPKEKTSSVSINVFQLDSNTEKVSIEGASIVKKQNITS
TSTTMKTTTIVDRNTRISTTDIPLSTPSSTTTIGHKKESTTKRPNNKVSIIPSTGTPHHT
WELVNTSTNNNDTSNKVSPQKYYNDTLQAIIVKNDASLNTTTRFPSKFSILRNLTDLIKR
YSQNSTLKPSEIKIETTTVHSKTTTSVKLEDIGEIVRHTPVEMTGSVEVISEEDLETTTA
RIITLMPAKSNLGVNRPLRPRPKIDPQVTEDSERSFNDPSDVNNSADDLELSSYNYTELL
SEASEMISSSASMYNDTNIDVVETNEDPKALRSSGIPANPVSGNRLPKSNDLKNPNVENF
KESNIPEGTYKVSYHVTGSVSSKQANKTKHLPAYELALEPDVVLEIPSNQSSTLTLDKLK
QLASLATITNFNNSTFFRAPGGVISTKAIPSSYTLNQAGFKILTKTFNKATPAKQEENSF
NQPEKPISKPILTKKQNKPEFEKEIKVEEFCDNSTSFSCSSGACIPLTSRCNRLIDCPSG
EDEKACSCADYLRADFSQSKICDGFVDCWDYSDENKCDWCKEGQFVCANARQCIEMNKVC
DGNPDCPLGDDEKSCVALGDDIDSNEVIPYNEEGFVMVRKRGVWGRLCVESFNDVVTQAH
SSLKLPDLGRAVCRAMTFQDSPWVREAREGRKVSTIGYWEVWHNVHARAADTRLTFKRSS
CTRHRALRVRCEDLDCGIRPHADAQQPRGVTYERVRWGRVVGGGGAAAGAWPWQAALYRD
GDFQCGATLISTQWLLSASHCFYQATEAHWVARLGALRRGAWPRGPWERVTRVRQVVLHP
KYAPRGFKNDIALLRVDPLPLHARLRPACLPPSRSQPPAGHHCTVVGWGQLYEHERVFPD
TLQEVELPVISTAECRRRTRLLPLYRITEDMFCAGYERGGRDACLGDSGGPLMCQEDDRW
YIYGVTSNGYGCARANRPGVYTKVSNYIEWIDSVMTTHTTTTNKTISNSEENSKDFYADL
ETAENKRVLHRTYDTCRGFRCPLGECLPQSSVCNGFLECSDGSDEWQCDNFMTNSSWYSP
V