New model in OGS2.0 | DPOGS215619  |
---|---|
Genomic Position | scaffold300:- 146624-153707 |
See gene structure | |
CDS Length | 4686 |
Paired RNAseq reads   | 871 |
Single RNAseq reads   | 2213 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003679 (2e-70) |
Best Drosophila hit   | nudel (9e-38) |
Best Human hit | enteropeptidase precursor (1e-40) |
Best NR hit (blastp)   | serine protease P54 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | serine protease P54 [Tribolium castaneum] (5e-179) |
GeneOntology terms    | GO:0016020 membrane GO:0016021 integral to membrane GO:0005903 brush border GO:0008233 peptidase activity GO:0004252 serine-type endopeptidase activity GO:0005044 scavenger receptor activity GO:0006508 proteolysis |
InterPro families    | IPR002172 Low-density lipoprotein (LDL) receptor class A repeat IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR009003 Peptidase cysteine/serine, trypsin-like IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site |
Orthology group | MCL18512 |
Nucleotide sequence:
ATGGGTGCCTTAAGGCCTGATAATGGAGAACCAATAATGACATTTGAGGGAACATTTCGC
GTTACTCGCGGGGACGTTTACGGTGGTGTCCCAGGAAGTCCGTCCTGGCGAGAGCGTGCT
CGCAGATACAGCGCGTCCCTTAAACAGGTGTACGCAGCTCCATCGCCCTTAAGACAGGCC
TTCGCCGGAGCAATAGTAACAGGGTTCGGTGACAGACGCCTTGACGTTCACTTCAAACTA
TACTTAGATAGAAGAAAAATACCAAGTTCTCTTACAAATATAGAGGAATCTTTGAAAAAA
ATATTAATACAAGATTTGATATCCAAACATAGTGCATTTGGACAAATAAAAATAGATGCA
TCTAGTATAATTATAAAAAGGGACTTAGAACACACATACCACTCGGAACAGTATGTCAAG
GAAGCCATGAATGAAACTGTAACTACACCTAATCCCAAGGTGTTATCACCTCAAAACGCG
AAAGACAAAACTCTCCAAAGCCGAATTGGTGTTGTTCGAAAAACGACAGTTAAACCTAAA
CAAACTTTGAGAAAAGATGATCCAGATGAACCCGATATAGATACAGAGAACATTCCAGTA
GTACAAGGCTCTTTCCAAATAACAAAGACAGAGGCGGATATAACGGAGAATAAACATAAT
CCAAGCAAAACTAATCCCAGTAGAGGCGATGAAAAAAACAACCATAAAACACCATCTACA
CCAAAAACAGCGACATCTACAAACACGAAATCGCCTACAACTTATACAACAACAACCACC
AGCACAAGAAAGTTCCCACCAAGTACACTGAAACCAAAACCAATCACAGTAAAGTCTTCT
TTGAATATGAAGCCAAAAGTGGATATAAATAACAGTTTCAGAGAAGTAAGTACTGCCAAG
CCTTCAACAACAACGAGTACAATGAAAACAACAACTACATCTAAAAGAACAACAACTACT
TCTATGGCAACAACTACTCAAAATGTGTCTCAAATCTTATATGATCTTCTTACAAATGAA
AATCACGATAAAGATTTGCCAAAAATTGATACTTTATTCACAGTACCTCATGTCATTGAT
AATGAACCGTGGAGGCCTATTACAAGACCTTATTATGAAACAACCAGTAAACAATCTACT
TTACCTATAATTGAACAAAATGCAGAAGATCGAATAGGTGTTGCAGAAGTGGTTGAAGAT
ATTTCTCTTTTAGAATCTATGTTAACTCCCAGTCCACCGGTGAAACACAAAGATATAACA
ACACGTAGACCATCAGGGCTGTACAACGTAGATCCCCATCTAGCGGCAGATGTATACATC
CCCAATCCAGTTTATACTAGTTTTACTATTCCAGCATTCATTCCACCGCTCAAAGATATG
GAAACATTGGGATCAAGCTATCCAAAACCACATCCTTTACCAGTTGACAAAATAAGTGGT
GCAATAGAAGTGGTACCTGAATCAAATTTAAACATGGATGACAATGACGGTAGACCTATA
ATAAGGCCTCCCAAAGAAAAAACTTCTTCCGTTAGTATAAATGTATTTCAATTAGATAGT
AATACAGAAAAAGTTTCAATTGAGGGTGCATCAATTGTAAAGAAACAAAATATTACTTCC
ACAAGCACAACAATGAAAACAACTACAATTGTTGATCGAAATACAAGAATTTCTACAACA
GATATTCCTTTGAGTACACCTTCTAGCACTACTACAATAGGACATAAAAAGGAAAGTACT
ACGAAGAGACCAAATAATAAAGTATCAATAATACCATCGACCGGAACTCCTCACCATACA
TGGGAATTAGTCAACACCTCAACAAATAATAACGACACTTCTAATAAAGTATCTCCCCAA
AAGTATTACAATGATACATTGCAAGCTATAATTGTTAAAAACGATGCTTCTCTCAATACG
ACAACAAGATTCCCGAGCAAATTTTCTATTTTAAGAAATTTAACTGACCTAATTAAACGA
TATTCTCAAAATAGTACGTTAAAGCCGTCTGAAATCAAAATTGAAACTACTACCGTTCAT
TCCAAAACAACAACTTCAGTAAAATTGGAGGACATTGGAGAAATTGTTCGACACACACCA
GTAGAAATGACTGGATCAGTGGAAGTTATTTCCGAGGAAGACCTAGAGACAACAACAGCT
AGAATCATAACATTGATGCCTGCTAAATCCAATCTAGGAGTGAATCGACCTCTACGACCC
CGTCCAAAAATTGATCCTCAAGTAACAGAGGATAGTGAACGTAGTTTCAATGATCCATCA
GATGTAAATAATTCTGCAGATGATCTTGAATTGAGTTCTTATAATTACACAGAATTATTG
TCAGAAGCGTCTGAAATGATATCGAGCTCAGCAAGTATGTACAATGATACAAATATTGAT
GTAGTTGAAACTAATGAAGATCCCAAAGCTCTTCGATCATCGGGTATACCGGCCAATCCT
GTGAGTGGAAATAGACTGCCTAAGTCAAACGATTTAAAAAATCCAAATGTTGAAAACTTT
AAAGAGAGTAATATTCCTGAAGGGACGTACAAGGTATCTTATCATGTAACGGGCAGCGTT
AGCAGTAAACAGGCTAATAAAACTAAACACCTTCCTGCCTATGAGCTAGCTCTAGAACCG
GATGTAGTGCTAGAAATACCATCAAATCAAAGTAGTACATTAACCCTAGATAAATTAAAG
CAACTAGCTAGCCTTGCTACAATAACAAATTTTAACAACAGCACATTTTTCCGTGCTCCT
GGTGGTGTAATTTCGACCAAAGCGATCCCGTCAAGTTATACATTAAATCAAGCTGGATTT
AAAATACTCACAAAAACATTTAACAAAGCAACGCCCGCGAAACAAGAGGAAAACAGCTTT
AATCAACCAGAAAAACCTATTAGTAAACCAATTCTTACAAAAAAACAAAATAAACCAGAA
TTTGAAAAAGAAATAAAAGTTGAAGAATTCTGTGATAACTCCACGTCGTTTTCATGCTCC
AGCGGAGCTTGCATTCCCCTGACGTCTCGGTGTAATAGATTGATAGATTGTCCCTCCGGG
GAGGACGAGAAGGCATGCTCGTGTGCAGATTACTTACGAGCTGATTTTTCACAATCTAAG
ATTTGTGACGGTTTCGTTGATTGTTGGGATTACTCGGATGAAAATAAATGTGACTGGTGT
AAAGAAGGCCAATTTGTATGTGCGAACGCTCGTCAATGTATAGAAATGAATAAAGTTTGC
GATGGAAATCCTGACTGTCCACTCGGCGACGACGAGAAAAGCTGTGTAGCTTTAGGTGAC
GACATTGACAGCAACGAAGTCATTCCGTATAACGAGGAAGGTTTTGTTATGGTTCGTAAG
CGCGGTGTTTGGGGCAGACTCTGCGTGGAGAGTTTCAATGATGTGGTCACTCAAGCACAT
AGTTCACTTAAGTTACCAGACCTTGGTAGGGCCGTCTGTCGTGCAATGACCTTTCAAGAT
TCGCCGTGGGTTCGCGAGGCACGTGAGGGTAGAAAAGTGAGCACGATAGGTTACTGGGAA
GTTTGGCACAATGTACACGCTCGAGCCGCGGACACGCGGTTGACTTTCAAACGATCTAGT
TGTACGAGACATCGCGCTCTGCGTGTCAGATGCGAGGACTTGGACTGCGGAATACGACCT
CACGCTGATGCACAGCAACCCAGGGGTGTAACTTACGAGCGAGTGCGGTGGGGCAGGGTG
GTAGGTGGTGGAGGAGCGGCGGCAGGCGCCTGGCCCTGGCAGGCAGCTTTATACCGCGAT
GGAGACTTCCAGTGTGGCGCTACCCTTATCTCAACGCAGTGGCTTCTATCAGCAAGTCAT
TGTTTCTATCAAGCTACTGAAGCCCATTGGGTTGCACGACTCGGAGCGTTGCGGAGAGGA
GCCTGGCCTCGTGGTCCTTGGGAGCGAGTGACACGCGTTCGTCAAGTAGTGTTACATCCG
AAGTATGCACCACGTGGATTTAAAAATGACATAGCGTTATTGCGAGTTGACCCTCTGCCT
CTGCACGCTCGTCTGCGGCCGGCCTGTCTGCCACCGTCGCGTTCACAACCGCCAGCCGGA
CACCATTGTACCGTGGTTGGTTGGGGACAATTGTATGAACATGAACGGGTATTCCCGGAC
ACGCTCCAGGAGGTGGAGTTGCCGGTGATATCCACAGCAGAGTGTCGTCGCCGCACTCGT
CTGCTGCCCCTCTACAGGATCACTGAAGATATGTTCTGTGCCGGCTATGAACGCGGCGGA
CGCGACGCTTGTCTTGGAGACTCGGGAGGGCCGCTTATGTGCCAAGAGGACGATAGATGG
TATATTTACGGTGTAACCAGCAATGGCTATGGATGTGCCAGGGCGAACCGACCTGGCGTT
TACACGAAGGTCTCCAACTACATCGAGTGGATTGACAGCGTCATGACGACTCACACGACG
ACAACGAACAAAACTATATCGAACAGCGAAGAAAACTCCAAAGATTTCTACGCGGATTTA
GAAACGGCAGAGAACAAGAGAGTTCTTCATAGGACCTACGATACTTGTAGAGGTTTCCGA
TGCCCTCTTGGGGAATGCCTACCACAGTCTAGCGTCTGCAATGGCTTCCTTGAATGTTCG
GACGGCAGTGACGAATGGCAATGCGATAATTTTATGACGAATTCAAGCTGGTACAGTCCC
GTCTAA
Protein sequence:
MGALRPDNGEPIMTFEGTFRVTRGDVYGGVPGSPSWRERARRYSASLKQVYAAPSPLRQA
FAGAIVTGFGDRRLDVHFKLYLDRRKIPSSLTNIEESLKKILIQDLISKHSAFGQIKIDA
SSIIIKRDLEHTYHSEQYVKEAMNETVTTPNPKVLSPQNAKDKTLQSRIGVVRKTTVKPK
QTLRKDDPDEPDIDTENIPVVQGSFQITKTEADITENKHNPSKTNPSRGDEKNNHKTPST
PKTATSTNTKSPTTYTTTTTSTRKFPPSTLKPKPITVKSSLNMKPKVDINNSFREVSTAK
PSTTTSTMKTTTTSKRTTTTSMATTTQNVSQILYDLLTNENHDKDLPKIDTLFTVPHVID
NEPWRPITRPYYETTSKQSTLPIIEQNAEDRIGVAEVVEDISLLESMLTPSPPVKHKDIT
TRRPSGLYNVDPHLAADVYIPNPVYTSFTIPAFIPPLKDMETLGSSYPKPHPLPVDKISG
AIEVVPESNLNMDDNDGRPIIRPPKEKTSSVSINVFQLDSNTEKVSIEGASIVKKQNITS
TSTTMKTTTIVDRNTRISTTDIPLSTPSSTTTIGHKKESTTKRPNNKVSIIPSTGTPHHT
WELVNTSTNNNDTSNKVSPQKYYNDTLQAIIVKNDASLNTTTRFPSKFSILRNLTDLIKR
YSQNSTLKPSEIKIETTTVHSKTTTSVKLEDIGEIVRHTPVEMTGSVEVISEEDLETTTA
RIITLMPAKSNLGVNRPLRPRPKIDPQVTEDSERSFNDPSDVNNSADDLELSSYNYTELL
SEASEMISSSASMYNDTNIDVVETNEDPKALRSSGIPANPVSGNRLPKSNDLKNPNVENF
KESNIPEGTYKVSYHVTGSVSSKQANKTKHLPAYELALEPDVVLEIPSNQSSTLTLDKLK
QLASLATITNFNNSTFFRAPGGVISTKAIPSSYTLNQAGFKILTKTFNKATPAKQEENSF
NQPEKPISKPILTKKQNKPEFEKEIKVEEFCDNSTSFSCSSGACIPLTSRCNRLIDCPSG
EDEKACSCADYLRADFSQSKICDGFVDCWDYSDENKCDWCKEGQFVCANARQCIEMNKVC
DGNPDCPLGDDEKSCVALGDDIDSNEVIPYNEEGFVMVRKRGVWGRLCVESFNDVVTQAH
SSLKLPDLGRAVCRAMTFQDSPWVREAREGRKVSTIGYWEVWHNVHARAADTRLTFKRSS
CTRHRALRVRCEDLDCGIRPHADAQQPRGVTYERVRWGRVVGGGGAAAGAWPWQAALYRD
GDFQCGATLISTQWLLSASHCFYQATEAHWVARLGALRRGAWPRGPWERVTRVRQVVLHP
KYAPRGFKNDIALLRVDPLPLHARLRPACLPPSRSQPPAGHHCTVVGWGQLYEHERVFPD
TLQEVELPVISTAECRRRTRLLPLYRITEDMFCAGYERGGRDACLGDSGGPLMCQEDDRW
YIYGVTSNGYGCARANRPGVYTKVSNYIEWIDSVMTTHTTTTNKTISNSEENSKDFYADL
ETAENKRVLHRTYDTCRGFRCPLGECLPQSSVCNGFLECSDGSDEWQCDNFMTNSSWYSP
V