New model in OGS2.0 | DPOGS215549  |
---|---|
Genomic Position | scaffold10531:- 11090-15197 |
See gene structure | |
CDS Length | 1554 |
Paired RNAseq reads   | 1539 |
Single RNAseq reads   | 3604 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002284 (3e-150) |
Best Drosophila hit   | CG4914 (5e-08) |
Best Human hit | ND |
Best NR hit (blastp)   | seminal fluid protein HACP010 [Heliconius melpomene] (0.0) |
Best NR hit (blastx)   | seminal fluid protein HACP010 [Heliconius melpomene] (0.0) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR009003 Peptidase cysteine/serine, trypsin-like |
Orthology group | MCL39651 |
Nucleotide sequence:
ATGATAGCAGACGGTTTACATCGCAAGCTGATGGTGGATTCTGTGAGACCAGCCAAGAAT
GAGCTCGGACCTGGTCCAGTGAAAGCACAAAAAATAGTTCAGAGAGCGAACGAGAATCCT
GAGGGGGACAAGGTCTTCGCTGGTAGTGGGTTCATATTACGACCAGAAAATGACGGGAAA
CCAGCTTCAGTCTATCTTCAACTAACGAAAGAATCAGCGAAAGCTCATAAATATCGCGGC
TGGTTATGTGGTGGGGTCATATTACATCAGTACTATGTGCTAACATCAGCGGCGTGTGTT
GAAGATGCTGACCATTTCTATATTGTCTCTGGAACAACTAAATACGTTGACAGCTTCGAT
TATAAAAATGACGATTGCGTTTGCAAACACAGACGGAAAGTTGTGTGGAAATGTATTCCT
AAAAATTATAAATTCGATTTCCAAGATAGCATAAAATGGTCATCTAACGATATTGCGATC
GTAAAAGTAGATAGGCCATTTAAGCTCGGAATCACAGAGAAGGATTGCGAGTTTGCCACT
GATCTTGTTTGTTATAATAATATAAGTAGGGAACTTGAAAAGGCTGGAACTAAAGGCTAT
ATTGCTGGCTGGGGTAGCGGCAACAACTTCAGGGAGGGTGTGTATCGTCGTCAAAATGGC
CACATACCAACAAATTCGAAATACCTTCAAGAGGCTAAAGTTTGCGTCATGGACAATGAG
CAGTGCGCTAAAAAGTGGGCCCAGCGTTTTAGGAGCATCATCACACAATACATGATCTGC
ACCAAGGATGTGATGAAACGTCTCAGCGAGATTTGCGATAAAAAATACGCGAATTGTACC
GACGTAGAGTCAAGACGCATAGGCATGGACGACGATTTGAACGATGATCAAACAAACTAC
CGTCTAGACCTTGGACGCCACCTCCGAGATCCCGATGACTATACAGCAAGGCGTACTACC
AAACAGGGAGGGTTTTGTGAGAACGACCATGGAGGTCCTCTAGTAGTGAAATATCAAGGA
AAAGAAAGAGTCATCGGTGTGATATCAGCCTGCAAGATAGATCCAAAAACCCACAGCTGC
CACGGACCCTTCCTGTACACCAGCGTCTTCATGAACAGACAGTTTATATCTTGTGCTATC
AACAAGGATGTGGAAGAAAATTGCCGAAGAGTGTTCCGTACTGGTATAACACACGAAGAA
TGGTCTGTCAATTGGGATGACAAAGCCGACGACGATGAGATTCAGGATAGAGCAAGCGAA
GAGAAACGAAGCAAGCCTGATAATAATAGTGATGAAGAAAATGCCGAAGTTTTAAGCAAA
AAAGACAAAGGGGAGAAAAAAGGTTCTGATTCCTCATCGAGTAGTGAGAGTAAAGAAGAT
GTAGCAAAATCAAGTACGAAAAAAACCGAAAAGGCACACAAGAAAAAACATAACAAGCAT
TCGAAGTCAAAAAGAAATACGAGGGAGTCGAAACGGGAGGGAGTTTTAAGGGGACACGAG
GAAGTTTTCTATGAGTCCAGCAGAGATGACAACGTCAACGAACCTGATGAATGA
Protein sequence:
MIADGLHRKLMVDSVRPAKNELGPGPVKAQKIVQRANENPEGDKVFAGSGFILRPENDGK
PASVYLQLTKESAKAHKYRGWLCGGVILHQYYVLTSAACVEDADHFYIVSGTTKYVDSFD
YKNDDCVCKHRRKVVWKCIPKNYKFDFQDSIKWSSNDIAIVKVDRPFKLGITEKDCEFAT
DLVCYNNISRELEKAGTKGYIAGWGSGNNFREGVYRRQNGHIPTNSKYLQEAKVCVMDNE
QCAKKWAQRFRSIITQYMICTKDVMKRLSEICDKKYANCTDVESRRIGMDDDLNDDQTNY
RLDLGRHLRDPDDYTARRTTKQGGFCENDHGGPLVVKYQGKERVIGVISACKIDPKTHSC
HGPFLYTSVFMNRQFISCAINKDVEENCRRVFRTGITHEEWSVNWDDKADDDEIQDRASE
EKRSKPDNNSDEENAEVLSKKDKGEKKGSDSSSSSESKEDVAKSSTKKTEKAHKKKHNKH
SKSKRNTRESKREGVLRGHEEVFYESSRDDNVNEPDE