New model in OGS2.0 | DPOGS205586  |
---|---|
Genomic Position | scaffold1797:+ 23502-37479 |
See gene structure | |
CDS Length | 2439 |
Paired RNAseq reads   | 210 |
Single RNAseq reads   | 670 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009750 (4e-18) |
Best Drosophila hit   | CG34350 (2e-122) |
Best Human hit | suppressor of tumorigenicity 14 protein (4e-38) |
Best NR hit (blastp)   | PREDICTED: similar to CG11824-PA [Apis mellifera] (5e-152) |
Best NR hit (blastx)   | PREDICTED: similar to CG11824-PA [Apis mellifera] (2e-123) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR009003 Peptidase cysteine/serine, trypsin-like IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL11109 |
Nucleotide sequence:
ATGGGTCCTCCCCTCAACACCTGGCCCTACAGCCACCATCGAGAACTTAAAAGCGCTAGT
GATAAAATGTTTATAAATCTGAGTGCGGTTAGCGCGGGACCCGTGATACTCAGCTTGGAC
CATTTGCGCGGCTCGGTGCCAATAGCTAGAAACATAAGGCATTTACCATGCATATCAAGG
AAAACTGCTCAAGAGGGACTCTGCATGTTCGCGATAGACTGTCTTAAAGCAAACGGAACT
CATTTGGGAACATGTATAGATAGGTTTTACTTCGGTTCCTGCTGCCAGCTGACAGATAAA
TCTGCTATACCAAATATAGCTGCGAACAATATTGAAGATAACGCCATAGACGGCGCTAAT
TTCGTACATCCGCTCATAGATCACAAAATCCATAGTCAAACGAGTAAAAAACCCGATAAA
ATAAACGTGAACACAGATAAAGAGAATGCAAAACCAATACAAGACGATATCGCTACTAAG
AAACCGTCAACAGTTCAGGATGTGACTAGCTCTGACAAAATGCAGTCGAAGTCTGATGAA
ACGGTATCTATTAACCATATAACAAGTGATATTAGGACTACAGAAGCTGTCACAGCTGTC
AAAGAAGCTACTACACAATCTATGAAAGTATCCGACAATGTAGTGACCGAGATTCCTGTT
AAACTGTCCACATTCCAAACAGTATCCGCTGCTGGTGACACAGCAACTGCAGCTCCGGAA
GCCAAACCCACAAAACCACAAACACCAGAAGAACCAGTCAAACCAACAAGGAAACCGGTG
AAACCAACGTATAAACCTAGACCATATAGACCCACGAATTTCACGAGACCTCCAATAAGT
CCTAAACCGAAACCGACGAAGCCTGTCGCATTATTCAATACAACAAGGAAGCCGCCTTAC
CGACCGCCCCCGAAACGTAACTCAACCAAAAAACCCCTGCCATCTCCACCGAGACTTAAT
ATAACCATCATACCTCAATCCACGAAACCATCAACGGAGAAAGCAACAGAGATACCAACA
GAAAAAAATACCGAGCGAATTACTACAGAACTCTTACCAGAACCAACAGAAGAGAAGGAG
AAAGAAACTGTCACTGAAAACGTCACGACAGTTGTCACTGAAAAGGTGACATTACAGGAT
GTCATTGAAAAAGATACCGTAAAACCAGCCACTGAAGGTGATAATCCTGTGACAACGAAG
CCTACCACTGACTACCCTCCCTTTGTAACTTGGACCAACGAGGCAAGTTCAAAAGCACCG
GCTACTGTCAGCGACGACTGGTCACCAATCACACCTCCTGACGGCTGGGTCTTAATATCT
ACCATGTCTCCCAAACCGGAAACAACAGTGAAACCACAAACAACAGAAACTGAAACAACA
CTAAAACCAACTTCGGTTCTAACTGAAGCGACTTCAATTTTAACATCAACTTCAACCACG
GCCTCGCCAACTTCAGAAATTGAGTTTGTTGTGAACGTGACATTGTCTCCTACAACACCC
ACTCCCACCTCGAGCATGGCGCCAACAACAAATGTCACCTCGGACGAAACACAAACAACA
ACAACAACAACACTAGCGGCTCTGACTACTATCGCGAACGTGACAACCACAGAGGCGACA
ACCACTATAACAACGACCACAGAATCTTACAATATGTCGAATTACAAAGAAGTATGCGGT
AGGCGCATGTGGCCTCAGGCGAGGATCGTTGGTGGGGCGAAGTCCGGCTTCGGGCAGTGG
CCCTGGCAGATATCGCTCCGACAGTACAGGACTTCGACCTACCTTCATAAGTGTGGGGCC
GCTTTATTGAACGAGAACTGGGCGATCACTGCCGCTCATTGTGTTGACAGGGTTCCTCCA
TCGGAGTTGTTGGTGCGTCTCGGTGAATATGATCTCGCGAACGAGGACGAGCCCTACGGC
TTCGCTGAGAGACGAGTGCAGATAGTAGCCAGCCATCCTCACTTCGATCCGGCTACCTTT
GAATATGATCTAGCTTTACTGAGGTTCTACGAGCCGGTTACATTCCAGCCGAACATTCTT
CCTGTGTGTGTCCCTGATGATGACGATTCTTACGTCGGACGAACAGCCTACGTCACGGGC
TGGGGACGTCTCTATGATGAGGGTCCCCTCCCGAGTGTGTTGCAGGAGGTGGAGGTGCCT
GTGATCAATAACACAGCCTGTGAGAGCATGTACCTCGCGGCTGGTTACAACGAGCACATA
CCGAACATATTCATTTGTGCCGGATGGAAGAAGGGAGGCTCGGACAGCTGTGAAGGCGAC
AGTGGTGGACCGATGGTGGTTCAGAGAGCGAAAGACGATCGCTTCGTACTGAGCGGAGTT
ATCTCGTGGGGTATCGGATGTGCGGAACCCAACCAGCCCGGGGTCTACACAAGGATATCC
GAGTTCAGGGATTGGATCAACCAGATACTACGCTTCTAA
Protein sequence:
MGPPLNTWPYSHHRELKSASDKMFINLSAVSAGPVILSLDHLRGSVPIARNIRHLPCISR
KTAQEGLCMFAIDCLKANGTHLGTCIDRFYFGSCCQLTDKSAIPNIAANNIEDNAIDGAN
FVHPLIDHKIHSQTSKKPDKINVNTDKENAKPIQDDIATKKPSTVQDVTSSDKMQSKSDE
TVSINHITSDIRTTEAVTAVKEATTQSMKVSDNVVTEIPVKLSTFQTVSAAGDTATAAPE
AKPTKPQTPEEPVKPTRKPVKPTYKPRPYRPTNFTRPPISPKPKPTKPVALFNTTRKPPY
RPPPKRNSTKKPLPSPPRLNITIIPQSTKPSTEKATEIPTEKNTERITTELLPEPTEEKE
KETVTENVTTVVTEKVTLQDVIEKDTVKPATEGDNPVTTKPTTDYPPFVTWTNEASSKAP
ATVSDDWSPITPPDGWVLISTMSPKPETTVKPQTTETETTLKPTSVLTEATSILTSTSTT
ASPTSEIEFVVNVTLSPTTPTPTSSMAPTTNVTSDETQTTTTTTLAALTTIANVTTTEAT
TTITTTTESYNMSNYKEVCGRRMWPQARIVGGAKSGFGQWPWQISLRQYRTSTYLHKCGA
ALLNENWAITAAHCVDRVPPSELLVRLGEYDLANEDEPYGFAERRVQIVASHPHFDPATF
EYDLALLRFYEPVTFQPNILPVCVPDDDDSYVGRTAYVTGWGRLYDEGPLPSVLQEVEVP
VINNTACESMYLAAGYNEHIPNIFICAGWKKGGSDSCEGDSGGPMVVQRAKDDRFVLSGV
ISWGIGCAEPNQPGVYTRISEFRDWINQILRF