New model in OGS2.0 | DPOGS208871  |
---|---|
Genomic Position | scaffold977:- 36417-38763 |
See gene structure | |
CDS Length | 1302 |
Paired RNAseq reads   | 108 |
Single RNAseq reads   | 438 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008101 (1e-12) |
Best Drosophila hit   | gastrulation-defective (1e-20) |
Best Human hit | enteropeptidase precursor (8e-19) |
Best NR hit (blastp)   | serine proteinase [Samia cynthia ricini] (4e-118) |
Best NR hit (blastx)   | serine proteinase [Samia cynthia ricini] (3e-117) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR009003 Peptidase cysteine/serine, trypsin-like |
Orthology group | MCL40223 |
Nucleotide sequence:
ATGAAAGCGATATATTTAGTTTTAGCGTTTTTTGTTTTGGAGGTGCATTCAATGTCCCTC
GTGGGGCCGGTTGCGACATGGTACCGGCCGTGCGGACTTGGAGCTGTGTTTTTTAAAAAC
CTTACTTCAAACTTTTGGCTCGCAAAAATAAATGTCAGTTTATATAATTTAGACAAAGCT
AATATATCGATTCAATTCGAACAAGAAGTGCAGATTGTGGCGGTCCCAATAAAGTCTTTA
ATCAAGTTTTATAAAAAGTCGAATACTTACACATTTAATTCGCTGGAAACGATTCCTAAT
GAATATAGTATTTATATGAAAATTGTCAACGGCTCCGGTACGGACATACCGAAAGTTTCA
AGTATTAAATTGAATAATATGGTGCTGTGTAACGAAACTGTAAAGAATACGGTAAATGTT
AAGTCCTACAATGTTACTAAAGAAAATGACAACACCTTTAAATATATGTGCGGCCACCGA
TCGTTAAAAAGCTCAGAAGTTAATCAAGTCATGGGAGATGCCAAGGCTGGTGACTGGCCC
TGGCATGTAGCTATACTGATAAGGAGAGGAACAAAAAATCTCGCCAACTACCAATGTGGA
GGAACTATTATTTCTAGTACCGCAATTCTCACTGCCGGTCATTGTGTTTTCATAAATGGG
ACACGTATTGAAAGTGAAAAACTTGTAATCGAAGCTGGTGTGGTCGATCTCAGGGCAAAA
GACCAAAAAGGAAAACAAACACTAAACGTTGACAAAGTGATTTTGCACCCCGAGTACAGT
ATAGAACACGCAAGTTCAGATCTCGCTATTCTTGTGGTCAATAAACTACGGTACACTGAA
TATGTCCAACCAATTTGTATTTGGGGGCCAGTGTATGACAAAATAACGCTCTTTGGGCGG
ACAGCTATGATTACTGGGTTTGGAACAACAGAAAACGACGTACTTTCAAACACTCTCAGA
TCTGCGTACACTACTATACAAAATGACACCACTTGCATTGCGTTCAACCAAAATTTATAT
TCAAAATTGCTAAATGAATTCACATTTTGTGCTGGTTTAGGACCCGAAGTTGGAGTGAAT
CCTCGCAACGGGGATAGTGGGGGAGGCTTAACGGTACCAGTGGTGCGAGCTGACAACAAA
GTGACCTGGTTTCTACGAGGTGTTCTGTCCAAATGTGGCTTACCTACCGGTCACAAATTA
TGCTCTCCTAATTTTTACGTAGTTTACACAGATGTTGCTCCTCATTACGGCTGGATATAC
CATAACGCGGGATTATATTTTTCAAGTAACATCATTTATTGA
Protein sequence:
MKAIYLVLAFFVLEVHSMSLVGPVATWYRPCGLGAVFFKNLTSNFWLAKINVSLYNLDKA
NISIQFEQEVQIVAVPIKSLIKFYKKSNTYTFNSLETIPNEYSIYMKIVNGSGTDIPKVS
SIKLNNMVLCNETVKNTVNVKSYNVTKENDNTFKYMCGHRSLKSSEVNQVMGDAKAGDWP
WHVAILIRRGTKNLANYQCGGTIISSTAILTAGHCVFINGTRIESEKLVIEAGVVDLRAK
DQKGKQTLNVDKVILHPEYSIEHASSDLAILVVNKLRYTEYVQPICIWGPVYDKITLFGR
TAMITGFGTTENDVLSNTLRSAYTTIQNDTTCIAFNQNLYSKLLNEFTFCAGLGPEVGVN
PRNGDSGGGLTVPVVRADNKVTWFLRGVLSKCGLPTGHKLCSPNFYVVYTDVAPHYGWIY
HNAGLYFSSNIIY