New model in OGS2.0 | DPOGS202454  |
---|---|
Genomic Position | scaffold1682:+ 40964-46335 |
See gene structure | |
CDS Length | 1884 |
Paired RNAseq reads   | 104 |
Single RNAseq reads   | 391 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009953 (4e-66) |
Best Drosophila hit   | CG9460 (4e-29) |
Best Human hit | leukocyte elastase inhibitor (2e-24) |
Best NR hit (blastp)   | serpin 1 [Manduca sexta] (4e-81) |
Best NR hit (blastx)   | serpin 1 [Manduca sexta] (1e-78) |
GeneOntology terms   | GO:0005515 protein binding |
InterPro families   | IPR000215 Protease inhibitor I4, serpin |
Orthology group | MCL10067 |
Nucleotide sequence:
ATGTGCCGAAAAAGAAGAAGACATAGCCACTGTGCGGAACACCACAAGAAGATGGACTGC
CCGAAGTGGACCGCGGGAGCAATACCGGTATGCATTAACTGTCAAGCTGGAAAACTGGAC
AAAAGCGATCACAATGCTTTTGAAGCAGAATGTCCAGTTAGGCAGAAGTGTGACGCGCTA
GCAAGGGCTACCGGAAAAGGAAACAGACTGCCACGTTGTGTGGTGGACGTTACCGTCAGC
TCACCTGACTTGCTAGGCAAGGTGGGAGAGAAGAGGGTGACCGGAAAAGCCACTTTAAGC
AAAGGAGGGAAGGATGAACCAGACAACATCAAGAAGGAATACAGTGCGATTGTGGTTGAG
GTATGTGAGGAGGTTCTCCTAAAACAAGGTGTAGGAACCAACCTGCAATTGCTTGGTTAC
CAAAGAACTGGAGGAAATAAAAGGAGAAACTATTCACCATCGAATCACCGAATCTACTGT
GCGGCAGACTGTAGAAAAGTTATGGAAGATAACTTAAACAAAAAGCACATATTTTTCTTT
GCTATCACGGCTATGGCTAGCGAAAAGACTTTAAATGAAATGCTTTTTAATAGCAATACC
CAATTCACAACAAAAATGTTTAAAGAAGTAGTAAAAGCCAAACCAGGACAAAGTGTAGTG
CTATCAGCTTTTTCGGTTCTGCCACCTCTTGCTCACCTTGCTTTAGCATCTGTTGGGGAA
TCACACGATGAACTTCTTGATGTAATTGAAATGCCAAATGACAACGTTACTAAAGCAGTA
TTTTCAAAAGCAAACACCGTTTTGAGATTAGTAAAAGGAGTGACTCTTAAAATGGCAAGC
AAAGTCTACGTGGCTGAGAATTATGCATTAAACAGGGACTTTGCCGCCCTTAGTCAAGAT
GTTTTTGGATCTGAAGTTGAAAATATCGATTTTTCTGAAAACGAAAATGCCTCTAAGAAA
ATCAATCAATGGGTTGAAGATGAAACAAATAATCGAATTAAAGATCTAGTAGACCCCACA
TCCCTAGATGCTGATACCAAAGCTGTATTAGTAAACGCAATTTACTTTAAGGGTGCATGG
AAAACTCCTTTTGACAAGAAAAGCACAACCGACAGAGATTTCCATGTGAGCAAAGAAAAT
GTTGTCAAAGTACCCACCATGTACAATTCAGACACCTTTTATTACATCGATAGCGAGGAA
CTCGACGCACAGGTATTGGAACTTAAATATGAAGGAGAAGATTCTGCTCTGGTACTCCTA
GATAGCGGTCAATACTCGCGTCGTCACGATAGGGTTTTAGAAATCATACTCGTACGTGAA
GTGGTTAGTGTATCGGTAGCAAGAGTGCAACAAGAAATAACCACGAACCAACGATCAACA
GGTTTTGTGAGAGAAGGCACTAGGACTACAAAATCGAACGGCAAGCCTTACTCTATTATT
AAAGTGACTTCGGATTGGACTTTAATGATGAACACGTATGAGAAGCAATATAAGATCCCA
GAGGATATTTCTGCGTCGGCCTATAGACCGGACATATTTTTATATTTGCGGATTTTAAAG
CGCGTTATACTTCTAGAGCTCACGGTTCCTTGGAAAACCAACATCCCCAAAGACCATGCG
ATCAAGGTCAACAAGTATTACCAGCTCACTAACGAACTCATTAAGAATATGTTCGTTGTA
AATTTGTACGTGGTAGAAGTAGGAGCGAGAGGTATAACAACCAAATCTCTCTACAACCTG
CTAAAAGACTTAGGCCTCCCAAGAACTAATATCAGTTCATTCTTGGAACATGCGTCGAAA
TCAGCCCTAGCAGGTTCGGTTCATATTTGGTTAGGTAGAGAGAAGCATGGCCAGTGGAGG
TCAGCGTTATCGCGCATTAGATAG
Protein sequence:
MCRKRRRHSHCAEHHKKMDCPKWTAGAIPVCINCQAGKLDKSDHNAFEAECPVRQKCDAL
ARATGKGNRLPRCVVDVTVSSPDLLGKVGEKRVTGKATLSKGGKDEPDNIKKEYSAIVVE
VCEEVLLKQGVGTNLQLLGYQRTGGNKRRNYSPSNHRIYCAADCRKVMEDNLNKKHIFFF
AITAMASEKTLNEMLFNSNTQFTTKMFKEVVKAKPGQSVVLSAFSVLPPLAHLALASVGE
SHDELLDVIEMPNDNVTKAVFSKANTVLRLVKGVTLKMASKVYVAENYALNRDFAALSQD
VFGSEVENIDFSENENASKKINQWVEDETNNRIKDLVDPTSLDADTKAVLVNAIYFKGAW
KTPFDKKSTTDRDFHVSKENVVKVPTMYNSDTFYYIDSEELDAQVLELKYEGEDSALVLL
DSGQYSRRHDRVLEIILVREVVSVSVARVQQEITTNQRSTGFVREGTRTTKSNGKPYSII
KVTSDWTLMMNTYEKQYKIPEDISASAYRPDIFLYLRILKRVILLELTVPWKTNIPKDHA
IKVNKYYQLTNELIKNMFVVNLYVVEVGARGITTKSLYNLLKDLGLPRTNISSFLEHASK
SALAGSVHIWLGREKHGQWRSALSRIR