New model in OGS2.0 | DPOGS214606  |
---|---|
Genomic Position | scaffold34:- 20850-30847 |
See gene structure | |
CDS Length | 6864 |
Paired RNAseq reads   | 8448 |
Single RNAseq reads   | 24487 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005131 (1e-13) |
Best Drosophila hit   | CG31635, isoform B (1e-14) |
Best Human hit | PREDICTED: leucine-rich repeat-containing protein 68 isoform 2 (8e-18) |
Best NR hit (blastp)   | cysteine proteinase inhibitor precursor [Manduca sexta] (0.0) |
Best NR hit (blastx)   | cysteine proteinase inhibitor precursor [Manduca sexta] (0.0) |
GeneOntology terms    | GO:0005575 cellular_component GO:0003674 molecular_function GO:0008150 biological_process |
InterPro families    | IPR000010 Proteinase inhibitor I25, cystatin IPR020381 Proteinase inhibitor I25, cystatin, conserved region IPR003590 Leucine-rich repeat, ribonuclease inhibitor subtype |
Orthology group | ND |
Nucleotide sequence:
ATGAAGAAACTTGAGTGTGTGTTATTTTGTTTATTAATAAGTTACTGTTTGGGAGAATTA
AGTATTGAAAGGAAAGAAAAGTTTCTTAATGGGTTTGTAGAGTATTTAAACAACCTACCC
AACCAGATTTATTCATACGAAGAAGGTTATATATTAAATGCGCAGAATACTGATGATACA
AATTGTTATAATGTTGAGACCTTGCTACTCTCAAGACCTATTAATGATTTGGATAATGCA
AAGTATCAGAAATGCTTTGCCACAATAGTGGACCTTGATGATAATGGGATAAGCATACAG
AATAATCAACATCATTGTGAGGATTCAGATCTGATATCTCAAGTAAATTCTTTGGATGAT
ACCAACCTTGAAGAAGTTGTTGTGGATGAGACAAGTACTACAACTGAGGCTGTAATCGGT
CATAAGCCTGTGGAACTTGACAATGAAGTTCAAATTAACACTGGGGTAACTTCTGGTGAG
CAATTTATTGCTGTGCCGAGAAGGCAACCAGGTGGAGCTTGTGTGGGTTGTTCAAGTCAT
GTTAATCCGCAAGCTCCAGGTGTTACAGAATTAGCAACACTCGGCGTCAAACATTTAAAT
ATTCATGAGCAAAATGTGAAACATTCCCTTGAAGCGGTTTTAGATGTCGAAAGACAAGTT
CAAGTCGTTAACGGTGTTAGATACATTTTGATACTACAAGTAGGATATGAACCATGTATT
ACTACAAGTGAGGGATGTGTTGATAGAAAAGTCTGTAAAATATCTATATTGGAGAAGAGT
TGGATAAAATTACCAGATGGTTCTAAATATAGAGCGGTTTTATCTAATAATTGCACAGAA
GAATGGCAATTTGGTGATGAAGGCGAAATAATAACGAATGATGAAAATAATCCGCACACT
AATAATCCCATCGATAACAACTCAAATCCTACAACTGATGATGACAAGGGATCAAATACT
GGTTCTGTTGATGAAATTGTGAAATCCGAAAAAAATTTTGATGTTCAATCCCAGCCTAAT
CAGGGACTCAGTGATGAACAGATAAAAAAAATAGAAGAACAAATCATACCATATGACCAT
ATTTACGAAACTACAACAAGTGTAAACATTTTAGAACAATCTCTTCAACCTTTAAAAGAG
AAAGTTCATAGCGTTGACAATGCTGAAACAATTACACATCAAGTAGATAATACAATTCCG
TTCAAAGAAACTTTATATCACGAAAAGGAGAGTTTCTTAGGTGAGGATAGAAAGAAGGCT
ATCGACGATTTAATAAATTTTTTTGACTCTGCTGGTCTCGACATTAACCAAGCTAGAGTC
CCGAGAGCGAGGAGGAGTTATAACCATGATCTGAAAATTATGGCACTAACTGAAAAAATT
CATAAAATAAAAAACAATATTAAGAACGCTAAATACCTTTACGCTCTGGCACAAGAAGTG
GTTGACTATCTTAATGAAATTGACTTCGAAATAAAAACACGAACACTTGTGGAAGTTACA
AATGCTGAGGAGGAATTCGAAAACCATCAACACTTTTTCTATATTCAGGCAAGAGTAATC
ATACCATGTGATAAAGCAGATTGCGAAGACAAAGAAGGAGAAATGAAGATCTGCAATGGT
GTTATTGAAGCAATAGAAAAAGAAAGGCCTCAAATTCTTAATGCTTTTTGCTATGATGAT
AACCAAAAAAAACATGTATTCTCAAAGACTGAGCCTATAGACCTGGATGATCCTGTTCTG
ATGAAATTAACAAAAGAAGCCCTTAAAAAAATTGAAAAAGAATCCCTTCATCACAATGCT
TTGAAAATAGAAAAAGTTATACAACCAACCATAAAAAAATCTTCCGGTACCCTTACAAAA
TTTTCTCTTAGCCTGTCGTTAACAAATTGTAACAAAACAGTTCCATATGTAAATAGGGAA
AATTGCACTATTATGCAAGGCAATGATTCTCTTATTTGCGAAGTTACGATACTTGAAAGA
CATTGGTTAAAAGAGAAAAAACTTACTTATTCTTGTATGTCTAGACCGTTTGATGAAAGA
TTTTCTGCCAAGAAACAAGTGGAAACCAAACCCGTCGTGACTCAGGATCCTAAGATTTTA
GAAATGGTTCTACAAGCCTTGCAATATCTAGATTCAAATTCCAATAGAAACAATAAGCAA
AAAGTCGTTGAAATTAATTCAGTATCTACTCAGCTTATTGGTGGATTGATAACTCAAGTG
GAATTCGTTGCTGGTTACACCGAATGCCCCAATGAATTTGATGTTGATTTAAAAAAATGT
AATTTGCTTGAAAATGAAGCATTACGAAAATGTAAAGCAGAAGTTTGGGATCGACCATGG
TTGAATGACGGTAGACAAATAAAAGTTAAATGTGATGATTCTTTTAATGGACAATCTAAA
GTATATAGGAAAAAAAGGGATGTTTCTGATAGCCAACATCATATGGAAACTTATCAGCTT
ATTGGCGGTCCAAAGGTATCGAATGAAAAAGACACTAAATACTTAAACTTGGCTAGAAAG
TCGTTAAATCAATTCCTGCAAAATAATGGAGTTAGTGAGAAATTTGAAGTCCTGAAAGTT
AATAAAGTAACCGAACAAGTAGTCGCAGGAACCTTAACCGAAATCAAATTTACAATAACT
TCTCAGAGTAATGGCGACAACATAGATTGTCATTCGAAAGTTTGGGAAAAGCCTTGGATG
AATTTCGAAGAAATAACCGTGACTTGTGAAGAATCACTTAAGAATAGACAATTACGACAG
AAAAGAGGAGTAAATGATGGCCCACTCGTCGGTGCTCCACAGAAAGTCGATTCGAACGAC
GTTGTACATTATTCTGTGTTTCGTCAGGACTCTGGCGTGGAGTACATTTCGTCAGCTCTT
GTCGAACAAGCGGAGCACTCGCCACCGTCCGCGGTAGCGTCCCCGCTCTCACCGCACTCC
GCCTGCGGCTACGAGTCCACCGGTCTGGCGTTTCTGGTGCTATGGAACAACCAGCTAACG
AGGAACTGCGCACAGAATCTAGCTAAAGTACTGCGCACATCAAAGTCCCTGTGCGTTTTG
AACGTCGGCCGTAACCCCCTGGGCTCGGAGGCCGTGAGGTCTCTAGTGGGGCGAGGGTTG
GTGTCCCTGGGCCTACAAGCCGCCAGGCTCGGACCGGACGCGGCCAGGGGACTGGCTGAT
ATTATACGAGGGGGGGAGAGACTACAGAGGTTGGATCTCCGGGACAACAAGCTGGGCGTG
CCAGGACTACAGGCGATACTAGCAGCTGTTAAAGAACACGCCTCCATCACACAGATAGAC
CTGGATGATCCAGCGGAGTCACAGACGGGCGTCCAGAGCACGGAGGCTGCGACAGTCGCG
CGACTGTTGCGCGAAATTCGTGTCGTGTGTCGCGGGAACGAACCCGCGGCCCCCGACAGG
TTAATGAGGAAGATCAGCCTCACTTGTCATACAGTCCCTATGATTAAGACTCCTGCCGCT
GATGATGATCGTCGCGTCCGCCTCCGTTCCCCAGCGCCGTCTCCGGCCCCCTCGCCGGCT
GGTAGCCCCGTACCAACACCTACTGGATCACGATTTTCGGTGACTCGAGTGACCCCCGAG
CGGGAGATGTCAGATTCAACCCCCACAACCCCCACGACCCCCACCAGATGTACTTCATCA
AGGTTTAAAGTTGTACAGGTGGTGGAACCGCAAGTAGTGATGCCCAGGAAGTCTGTCTCG
AGATTCTCAGTTACGAGGAACTATGACAGTACTTACAATCCCACGTTACCACCGACCACG
CCATCGCCATCACCATCGCCGTCTCCGACACCGTCACCAGTACCGGACAGGAGCGAGAAG
ATTGGCCAAACTCCTTTGAAAAAGGTTGGCCCAACCATCGACGGTAGTGGCGCAAAGGTT
GGCCATATCGACAAGCAAGCAGCAAATGAACGATCGAATATTGAACACACACAGAGTGTT
GACTTTAAAAAGACTAACACAGATAGTAAAGAAACCACGAATAAGAAAGAGACTGAGATG
CTAGCGGACTTCAGTTACGACGAGGTCCGCATAAAAGATGTTATATTAAAAGATAAAGAT
AAAGATAAAGATGTAAAAGATATAGAAGGTAGTTTGATAATTATTGACGATGTCAGAGAC
GAGGACGAAGAACCGTCGAGTGAAGCGGCTAAGGATTTAGATGTGTGTGATGTGCTTGTG
ACAAGTGACTTCGGTGTCAAACGCCAAGTTAGTGACGATAGTGTACACGACTCGGACAAC
GACGTGTTTAGTGATAGCGGGGACTTTAAAAATTTAGATTTAGTGTATAGTGATACCTAT
AACGGTGATAGGAACGAACAAATGTCTCATAGTGAGACGGTTGGCGGCGGGATAGAGACG
GGTGTAGCGAGGTGTATAGGTGTGAGCGAGACGGATAGAGATATGACAATTGAAACAGAT
AATAGTCTAACCAAAGTACATCGCGACCAAAATGTAAATAATATATATAAGGATGATAAA
GTAGCCATAGAGGATGATGGGAAAGGTGTAGATGTTGTAGATTTAAACGTAACCGTTGTA
CCTGCTCGTGATAGTGTAGTGTTAAAGAAGAATAAGAGTGAATCGAGCCTCGACAGCCCG
GACCTGGAGGTGTCGAGGCTGATGCGGAGACCTGTGAGTGCCTTCTGTGATAGCAGCTCG
TCGCTGGAGATATCCGGCAGCTCGATGGAGAGTCTGAACACGGACAGACCCAGGCTGATA
ATCGACAAGCATTTGTCCAAAGACAACAGCGTCGAGTCCACCAGTGAGGTGACACCAGTG
AATCTTAACGTATCGATAAGTTCCAACGAGAGCGTCTCGCCTATTATATTCGCTAAGAAG
ATCCACGGATCGCTATCCAGCCTGGAGGCGAGCGTCAGTTCGGTTGAATCCGCTAAAGAA
AAGATAATGGTGACGTCGGCGGATTCAGGGATAGAATATTCGTTACAAAACCCATCCGAA
ATGAAAGACGACAGTTCGTCCAATGAAGGCACTCTGACGAACTGCAGTTCGAGTTTGAAG
GAAACTATGAGGAAGGATTCGCAGGATACCGTGACGCCGAAGCGAACGTCCAGCTTGTTG
GATGTACCGGCTCTGAAGTCCAAGGGCTTAGAACGGATGAGGAAGATATCCTGGGTAGCA
CCGTCAGCAAGCTTCCATCTCCCCAAAGCTGAGGAGAAAGTGGAATACAAGCTGCCGGGG
AATCTGGAGAAATTGCTCAGCCTCTTCCAACATCCGAGCAGTCTGTTCTCCAGGAGTAGC
AGTGACGATGAGAGAAAATCTAACTCCGGGACACCCCCGAGGAAGGATTCGTCTTTAACC
AGCTCGTTCTGGTCCTGGGGGAGTGTCGCCGAGAAGAACGACGATGACAGTATATCGGAT
GCAACAGACTCGACGCTGTCCGAGCGCGTGCAGGTGTCCTTCGTCGACGAATCGTTCTCA
AAGAAACTCGACAGCAAAACGCCTTCCACGGACACTGATAACACTCTAAGTGAATTTCAG
TTTCCTAATACGGAGAAAGTTACCGTAACCACCGATAAATTAGTACAAAGTTTAGATGTA
AGCGACCCGTGCTCGGCCAACCCGAGCGATGATCTCATTGTGCCCAACGATTTTGTATAC
GATGATAACTTAAAATCTGAAGATAAAGTTGATGTTAAACGAACCTTCGCCTCTGTATTG
AAGTCGTCTGGTTCGGAGAATTCATTGGAGAGGCCGAATCCTGACGTGGGACAGACGGTC
GAGAAGCTTCCCAGCAAGGTGATCAAAGGCATCAAGGAAAATATAAGCCCGGAGAATACT
TTGACGTCCAGTATAGCGACGAAAGCCATGGCTATGGAAGTAGCGGAGAGACAGGCCAAA
AATAAACAAATCGTCAACACAGTATGGGAGGTCACGAATCCGTTGACGGAGAAAAGTGAT
ACTAAAGTGACTGAGAAGAAGACTCAAAGTGACTTGGCGCCGATAGCGAATATCGATGAA
ACTTGTGACGTATCAGCTGATGACGTCATCCAGCTGGCATACATCGATGATAAAGAAGAT
GGAGCCGATAAGGTTGAGAAGGTCCTGGAGAATATCGACCTGGGGAAGGACGCCTTATCG
TATCTGATATATGAAAACCAAGATTACGAGGCGGATACAGAAACTGTGTTGGCGAATAGA
TCTCAGGAAGGATCTCTGGCTCAAGAACTGAGGGATGCTGAGATAAAGGAGATGCTAGAT
CTATCACCTGAGTTGGTGTTAGACGAGGCCCTGGAAATACCGGAGATATTCACTGTTGAA
ATCAAGGGACGAAAAAGCTCTCCAGTCATACCGGAGAGGGCGAAGATGAAGAAGTCCAAC
TCGCTGGAGGATTTGACGAAGAGACAGAATCTAGAAGAGAAAGAGAGTCCTAAGATGAAG
ACGATAGCGTTCAAAGTCCCCGAGAGCACCACTCCCAGAGACATACCAGAGAGACGAACG
AAATTAAGATCTAGGAGCGGATCCAGTCCTAAATCATTACCGGAGAGCCTGAACAAACCT
TGTCCCTTGACGAAGATGGATTCCATATTGAGCAAGAAGAAGAAAAAAGTGTCCTCGCTG
GGGAAAATGGCGAAAGACTCGCTGCTAGCGTTGAACATGAGCGAGGAGGAAATCGCCGAG
TTCAGACGCTCCTATAAACTGACGTCGGTTGAGAGTCTAAGGTCTTTGGAGTCCGTGTCC
GAAGATGCGAACTCACACAGCGGGACCTCATACGATTCGAGATGCCGAGCCTGTCTCCGG
ACTTCACAAGAGAGTCTCATGTCGCTGGACTCCATCAACGAGGACTGCAGGTGTGCCGAT
GACGAGAAACGTCACCATAGATAA
Protein sequence:
MKKLECVLFCLLISYCLGELSIERKEKFLNGFVEYLNNLPNQIYSYEEGYILNAQNTDDT
NCYNVETLLLSRPINDLDNAKYQKCFATIVDLDDNGISIQNNQHHCEDSDLISQVNSLDD
TNLEEVVVDETSTTTEAVIGHKPVELDNEVQINTGVTSGEQFIAVPRRQPGGACVGCSSH
VNPQAPGVTELATLGVKHLNIHEQNVKHSLEAVLDVERQVQVVNGVRYILILQVGYEPCI
TTSEGCVDRKVCKISILEKSWIKLPDGSKYRAVLSNNCTEEWQFGDEGEIITNDENNPHT
NNPIDNNSNPTTDDDKGSNTGSVDEIVKSEKNFDVQSQPNQGLSDEQIKKIEEQIIPYDH
IYETTTSVNILEQSLQPLKEKVHSVDNAETITHQVDNTIPFKETLYHEKESFLGEDRKKA
IDDLINFFDSAGLDINQARVPRARRSYNHDLKIMALTEKIHKIKNNIKNAKYLYALAQEV
VDYLNEIDFEIKTRTLVEVTNAEEEFENHQHFFYIQARVIIPCDKADCEDKEGEMKICNG
VIEAIEKERPQILNAFCYDDNQKKHVFSKTEPIDLDDPVLMKLTKEALKKIEKESLHHNA
LKIEKVIQPTIKKSSGTLTKFSLSLSLTNCNKTVPYVNRENCTIMQGNDSLICEVTILER
HWLKEKKLTYSCMSRPFDERFSAKKQVETKPVVTQDPKILEMVLQALQYLDSNSNRNNKQ
KVVEINSVSTQLIGGLITQVEFVAGYTECPNEFDVDLKKCNLLENEALRKCKAEVWDRPW
LNDGRQIKVKCDDSFNGQSKVYRKKRDVSDSQHHMETYQLIGGPKVSNEKDTKYLNLARK
SLNQFLQNNGVSEKFEVLKVNKVTEQVVAGTLTEIKFTITSQSNGDNIDCHSKVWEKPWM
NFEEITVTCEESLKNRQLRQKRGVNDGPLVGAPQKVDSNDVVHYSVFRQDSGVEYISSAL
VEQAEHSPPSAVASPLSPHSACGYESTGLAFLVLWNNQLTRNCAQNLAKVLRTSKSLCVL
NVGRNPLGSEAVRSLVGRGLVSLGLQAARLGPDAARGLADIIRGGERLQRLDLRDNKLGV
PGLQAILAAVKEHASITQIDLDDPAESQTGVQSTEAATVARLLREIRVVCRGNEPAAPDR
LMRKISLTCHTVPMIKTPAADDDRRVRLRSPAPSPAPSPAGSPVPTPTGSRFSVTRVTPE
REMSDSTPTTPTTPTRCTSSRFKVVQVVEPQVVMPRKSVSRFSVTRNYDSTYNPTLPPTT
PSPSPSPSPTPSPVPDRSEKIGQTPLKKVGPTIDGSGAKVGHIDKQAANERSNIEHTQSV
DFKKTNTDSKETTNKKETEMLADFSYDEVRIKDVILKDKDKDKDVKDIEGSLIIIDDVRD
EDEEPSSEAAKDLDVCDVLVTSDFGVKRQVSDDSVHDSDNDVFSDSGDFKNLDLVYSDTY
NGDRNEQMSHSETVGGGIETGVARCIGVSETDRDMTIETDNSLTKVHRDQNVNNIYKDDK
VAIEDDGKGVDVVDLNVTVVPARDSVVLKKNKSESSLDSPDLEVSRLMRRPVSAFCDSSS
SLEISGSSMESLNTDRPRLIIDKHLSKDNSVESTSEVTPVNLNVSISSNESVSPIIFAKK
IHGSLSSLEASVSSVESAKEKIMVTSADSGIEYSLQNPSEMKDDSSSNEGTLTNCSSSLK
ETMRKDSQDTVTPKRTSSLLDVPALKSKGLERMRKISWVAPSASFHLPKAEEKVEYKLPG
NLEKLLSLFQHPSSLFSRSSSDDERKSNSGTPPRKDSSLTSSFWSWGSVAEKNDDDSISD
ATDSTLSERVQVSFVDESFSKKLDSKTPSTDTDNTLSEFQFPNTEKVTVTTDKLVQSLDV
SDPCSANPSDDLIVPNDFVYDDNLKSEDKVDVKRTFASVLKSSGSENSLERPNPDVGQTV
EKLPSKVIKGIKENISPENTLTSSIATKAMAMEVAERQAKNKQIVNTVWEVTNPLTEKSD
TKVTEKKTQSDLAPIANIDETCDVSADDVIQLAYIDDKEDGADKVEKVLENIDLGKDALS
YLIYENQDYEADTETVLANRSQEGSLAQELRDAEIKEMLDLSPELVLDEALEIPEIFTVE
IKGRKSSPVIPERAKMKKSNSLEDLTKRQNLEEKESPKMKTIAFKVPESTTPRDIPERRT
KLRSRSGSSPKSLPESLNKPCPLTKMDSILSKKKKKVSSLGKMAKDSLLALNMSEEEIAE
FRRSYKLTSVESLRSLESVSEDANSHSGTSYDSRCRACLRTSQESLMSLDSINEDCRCAD
DEKRHHR