New model in OGS2.0 | DPOGS204282  |
---|---|
Genomic Position | scaffold1712:- 173-10237 |
See gene structure | |
CDS Length | 3174 |
Paired RNAseq reads   | 27608 |
Single RNAseq reads   | 97288 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007558 (2e-07) |
Best Drosophila hit   | ND |
Best Human hit | inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 precursor (1e-60) |
Best NR hit (blastp)   | unnamed protein product [Tetraodon nigroviridis] (2e-100) |
Best NR hit (blastx)   | unnamed protein product [Tetraodon nigroviridis] (2e-101) |
GeneOntology terms    | GO:0030414 peptidase inhibitor activity GO:0005737 cytoplasm GO:0006953 acute-phase response GO:0005576 extracellular region GO:0005886 plasma membrane GO:0004867 serine-type endopeptidase inhibitor activity GO:0030212 hyaluronan metabolic process |
InterPro families    | IPR002035 von Willebrand factor, type A IPR013694 Vault protein inter-alpha-trypsin IPR006587 Vault protein inter-alpha-trypsin, metazoa |
Orthology group | MCL11159 |
Nucleotide sequence:
ATGAAGAAATCCTGGATATACCTGTTTTACATAGTTTTTATCGTTGCAAAAGCACAAACC
GCCTCAATTTCTAGCACCGAAACTTTGGTTGTTGCCAAGACAGATGATGAGGCGTCAACG
GCTGCTCCGTCTGAACCAGTAACCGACGAACCAAACGCTCCTATCAAAGTGACAGAAATG
AGAGTTAATTCGGAGGTGACGATGCGGTACGCACATACAGCTGTTGTCACACACGTCAGA
AACCCAGCTTCCAAAGCACAGGAGGCAACCTTCCATGTGCTGTTGCCAGAGACAGCCTTC
ATCAGCGGCTTCATAATGACGTTGGGCGGGAAATCGTATAAGGCTTACGTAAAAGAAAAA
AATGAAGCGAAACAAATTTTCAACGAAGCTGTCTCTCACGGGACTGGGGCGGCCCACATC
GCGGCCAAAGCTCGTGATTCAAACCATTTCACAGTATCAGTGAATGTGGAGCCGAAGAGT
GTTGCTATATTCAATCTGACCTATGAAGAGTTATTGGTGCGTCGCAACGGCGTTTACAAC
CACGCAATCAACCTTCACCCGGGAACCTTAGTACCCAAGCTGGAGGTGGTGGTACACATC
AAGGAGTCCCAGAAGATCACGACGCTCCGAGTGCCTGAGGTCAGGACTGGCAATGAAATC
GATGCTACAGAAAACGACGCACAAAATTCAAAGGCTGTCCAAACTAGAAATGGCGACAAG
GAAGCTACCATTACATTCACGCCCGACTTGGACGAACAGATGAACCTTATTAAGATATAT
AAGGACAAAACAAAAGATACCGTGGCACATCATTATTGGGACAACAATGAGGAAGAAGAC
AACAGAGACGGAGTTTTGGGACAATTTGTTGTTCAATACGACGTGGAACGTTCGAACGAT
GGAGAAGTCTTGGTGAATGATGGATATTTTGTGCACTTCCTGGCACCCAGCTCGTTGCCA
CCACTCAACAAGTACGTGGTATTTGTGCTGGACACTTCCAGCTCTATGATCGGTCGCAAG
GTGGAACAATTGATTGCAGCTATGGACGCCATACTGTCCGACCTCAACCCGAAAAATTCG
AAGGCTGTCCAAACTAGAAATGGCGACAAGGAAGCTACCATTACATTCACGCCCGACTTG
GACGAACAGATGAACCTTATTAAGATATATAAGGAAAAAACAAAAGATACCGTGACACAT
CATTATTGGGACAACAATGAGGAAGAAGACAACAGAGACGGAGTTTTGGGACAATTTGTT
GTTCAATACGACGTGGAACGTTCTAACGATGGAGAAGTCTTGGTGAATGATGGATATTTT
GTGCACTTCCTGGCACCCAGCTCGTTGCCACCACTCAACAAGTACGTGGTATTTGTGCTG
GACACATCCAGCTCTATGATCGGTCGCAAGGTGGAACAATTGATCGCAGCTATGGACGCC
ATACTGTCGGACCTCAACCCGAGTGATTACTTCAGCATTGTTGAATTTAACTCCGACTAC
TCGGTCCATGAGCTGAAAGAAGCGGATGAGCCTCAACCTGAACCTCAAAAGTTTTCTTGG
TATGGATCAACGTCATCATCAAACAAGGAACTTGTCTCACCATCACTTGCTTCACCTGAG
AACATCGCTAAGGCCAAGGTTATCATTTCCAGATTACGGGCTAATGGAGGAACCAATATC
CACAGCGCTTTGAGCGTAGCTATGGATCTTATTCATAAGTTCTCTGGAAAGCACGATATT
TCTTCTGAAAAATCGAATTCAAGTGACGCTGCAAACGAAAAAGCGATAGCAAATGCTAAC
GACTTGAAAACCAAACCAGTCCATGAATTGGAGCCCATCATTATTTTCCTGACGGACGGC
GACCCGACCGTCGGAGAGACCAGCACCTCGCGTATCATCTCACACGTCACCGAGAAGAAC
TCCGGAGAAATGAGGGCTTCCTTGTTCTCACTTGCTTTCGGTGAGGATGCGGATCGCAAC
TTCTTGAGAAAGCTATCACTGCGTAACGAAGGCTTCATGCGGCACATCTACGAGGCGGCG
GATGCGGCGCTTCAGCTGAGAGACTTCTACAAACAGGTCTCCTCTCCACTGCTGGCTCAC
GTCAAGTTCACATACCCACGGGAACAGATAAAAGAGGGTTCAGTTAGTAAGAACAAGTTC
CGCACCGTGTACGCGGGTTCAGAGGTAGTAGTGGCTGGGGAGCTCTCTGACGACGACGTT
GATTTGAGACCTGTCGTTAGTGGCTTCTGCGGGAACCAAAATGGAAAATTGATTCCATAT
GAAAATGATCAGTCCAAGATCAAAGTCACTCGCGTGAAGGAGTTCTTACCTCTGGAGCGC
CTGTGGGCGTACCTGAGTATCCATCAGCTATTGGACCAACGTGACGCCTCCGAAGATACA
GCCGCCAAAGAGCATGAGAAGAAAGCACTCAATTTAGCGCTGAAGTACTCGTTCGTGACT
CCCCTAACGTCGTTGGTGGTGGTAAAGCCGAACGAAACGAACGCCGTGGACGCTGAATCT
GTAGACAAAAATAACAACAATCAATTCTCAGGTATTACACCACTGTCGTTTAATGCAATG
CCTCAAGCGCCTTTAAGTCATCATTTATTGATAGCACCACCAGCGTACAGACCCATGGTT
ATGGGTGGGAATGGAGACGCACTCGCGTTGGTAGGAGGTTTCCATGCTCAAGTAGAAGAC
GAGGAGGTCGACGAAAAATATGACGACATTGGCCAGATCAGTCTCAACAGAGCTGGTTAC
AGATTCGATTCAGACGAGGACGATTATGATGGCTTTATAGGTTCAAGTTCATTTATTACA
ACACCAGCACCAGTGCAGGACTTTTTTGAAACTGTCGCTACCGAAGTCCCAGATCAGGAC
AAATACCATTTAGAGAACTACATGTGGGCTTTAGCTTTAGTGAACAACACCGCTGACGCC
CTCGTGTTTATGGATAATGGAACCGAAATCGTTTTACAGCTCTCTAAAGATAGTAATGCT
CCTCGTGGTAGCTCTGAGGAGTCCTGCACGAACGTGCCCGTTGACGCGGCGAGCCCTGCT
TCGGGCCCTGAACCCGTGAAGGCCTCCTGTGTCTATATCACTCGCTGTTCCGCAGCCAGG
AACATCACCGAAGATGACTATCGCAGATCATACTGTCGCGTTGACAACAAGTGA
Protein sequence:
MKKSWIYLFYIVFIVAKAQTASISSTETLVVAKTDDEASTAAPSEPVTDEPNAPIKVTEM
RVNSEVTMRYAHTAVVTHVRNPASKAQEATFHVLLPETAFISGFIMTLGGKSYKAYVKEK
NEAKQIFNEAVSHGTGAAHIAAKARDSNHFTVSVNVEPKSVAIFNLTYEELLVRRNGVYN
HAINLHPGTLVPKLEVVVHIKESQKITTLRVPEVRTGNEIDATENDAQNSKAVQTRNGDK
EATITFTPDLDEQMNLIKIYKDKTKDTVAHHYWDNNEEEDNRDGVLGQFVVQYDVERSND
GEVLVNDGYFVHFLAPSSLPPLNKYVVFVLDTSSSMIGRKVEQLIAAMDAILSDLNPKNS
KAVQTRNGDKEATITFTPDLDEQMNLIKIYKEKTKDTVTHHYWDNNEEEDNRDGVLGQFV
VQYDVERSNDGEVLVNDGYFVHFLAPSSLPPLNKYVVFVLDTSSSMIGRKVEQLIAAMDA
ILSDLNPSDYFSIVEFNSDYSVHELKEADEPQPEPQKFSWYGSTSSSNKELVSPSLASPE
NIAKAKVIISRLRANGGTNIHSALSVAMDLIHKFSGKHDISSEKSNSSDAANEKAIANAN
DLKTKPVHELEPIIIFLTDGDPTVGETSTSRIISHVTEKNSGEMRASLFSLAFGEDADRN
FLRKLSLRNEGFMRHIYEAADAALQLRDFYKQVSSPLLAHVKFTYPREQIKEGSVSKNKF
RTVYAGSEVVVAGELSDDDVDLRPVVSGFCGNQNGKLIPYENDQSKIKVTRVKEFLPLER
LWAYLSIHQLLDQRDASEDTAAKEHEKKALNLALKYSFVTPLTSLVVVKPNETNAVDAES
VDKNNNNQFSGITPLSFNAMPQAPLSHHLLIAPPAYRPMVMGGNGDALALVGGFHAQVED
EEVDEKYDDIGQISLNRAGYRFDSDEDDYDGFIGSSSFITTPAPVQDFFETVATEVPDQD
KYHLENYMWALALVNNTADALVFMDNGTEIVLQLSKDSNAPRGSSEESCTNVPVDAASPA
SGPEPVKASCVYITRCSAARNITEDDYRRSYCRVDNK