DPGLEAN21424 in OGS1.0

New model in OGS2.0DPOGS204282 
Genomic Positionscaffold1712:- 173-10237
See gene structure
CDS Length3174
Paired RNAseq reads  27608
Single RNAseq reads  97288
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007558 (2e-07)
Best Drosophila hit  ND
Best Human hitinter-alpha-trypsin inhibitor heavy chain H4 isoform 1 precursor (1e-60)
Best NR hit (blastp)  unnamed protein product [Tetraodon nigroviridis] (2e-100)
Best NR hit (blastx)  unnamed protein product [Tetraodon nigroviridis] (2e-101)
GeneOntology terms





  
GO:0030414 peptidase inhibitor activity
GO:0005737 cytoplasm
GO:0006953 acute-phase response
GO:0005576 extracellular region
GO:0005886 plasma membrane
GO:0004867 serine-type endopeptidase inhibitor activity
GO:0030212 hyaluronan metabolic process
InterPro families

  
IPR002035 von Willebrand factor, type A
IPR013694 Vault protein inter-alpha-trypsin
IPR006587 Vault protein inter-alpha-trypsin, metazoa
Orthology groupMCL11159

Nucleotide sequence:

ATGAAGAAATCCTGGATATACCTGTTTTACATAGTTTTTATCGTTGCAAAAGCACAAACC
GCCTCAATTTCTAGCACCGAAACTTTGGTTGTTGCCAAGACAGATGATGAGGCGTCAACG
GCTGCTCCGTCTGAACCAGTAACCGACGAACCAAACGCTCCTATCAAAGTGACAGAAATG
AGAGTTAATTCGGAGGTGACGATGCGGTACGCACATACAGCTGTTGTCACACACGTCAGA
AACCCAGCTTCCAAAGCACAGGAGGCAACCTTCCATGTGCTGTTGCCAGAGACAGCCTTC
ATCAGCGGCTTCATAATGACGTTGGGCGGGAAATCGTATAAGGCTTACGTAAAAGAAAAA
AATGAAGCGAAACAAATTTTCAACGAAGCTGTCTCTCACGGGACTGGGGCGGCCCACATC
GCGGCCAAAGCTCGTGATTCAAACCATTTCACAGTATCAGTGAATGTGGAGCCGAAGAGT
GTTGCTATATTCAATCTGACCTATGAAGAGTTATTGGTGCGTCGCAACGGCGTTTACAAC
CACGCAATCAACCTTCACCCGGGAACCTTAGTACCCAAGCTGGAGGTGGTGGTACACATC
AAGGAGTCCCAGAAGATCACGACGCTCCGAGTGCCTGAGGTCAGGACTGGCAATGAAATC
GATGCTACAGAAAACGACGCACAAAATTCAAAGGCTGTCCAAACTAGAAATGGCGACAAG
GAAGCTACCATTACATTCACGCCCGACTTGGACGAACAGATGAACCTTATTAAGATATAT
AAGGACAAAACAAAAGATACCGTGGCACATCATTATTGGGACAACAATGAGGAAGAAGAC
AACAGAGACGGAGTTTTGGGACAATTTGTTGTTCAATACGACGTGGAACGTTCGAACGAT
GGAGAAGTCTTGGTGAATGATGGATATTTTGTGCACTTCCTGGCACCCAGCTCGTTGCCA
CCACTCAACAAGTACGTGGTATTTGTGCTGGACACTTCCAGCTCTATGATCGGTCGCAAG
GTGGAACAATTGATTGCAGCTATGGACGCCATACTGTCCGACCTCAACCCGAAAAATTCG
AAGGCTGTCCAAACTAGAAATGGCGACAAGGAAGCTACCATTACATTCACGCCCGACTTG
GACGAACAGATGAACCTTATTAAGATATATAAGGAAAAAACAAAAGATACCGTGACACAT
CATTATTGGGACAACAATGAGGAAGAAGACAACAGAGACGGAGTTTTGGGACAATTTGTT
GTTCAATACGACGTGGAACGTTCTAACGATGGAGAAGTCTTGGTGAATGATGGATATTTT
GTGCACTTCCTGGCACCCAGCTCGTTGCCACCACTCAACAAGTACGTGGTATTTGTGCTG
GACACATCCAGCTCTATGATCGGTCGCAAGGTGGAACAATTGATCGCAGCTATGGACGCC
ATACTGTCGGACCTCAACCCGAGTGATTACTTCAGCATTGTTGAATTTAACTCCGACTAC
TCGGTCCATGAGCTGAAAGAAGCGGATGAGCCTCAACCTGAACCTCAAAAGTTTTCTTGG
TATGGATCAACGTCATCATCAAACAAGGAACTTGTCTCACCATCACTTGCTTCACCTGAG
AACATCGCTAAGGCCAAGGTTATCATTTCCAGATTACGGGCTAATGGAGGAACCAATATC
CACAGCGCTTTGAGCGTAGCTATGGATCTTATTCATAAGTTCTCTGGAAAGCACGATATT
TCTTCTGAAAAATCGAATTCAAGTGACGCTGCAAACGAAAAAGCGATAGCAAATGCTAAC
GACTTGAAAACCAAACCAGTCCATGAATTGGAGCCCATCATTATTTTCCTGACGGACGGC
GACCCGACCGTCGGAGAGACCAGCACCTCGCGTATCATCTCACACGTCACCGAGAAGAAC
TCCGGAGAAATGAGGGCTTCCTTGTTCTCACTTGCTTTCGGTGAGGATGCGGATCGCAAC
TTCTTGAGAAAGCTATCACTGCGTAACGAAGGCTTCATGCGGCACATCTACGAGGCGGCG
GATGCGGCGCTTCAGCTGAGAGACTTCTACAAACAGGTCTCCTCTCCACTGCTGGCTCAC
GTCAAGTTCACATACCCACGGGAACAGATAAAAGAGGGTTCAGTTAGTAAGAACAAGTTC
CGCACCGTGTACGCGGGTTCAGAGGTAGTAGTGGCTGGGGAGCTCTCTGACGACGACGTT
GATTTGAGACCTGTCGTTAGTGGCTTCTGCGGGAACCAAAATGGAAAATTGATTCCATAT
GAAAATGATCAGTCCAAGATCAAAGTCACTCGCGTGAAGGAGTTCTTACCTCTGGAGCGC
CTGTGGGCGTACCTGAGTATCCATCAGCTATTGGACCAACGTGACGCCTCCGAAGATACA
GCCGCCAAAGAGCATGAGAAGAAAGCACTCAATTTAGCGCTGAAGTACTCGTTCGTGACT
CCCCTAACGTCGTTGGTGGTGGTAAAGCCGAACGAAACGAACGCCGTGGACGCTGAATCT
GTAGACAAAAATAACAACAATCAATTCTCAGGTATTACACCACTGTCGTTTAATGCAATG
CCTCAAGCGCCTTTAAGTCATCATTTATTGATAGCACCACCAGCGTACAGACCCATGGTT
ATGGGTGGGAATGGAGACGCACTCGCGTTGGTAGGAGGTTTCCATGCTCAAGTAGAAGAC
GAGGAGGTCGACGAAAAATATGACGACATTGGCCAGATCAGTCTCAACAGAGCTGGTTAC
AGATTCGATTCAGACGAGGACGATTATGATGGCTTTATAGGTTCAAGTTCATTTATTACA
ACACCAGCACCAGTGCAGGACTTTTTTGAAACTGTCGCTACCGAAGTCCCAGATCAGGAC
AAATACCATTTAGAGAACTACATGTGGGCTTTAGCTTTAGTGAACAACACCGCTGACGCC
CTCGTGTTTATGGATAATGGAACCGAAATCGTTTTACAGCTCTCTAAAGATAGTAATGCT
CCTCGTGGTAGCTCTGAGGAGTCCTGCACGAACGTGCCCGTTGACGCGGCGAGCCCTGCT
TCGGGCCCTGAACCCGTGAAGGCCTCCTGTGTCTATATCACTCGCTGTTCCGCAGCCAGG
AACATCACCGAAGATGACTATCGCAGATCATACTGTCGCGTTGACAACAAGTGA

Protein sequence:

MKKSWIYLFYIVFIVAKAQTASISSTETLVVAKTDDEASTAAPSEPVTDEPNAPIKVTEM
RVNSEVTMRYAHTAVVTHVRNPASKAQEATFHVLLPETAFISGFIMTLGGKSYKAYVKEK
NEAKQIFNEAVSHGTGAAHIAAKARDSNHFTVSVNVEPKSVAIFNLTYEELLVRRNGVYN
HAINLHPGTLVPKLEVVVHIKESQKITTLRVPEVRTGNEIDATENDAQNSKAVQTRNGDK
EATITFTPDLDEQMNLIKIYKDKTKDTVAHHYWDNNEEEDNRDGVLGQFVVQYDVERSND
GEVLVNDGYFVHFLAPSSLPPLNKYVVFVLDTSSSMIGRKVEQLIAAMDAILSDLNPKNS
KAVQTRNGDKEATITFTPDLDEQMNLIKIYKEKTKDTVTHHYWDNNEEEDNRDGVLGQFV
VQYDVERSNDGEVLVNDGYFVHFLAPSSLPPLNKYVVFVLDTSSSMIGRKVEQLIAAMDA
ILSDLNPSDYFSIVEFNSDYSVHELKEADEPQPEPQKFSWYGSTSSSNKELVSPSLASPE
NIAKAKVIISRLRANGGTNIHSALSVAMDLIHKFSGKHDISSEKSNSSDAANEKAIANAN
DLKTKPVHELEPIIIFLTDGDPTVGETSTSRIISHVTEKNSGEMRASLFSLAFGEDADRN
FLRKLSLRNEGFMRHIYEAADAALQLRDFYKQVSSPLLAHVKFTYPREQIKEGSVSKNKF
RTVYAGSEVVVAGELSDDDVDLRPVVSGFCGNQNGKLIPYENDQSKIKVTRVKEFLPLER
LWAYLSIHQLLDQRDASEDTAAKEHEKKALNLALKYSFVTPLTSLVVVKPNETNAVDAES
VDKNNNNQFSGITPLSFNAMPQAPLSHHLLIAPPAYRPMVMGGNGDALALVGGFHAQVED
EEVDEKYDDIGQISLNRAGYRFDSDEDDYDGFIGSSSFITTPAPVQDFFETVATEVPDQD
KYHLENYMWALALVNNTADALVFMDNGTEIVLQLSKDSNAPRGSSEESCTNVPVDAASPA
SGPEPVKASCVYITRCSAARNITEDDYRRSYCRVDNK