New model in OGS2.0 | DPOGS215836  |
---|---|
Genomic Position | scaffold815:+ 133102-145917 |
See gene structure | |
CDS Length | 4299 |
Paired RNAseq reads   | 4075 |
Single RNAseq reads   | 10730 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013565 (0.0) |
Best Drosophila hit   | thiolester containing protein III (2e-98) |
Best Human hit | CD109 antigen isoform 1 precursor (7e-48) |
Best NR hit (blastp)   | PREDICTED: similar to tep3 [Tribolium castaneum] (1e-165) |
Best NR hit (blastx)   | PREDICTED: similar to tep3 [Tribolium castaneum] (1e-142) |
GeneOntology terms    | GO:0004866 endopeptidase inhibitor activity GO:0005515 protein binding GO:0005615 extracellular space GO:0050830 defense response to Gram-positive bacterium GO:0006911 phagocytosis, engulfment |
InterPro families    | IPR011626 A-macroglobulin complement component IPR002890 Alpha-2-macroglobulin, N-terminal IPR011625 Alpha-2-macroglobulin, N-terminal 2 IPR001599 Alpha-2-macroglobulin IPR009048 Alpha-macroglobulin, receptor-binding IPR008930 Terpenoid cylases/protein prenyltransferase alpha-alpha toroid |
Orthology group | MCL10147 |
Nucleotide sequence:
ATGGAACAAATAAAACATGAACGAGTACCGTTCATAATAATAGCCGCTGATGGTTACCTT
TTGTACTGCATCGTGCAGAACTTGTTCTCAGTTACCAAAATGAGGTACAACGCAGCTGTT
TTTACCGTCCTGGCAGTTTTAGTTGCTAAAAACAATATACAATGCGTATCAGTATTAGGA
CCGAAGGTCTTAAGGCCGTATGGGAATTACAAAGTATCTATCGCTGGAGGTGACAAAGCG
CATAATCTATATGTGGCTATAGAGGGCAGGAAGACAACTGGGGAACAGTTCTCACAGGGG
CGAGTAGTGCAAGTGGCACCTGCTTCTTCTAGACTTATAGAACTCGATACCGATAAAGGT
GTATACCAACCAGGTGACACCATTAACTTCAGAGTAATCGCTTTGGACAAGTATCTGTTG
CCTCTCTCTGGGACGGTGGATGTGAGTGTGTTGGATACCAAGGGCTCACCAGTGAGGCAA
TGGGCTTCCGTCAACCTCGATAAAGGATTGTTTTCTAACGAGCTTCTGTTAGCTGATGAA
CCCGCTTTAGGACAGTGGACTATACAAGCAGAGGTCAAGGGGCAGAAATATTCGAAACAT
CTGATGGTGGCAGATTATGTGCTACCTAAGTTCCAGATGCATATGAAAGTACCAAAAGAG
GTTCTGTTTAGCGAAGGAAGATTTAATATTAATGTTACAGCCAGACATTTTAATGGCCTA
CCTGTAAAAGGTGAATTAACAATATCCGCATACGCTGTGTTCTTCTCGGGACTACTTCAA
CCGGTATTTTCATCTCCCGCCCGTAAAGTCATTGAGTTTAACGGCCAAGCGGAAGTTTTG
TATGACCTTAAAACAGACTTAGATCTGGCTGAAGATGCAGCCAGACCGTTAGTAGTTGAA
GCTGTGATAGAAGAAAAAAATACACTGATACGACAGAATATTACCACTAGAATACTTCTT
TTGCGAAGACCCTATAGACTTCAAGTTACTGCTCCTGAGAGGTTTAAACCTAGATTACCT
TATATTGTCCAGATACAATTAGTTAATTCTACTGGTGATACGTTACCTGTATCTGATGAT
GTAGTCGTTGAAAGACTTTGGGATGATGGTGCACCTGTTAACAAAACAACTATTAAACTT
AACAAAGGTTTTGGAATTTACACCTACACTCCAGATGTTGCGCACACAAATTCTACTCTT
AATTTAGTGATCAAATACAAGGAAGTATCAGAAAGAATAGTTAACGTCCAGAAGAGCTTG
GAGACTGGTGATCAATACATGACTCTGGAACTGTTAACACGAAATATGTCTATCGGTGAT
GAGATGCGTGGGAGAGCCACCTCCACGGAACCTATGGATCTGGTGCATTATGCGGTCATC
GGAAGAGGGGACATTCTTGTTGCTAAGACATTAGAATTAAGCCCCCCTCGTACCAGCGTG
GATATCTCAGTACCGGTAACAAGTGGTATGTCTCCGGGCTGCTCGCTAATAGCTTGGAGC
CCCCGATTAACAGGATCTATACTGGCTGCAGCTTTACTGGTTCCACAAAAAGACTTAATG
CAACATAAGGTGTCAGTAACATCAGTATCGCCAGGAACATCACTACGTCCTAATGGCCTG
GTGGAGTTTCGAGTGCTCGGTGAGGCGGGAGCTCAGGCTGGTCTACTTGGAGGAGATCAA
CACGCCATTACTAACGGACTCGCTGGAACCAATGGCCTGGGTAGCGGACTGGATTTACAC
ACGATCGAACGAGAAGTTGAAAGCTTCATTGGCATAAAAAGATCATATTTCAAAAATGAT
GACGGAATTCCAATTTTGGGAATAGACTTAGGTGGACGTAACTCTACCGATGTGTTTAGT
AATGCTGGAATGGTTCTTCTGACAGATGGTGTTGTAGTATCAAACAGTATGAAGGACGAA
ACAGAGAAACATGAGACAGGCACCCGCCCACCAACAGCAGGTCCTTACGCGTTCAGTAGA
GTGCCAACGCCGCCATCGCCAAGACAATACTTGACTGAGACACTTTCACCACTTTCCACT
TGGATGTTTACTAATATAACTATTGGTTCCGACGGCGTTGGTACACGACAGCGTTGGTCC
CCAATAACTCCTGGTGAATGGTCGGTCGGAGCATTTGCGATTCATCCAACACTGGGTCTT
GGTCTTGCGGCACCTCGCAAATTTAACACTGCCCTTCCTCTATCCCTCACAGCCGAACTT
CCCGCAAGTCTTCAAAGAGGAGAAACAATAGCTGTGATTGTGACCTTAAAAAGTTCTCTT
ACAGTTGATACACCAGTAGAAGTCACATTCCACAACTCCGATCAGTACTACGAATTCGAA
CCTCTAGAAAATAATATTGACTCGACAAAAAAGATTGAATTGTTCCGTCGAGTAAGCGTA
ACCGTGCCAGCTCGCGGGTCCGTCAGTACGGCGTTCCTCGTGAGCGCTCGTCGCGTCGGT
GACTCACCCATCATTGTGGAAGCCAACGGCAATGGAGTCTCCGCTTCACTCTTCCGCACC
ATTGACGTTCAGGACGGATACATTGAAGATGTCTGGTCTTGGGCAATATTAGACGGTCGT
CGAGGCGTTGCTCGCGCTAATATCACTCTTGAACCAGCAGCCGGGACTAAGCTCGGAGCA
GTTTCTTTGGAAGCTACTGGGGACTTATTGGCAAATGCATTTAGGGCCATTAAAGCGCCG
CCTATATCAGCCGCTGACCCTAATTATGCGCTAAGACCATTGGCGAGAGCTTGCGTATTG
TTGGACTATTTGCAAGCCACAGATCAAGACGATGAAATCACTATAGTAAAAGAGGCTCGA
TCACAAGCAGCTACCGGCTACCAACGACTTATGGCATTCAGACGACCAGACGGGTCGTTC
GTTCAGGAAATTGGTGAAGAATCTGAACCAGATGTCTGGATGACAGCATTATCAGCTCGA
TGGCTAAGCCGTTCCTCGCGCTATGTTGAAGTGTCTCCTGAAGCTGCAACATCCGCGGCA
CGCTGGCTGGTGGCAGCTCAAAGAAGTGACGGTAGCTGGCAACCTTCGGCATCACCTGAC
GACCCGCTGGGTCGGGAAGCCTTGCCACTCACGGCCCAAGCTTTACTAGCACTATTAGAG
ACTAAGGCCAGCGACCCGTTGTACAAAAACGCTATGAATAAAGCTTTGGATTACCTAGCC
GATAAAGTCTCTGAGTCACTCGAGGCACCGACACTGGCGTTAGTGGGAGCCGCTCTGGCC
GCCGCAAGACATCCTCGTGCTGCGCTAGCTCTGAAAGCCCTGGAAACACATGCACACAGT
GACAGAGGTACCAATCTCTACTGGCCTCGAAAATTATCAAAATCGGAGTTACGGAACCCC
TGGCTGAAGGGTAATTCTCTTGAGGCTTCGACTGCAGCTTGGGGTCTACGCGCTATGTTG
GCTTCCAGTCTGATAGATGAATCTGTACCTGTTGCGCGATACCTTATACAAGCACTAGGA
CCTAGAGACCACGACCCGGATGTGTTAGACGCTTTGGCCTTGTTTGCGCACATGATTAGA
ACGACGACCAAACTGAGGGTATCTGTAAATGTCACCGGTTTCGAGGAACCGCGCCAGTTC
AACATCGACAGCGACAATTCACTGATCTTACAAACACAACTGGTACGCAATGCTCGTAAT
GCGAGTGCAGTGACCGAGGGTCGGGGTATGGCCGTGGTGGGTCTAGCGGCTCGTGGCAGT
ACTAACGTGACGGGTGCCTGGCCTCGTTACACGCTCGACCCACGCGTGGATCAGGTCTCT
ACCAGAGACCGACTTCAGCTGTCTGTATGCATCGGATTTGTTCCTGCTGGCAATGAAACA
GAAAGCGGACTGGCTCTTCTAATTGTGCAATTACCGTCGGGATATTTGGCTGACATAAAT
ACTATAACAGAGCTAACGTCGGCGCGTCATGTTGTGGGTGCTCGAGTGGTGCACGGTGGA
TCCCGCGTGGTATCATGGGTGCGACCCTCAGTACACGAGCGCTGCGCCACCCTCGGAGCT
CCACGCGCTCTACCCGTCGCAAGACAGAGGCCTGGATATGTCACCATAGTGGATCTTTAT
GACTCTAGTCACCGAGCGCGTGTCTTTTACCAAGCTGTCCCAAGTACCGCGTGCGACATT
TGTCGCTCGTGGCCCTCATGTGAGCGCGCTTGTGGTTCCGCAGCGGAACAGCGTGCTTCC
CCCACCACCCCCGCCGCCACACGTAACCCCAACAGTGCATCTGTCCCGCTCGCACAAACT
GTGCTCTGTCTCGCTTTGGCATTGTTAGTCAGTATATAA
Protein sequence:
MEQIKHERVPFIIIAADGYLLYCIVQNLFSVTKMRYNAAVFTVLAVLVAKNNIQCVSVLG
PKVLRPYGNYKVSIAGGDKAHNLYVAIEGRKTTGEQFSQGRVVQVAPASSRLIELDTDKG
VYQPGDTINFRVIALDKYLLPLSGTVDVSVLDTKGSPVRQWASVNLDKGLFSNELLLADE
PALGQWTIQAEVKGQKYSKHLMVADYVLPKFQMHMKVPKEVLFSEGRFNINVTARHFNGL
PVKGELTISAYAVFFSGLLQPVFSSPARKVIEFNGQAEVLYDLKTDLDLAEDAARPLVVE
AVIEEKNTLIRQNITTRILLLRRPYRLQVTAPERFKPRLPYIVQIQLVNSTGDTLPVSDD
VVVERLWDDGAPVNKTTIKLNKGFGIYTYTPDVAHTNSTLNLVIKYKEVSERIVNVQKSL
ETGDQYMTLELLTRNMSIGDEMRGRATSTEPMDLVHYAVIGRGDILVAKTLELSPPRTSV
DISVPVTSGMSPGCSLIAWSPRLTGSILAAALLVPQKDLMQHKVSVTSVSPGTSLRPNGL
VEFRVLGEAGAQAGLLGGDQHAITNGLAGTNGLGSGLDLHTIEREVESFIGIKRSYFKND
DGIPILGIDLGGRNSTDVFSNAGMVLLTDGVVVSNSMKDETEKHETGTRPPTAGPYAFSR
VPTPPSPRQYLTETLSPLSTWMFTNITIGSDGVGTRQRWSPITPGEWSVGAFAIHPTLGL
GLAAPRKFNTALPLSLTAELPASLQRGETIAVIVTLKSSLTVDTPVEVTFHNSDQYYEFE
PLENNIDSTKKIELFRRVSVTVPARGSVSTAFLVSARRVGDSPIIVEANGNGVSASLFRT
IDVQDGYIEDVWSWAILDGRRGVARANITLEPAAGTKLGAVSLEATGDLLANAFRAIKAP
PISAADPNYALRPLARACVLLDYLQATDQDDEITIVKEARSQAATGYQRLMAFRRPDGSF
VQEIGEESEPDVWMTALSARWLSRSSRYVEVSPEAATSAARWLVAAQRSDGSWQPSASPD
DPLGREALPLTAQALLALLETKASDPLYKNAMNKALDYLADKVSESLEAPTLALVGAALA
AARHPRAALALKALETHAHSDRGTNLYWPRKLSKSELRNPWLKGNSLEASTAAWGLRAML
ASSLIDESVPVARYLIQALGPRDHDPDVLDALALFAHMIRTTTKLRVSVNVTGFEEPRQF
NIDSDNSLILQTQLVRNARNASAVTEGRGMAVVGLAARGSTNVTGAWPRYTLDPRVDQVS
TRDRLQLSVCIGFVPAGNETESGLALLIVQLPSGYLADINTITELTSARHVVGARVVHGG
SRVVSWVRPSVHERCATLGAPRALPVARQRPGYVTIVDLYDSSHRARVFYQAVPSTACDI
CRSWPSCERACGSAAEQRASPTTPAATRNPNSASVPLAQTVLCLALALLVSI