DPGLEAN03069 in OGS1.0

New model in OGS2.0DPOGS215836 
Genomic Positionscaffold815:+ 133102-145917
See gene structure
CDS Length4299
Paired RNAseq reads  4075
Single RNAseq reads  10730
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013565 (0.0)
Best Drosophila hit  thiolester containing protein III (2e-98)
Best Human hitCD109 antigen isoform 1 precursor (7e-48)
Best NR hit (blastp)  PREDICTED: similar to tep3 [Tribolium castaneum] (1e-165)
Best NR hit (blastx)  PREDICTED: similar to tep3 [Tribolium castaneum] (1e-142)
GeneOntology terms



  
GO:0004866 endopeptidase inhibitor activity
GO:0005515 protein binding
GO:0005615 extracellular space
GO:0050830 defense response to Gram-positive bacterium
GO:0006911 phagocytosis, engulfment
InterPro families




  
IPR011626 A-macroglobulin complement component
IPR002890 Alpha-2-macroglobulin, N-terminal
IPR011625 Alpha-2-macroglobulin, N-terminal 2
IPR001599 Alpha-2-macroglobulin
IPR009048 Alpha-macroglobulin, receptor-binding
IPR008930 Terpenoid cylases/protein prenyltransferase alpha-alpha toroid
Orthology groupMCL10147

Nucleotide sequence:

ATGGAACAAATAAAACATGAACGAGTACCGTTCATAATAATAGCCGCTGATGGTTACCTT
TTGTACTGCATCGTGCAGAACTTGTTCTCAGTTACCAAAATGAGGTACAACGCAGCTGTT
TTTACCGTCCTGGCAGTTTTAGTTGCTAAAAACAATATACAATGCGTATCAGTATTAGGA
CCGAAGGTCTTAAGGCCGTATGGGAATTACAAAGTATCTATCGCTGGAGGTGACAAAGCG
CATAATCTATATGTGGCTATAGAGGGCAGGAAGACAACTGGGGAACAGTTCTCACAGGGG
CGAGTAGTGCAAGTGGCACCTGCTTCTTCTAGACTTATAGAACTCGATACCGATAAAGGT
GTATACCAACCAGGTGACACCATTAACTTCAGAGTAATCGCTTTGGACAAGTATCTGTTG
CCTCTCTCTGGGACGGTGGATGTGAGTGTGTTGGATACCAAGGGCTCACCAGTGAGGCAA
TGGGCTTCCGTCAACCTCGATAAAGGATTGTTTTCTAACGAGCTTCTGTTAGCTGATGAA
CCCGCTTTAGGACAGTGGACTATACAAGCAGAGGTCAAGGGGCAGAAATATTCGAAACAT
CTGATGGTGGCAGATTATGTGCTACCTAAGTTCCAGATGCATATGAAAGTACCAAAAGAG
GTTCTGTTTAGCGAAGGAAGATTTAATATTAATGTTACAGCCAGACATTTTAATGGCCTA
CCTGTAAAAGGTGAATTAACAATATCCGCATACGCTGTGTTCTTCTCGGGACTACTTCAA
CCGGTATTTTCATCTCCCGCCCGTAAAGTCATTGAGTTTAACGGCCAAGCGGAAGTTTTG
TATGACCTTAAAACAGACTTAGATCTGGCTGAAGATGCAGCCAGACCGTTAGTAGTTGAA
GCTGTGATAGAAGAAAAAAATACACTGATACGACAGAATATTACCACTAGAATACTTCTT
TTGCGAAGACCCTATAGACTTCAAGTTACTGCTCCTGAGAGGTTTAAACCTAGATTACCT
TATATTGTCCAGATACAATTAGTTAATTCTACTGGTGATACGTTACCTGTATCTGATGAT
GTAGTCGTTGAAAGACTTTGGGATGATGGTGCACCTGTTAACAAAACAACTATTAAACTT
AACAAAGGTTTTGGAATTTACACCTACACTCCAGATGTTGCGCACACAAATTCTACTCTT
AATTTAGTGATCAAATACAAGGAAGTATCAGAAAGAATAGTTAACGTCCAGAAGAGCTTG
GAGACTGGTGATCAATACATGACTCTGGAACTGTTAACACGAAATATGTCTATCGGTGAT
GAGATGCGTGGGAGAGCCACCTCCACGGAACCTATGGATCTGGTGCATTATGCGGTCATC
GGAAGAGGGGACATTCTTGTTGCTAAGACATTAGAATTAAGCCCCCCTCGTACCAGCGTG
GATATCTCAGTACCGGTAACAAGTGGTATGTCTCCGGGCTGCTCGCTAATAGCTTGGAGC
CCCCGATTAACAGGATCTATACTGGCTGCAGCTTTACTGGTTCCACAAAAAGACTTAATG
CAACATAAGGTGTCAGTAACATCAGTATCGCCAGGAACATCACTACGTCCTAATGGCCTG
GTGGAGTTTCGAGTGCTCGGTGAGGCGGGAGCTCAGGCTGGTCTACTTGGAGGAGATCAA
CACGCCATTACTAACGGACTCGCTGGAACCAATGGCCTGGGTAGCGGACTGGATTTACAC
ACGATCGAACGAGAAGTTGAAAGCTTCATTGGCATAAAAAGATCATATTTCAAAAATGAT
GACGGAATTCCAATTTTGGGAATAGACTTAGGTGGACGTAACTCTACCGATGTGTTTAGT
AATGCTGGAATGGTTCTTCTGACAGATGGTGTTGTAGTATCAAACAGTATGAAGGACGAA
ACAGAGAAACATGAGACAGGCACCCGCCCACCAACAGCAGGTCCTTACGCGTTCAGTAGA
GTGCCAACGCCGCCATCGCCAAGACAATACTTGACTGAGACACTTTCACCACTTTCCACT
TGGATGTTTACTAATATAACTATTGGTTCCGACGGCGTTGGTACACGACAGCGTTGGTCC
CCAATAACTCCTGGTGAATGGTCGGTCGGAGCATTTGCGATTCATCCAACACTGGGTCTT
GGTCTTGCGGCACCTCGCAAATTTAACACTGCCCTTCCTCTATCCCTCACAGCCGAACTT
CCCGCAAGTCTTCAAAGAGGAGAAACAATAGCTGTGATTGTGACCTTAAAAAGTTCTCTT
ACAGTTGATACACCAGTAGAAGTCACATTCCACAACTCCGATCAGTACTACGAATTCGAA
CCTCTAGAAAATAATATTGACTCGACAAAAAAGATTGAATTGTTCCGTCGAGTAAGCGTA
ACCGTGCCAGCTCGCGGGTCCGTCAGTACGGCGTTCCTCGTGAGCGCTCGTCGCGTCGGT
GACTCACCCATCATTGTGGAAGCCAACGGCAATGGAGTCTCCGCTTCACTCTTCCGCACC
ATTGACGTTCAGGACGGATACATTGAAGATGTCTGGTCTTGGGCAATATTAGACGGTCGT
CGAGGCGTTGCTCGCGCTAATATCACTCTTGAACCAGCAGCCGGGACTAAGCTCGGAGCA
GTTTCTTTGGAAGCTACTGGGGACTTATTGGCAAATGCATTTAGGGCCATTAAAGCGCCG
CCTATATCAGCCGCTGACCCTAATTATGCGCTAAGACCATTGGCGAGAGCTTGCGTATTG
TTGGACTATTTGCAAGCCACAGATCAAGACGATGAAATCACTATAGTAAAAGAGGCTCGA
TCACAAGCAGCTACCGGCTACCAACGACTTATGGCATTCAGACGACCAGACGGGTCGTTC
GTTCAGGAAATTGGTGAAGAATCTGAACCAGATGTCTGGATGACAGCATTATCAGCTCGA
TGGCTAAGCCGTTCCTCGCGCTATGTTGAAGTGTCTCCTGAAGCTGCAACATCCGCGGCA
CGCTGGCTGGTGGCAGCTCAAAGAAGTGACGGTAGCTGGCAACCTTCGGCATCACCTGAC
GACCCGCTGGGTCGGGAAGCCTTGCCACTCACGGCCCAAGCTTTACTAGCACTATTAGAG
ACTAAGGCCAGCGACCCGTTGTACAAAAACGCTATGAATAAAGCTTTGGATTACCTAGCC
GATAAAGTCTCTGAGTCACTCGAGGCACCGACACTGGCGTTAGTGGGAGCCGCTCTGGCC
GCCGCAAGACATCCTCGTGCTGCGCTAGCTCTGAAAGCCCTGGAAACACATGCACACAGT
GACAGAGGTACCAATCTCTACTGGCCTCGAAAATTATCAAAATCGGAGTTACGGAACCCC
TGGCTGAAGGGTAATTCTCTTGAGGCTTCGACTGCAGCTTGGGGTCTACGCGCTATGTTG
GCTTCCAGTCTGATAGATGAATCTGTACCTGTTGCGCGATACCTTATACAAGCACTAGGA
CCTAGAGACCACGACCCGGATGTGTTAGACGCTTTGGCCTTGTTTGCGCACATGATTAGA
ACGACGACCAAACTGAGGGTATCTGTAAATGTCACCGGTTTCGAGGAACCGCGCCAGTTC
AACATCGACAGCGACAATTCACTGATCTTACAAACACAACTGGTACGCAATGCTCGTAAT
GCGAGTGCAGTGACCGAGGGTCGGGGTATGGCCGTGGTGGGTCTAGCGGCTCGTGGCAGT
ACTAACGTGACGGGTGCCTGGCCTCGTTACACGCTCGACCCACGCGTGGATCAGGTCTCT
ACCAGAGACCGACTTCAGCTGTCTGTATGCATCGGATTTGTTCCTGCTGGCAATGAAACA
GAAAGCGGACTGGCTCTTCTAATTGTGCAATTACCGTCGGGATATTTGGCTGACATAAAT
ACTATAACAGAGCTAACGTCGGCGCGTCATGTTGTGGGTGCTCGAGTGGTGCACGGTGGA
TCCCGCGTGGTATCATGGGTGCGACCCTCAGTACACGAGCGCTGCGCCACCCTCGGAGCT
CCACGCGCTCTACCCGTCGCAAGACAGAGGCCTGGATATGTCACCATAGTGGATCTTTAT
GACTCTAGTCACCGAGCGCGTGTCTTTTACCAAGCTGTCCCAAGTACCGCGTGCGACATT
TGTCGCTCGTGGCCCTCATGTGAGCGCGCTTGTGGTTCCGCAGCGGAACAGCGTGCTTCC
CCCACCACCCCCGCCGCCACACGTAACCCCAACAGTGCATCTGTCCCGCTCGCACAAACT
GTGCTCTGTCTCGCTTTGGCATTGTTAGTCAGTATATAA

Protein sequence:

MEQIKHERVPFIIIAADGYLLYCIVQNLFSVTKMRYNAAVFTVLAVLVAKNNIQCVSVLG
PKVLRPYGNYKVSIAGGDKAHNLYVAIEGRKTTGEQFSQGRVVQVAPASSRLIELDTDKG
VYQPGDTINFRVIALDKYLLPLSGTVDVSVLDTKGSPVRQWASVNLDKGLFSNELLLADE
PALGQWTIQAEVKGQKYSKHLMVADYVLPKFQMHMKVPKEVLFSEGRFNINVTARHFNGL
PVKGELTISAYAVFFSGLLQPVFSSPARKVIEFNGQAEVLYDLKTDLDLAEDAARPLVVE
AVIEEKNTLIRQNITTRILLLRRPYRLQVTAPERFKPRLPYIVQIQLVNSTGDTLPVSDD
VVVERLWDDGAPVNKTTIKLNKGFGIYTYTPDVAHTNSTLNLVIKYKEVSERIVNVQKSL
ETGDQYMTLELLTRNMSIGDEMRGRATSTEPMDLVHYAVIGRGDILVAKTLELSPPRTSV
DISVPVTSGMSPGCSLIAWSPRLTGSILAAALLVPQKDLMQHKVSVTSVSPGTSLRPNGL
VEFRVLGEAGAQAGLLGGDQHAITNGLAGTNGLGSGLDLHTIEREVESFIGIKRSYFKND
DGIPILGIDLGGRNSTDVFSNAGMVLLTDGVVVSNSMKDETEKHETGTRPPTAGPYAFSR
VPTPPSPRQYLTETLSPLSTWMFTNITIGSDGVGTRQRWSPITPGEWSVGAFAIHPTLGL
GLAAPRKFNTALPLSLTAELPASLQRGETIAVIVTLKSSLTVDTPVEVTFHNSDQYYEFE
PLENNIDSTKKIELFRRVSVTVPARGSVSTAFLVSARRVGDSPIIVEANGNGVSASLFRT
IDVQDGYIEDVWSWAILDGRRGVARANITLEPAAGTKLGAVSLEATGDLLANAFRAIKAP
PISAADPNYALRPLARACVLLDYLQATDQDDEITIVKEARSQAATGYQRLMAFRRPDGSF
VQEIGEESEPDVWMTALSARWLSRSSRYVEVSPEAATSAARWLVAAQRSDGSWQPSASPD
DPLGREALPLTAQALLALLETKASDPLYKNAMNKALDYLADKVSESLEAPTLALVGAALA
AARHPRAALALKALETHAHSDRGTNLYWPRKLSKSELRNPWLKGNSLEASTAAWGLRAML
ASSLIDESVPVARYLIQALGPRDHDPDVLDALALFAHMIRTTTKLRVSVNVTGFEEPRQF
NIDSDNSLILQTQLVRNARNASAVTEGRGMAVVGLAARGSTNVTGAWPRYTLDPRVDQVS
TRDRLQLSVCIGFVPAGNETESGLALLIVQLPSGYLADINTITELTSARHVVGARVVHGG
SRVVSWVRPSVHERCATLGAPRALPVARQRPGYVTIVDLYDSSHRARVFYQAVPSTACDI
CRSWPSCERACGSAAEQRASPTTPAATRNPNSASVPLAQTVLCLALALLVSI