DPGLEAN06125 in OGS1.0

New model in OGS2.0DPOGS206012 
Genomic Positionscaffold632:- 52-18777
See gene structure
CDS Length4134
Paired RNAseq reads  378
Single RNAseq reads  939
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012631 (0.0)
Best Drosophila hit  thiolester containing protein II, isoform C (5e-40)
Best Human hitalpha-2-macroglobulin-like protein 1 precursor (1e-47)
Best NR hit (blastp)  alpha2 macroglobulin isoform 2 [Fenneropenaeus chinensis] (1e-83)
Best NR hit (blastx)  alpha2 macroglobulin isoform 2 [Fenneropenaeus chinensis] (1e-73)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families





  
IPR008930 Terpenoid cylases/protein prenyltransferase alpha-alpha toroid
IPR011626 A-macroglobulin complement component
IPR001599 Alpha-2-macroglobulin
IPR002890 Alpha-2-macroglobulin, N-terminal
IPR011625 Alpha-2-macroglobulin, N-terminal 2
IPR019565 Alpha-2-macroglobulin, thiol-ester bond-forming
IPR019742 Alpha-2-macroglobulin, conserved site
Orthology groupMCL11614

Nucleotide sequence:

ATGAAGTGGTTGTTGTTACTGTTCGTGTTACCGGCACAATTGACTGGGACGCCGCAGAAT
TTGACAAGCTCGCCATGCACCGACAGAAACCACATATTCCTGATGCCTGGCGCACTCACG
GCTGGTGGTTCAAGTCGGGCCTGTGTCTCCCGGTTCTACACAGAGGGTTCTGCTCAAATG
ACACTCACATTGAATGTTGATGGAGAAACTGTGACTTCGAGAGACGGTGGTTGTCTGGAT
ATTTCAGTTCCACAGCGACCAAATTCCAAAGCCGATGTCTATATTAATATAAGATATCCT
CAATGCGTGTGGGAGCGACATATGAAAGTTCGTGTTTCAACCGGCCGCGTGGTGATAGTA
CACACGGAGCGAGCTCGGTATAGACCTGGAGACTTGGTGAGGGTTAGAGCTCTCGTCGTT
AAAGCTGATCTGACGCCTGCACATACTGCTTATTTCGTTGTATTCGTCAGATATCCTCAA
TGCGTGTGGGAGCGACATATGAGAGTTCGTGTCTCAACCGGCCGCGTGGTGATAGTACAT
ACGGAGCGAGCTCGGTATAGACCTGGAGACTTGGTGAGGGTTAGAGCTCTCGTCGTTAAA
GCTGATCTGACGCCCGCACATACTGCTATTGACGAAATATGGCTGGAGGGACCTGGGGGG
TGGGGGGTTAAGACAGCACAGTGGCTCAAACTACGACCTCGACTGGGACTGGTCCAGGTT
CAACATCAGTTGGACGACAGCGCTCCGCCAGGGAAATGGAGAGTGAGGACTCGCCTTGCT
GATGGAGCTCAGGGTTCCTCTTCCTTTTTGGTGGGTAACTACGAGCTACCACCATTTCAA
CTTACAGTTCGCCACTCTCCAAGAATATTAAGGACCAGCGAGAGACTTGTGTGGACCGTG
TGTGTGAGGTATCCCTGGTCTGAAGCTGTGGAAGGCATGTTGGTGATACGTCTCCGCGGT
GCGGGTGGTGGAGATGGTGCTGGAATCAGAACTGCTGTCCGTCTGAAGGCGCCTCGAGCA
TGCCACAGACACGCCGCAGCTGCCAAACGAATAGGACTGAACGGTGACAACCCTCCTGAT
GTGGTTGTCGCCGACTTTTCCTTTGAAGAGGAAGGCACCGGTGTGTGGCAGAATACTACA
GTGGTGTCCCAAGTCGTTGATGAAGCGGTTACTTTGGAGTTCCTCACTAAACATCGCGTC
ATCATATCACCCGGGCTGCCTCATAAGATTAAAATAAAGGCTACTCGTTGGGACAACAAG
CCTGCTGCTGGCGAGCGAGTGAACGTATGCCGGTCTGCATCTTCCATTATTGCATCCGAA
TTTAATTCCACTGAAGCTATTTGTGTGAACGCTACCACAGACGAGAAGGGAATAGCAAGA
GTCATGTTCAATGCCGACAATAGTCCATATTACAACTTACAGGCCAGTATTAACAACACT
CGAACGACCACTCAAGTCCTCGTAGTTCGATCTGGCCGAGCAGCCCTCGGAGCCCTTCGC
TCAGAACAGCACGGTGCTGCCTTATTGCCGCTGTACATAGACCTGAAAGTCGTCGCACCG
CTTACGGTGCATTTCGTGGTCATCACTCGCGGCGGCATCATCTTTCGATGGGGAGCGACA
ACTCAATGTCCCATAACAAGTCCAACAGACAAAATAATAACGTCCCCGAGAAATAGCATC
TGCCCTAACACAAACCCATATTCAATTGATAAGATCCTTAATAACAACCTGGAGTCCAAC
TCCACCGAACTGGAGACGCTTCTAGATAACTATTTATCTAAAGTAATGTTACCAATAAAA
GTCAGCCCTCAAATGTGTCCGGAAAGTCATCTCCTAGCGTACTTCTACCACAATGATGAA
CTTATCACGGCCAGCAAACATTTTGAAATGGAAGACTGCTTCGTCAGAAAGGTGGATGCT
TCGTGGTCACCACGTTTGGTTGCACCTGGTTCTCTGGCAACGCTGCAACTCACCACTCCT
GGACCAGCATTATGTGCTCTTACAGTTTTGGATACGGCTTCTAAATGGATTCAATACGAA
AATATCAGGGAGCTGGTGATGAACGGCCTGCGAACGCTCATGGACAGTCACAGGAATTTG
ACAGAACATGATGCTGCGTGGGAGTGCTTTCTAACATCGGAGAGCCCGGTTTTGTCTACA
AGTCGCGACCTCTTGAGCTGGTGGCTGGCCTCGGCAGGAGTGAGACTCCTGGGAGACCAT
CCATCATCCTGCGAAGCCCCAGAGCTGATGATAGATGACGTATTACCTAGAAGTGATTTC
TCAGAGTCGTGGTTATGGAAGCTGGCTCCAGTCAGCTCGCGAGGTTCCTTATCTGTGACG
TCCCGCGCACCTCATACAGTAACGAGATATGAAGCGACTGCTCTCTGTGTGTCTCGCGCT
GGTCTAGCCATTTCTTCACCCGCCGTGCTTCAGGTGTTCCGCGAGGTGTTTATTCACGCG
AGCGGTCCTCGCCGCGTGCGTCGCGGTGACGCCATTCTTGTTCCATACAGAGTTTTTAAT
TATCTGTACACACCACACGCAGTGGAAATCATCATAACGACGAATCACGTTGTGGATGGA
TTAACTCGTGAGGTTGTATGTCTGTCTGCTCGCACGTCCACCGCGCGCCGTATGATGGTC
ACGTGTCAAGACTCGGACCTCCTCAGCATCAGAGCGACAGGAGTGAAAGATGCCAACTGC
AGTACGGACTATAGAGAGTTCAGCGATGAAGTTGTAATCCACATCCAGGTTGATCCCGAG
GGTGTTCCAGTGCGGGAGATGAAGTCTGCACTGTTATGCGGAGTTGATAGTGTGAACTTC
ACGAGTTCGTCGGAAGTGACGTGGGACTGGTCATCGGAGAGAGCGCTGCCAGGCACCGAG
TCCCTGACCGTATGGACAACCACAGACCTCATGGGACCTCTACTGGCTCATGCAGATGGA
TTAGTGGATCTCCCGCGAGGCTGCGGCGAACAGAATATGGCCCGCCTCGCTACCGCTCTG
CTCGCATTGCGGCTTTTGGAACCTCACTCACCCGCCGCAGACGACGCGAAGGATCAAGTA
GCTAGAGGGTTCACCCGTCAGCTACAGTACGCTCACGTGGGTGGGGGGTTCAGCGCCTTT
GGTAAGAACGACCCTACTCCCTCCACCTGGCTCACAGCATTCAGCTTGAGATACATGAGG
AAGGCGTATGAGGTCATATCAGGTTCCGGTCCGCTGCCTCCAGTGTTAGAATTGTCTCGA
GACTGGTTGCTCAACCAGCAACTCGAAAATGGTTGTTTCAGTAACACTGGACACGTCTTC
CATCATCTACTCAAGGGCGGTCTAGACGAGGATGGAGAAATAGCCAATGTGGCCCTCACA
GCCTACGTCATCGCCTCACTCACAGAAACCTCCCTCCCTTACAAGATCCTCAACAATTCC
CTCCCATGTCTTCGCGCCCTGGTTCCCATGAGGACCAAAACTAATTCAAGAGTATACGCA
CAAGCCTTGATAACTTACGCGTTCATGAAGTTGAGGAAATATGAGGAATTAGGTAATGAT
ACCTTGATGGGGAGTTTGGAGGAAGATTACTTGAGAGAACTGATCGAGTTACTAAGGATC
GCTAAGAGGAGTGGAGATTTTGTATGGTGGGAAACTGGTAACCTGGCTACATCCATCGAG
TCCACGGGATACGCTCTCCTGTCTCTATCTGAGTGTCCGCCGAGGAGAGGCTGCGAGGTA
GCCGCGGCCGGTTCGCTGAGATGGCTGGCGGCCCACAGAGGCACTTCAGGGGGATTCCTC
TCAACTCAGGACACGTTAGTGGCCCTGGAGGGTATGTCTCGTCTGTCGCCGCTACCTGCT
GGAGGGCTGGTGACACTACAGTCTGGAGACGACACGAGGATCGTGACTCCTACAGCCGTC
CCCGAGCTGGTGACGATGAAGGTGGACCAGCTGAGAGTCACCGTTGAAGGCCCCGGATGT
GCTCTAGTTCAGGCCACTCACAGTTACAACACGCTTGAGCCACACGAGTACCAACTCGAG
CCGAGCTCTCTCTCCGTACACACGAACGTTCAAACCGACGGTCCCTTCGATTGTGTCAAC
GACGTCTGCTTCTGTGCCGCCATAGTTAAGGTACTTTCTATTTTATATTTGTAA

Protein sequence:

MKWLLLLFVLPAQLTGTPQNLTSSPCTDRNHIFLMPGALTAGGSSRACVSRFYTEGSAQM
TLTLNVDGETVTSRDGGCLDISVPQRPNSKADVYINIRYPQCVWERHMKVRVSTGRVVIV
HTERARYRPGDLVRVRALVVKADLTPAHTAYFVVFVRYPQCVWERHMRVRVSTGRVVIVH
TERARYRPGDLVRVRALVVKADLTPAHTAIDEIWLEGPGGWGVKTAQWLKLRPRLGLVQV
QHQLDDSAPPGKWRVRTRLADGAQGSSSFLVGNYELPPFQLTVRHSPRILRTSERLVWTV
CVRYPWSEAVEGMLVIRLRGAGGGDGAGIRTAVRLKAPRACHRHAAAAKRIGLNGDNPPD
VVVADFSFEEEGTGVWQNTTVVSQVVDEAVTLEFLTKHRVIISPGLPHKIKIKATRWDNK
PAAGERVNVCRSASSIIASEFNSTEAICVNATTDEKGIARVMFNADNSPYYNLQASINNT
RTTTQVLVVRSGRAALGALRSEQHGAALLPLYIDLKVVAPLTVHFVVITRGGIIFRWGAT
TQCPITSPTDKIITSPRNSICPNTNPYSIDKILNNNLESNSTELETLLDNYLSKVMLPIK
VSPQMCPESHLLAYFYHNDELITASKHFEMEDCFVRKVDASWSPRLVAPGSLATLQLTTP
GPALCALTVLDTASKWIQYENIRELVMNGLRTLMDSHRNLTEHDAAWECFLTSESPVLST
SRDLLSWWLASAGVRLLGDHPSSCEAPELMIDDVLPRSDFSESWLWKLAPVSSRGSLSVT
SRAPHTVTRYEATALCVSRAGLAISSPAVLQVFREVFIHASGPRRVRRGDAILVPYRVFN
YLYTPHAVEIIITTNHVVDGLTREVVCLSARTSTARRMMVTCQDSDLLSIRATGVKDANC
STDYREFSDEVVIHIQVDPEGVPVREMKSALLCGVDSVNFTSSSEVTWDWSSERALPGTE
SLTVWTTTDLMGPLLAHADGLVDLPRGCGEQNMARLATALLALRLLEPHSPAADDAKDQV
ARGFTRQLQYAHVGGGFSAFGKNDPTPSTWLTAFSLRYMRKAYEVISGSGPLPPVLELSR
DWLLNQQLENGCFSNTGHVFHHLLKGGLDEDGEIANVALTAYVIASLTETSLPYKILNNS
LPCLRALVPMRTKTNSRVYAQALITYAFMKLRKYEELGNDTLMGSLEEDYLRELIELLRI
AKRSGDFVWWETGNLATSIESTGYALLSLSECPPRRGCEVAAAGSLRWLAAHRGTSGGFL
STQDTLVALEGMSRLSPLPAGGLVTLQSGDDTRIVTPTAVPELVTMKVDQLRVTVEGPGC
ALVQATHSYNTLEPHEYQLEPSSLSVHTNVQTDGPFDCVNDVCFCAAIVKVLSILYL