New model in OGS2.0 | DPOGS206012  |
---|---|
Genomic Position | scaffold632:- 52-18777 |
See gene structure | |
CDS Length | 4134 |
Paired RNAseq reads   | 378 |
Single RNAseq reads   | 939 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012631 (0.0) |
Best Drosophila hit   | thiolester containing protein II, isoform C (5e-40) |
Best Human hit | alpha-2-macroglobulin-like protein 1 precursor (1e-47) |
Best NR hit (blastp)   | alpha2 macroglobulin isoform 2 [Fenneropenaeus chinensis] (1e-83) |
Best NR hit (blastx)   | alpha2 macroglobulin isoform 2 [Fenneropenaeus chinensis] (1e-73) |
GeneOntology terms    | GO:0005575 cellular_component GO:0003674 molecular_function GO:0008150 biological_process |
InterPro families    | IPR008930 Terpenoid cylases/protein prenyltransferase alpha-alpha toroid IPR011626 A-macroglobulin complement component IPR001599 Alpha-2-macroglobulin IPR002890 Alpha-2-macroglobulin, N-terminal IPR011625 Alpha-2-macroglobulin, N-terminal 2 IPR019565 Alpha-2-macroglobulin, thiol-ester bond-forming IPR019742 Alpha-2-macroglobulin, conserved site |
Orthology group | MCL11614 |
Nucleotide sequence:
ATGAAGTGGTTGTTGTTACTGTTCGTGTTACCGGCACAATTGACTGGGACGCCGCAGAAT
TTGACAAGCTCGCCATGCACCGACAGAAACCACATATTCCTGATGCCTGGCGCACTCACG
GCTGGTGGTTCAAGTCGGGCCTGTGTCTCCCGGTTCTACACAGAGGGTTCTGCTCAAATG
ACACTCACATTGAATGTTGATGGAGAAACTGTGACTTCGAGAGACGGTGGTTGTCTGGAT
ATTTCAGTTCCACAGCGACCAAATTCCAAAGCCGATGTCTATATTAATATAAGATATCCT
CAATGCGTGTGGGAGCGACATATGAAAGTTCGTGTTTCAACCGGCCGCGTGGTGATAGTA
CACACGGAGCGAGCTCGGTATAGACCTGGAGACTTGGTGAGGGTTAGAGCTCTCGTCGTT
AAAGCTGATCTGACGCCTGCACATACTGCTTATTTCGTTGTATTCGTCAGATATCCTCAA
TGCGTGTGGGAGCGACATATGAGAGTTCGTGTCTCAACCGGCCGCGTGGTGATAGTACAT
ACGGAGCGAGCTCGGTATAGACCTGGAGACTTGGTGAGGGTTAGAGCTCTCGTCGTTAAA
GCTGATCTGACGCCCGCACATACTGCTATTGACGAAATATGGCTGGAGGGACCTGGGGGG
TGGGGGGTTAAGACAGCACAGTGGCTCAAACTACGACCTCGACTGGGACTGGTCCAGGTT
CAACATCAGTTGGACGACAGCGCTCCGCCAGGGAAATGGAGAGTGAGGACTCGCCTTGCT
GATGGAGCTCAGGGTTCCTCTTCCTTTTTGGTGGGTAACTACGAGCTACCACCATTTCAA
CTTACAGTTCGCCACTCTCCAAGAATATTAAGGACCAGCGAGAGACTTGTGTGGACCGTG
TGTGTGAGGTATCCCTGGTCTGAAGCTGTGGAAGGCATGTTGGTGATACGTCTCCGCGGT
GCGGGTGGTGGAGATGGTGCTGGAATCAGAACTGCTGTCCGTCTGAAGGCGCCTCGAGCA
TGCCACAGACACGCCGCAGCTGCCAAACGAATAGGACTGAACGGTGACAACCCTCCTGAT
GTGGTTGTCGCCGACTTTTCCTTTGAAGAGGAAGGCACCGGTGTGTGGCAGAATACTACA
GTGGTGTCCCAAGTCGTTGATGAAGCGGTTACTTTGGAGTTCCTCACTAAACATCGCGTC
ATCATATCACCCGGGCTGCCTCATAAGATTAAAATAAAGGCTACTCGTTGGGACAACAAG
CCTGCTGCTGGCGAGCGAGTGAACGTATGCCGGTCTGCATCTTCCATTATTGCATCCGAA
TTTAATTCCACTGAAGCTATTTGTGTGAACGCTACCACAGACGAGAAGGGAATAGCAAGA
GTCATGTTCAATGCCGACAATAGTCCATATTACAACTTACAGGCCAGTATTAACAACACT
CGAACGACCACTCAAGTCCTCGTAGTTCGATCTGGCCGAGCAGCCCTCGGAGCCCTTCGC
TCAGAACAGCACGGTGCTGCCTTATTGCCGCTGTACATAGACCTGAAAGTCGTCGCACCG
CTTACGGTGCATTTCGTGGTCATCACTCGCGGCGGCATCATCTTTCGATGGGGAGCGACA
ACTCAATGTCCCATAACAAGTCCAACAGACAAAATAATAACGTCCCCGAGAAATAGCATC
TGCCCTAACACAAACCCATATTCAATTGATAAGATCCTTAATAACAACCTGGAGTCCAAC
TCCACCGAACTGGAGACGCTTCTAGATAACTATTTATCTAAAGTAATGTTACCAATAAAA
GTCAGCCCTCAAATGTGTCCGGAAAGTCATCTCCTAGCGTACTTCTACCACAATGATGAA
CTTATCACGGCCAGCAAACATTTTGAAATGGAAGACTGCTTCGTCAGAAAGGTGGATGCT
TCGTGGTCACCACGTTTGGTTGCACCTGGTTCTCTGGCAACGCTGCAACTCACCACTCCT
GGACCAGCATTATGTGCTCTTACAGTTTTGGATACGGCTTCTAAATGGATTCAATACGAA
AATATCAGGGAGCTGGTGATGAACGGCCTGCGAACGCTCATGGACAGTCACAGGAATTTG
ACAGAACATGATGCTGCGTGGGAGTGCTTTCTAACATCGGAGAGCCCGGTTTTGTCTACA
AGTCGCGACCTCTTGAGCTGGTGGCTGGCCTCGGCAGGAGTGAGACTCCTGGGAGACCAT
CCATCATCCTGCGAAGCCCCAGAGCTGATGATAGATGACGTATTACCTAGAAGTGATTTC
TCAGAGTCGTGGTTATGGAAGCTGGCTCCAGTCAGCTCGCGAGGTTCCTTATCTGTGACG
TCCCGCGCACCTCATACAGTAACGAGATATGAAGCGACTGCTCTCTGTGTGTCTCGCGCT
GGTCTAGCCATTTCTTCACCCGCCGTGCTTCAGGTGTTCCGCGAGGTGTTTATTCACGCG
AGCGGTCCTCGCCGCGTGCGTCGCGGTGACGCCATTCTTGTTCCATACAGAGTTTTTAAT
TATCTGTACACACCACACGCAGTGGAAATCATCATAACGACGAATCACGTTGTGGATGGA
TTAACTCGTGAGGTTGTATGTCTGTCTGCTCGCACGTCCACCGCGCGCCGTATGATGGTC
ACGTGTCAAGACTCGGACCTCCTCAGCATCAGAGCGACAGGAGTGAAAGATGCCAACTGC
AGTACGGACTATAGAGAGTTCAGCGATGAAGTTGTAATCCACATCCAGGTTGATCCCGAG
GGTGTTCCAGTGCGGGAGATGAAGTCTGCACTGTTATGCGGAGTTGATAGTGTGAACTTC
ACGAGTTCGTCGGAAGTGACGTGGGACTGGTCATCGGAGAGAGCGCTGCCAGGCACCGAG
TCCCTGACCGTATGGACAACCACAGACCTCATGGGACCTCTACTGGCTCATGCAGATGGA
TTAGTGGATCTCCCGCGAGGCTGCGGCGAACAGAATATGGCCCGCCTCGCTACCGCTCTG
CTCGCATTGCGGCTTTTGGAACCTCACTCACCCGCCGCAGACGACGCGAAGGATCAAGTA
GCTAGAGGGTTCACCCGTCAGCTACAGTACGCTCACGTGGGTGGGGGGTTCAGCGCCTTT
GGTAAGAACGACCCTACTCCCTCCACCTGGCTCACAGCATTCAGCTTGAGATACATGAGG
AAGGCGTATGAGGTCATATCAGGTTCCGGTCCGCTGCCTCCAGTGTTAGAATTGTCTCGA
GACTGGTTGCTCAACCAGCAACTCGAAAATGGTTGTTTCAGTAACACTGGACACGTCTTC
CATCATCTACTCAAGGGCGGTCTAGACGAGGATGGAGAAATAGCCAATGTGGCCCTCACA
GCCTACGTCATCGCCTCACTCACAGAAACCTCCCTCCCTTACAAGATCCTCAACAATTCC
CTCCCATGTCTTCGCGCCCTGGTTCCCATGAGGACCAAAACTAATTCAAGAGTATACGCA
CAAGCCTTGATAACTTACGCGTTCATGAAGTTGAGGAAATATGAGGAATTAGGTAATGAT
ACCTTGATGGGGAGTTTGGAGGAAGATTACTTGAGAGAACTGATCGAGTTACTAAGGATC
GCTAAGAGGAGTGGAGATTTTGTATGGTGGGAAACTGGTAACCTGGCTACATCCATCGAG
TCCACGGGATACGCTCTCCTGTCTCTATCTGAGTGTCCGCCGAGGAGAGGCTGCGAGGTA
GCCGCGGCCGGTTCGCTGAGATGGCTGGCGGCCCACAGAGGCACTTCAGGGGGATTCCTC
TCAACTCAGGACACGTTAGTGGCCCTGGAGGGTATGTCTCGTCTGTCGCCGCTACCTGCT
GGAGGGCTGGTGACACTACAGTCTGGAGACGACACGAGGATCGTGACTCCTACAGCCGTC
CCCGAGCTGGTGACGATGAAGGTGGACCAGCTGAGAGTCACCGTTGAAGGCCCCGGATGT
GCTCTAGTTCAGGCCACTCACAGTTACAACACGCTTGAGCCACACGAGTACCAACTCGAG
CCGAGCTCTCTCTCCGTACACACGAACGTTCAAACCGACGGTCCCTTCGATTGTGTCAAC
GACGTCTGCTTCTGTGCCGCCATAGTTAAGGTACTTTCTATTTTATATTTGTAA
Protein sequence:
MKWLLLLFVLPAQLTGTPQNLTSSPCTDRNHIFLMPGALTAGGSSRACVSRFYTEGSAQM
TLTLNVDGETVTSRDGGCLDISVPQRPNSKADVYINIRYPQCVWERHMKVRVSTGRVVIV
HTERARYRPGDLVRVRALVVKADLTPAHTAYFVVFVRYPQCVWERHMRVRVSTGRVVIVH
TERARYRPGDLVRVRALVVKADLTPAHTAIDEIWLEGPGGWGVKTAQWLKLRPRLGLVQV
QHQLDDSAPPGKWRVRTRLADGAQGSSSFLVGNYELPPFQLTVRHSPRILRTSERLVWTV
CVRYPWSEAVEGMLVIRLRGAGGGDGAGIRTAVRLKAPRACHRHAAAAKRIGLNGDNPPD
VVVADFSFEEEGTGVWQNTTVVSQVVDEAVTLEFLTKHRVIISPGLPHKIKIKATRWDNK
PAAGERVNVCRSASSIIASEFNSTEAICVNATTDEKGIARVMFNADNSPYYNLQASINNT
RTTTQVLVVRSGRAALGALRSEQHGAALLPLYIDLKVVAPLTVHFVVITRGGIIFRWGAT
TQCPITSPTDKIITSPRNSICPNTNPYSIDKILNNNLESNSTELETLLDNYLSKVMLPIK
VSPQMCPESHLLAYFYHNDELITASKHFEMEDCFVRKVDASWSPRLVAPGSLATLQLTTP
GPALCALTVLDTASKWIQYENIRELVMNGLRTLMDSHRNLTEHDAAWECFLTSESPVLST
SRDLLSWWLASAGVRLLGDHPSSCEAPELMIDDVLPRSDFSESWLWKLAPVSSRGSLSVT
SRAPHTVTRYEATALCVSRAGLAISSPAVLQVFREVFIHASGPRRVRRGDAILVPYRVFN
YLYTPHAVEIIITTNHVVDGLTREVVCLSARTSTARRMMVTCQDSDLLSIRATGVKDANC
STDYREFSDEVVIHIQVDPEGVPVREMKSALLCGVDSVNFTSSSEVTWDWSSERALPGTE
SLTVWTTTDLMGPLLAHADGLVDLPRGCGEQNMARLATALLALRLLEPHSPAADDAKDQV
ARGFTRQLQYAHVGGGFSAFGKNDPTPSTWLTAFSLRYMRKAYEVISGSGPLPPVLELSR
DWLLNQQLENGCFSNTGHVFHHLLKGGLDEDGEIANVALTAYVIASLTETSLPYKILNNS
LPCLRALVPMRTKTNSRVYAQALITYAFMKLRKYEELGNDTLMGSLEEDYLRELIELLRI
AKRSGDFVWWETGNLATSIESTGYALLSLSECPPRRGCEVAAAGSLRWLAAHRGTSGGFL
STQDTLVALEGMSRLSPLPAGGLVTLQSGDDTRIVTPTAVPELVTMKVDQLRVTVEGPGC
ALVQATHSYNTLEPHEYQLEPSSLSVHTNVQTDGPFDCVNDVCFCAAIVKVLSILYL