New model in OGS2.0 | DPOGS202912  |
---|---|
Genomic Position | scaffold273:+ 134938-149059 |
See gene structure | |
CDS Length | 5385 |
Paired RNAseq reads   | 5897 |
Single RNAseq reads   | 13874 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004193 (0.0) |
Best Drosophila hit   | macroglobulin complement-related (0.0) |
Best Human hit | CD109 antigen isoform 1 precursor (1e-67) |
Best NR hit (blastp)   | PREDICTED: similar to Macroglobulin complement-related CG7586-PA, partial [Apis mellifera] (0.0) |
Best NR hit (blastx)   | GJ20388 [Drosophila virilis] (0.0) |
GeneOntology terms    | GO:0004866 endopeptidase inhibitor activity GO:0005615 extracellular space GO:0043499 eukaryotic cell surface binding GO:0006911 phagocytosis, engulfment GO:0001878 response to yeast GO:0005515 protein binding |
InterPro families    | IPR008930 Terpenoid cylases/protein prenyltransferase alpha-alpha toroid IPR009048 Alpha-macroglobulin, receptor-binding IPR002172 Low-density lipoprotein (LDL) receptor class A repeat IPR011626 A-macroglobulin complement component IPR011625 Alpha-2-macroglobulin, N-terminal 2 IPR001599 Alpha-2-macroglobulin IPR002890 Alpha-2-macroglobulin, N-terminal |
Orthology group | MCL16070 |
Nucleotide sequence:
ATGAATCTCAAAGTGATATCAATATTATTATTTATAATAACAAATGCAACATGGGCTCAA
GACTATTCATCTCAAACATACAAACCTGACTACCAAGTGAACCCAACAAACAATTATCTC
AACACCAACGATCCAAAATATGACAATCCGTATGATCCTAACAAAAGATTCTCAAATCGT
AATCAATACGATATAAATCGTAATCCATACAGTACTGAGCAAAGATACAATCAGTTCGAC
CGTAATGACCTAGGGACCAATCAGAACCCATATAAGACGGATACCGGAAGACAGACTTAC
GACACCAGTAGAACATACAGTACTTCATACACGGGAGCCGAGTTGGAACATGATAGTGTT
ATTATAAATGAAGCCACCTATTTCATTGTTGCATCTCGAATGATCCGGCCGGGGCAAATA
TATAAGATCTCAGCTAACATTCTGAAAGCTCGTCTCCAGATGACGATCAGAGCATCCATC
TCATGTAACGGTGTGGAAATAGCTGACGTTATAGAGAGAGTGAAGGAAGGTGTTCCTGAG
GTTTTGAATATGAGGGTGCCCGCTACAACGGTGCCGGGAGACTACAGGCTGCGCGTGGAA
GGTCTCTACCTGGACGATCCATTTGGAGGGAGAGCTTTCGTTAACGAAACAAAATTGACT
TTCTCACAGAGATTCATGACCATCTTCATACAGATGGACAAGCCGGTGTACATGCAGTCG
CAGACTGTTAAGTTCCGTGTGATTCCAATCAACACGGAGTTGAAGGCTTTCGGCCGAGCC
ATCGACGTCTTCATACTGGATCCCAACAAGCGTATCATGAAGAGGTGGCTCTCCAGACAG
AGTAATTTTGGTACGGTGAGCCTGCAGTATCCCCTGTCGGACCAGCCTGTGTTCGGTGAC
TGGACGGTGAGGGTGGAGGCGCTGGGGCAGAGAGAGGAGGCCAGCTTCCTCGTGGAGGAG
TACTATCAGACCAGGTTTGAGGTGAACGTGACTATGCCGGCCTTCTTCTTCAACACGGAC
GGGTTCATCCACGGTCGCGTGATGGCCAACTACACGTCGGGCGCTCCTGTGCGGGGCAAC
CTCACTCTTCGCGCCACCCTGCGGCCGGTGCCGTCGTACAGGCCGAGACAGCAACACAGA
TACAACACACAACGTCCGAACTACAATCCCTACTTATACAACGAGACGGAGCAGTACGAC
CAGGGACAGGAGGATAGACGCTGGCAGCTCAACCACCGACCCGGGCAGCCCAACCAGCTC
GACCATCCCGACTGGTGGTATGACACGCAGAAGGTCGTCTCCAGGGTGTTTAACTTTGAT
GAGGAGTTCCCGTTCTGGATGCCGAAACCAGACCCTGTTGAGATCGCACAAATTAATATG
AACATCAACAACAACAACATGAACAACATGAACAACATGAACAACTACCCAACAACTACA
TCATATAATAATTATTACAACAATCCCTACAACGACAAGCTGCCTTATTTAAGATTTTTC
AACGGCACGTACGACTTCAAGTACCCGATGTCGGAGCTGGCGCAGCTGGTGCCGTCGTTG
GAGGGCGTGGAGGTGATCGTGACGGCGACCGTGGGCGACCCGTTCCTGGACGAGGTGGTG
GAGGGCTACAGCGTCGCCCGCATCTTCAACTCCTCGCTGGCCGTCACCTTCCTCGGCGGG
GAGCTGCAAGTCTTCAAGCCGCACATGCCTTTCGATGTTTATATGGCGGTGTCTTTCCAC
GACGGGTCTCGTCTCCCGTGGTGGCTGGCTCGGGGCGCGGTTCTGTCCGTGTCGGCGTCG
CTGGAGGGCCGCGGCGGTGCCATCGAAGTGCCTCGGCCGACGCTGGTGCCGGGGGACACC
GCCGTGTGGCATCTCAAACTAGACCTTTATAAGCTCCTCAAACTGGACAACGACCCCAAC
TACCGCGAGGTGTTGAACGGCATCACCGGAGTCCGTCTGTCGGCGTCGCTGGTGGACGCT
TCGGGAGGCCGAGCCACCGCTGACGCACACTTCGTCGCCCACCAGAGCCCTAACCATCAC
CACCTCCGAGTGTCCACATCAACTACAGACGCCAGGGTGGGCGAGTACATCGTCTTCCAC
GTGCAAAGTAACTTCTACATGGAGTCGTTCAACTACGTCGTCATGTCCAAGGGAATCATT
CTGACGAGTGGACAAGAAATGATGCAGGAGGGTGTGCGTACATTTTCGGTGGGCGTGAGC
GCTGAGATGGCTCCCGTGGGCACGCTGGTCGTGTGGGCGCATCGCAGACGAGCGCTCGTT
CTGGCCGACTCCATCACATTCCCCGTTAACGGCATTTCCACTAACAACTTCACAGTGCAC
ATCAACAACCGTAAGCACCGTACAGGCGAGCGCGTGGAGGTGGCCATCTACGGTGAGCCG
GGGGCGTACGTCGGCCTGTCCGGCATCGACCACGCCTTCTACACCATGCAGGCCGGCAAC
GAACTCACCTACGCTAAGGTGTTATCGAAGATGTCTCATTTCGACGAGTCCACTAACGGC
ACGTTCTCGTACACGTGGCGCTCGCACTTCGGTGACGCGGACGAGCTGGTGTACTTCCCC
TCCGCCAGCTTCGGCATCGACGCCAACAGGACATTCGAATACGTGGGTCTGATAGTGCTG
AGCGACGTACCGGTGTGGCGGCGGCCGTCCCCCTGCAACTCGTCCTTGGGTCTCGGCGAG
TGTCTGGACGGCTCCTGCTACCCTCTCGCCAAGCGGTGTGACGGCAGCCCGGACTGCGCC
GACCGCACCGACGAGGCTAACTGTCAGCGCGAGGACTCATTCGAGCTGTCCCACTTCCGC
AAGTTCCGGTTCAACCGCATCCAGCGCCAGTACGACAACGCGTGGCTGTGGCGGGACGTG
AACATCGGCCCGCACGGCCGGTACGTGTTCACGGCGGACGTGCCCGCCGCGCCCGCGCAC
TGGACGATCTCCGCCTTCGCCATGTCGCCCACCAGCGGCATCGGCATGCTGGCCAAGCCG
CTACACTACGTGGGTATCTTGCCGTTCTTCATGAAGCTGGAGGGTCCCGACAAGTGTCGC
CAGGGGGAGCAGCTAGGCCTGCGTGTGGCGGTGTTCAACTATCAGCCCCAGGACATCGAG
GCGGTGGTCGTGCTGGCCAGCTCACCCGACTACAAGTTCGTGCACGTCGAGGAGAATGGC
ATCGTCCGCTCGTACAACCCCCGCACGTCTTTCGGCGAGCACCAGTTCTTCGTGTACATC
AAGGCGGGCGACGCGGCCGTGGTGCACATCCCCGTGGTGGCGGCCCGCCTCGGCACCATC
AGGGTGGACGTGCTGGCCTCCACGCTGCAGGGACAGCACCGAGAACACAAATACATCACC
GTAGAGGCGGACGGTATACCTCAGTACCGCCACCAGTCGGTGCTGCTGGACCTGTCCAAC
CGCGCCTACGTGTTCCAGTACATGCACGTGAACGTGACGGAGACTCCCATCATCCCGTAC
GAGGTGGACCGCTACTACGTGTTCGGCTCCAACAAGGCCAGGATCTCCGTAGTAGGAGAC
GTCGTGGGCCCGCTCTTCCCCACCATGCCCGTCAACGCCACCAGCCTGCTGGACTTACCG
ATGGATTCTGCGGAACAGAACATGTTCAGTTTCGCGGCCAACATGTACCTGACGCTGTAC
ATGCGGCTCATCAACCAGAGGAACCGCACGCTGGAGAAGGAGGCCTTCTACCACATGAAC
ATCCTGTATCAGAGACAGCTGTCCTTCATGAAGGCGGACGGATCCTTTGGACACTTCAGG
AGTGACTGGAACCAGTCGTCCTCGAGCGTGTGGTTGACATCATTCTGCGCTAAGATATTC
CAGGAAGCTTCCTTCAATGAATGGGAGAACTACATTTACATCGATCCCGATGTGATATCT
ATGGCCGTGTCCTGGATCCTGGAGCGCCAGGCTCCGTGGGGCGCCTTCATGGAGGTGCAT
AGGGCGGCCGACCGGAACGCTAACAGCACCATCGTCGTGCCGCTCGACTCACACCTGTAC
CGAGACGTTGGCAACCAGCTGCAGGGAGTCAAGGCGAACAACTCTATAATACTACAGCGG
AATATAACGCTCACGGCGCAGGTCGTCATCACCCTGGAGTCTGTCAAGAACCTCAAAGAC
TTCGGGATACGAGAGGGTCTGTCGGCTCGCGTGTCGACGGCGCAGCAGCTGGGAGTGTCG
TGGTTGGAACGGAACCTGCAGCTCGTCCACGAGTTCGGCTCCCCGCTGACGATGGCGCTC
CTCTCGTACGCGCTCACCGCGGCCAAGGCGCCCTCCGCCGAGCACGCCTTCAGATTACTC
AAGAGGCATCAGCGATCCGAGGGTGGTCTGGTGTACTGGGGCAAGGAACCCGTGCCAGCT
CCTCCATATAAAATGGAAAACCAGAAGCCCTTCCTGCTTCCTCGGCTGCCCTACAACTAC
GACTCCAATAACATCGCCGCCACGGCCTACGCTCTGCTCGCCTGCATGGACCATCAGGAC
AACAACGAACCAATCGTTATGTGGCTTAATTCACAACGTCTCAAAGATGGCGGCTGGGCT
TCCACACAGGACACGTACATAGCCCTCCGCGCGCTGATCGAGTACACCAACAGGAAGCGG
TTGCGTGACGTGAGCTCGCTGGCGGTGAGCGTGGAAGCCGTGGCGCTGAGCGGCGAGCCC
AGGACGCTCACCGTCAGGAATCAAGACCTCGCTACGCTACAAACGGTCGATATTCCCAGC
GCTTGGGGTACAGTGAAGGTGACAGCTCGCGGCGCGGGGTACGCCATCCTTCAGATGTCC
GTCCAGTACAACGTGGACACGCCGCGGTTCCAGACACAGCCCCCGCGCCCGGCCTTCGCC
CTCCTCGCGCGCGCTCACTACTACGGCCGGAATCAGTCGCACATACACTACCAGACATGC
GCCAGCTGGACTCACCTGTCGGAGAGTCCGGTGTCGGGGCTGGCGGTGCTGGAGGCAGCT
CTGCCGACCGGCTACATGATACAACAGCAACGACTTGACGGATACGTCATGAGTGGAACA
GTTCCCAAACTACAGCGAGCCAGATACCAGCCCGGGAAGATTCTGTTCTACTTTGATTAC
TTGGATCACGAGTTGACCTGTGTGAACTTCACCATCGAGCGCTGGTACCCCGTCGCCAAC
ATGTCGCGATACCTCCCCATACGAGTGTACGACTACTACGCACCCGAACGTTTCAACGAG
ACGATATTCGACGCTCTCCCCACTTACTTATTGAACATCTGCGAGGTGTGCGGCTCCTCG
CAGTGTCCGTACTGCGCCATCTACAACGCCGGGGCTCCGCTGACCGCCTCCGCGCTCCTG
CTACTAGCGACCGTCGCCGCGATCATACGGAACGTACGAACGTAA
Protein sequence:
MNLKVISILLFIITNATWAQDYSSQTYKPDYQVNPTNNYLNTNDPKYDNPYDPNKRFSNR
NQYDINRNPYSTEQRYNQFDRNDLGTNQNPYKTDTGRQTYDTSRTYSTSYTGAELEHDSV
IINEATYFIVASRMIRPGQIYKISANILKARLQMTIRASISCNGVEIADVIERVKEGVPE
VLNMRVPATTVPGDYRLRVEGLYLDDPFGGRAFVNETKLTFSQRFMTIFIQMDKPVYMQS
QTVKFRVIPINTELKAFGRAIDVFILDPNKRIMKRWLSRQSNFGTVSLQYPLSDQPVFGD
WTVRVEALGQREEASFLVEEYYQTRFEVNVTMPAFFFNTDGFIHGRVMANYTSGAPVRGN
LTLRATLRPVPSYRPRQQHRYNTQRPNYNPYLYNETEQYDQGQEDRRWQLNHRPGQPNQL
DHPDWWYDTQKVVSRVFNFDEEFPFWMPKPDPVEIAQINMNINNNNMNNMNNMNNYPTTT
SYNNYYNNPYNDKLPYLRFFNGTYDFKYPMSELAQLVPSLEGVEVIVTATVGDPFLDEVV
EGYSVARIFNSSLAVTFLGGELQVFKPHMPFDVYMAVSFHDGSRLPWWLARGAVLSVSAS
LEGRGGAIEVPRPTLVPGDTAVWHLKLDLYKLLKLDNDPNYREVLNGITGVRLSASLVDA
SGGRATADAHFVAHQSPNHHHLRVSTSTTDARVGEYIVFHVQSNFYMESFNYVVMSKGII
LTSGQEMMQEGVRTFSVGVSAEMAPVGTLVVWAHRRRALVLADSITFPVNGISTNNFTVH
INNRKHRTGERVEVAIYGEPGAYVGLSGIDHAFYTMQAGNELTYAKVLSKMSHFDESTNG
TFSYTWRSHFGDADELVYFPSASFGIDANRTFEYVGLIVLSDVPVWRRPSPCNSSLGLGE
CLDGSCYPLAKRCDGSPDCADRTDEANCQREDSFELSHFRKFRFNRIQRQYDNAWLWRDV
NIGPHGRYVFTADVPAAPAHWTISAFAMSPTSGIGMLAKPLHYVGILPFFMKLEGPDKCR
QGEQLGLRVAVFNYQPQDIEAVVVLASSPDYKFVHVEENGIVRSYNPRTSFGEHQFFVYI
KAGDAAVVHIPVVAARLGTIRVDVLASTLQGQHREHKYITVEADGIPQYRHQSVLLDLSN
RAYVFQYMHVNVTETPIIPYEVDRYYVFGSNKARISVVGDVVGPLFPTMPVNATSLLDLP
MDSAEQNMFSFAANMYLTLYMRLINQRNRTLEKEAFYHMNILYQRQLSFMKADGSFGHFR
SDWNQSSSSVWLTSFCAKIFQEASFNEWENYIYIDPDVISMAVSWILERQAPWGAFMEVH
RAADRNANSTIVVPLDSHLYRDVGNQLQGVKANNSIILQRNITLTAQVVITLESVKNLKD
FGIREGLSARVSTAQQLGVSWLERNLQLVHEFGSPLTMALLSYALTAAKAPSAEHAFRLL
KRHQRSEGGLVYWGKEPVPAPPYKMENQKPFLLPRLPYNYDSNNIAATAYALLACMDHQD
NNEPIVMWLNSQRLKDGGWASTQDTYIALRALIEYTNRKRLRDVSSLAVSVEAVALSGEP
RTLTVRNQDLATLQTVDIPSAWGTVKVTARGAGYAILQMSVQYNVDTPRFQTQPPRPAFA
LLARAHYYGRNQSHIHYQTCASWTHLSESPVSGLAVLEAALPTGYMIQQQRLDGYVMSGT
VPKLQRARYQPGKILFYFDYLDHELTCVNFTIERWYPVANMSRYLPIRVYDYYAPERFNE
TIFDALPTYLLNICEVCGSSQCPYCAIYNAGAPLTASALLLLATVAAIIRNVRT