New model in OGS2.0 | DPOGS210690  |
---|---|
Genomic Position | scaffold317:+ 2766-46276 |
See gene structure | |
CDS Length | 4971 |
Paired RNAseq reads   | 301 |
Single RNAseq reads   | 676 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006306 (2e-34) |
Best Drosophila hit   | ND |
Best Human hit | C3 and PZP-like alpha-2-macroglobulin domain-containing protein 8 (2e-50) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC000808 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC000808 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005615 extracellular space GO:0004867 serine-type endopeptidase inhibitor activity GO:0005515 protein binding GO:0005576 extracellular region GO:0005886 plasma membrane |
InterPro families    | IPR008930 Terpenoid cylases/protein prenyltransferase alpha-alpha toroid IPR009048 Alpha-macroglobulin, receptor-binding IPR011626 A-macroglobulin complement component IPR011625 Alpha-2-macroglobulin, N-terminal 2 IPR001599 Alpha-2-macroglobulin |
Orthology group | MCL21912 |
Nucleotide sequence:
ATGAGCACGGAGAAGAGAGTGAGCGTTCGCGTGAAGCTCTACGATGATAAAACCGACATC
TACAGCCAAGATATTGATATGTCGACCGGAGAAGGCGGTTTCATAGTCCCGACTGTGATG
GCTGATTCACCATTTATAAATTTGCAGGCTGAGTTAGTAGCTGTGGAAGGCAAAGAGATA
GAAACACACTATGTGTTGGCCCGGGAGAAAATACGTCGGTGGAATTCTACAACAAAATGT
TACTTGCTCGTGGAAAACTTGCCTACGCCATTACAGGCTGGTGGAATTGCAAGCGCTAGT
GTTTGGTCATCATGTGGGTGCCGTCAGCGTCTGTTAGCGGCGGTCACCAACGGTGGCCGT
GCACTGCACTGGGCAGCTGTACCAGCACCGAAATCTGCCGATGAAAACGATTTGTGCCGT
TTTAATTACACATTTCCAGTGACGGCTGACATGGCACCGATCAGTTCTCTGCTAGTTTAT
TACGTCACCGAGCTAGGCGAGCCAGTGAGTGACGTAGCCAGCTTCCACGTCAAACTACTA
CATAAGGAAGTGGCCGTAGCTATAGAAGACCGTCGATGGTGGTACCCACGAACCGCACTA
CAGCTTCGAGTGCTGGCACCGCCGGATTCGTTGATGTGTTTGATCGGAGCGCGTGCACTC
ACTGACTCAAGATTTGATACACATCAAGGTGAACCAGAACACGAAGAACAACCCGGGCCT
GAGTTCGTGTCAGCGGGAGTATCACTGTTCGTTGGTGGTGGCACTTGCGGTGGTGGGGTG
CTCTACCGACAGAAGACAGTCACACCTCGTGCACCAGCCCATTTAGTACCGCCCGCGTCC
CACGACAGGCTTTGGATGTGGAAATGCTTTAATTATACAACTCAATTGTCAACTGATGGA
GTAACAATAGCGGCGCCTTCAGAAGCTGGGCGCTGGTCTCTTTGGGCACTTTCTTTGTCT
AATCGTGGTCTTCGATTCTCGGCTCCAAAAACCATTAACGTCTTTCGTCCGATACAACTG
GACTTCTCCCTGCCACCTGCCCTGAAAGTCGGCGAAACAGTAGAAGTTGACGTCAAAATC
ACCAATAACATCAACAATTGTATGGACGTGACAGCCCTATTAGCGCTAAGTGCTGGGGCA
GCTTTCGCGAGTACTGGTGCTTTATATGTCACTGAACGATTGAGACTCGGCCCGCGTGGT
GGGACTCAGTTAGTTGTGAGAGTCGCTGTTAATACTCCCGGAAGGAAGAATATTACTGTG
GAAGTAACTGGATATAGCGCTGATAACTGTACGGTGTCTTACACATCTTTCAACAACGAG
ACCCTTGTCGGTTCTGTAATTCGATCAGCGAGTGTATTGGTCCTACCGGAGGGACTACAC
CGGAGCGATACTCAGAGCGCATACTTCTGTGCTAACGAACATCTCGCGGTTTCTTCCCGT
GGTTCATGGGAGTGGCAATGGGTGGCAGCGCCTCGTAACAGGGCAGGTCTTGTATTAGAA
TTGAAGGCACAGGGCGCAGCACATGTTGCATTATCGGCCGTGAGGGAACCATCCGATGAT
ATGTATAGAGTTGTGATCGAGAGGAGTCGAGTATGGATTGCGAAAGGAAAACATGGTTAT
GACGTACACCTTGCCAGTGCGGAACAAACTGAAAGCGACGCGGACTGCTCTGGTGAGGAC
TCTTGGTGCGCTTGGTGGGTGTGGTGGGAGGGCGGTCGTCTTTCCGTCGGTAGAGGAGCA
TCTCCTTCAGAAAGAAGGTTGTTAGTATGGCCCCTTACAGCAGATATGAGGATAAAGTAT
GTCGGTTTCAGTGCGCTTTGGGGAGATCAAGCTGATTTTAGAATATGGAACTTCAATGAA
GAAGCTGGATTTTCCCAAGTATTAGAATTAGGTCTACCCCATGGAGTGGTACCTGGTTCA
GCGAGTGGGACGTTATTAATTTCCGGAGGTCTTCATCTTCCTTTATATAGTTTCCAAACG
GATGCTTCAGATATATGGTCAGATGTTTGGAAAGATTCTCAATTATCAGCAGCTTCAGCT
AGTTTGGCACCGTTATTAGCATTGGAACATATACCTCATTTAGTGGACGAAATGGAGAAG
GAAAGAATATTGAATAAGCTACCTGAACAGGTACAAATACTACTTTCATTTCGTAAAAGC
GATAACTCGTTCAGCGATCATCCAGCAGTAAGCAGTCATTTATCTACAATCAAAATCTTA
GAAATTTTAAACAAAATTCAATCATATTACCCAGTGGATCCGGAACTTCTACAATCCATA
AAATCTTGGATACAATCTAGGCAAAATCCAGATGGTTCCTTTACCCCACTTGCTGCAGAC
AAGGAAGTCGATTATTATCCTGTTGAAATAAAAAATGTAAACGGCACAGACGCTGAGTTT
GATGTAAATGAATACTACTATTATGACAAAGATGGTAATATGACGCAAGAAGTAATTGAA
TATGAGAGAACCGTAGAAGTTACAGCAGAAACTTTGGTATCATTACTAGAAGTTGGAGTA
GAAAATCAAGTAGATGCAGATGTTGCAAAACTAGCGCAAACGTACTTAGAGAATAATGTC
CGGAATCTGACCTCGCCAGCCACTTTAGCAGCCACTGTTTTAGCGCTTGTTTTGGCAAGA
AGTCCTATCGTACCTGAAGCGTTACTTATACTACGTAATGCATCAACTACTGAAGAAGGA
GAGTTCGGTTGGCCAGCTCCCAGAAAGGATGCAGCAGATTGGCTCCTTGAAGAAACCTCT
AGAAACATCAAAACCACTTCCTACGAAGCGGTTACAATGGAGCAGTATGTGGCTGGCGTG
CGTGTGTTACTAGCTGCGTGTGCGCGGGGAGCCTTGGCGGAAGGAGAAGCGGCGGCTCGG
TTCTTATACTATCGGGCATCAACTTTACAAAGGCATCCCAGTCTAGCATACCAAGCTACA
AAAGCAGCTGCGCAGTACGCTGCACTGGCTCATGATAGACATAGAGCACTGACAGTATCT
CTGGCTACAGCTGGAATGGAATTAACAGACACGTTAGAACTACGCGCGTTGACACCACCT
CGGCCACTACAACTTCCAGGTCTACCTACTAAGGTGTTCGTATACGCCACCGGCGCTGGA
TGTGCCACTGTACAGGGCACAATATCATATTCGACGTATAATCCTAAAGCAGAAAATGCG
CTGCTGAACATCCAAGCAGCTATTATTGAAGAGATAAGACCTGAACGAAGCAGCATCGAA
GATTTGCAAGGAAACTTGCCGACATTGATCATTAAATCTTGCTTCAAATGGAAAGGAAAA
GAGCGCTCCGGAATTCTTCGTTTAGAATCTTCTCTTTTCTCTGGCTATGAATTACATTCA
GTAAATCCTGTTGTTCTTGATGGGGCCACGTTTGCTGACTTACATTACGGTTCGCGTGGA
GAATCAGTGTGGTTTGTGTTTACTAATATTAGCTCCACTTGTCCGGTTTGCGTAACTTAC
GAAGCGAGATCAAAGTTCGTCATAACAAGCCTCCGTCCAGCATTTGCTAAAATTTATCCT
TCAAGCAGACCAGATTTAGCTGTTGAAACATTCTTCCACGCAAGACCCGGAAGTCCTCTG
TTAAGGGGTATCACAGATGATGATTTTATAACTTGGTTCGATAAAACCCAACGTGCTAGT
CTAAAAACAAACACAAATATTGACAATATTTGTGAATGTGGTCGTATATGTAGTAGAGAT
TATGAATTTAGAAAGGATTACAAGAAAATGATGGAATCAACAACAACAGAGGAGACGACA
ACAGTAAAAATTACAGAACCAACTTTAACAACAGACTATAAGATATCAACAACAGATGTA
GTAACAGACATTCAAAGTGACTTACCTTCTACTTTATCCTCAACTTCAATTACCATAGAA
ACCCAAGATATATCAACAACAAGTATGCCAGCCCCTGGAAATGATACATCGAAAGTTTCT
AATGCCACAATATCTATAAATACTGACGATCCAATAATCATTCCCACTATAACATACGCA
ACGCAAACTGAAAACCAAAATATCAGTAATAACATTCTGCCAGCTGTCCCTATAATAAAC
GGTGAACTCATCGTACAAAAGCTTCCTGTTAATAAAAATTATGCAAAAAAACCTGACTTC
AGTAAGAAACCATTGCCGCGACGTAAAGGTACATTAAAAGCAACCTACGGGGATAAACAT
GAAAAGTTCTTTTCAAAATCTAAAATCCCTGATGATTTGAATCTTATAAAAACTATAAAA
CCGGTTTATCAAGTTACCGAAACTTCTACAATAAAAGGTTTGACTACGACTACGTCAACA
CTAAAAGGTATAACTAGTTCTGAAGCAAAAACTCCTGAACACGATATTTCATCCACTATG
AAGACGGAATTGAGGACTGTCACAGTATTTAATACAAATTCTACTATTATAACTCCAAGT
AGTGTAACTGAAGACAAAATTAAGTCTAATAAAACTCTTATTTTCACTCAACCCGAAATA
ACTACGGTCCCCCACTCAATAACAGCAATCACAAAAAGCAATATTAAAACAATACACTAT
AGGACTAAGAAACCTAAGCCTAAGACACAAATTAAGAAGCCCAACATAAATACGAATAAC
ACTAACGAGAAGCCTCTGAAAAATAATAAAACCACGAAACCTGAGATTGTTCTTAACACG
ACAAAAATAAGATTGGATTCGACCGAAAAATATGTATCAAAATCCTTAAAATCTATCAAT
AAGGAAATTCATAAAATACCGTTCACACCTGTTTCAGAAACCACTAAATCAAACAATATC
CCTACGAAATCAGATATCGCACCTGAAAATAGAGAAGGGTACGAAATTTTAGACAAAAAT
AATCTTTGGGAGCTTCTTAAAGAAGGTCCGGATGATACTAAAATAGAAGATAAAATTAAT
GTTCACAATCGATTGAATGAAGTGTCATCTGTCAATAATCGTTCTTTATAA
Protein sequence:
MSTEKRVSVRVKLYDDKTDIYSQDIDMSTGEGGFIVPTVMADSPFINLQAELVAVEGKEI
ETHYVLAREKIRRWNSTTKCYLLVENLPTPLQAGGIASASVWSSCGCRQRLLAAVTNGGR
ALHWAAVPAPKSADENDLCRFNYTFPVTADMAPISSLLVYYVTELGEPVSDVASFHVKLL
HKEVAVAIEDRRWWYPRTALQLRVLAPPDSLMCLIGARALTDSRFDTHQGEPEHEEQPGP
EFVSAGVSLFVGGGTCGGGVLYRQKTVTPRAPAHLVPPASHDRLWMWKCFNYTTQLSTDG
VTIAAPSEAGRWSLWALSLSNRGLRFSAPKTINVFRPIQLDFSLPPALKVGETVEVDVKI
TNNINNCMDVTALLALSAGAAFASTGALYVTERLRLGPRGGTQLVVRVAVNTPGRKNITV
EVTGYSADNCTVSYTSFNNETLVGSVIRSASVLVLPEGLHRSDTQSAYFCANEHLAVSSR
GSWEWQWVAAPRNRAGLVLELKAQGAAHVALSAVREPSDDMYRVVIERSRVWIAKGKHGY
DVHLASAEQTESDADCSGEDSWCAWWVWWEGGRLSVGRGASPSERRLLVWPLTADMRIKY
VGFSALWGDQADFRIWNFNEEAGFSQVLELGLPHGVVPGSASGTLLISGGLHLPLYSFQT
DASDIWSDVWKDSQLSAASASLAPLLALEHIPHLVDEMEKERILNKLPEQVQILLSFRKS
DNSFSDHPAVSSHLSTIKILEILNKIQSYYPVDPELLQSIKSWIQSRQNPDGSFTPLAAD
KEVDYYPVEIKNVNGTDAEFDVNEYYYYDKDGNMTQEVIEYERTVEVTAETLVSLLEVGV
ENQVDADVAKLAQTYLENNVRNLTSPATLAATVLALVLARSPIVPEALLILRNASTTEEG
EFGWPAPRKDAADWLLEETSRNIKTTSYEAVTMEQYVAGVRVLLAACARGALAEGEAAAR
FLYYRASTLQRHPSLAYQATKAAAQYAALAHDRHRALTVSLATAGMELTDTLELRALTPP
RPLQLPGLPTKVFVYATGAGCATVQGTISYSTYNPKAENALLNIQAAIIEEIRPERSSIE
DLQGNLPTLIIKSCFKWKGKERSGILRLESSLFSGYELHSVNPVVLDGATFADLHYGSRG
ESVWFVFTNISSTCPVCVTYEARSKFVITSLRPAFAKIYPSSRPDLAVETFFHARPGSPL
LRGITDDDFITWFDKTQRASLKTNTNIDNICECGRICSRDYEFRKDYKKMMESTTTEETT
TVKITEPTLTTDYKISTTDVVTDIQSDLPSTLSSTSITIETQDISTTSMPAPGNDTSKVS
NATISINTDDPIIIPTITYATQTENQNISNNILPAVPIINGELIVQKLPVNKNYAKKPDF
SKKPLPRRKGTLKATYGDKHEKFFSKSKIPDDLNLIKTIKPVYQVTETSTIKGLTTTTST
LKGITSSEAKTPEHDISSTMKTELRTVTVFNTNSTIITPSSVTEDKIKSNKTLIFTQPEI
TTVPHSITAITKSNIKTIHYRTKKPKPKTQIKKPNINTNNTNEKPLKNNKTTKPEIVLNT
TKIRLDSTEKYVSKSLKSINKEIHKIPFTPVSETTKSNNIPTKSDIAPENREGYEILDKN
NLWELLKEGPDDTKIEDKINVHNRLNEVSSVNNRSL