New model in OGS2.0 | DPOGS203359  |
---|---|
Genomic Position | scaffold1196:+ 18710-30405 |
See gene structure | |
CDS Length | 3300 |
Paired RNAseq reads   | 1360 |
Single RNAseq reads   | 3832 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011888 (0.0) |
Best Drosophila hit   | alpha catenin (1e-157) |
Best Human hit | catenin alpha-2 isoform 2 (5e-131) |
Best NR hit (blastp)   | PREDICTED: similar to alpha Catenin CG17947-PA [Acyrthosiphon pisum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to Catenin CG17947-PA [Apis mellifera] (8e-176) |
GeneOntology terms    | GO:0005915 zonula adherens GO:0016342 catenin complex GO:0007016 cytoskeletal anchoring at plasma membrane GO:0008092 cytoskeletal protein binding GO:0003779 actin binding GO:0030720 oocyte localization involved in germarium-derived egg chamber formation GO:0005912 adherens junction GO:0005914 spot adherens junction GO:0030055 cell-substrate junction GO:0007155 cell adhesion GO:0045296 cadherin binding GO:0007163 establishment or maintenance of cell polarity GO:0005198 structural molecule activity GO:0015629 actin cytoskeleton GO:0005913 cell-cell adherens junction GO:0005886 plasma membrane |
InterPro families    | IPR006077 Vinculin/alpha-catenin IPR001033 Alpha-catenin |
Orthology group | MCL10484 |
Nucleotide sequence:
ATGGCTCGACTGACGTTATGGCTTATCGTTCTTGCAATAGAAAGTTCCATTGCATCAAAA
CATCGAAGCGTCAATCTGGCTTTGGAAACTGTCGTTGAATTCGACAAGCTCGATGACGTG
AAGGTCGTCAACAGAGTCCAAGATTTCCTGACAAGAGTACAAAGTTTATTGAACAAAAAT
CACAGAAGCGATCTGTCTACTCGGAGGAAGAATCTATACCAATTCTCTGATTTTTTAGAA
GACGTTATCATAGAAGTACAGTCGTTGGACGACGATGATTTATTGGAAATATTCAATATC
GTCAAAAAAAATGTCGAGAAACGAAAAAAAAATGACATCCTGGACGACAACGGGAGGAGA
TATTTATCTGGCAAGTTATCGGTATTGAGACAGCGGCCGACTGTTGAAATACGCAGCGAT
TTAGACGAGCTGTTAAGAGAGTTGAGGAACGATAGGCAAAGTAAAAGCAACTACATACTC
GGCTTCATAAACAAACTTTACGAACGTGACGCACAAAACAAACTTAAGAAATTTGTTAAG
AGATTGAACATATTCAGGCGGAGTGCGAGGAAATCCAAAGTCGACTTGAAGAATATCATC
GAAAAAGTTATCAATGACGTCATATACGAACATTACAATAATTTGGAGGGCGACGAAAGA
CGCTATATAAAGAAATATGTGAGATCATTCTTTGTAGAGGACGCACCAAAAGAAGTAACA
ACCAAACCTGCAACGGACGAAGAGAAGACCGAAAACGTAAATACTGACGAGGAATCAAGC
GAATCCGTAGAACAAGCGGTCGATGTCAGTCCATTCGCGGACAACACAAAACAAAACACA
AAACCAAACAATTTGTATCCAAAAACATCAAAAGTAACGAGTACCCAGAAACATAAGTTC
AGAACCGGCATTCCGGTAAGAGAACATCAATTGGAAGAGGTAGCAGACAAGCCCACAGAA
GAAGATCGCGGCGAGGAGTCTGGACAAACATTTAAAGCTCAAGACGTCATCGCTACTCGT
GAAATCTTCGCAAATCATTCGCTTGAAATAAACGTTAACTTAATGATAAAAAAACATAAA
GACTCGCAGGAGGAAAACGTCCCCAATCAGAATACCTATAAACACAGACTCTCTCAGAGG
GCTTCGACGAAAAAAACAAGAAAACCCAAAAACCATCGCACAACTACAGCAGCTCTGTCC
GCTGAGACATTTGGTAGTTTACATAAAAAAACGACAAAAAAACATATCACCAAAAAAAAG
TTACGCGCTCCCAAAACTAGGACTACAGCGAAAGAAGCTATTCTTAGGACGATGATGAAT
GTTGACACTGACGAAAAAGATTCAGAGCCACTCTCTGACGATAATCGGCGCAAAAACTTT
AAAAATATCCACACAGAAGACCACAAGTCGGTTAAAACTACTATGAAACAGGCGATCAGA
CGCGCAAATATACCCATCGAGAGCTATTACAACGTCACTCTAACTACTAGAGTGAATAGT
GTCAGTGTCCCAACTATGAGAAAACATCGCACAACCACGACTTCGCGTCCCGTCGTCACA
ACTACAGAAGATACGACTTCATCACCAACAACTACAGTAACAACAACTACTACAACTACA
CCTAGCGATATATTCAATGAAATACAACATAAAAACGTTACTGACGCAGAATTATCCAGC
GAAATCGATAATTTAGACGTCAATCATCCAGAAGATACAAATTTGACCGATTCAAACAAA
ACGATCGGCAATGAAACTGAGACTTTCAATATTGACGAATATTTAAAAAAATCAGATTTA
CGAAACGCCGTAAACGAAGACAAGATGAAGAACATAGAGGACACAATGAGAGATGATTTT
AAAATGTTAATGTCTTCGCCCGCCACCACCACCACGGAGGCGTTCACGGATTGTATTTTG
GAAGTGAGCGCCGAATATATCAGTACAATGGCAACTCTAACGGATCCGTACGGGACGACC
GGTCAATTGACTCTCAAGTGGGACCCCAAGAACTTGGAGATAAGGACCATGTCGGTGGAG
AGGACCCTCGAGCCGTTGGTGTTGCAAGTTACCACACTCGTCAACAGCAAGGACAAGACC
GCTAAGAAGAAGAGACCTGGCAAATCGAAAAGGGCGAGTGCTCTTGTGGCGACAGTGGAG
CGTGCTACTGAGATCTTCATAGAGCGTGGTCAGATCATAGCGTACGAGAACCCTGAGATC
ACACAGGAGATGCTGGCGGCGGTGGAGGAGGTCCGGAAAGCTGGTGCGGCCATGAGTCTA
GCTGCTCGCGAGTTCTCCGAGGAGCCGTGCGCTTCCTCCGTCCGCTCGGGCATGGTCCGG
GCCGCTCGCAGTCTGCTGTCCGCGGTCACTCGACTGCTCATACTGGCGGACATGGTGGAT
GTACATCTTCTGCTCACCAACCTTAGGACGGTGGAGAACGATCTGGACAAGCTGAAGTCG
GCCTCCTCTCAATCGGAGCTGTTAGAATCGGCTCGTCAGTTCGGCAGGTCAGCCAACGAG
CTGGCGGCGCAGGCTGCCAGGAGACAGAAGGAGCTCAAGGAGCCGAGGATGAAGGACGAG
CTCGCGGCGGCCAGGGCCGTGCTCAAGAAACACTCCACTATGCTACTCACTGCGTCCAAA
GTGTACGTCCGTCACCCCGAGCTGGCCGCGGCCAAGGCCAACCGTGACTTCGTACTGCGC
GCCGTGTGCTCCGCCGTGGACACCATCTCGTGCGTGGCGCAAGGAAGACCGCTCCCGCCG
GCCGGGTCAAACCGGGTGCCCGTGGAAGGTCCGGGGGAACTGGCCCAAGCCTTGGATGAC
TTTGATGAGCGGATGGTGATGGAGCCCATGTCGTACTCCGAGCTGAGGACCAGGCCCTCG
CTGGAGGAGCGTCTCGAGAGCATCATCTCCGGGGCGGCGCTGATGGCGGACAGCTCCTGC
ACCAGGGACGAGCGCCGCGAGCGCATCGTGGCGGAGTGCAACGCCGTGAGGCAAGCGCTG
CAGGACCTGTTGCACGAGTACATGAGTAACGCCGGCAGACAGGAACAGTCTGAGGGTCTG
GAGCGAGCCCTGGAACAGATGTGCCGCAAGACGAGGGACCTCCGGAGGCAGCTGAGGAAG
GCCGTCGTGGACCACGTGTCCGACAGTTTCCTGGAGACGAACGTGCCGCTCCTGGTGCTG
CTGGAGGCGGCCCGCGGCGGCAACGAGAAGGAGGTGGAGGAGTACGCGCTGGTGTTCACG
GAACACGCCAACAAGCTGGTCGAGTCAAAATCTATTCACTTCAAGTCTAATATTTTATAA
Protein sequence:
MARLTLWLIVLAIESSIASKHRSVNLALETVVEFDKLDDVKVVNRVQDFLTRVQSLLNKN
HRSDLSTRRKNLYQFSDFLEDVIIEVQSLDDDDLLEIFNIVKKNVEKRKKNDILDDNGRR
YLSGKLSVLRQRPTVEIRSDLDELLRELRNDRQSKSNYILGFINKLYERDAQNKLKKFVK
RLNIFRRSARKSKVDLKNIIEKVINDVIYEHYNNLEGDERRYIKKYVRSFFVEDAPKEVT
TKPATDEEKTENVNTDEESSESVEQAVDVSPFADNTKQNTKPNNLYPKTSKVTSTQKHKF
RTGIPVREHQLEEVADKPTEEDRGEESGQTFKAQDVIATREIFANHSLEINVNLMIKKHK
DSQEENVPNQNTYKHRLSQRASTKKTRKPKNHRTTTAALSAETFGSLHKKTTKKHITKKK
LRAPKTRTTAKEAILRTMMNVDTDEKDSEPLSDDNRRKNFKNIHTEDHKSVKTTMKQAIR
RANIPIESYYNVTLTTRVNSVSVPTMRKHRTTTTSRPVVTTTEDTTSSPTTTVTTTTTTT
PSDIFNEIQHKNVTDAELSSEIDNLDVNHPEDTNLTDSNKTIGNETETFNIDEYLKKSDL
RNAVNEDKMKNIEDTMRDDFKMLMSSPATTTTEAFTDCILEVSAEYISTMATLTDPYGTT
GQLTLKWDPKNLEIRTMSVERTLEPLVLQVTTLVNSKDKTAKKKRPGKSKRASALVATVE
RATEIFIERGQIIAYENPEITQEMLAAVEEVRKAGAAMSLAAREFSEEPCASSVRSGMVR
AARSLLSAVTRLLILADMVDVHLLLTNLRTVENDLDKLKSASSQSELLESARQFGRSANE
LAAQAARRQKELKEPRMKDELAAARAVLKKHSTMLLTASKVYVRHPELAAAKANRDFVLR
AVCSAVDTISCVAQGRPLPPAGSNRVPVEGPGELAQALDDFDERMVMEPMSYSELRTRPS
LEERLESIISGAALMADSSCTRDERRERIVAECNAVRQALQDLLHEYMSNAGRQEQSEGL
ERALEQMCRKTRDLRRQLRKAVVDHVSDSFLETNVPLLVLLEAARGGNEKEVEEYALVFT
EHANKLVESKSIHFKSNIL