DPGLEAN10043 in OGS1.0

New model in OGS2.0DPOGS203359 
Genomic Positionscaffold1196:+ 18710-30405
See gene structure
CDS Length3300
Paired RNAseq reads  1360
Single RNAseq reads  3832
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011888 (0.0)
Best Drosophila hit  alpha catenin (1e-157)
Best Human hitcatenin alpha-2 isoform 2 (5e-131)
Best NR hit (blastp)  PREDICTED: similar to alpha Catenin CG17947-PA [Acyrthosiphon pisum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Catenin CG17947-PA [Apis mellifera] (8e-176)
GeneOntology terms














  
GO:0005915 zonula adherens
GO:0016342 catenin complex
GO:0007016 cytoskeletal anchoring at plasma membrane
GO:0008092 cytoskeletal protein binding
GO:0003779 actin binding
GO:0030720 oocyte localization involved in germarium-derived egg chamber formation
GO:0005912 adherens junction
GO:0005914 spot adherens junction
GO:0030055 cell-substrate junction
GO:0007155 cell adhesion
GO:0045296 cadherin binding
GO:0007163 establishment or maintenance of cell polarity
GO:0005198 structural molecule activity
GO:0015629 actin cytoskeleton
GO:0005913 cell-cell adherens junction
GO:0005886 plasma membrane
InterPro families
  
IPR006077 Vinculin/alpha-catenin
IPR001033 Alpha-catenin
Orthology groupMCL10484

Nucleotide sequence:

ATGGCTCGACTGACGTTATGGCTTATCGTTCTTGCAATAGAAAGTTCCATTGCATCAAAA
CATCGAAGCGTCAATCTGGCTTTGGAAACTGTCGTTGAATTCGACAAGCTCGATGACGTG
AAGGTCGTCAACAGAGTCCAAGATTTCCTGACAAGAGTACAAAGTTTATTGAACAAAAAT
CACAGAAGCGATCTGTCTACTCGGAGGAAGAATCTATACCAATTCTCTGATTTTTTAGAA
GACGTTATCATAGAAGTACAGTCGTTGGACGACGATGATTTATTGGAAATATTCAATATC
GTCAAAAAAAATGTCGAGAAACGAAAAAAAAATGACATCCTGGACGACAACGGGAGGAGA
TATTTATCTGGCAAGTTATCGGTATTGAGACAGCGGCCGACTGTTGAAATACGCAGCGAT
TTAGACGAGCTGTTAAGAGAGTTGAGGAACGATAGGCAAAGTAAAAGCAACTACATACTC
GGCTTCATAAACAAACTTTACGAACGTGACGCACAAAACAAACTTAAGAAATTTGTTAAG
AGATTGAACATATTCAGGCGGAGTGCGAGGAAATCCAAAGTCGACTTGAAGAATATCATC
GAAAAAGTTATCAATGACGTCATATACGAACATTACAATAATTTGGAGGGCGACGAAAGA
CGCTATATAAAGAAATATGTGAGATCATTCTTTGTAGAGGACGCACCAAAAGAAGTAACA
ACCAAACCTGCAACGGACGAAGAGAAGACCGAAAACGTAAATACTGACGAGGAATCAAGC
GAATCCGTAGAACAAGCGGTCGATGTCAGTCCATTCGCGGACAACACAAAACAAAACACA
AAACCAAACAATTTGTATCCAAAAACATCAAAAGTAACGAGTACCCAGAAACATAAGTTC
AGAACCGGCATTCCGGTAAGAGAACATCAATTGGAAGAGGTAGCAGACAAGCCCACAGAA
GAAGATCGCGGCGAGGAGTCTGGACAAACATTTAAAGCTCAAGACGTCATCGCTACTCGT
GAAATCTTCGCAAATCATTCGCTTGAAATAAACGTTAACTTAATGATAAAAAAACATAAA
GACTCGCAGGAGGAAAACGTCCCCAATCAGAATACCTATAAACACAGACTCTCTCAGAGG
GCTTCGACGAAAAAAACAAGAAAACCCAAAAACCATCGCACAACTACAGCAGCTCTGTCC
GCTGAGACATTTGGTAGTTTACATAAAAAAACGACAAAAAAACATATCACCAAAAAAAAG
TTACGCGCTCCCAAAACTAGGACTACAGCGAAAGAAGCTATTCTTAGGACGATGATGAAT
GTTGACACTGACGAAAAAGATTCAGAGCCACTCTCTGACGATAATCGGCGCAAAAACTTT
AAAAATATCCACACAGAAGACCACAAGTCGGTTAAAACTACTATGAAACAGGCGATCAGA
CGCGCAAATATACCCATCGAGAGCTATTACAACGTCACTCTAACTACTAGAGTGAATAGT
GTCAGTGTCCCAACTATGAGAAAACATCGCACAACCACGACTTCGCGTCCCGTCGTCACA
ACTACAGAAGATACGACTTCATCACCAACAACTACAGTAACAACAACTACTACAACTACA
CCTAGCGATATATTCAATGAAATACAACATAAAAACGTTACTGACGCAGAATTATCCAGC
GAAATCGATAATTTAGACGTCAATCATCCAGAAGATACAAATTTGACCGATTCAAACAAA
ACGATCGGCAATGAAACTGAGACTTTCAATATTGACGAATATTTAAAAAAATCAGATTTA
CGAAACGCCGTAAACGAAGACAAGATGAAGAACATAGAGGACACAATGAGAGATGATTTT
AAAATGTTAATGTCTTCGCCCGCCACCACCACCACGGAGGCGTTCACGGATTGTATTTTG
GAAGTGAGCGCCGAATATATCAGTACAATGGCAACTCTAACGGATCCGTACGGGACGACC
GGTCAATTGACTCTCAAGTGGGACCCCAAGAACTTGGAGATAAGGACCATGTCGGTGGAG
AGGACCCTCGAGCCGTTGGTGTTGCAAGTTACCACACTCGTCAACAGCAAGGACAAGACC
GCTAAGAAGAAGAGACCTGGCAAATCGAAAAGGGCGAGTGCTCTTGTGGCGACAGTGGAG
CGTGCTACTGAGATCTTCATAGAGCGTGGTCAGATCATAGCGTACGAGAACCCTGAGATC
ACACAGGAGATGCTGGCGGCGGTGGAGGAGGTCCGGAAAGCTGGTGCGGCCATGAGTCTA
GCTGCTCGCGAGTTCTCCGAGGAGCCGTGCGCTTCCTCCGTCCGCTCGGGCATGGTCCGG
GCCGCTCGCAGTCTGCTGTCCGCGGTCACTCGACTGCTCATACTGGCGGACATGGTGGAT
GTACATCTTCTGCTCACCAACCTTAGGACGGTGGAGAACGATCTGGACAAGCTGAAGTCG
GCCTCCTCTCAATCGGAGCTGTTAGAATCGGCTCGTCAGTTCGGCAGGTCAGCCAACGAG
CTGGCGGCGCAGGCTGCCAGGAGACAGAAGGAGCTCAAGGAGCCGAGGATGAAGGACGAG
CTCGCGGCGGCCAGGGCCGTGCTCAAGAAACACTCCACTATGCTACTCACTGCGTCCAAA
GTGTACGTCCGTCACCCCGAGCTGGCCGCGGCCAAGGCCAACCGTGACTTCGTACTGCGC
GCCGTGTGCTCCGCCGTGGACACCATCTCGTGCGTGGCGCAAGGAAGACCGCTCCCGCCG
GCCGGGTCAAACCGGGTGCCCGTGGAAGGTCCGGGGGAACTGGCCCAAGCCTTGGATGAC
TTTGATGAGCGGATGGTGATGGAGCCCATGTCGTACTCCGAGCTGAGGACCAGGCCCTCG
CTGGAGGAGCGTCTCGAGAGCATCATCTCCGGGGCGGCGCTGATGGCGGACAGCTCCTGC
ACCAGGGACGAGCGCCGCGAGCGCATCGTGGCGGAGTGCAACGCCGTGAGGCAAGCGCTG
CAGGACCTGTTGCACGAGTACATGAGTAACGCCGGCAGACAGGAACAGTCTGAGGGTCTG
GAGCGAGCCCTGGAACAGATGTGCCGCAAGACGAGGGACCTCCGGAGGCAGCTGAGGAAG
GCCGTCGTGGACCACGTGTCCGACAGTTTCCTGGAGACGAACGTGCCGCTCCTGGTGCTG
CTGGAGGCGGCCCGCGGCGGCAACGAGAAGGAGGTGGAGGAGTACGCGCTGGTGTTCACG
GAACACGCCAACAAGCTGGTCGAGTCAAAATCTATTCACTTCAAGTCTAATATTTTATAA

Protein sequence:

MARLTLWLIVLAIESSIASKHRSVNLALETVVEFDKLDDVKVVNRVQDFLTRVQSLLNKN
HRSDLSTRRKNLYQFSDFLEDVIIEVQSLDDDDLLEIFNIVKKNVEKRKKNDILDDNGRR
YLSGKLSVLRQRPTVEIRSDLDELLRELRNDRQSKSNYILGFINKLYERDAQNKLKKFVK
RLNIFRRSARKSKVDLKNIIEKVINDVIYEHYNNLEGDERRYIKKYVRSFFVEDAPKEVT
TKPATDEEKTENVNTDEESSESVEQAVDVSPFADNTKQNTKPNNLYPKTSKVTSTQKHKF
RTGIPVREHQLEEVADKPTEEDRGEESGQTFKAQDVIATREIFANHSLEINVNLMIKKHK
DSQEENVPNQNTYKHRLSQRASTKKTRKPKNHRTTTAALSAETFGSLHKKTTKKHITKKK
LRAPKTRTTAKEAILRTMMNVDTDEKDSEPLSDDNRRKNFKNIHTEDHKSVKTTMKQAIR
RANIPIESYYNVTLTTRVNSVSVPTMRKHRTTTTSRPVVTTTEDTTSSPTTTVTTTTTTT
PSDIFNEIQHKNVTDAELSSEIDNLDVNHPEDTNLTDSNKTIGNETETFNIDEYLKKSDL
RNAVNEDKMKNIEDTMRDDFKMLMSSPATTTTEAFTDCILEVSAEYISTMATLTDPYGTT
GQLTLKWDPKNLEIRTMSVERTLEPLVLQVTTLVNSKDKTAKKKRPGKSKRASALVATVE
RATEIFIERGQIIAYENPEITQEMLAAVEEVRKAGAAMSLAAREFSEEPCASSVRSGMVR
AARSLLSAVTRLLILADMVDVHLLLTNLRTVENDLDKLKSASSQSELLESARQFGRSANE
LAAQAARRQKELKEPRMKDELAAARAVLKKHSTMLLTASKVYVRHPELAAAKANRDFVLR
AVCSAVDTISCVAQGRPLPPAGSNRVPVEGPGELAQALDDFDERMVMEPMSYSELRTRPS
LEERLESIISGAALMADSSCTRDERRERIVAECNAVRQALQDLLHEYMSNAGRQEQSEGL
ERALEQMCRKTRDLRRQLRKAVVDHVSDSFLETNVPLLVLLEAARGGNEKEVEEYALVFT
EHANKLVESKSIHFKSNIL