New model in OGS2.0 | DPOGS208589  |
---|---|
Genomic Position | scaffold2965:+ 337-10923 |
See gene structure | |
CDS Length | 3495 |
Paired RNAseq reads   | 2129 |
Single RNAseq reads   | 5635 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005731 (3e-13) |
Best Drosophila hit   | adherens junction protein p120, isoform A (2e-141) |
Best Human hit | plakophilin-4 isoform b (2e-77) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC001464 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC001464 [Tribolium castaneum] (2e-159) |
GeneOntology terms    | GO:0005912 adherens junction GO:0005737 cytoplasm GO:0005813 centrosome GO:0007155 cell adhesion GO:0005488 binding GO:0001745 compound eye morphogenesis GO:0034334 adherens junction maintenance |
InterPro families    | IPR016024 Armadillo-type fold IPR011989 Armadillo-like helical IPR000225 Armadillo IPR019394 Predicted transmembrane/coiled-coil 2 protein |
Orthology group | MCL10994 |
Nucleotide sequence:
GCGGGTGATTACCCTGTCGGTGGCGGTGTGGAGCCGGACTATGGCCAGTACGTGTCGTCA
CCAGCCGCACATTACAATCACATGGGACACCTCACCATGGCGCCATCACAGAAATACCCT
CATGTGATGGAGACTGTGTCTGTGAGTAGCGGTTACGCGGGCGGAGTGGGGGCGGGTCAC
TACGGCACGTACGCTCACTACGGCGCCGCGTATCCAACTCAGCCCGCTTACTTAGTTGAA
CCTCAAGTACCCGCTTATATGGCACAGGAATTTGTAGAGGGCGGTGGTAGCGCTTCCCCT
CGCAGTGCTTCACCCGGGGCCATACCTCCGCAACACAATATGCATCTGCAACAGAGGTAC
GACTCAGCGTCTCTAGAACAATTAGGTCGACACTACTGTGTGACGTCACCGCGCGGTGAA
TACGCCCCGGACGCGTACGGCTACCAGCACTACTCCGCCGCCTACGACACTACCCACCAG
CCAACGGCGTTTAAGGATTCACAAAACGGTTTGAGTTTAGGCAGCACCGGAGGGCAGTCT
ATGTATGGCGATGAAGAGGAGCTACAAAAGCAAATGGCCAATATGGCATTAGTTCACGGT
AGTGTTGGGGTGGGAGGTCGGGAGGAGGGTGGTGGTCTTCAGTGGCGAGACCCCAACCTG
CCAGAGGTCATCGGTTTCCTGAACTCGCCGTCGGATGTCGTGAAGGCAAACGCCGCCGCT
TACTTACAACACCTCACCTATATGGATGACCCTAACAAACAGAAAACTAGAAGCTTAGAG
AACGTGAACGAGTACCTGAAGCTAGCAGCGAACGCTGACAAGCAGCAGCTGGCGAGGATC
AAAGCTGTGTTCGAGAAGAAGAATCAGAAGAGTGCGCTCTGTATCGTACAGCTGCAGAAG
AAGCTGGAGGGGTACAACAAGAGAATTAAGAGCTGGGAGATCAAGGAGTTGGTGACCGGC
GTCATATGGAACATGTCTTCCTGCGAGGATCTCAAGCAGTCCATTATAGACGACGCGGCT
CAAGTTATATTCAACAAGGTCATCATACATCACTCCGGCTGGCATCCGACAAACCCTGGG
GACACGTACTGGTCCAACGTGTTCCGTAACGCGTCCGGTGTGCTCCGTAACGCGTCCTCG
GCCGGGGAGTACGCTCGCAGACGTCTCCGTTCCCTGGCGGGGCTGGCCGAGGCTCTGCTG
CACACAGTGCGGGTGGCGCTCGTCAAGAACGCTATAGGGACCAAGGTCGTCGAAAATTGC
GTCTGTGTCCTAAGAAATCTGTCGTACAGGTGTCAGGAAATAGAGGACCCGCTGTATGAC
ACTCGAGCGCCTCCGACACAGTCATCGGGACAGGCGAGGATTCAAGCGAGTGCGTCGAAA
GGAGAAAATCTCGGATGTTTCGGTGGAAGTAAAAAGAAAAAAGAGGGTTCGTCATCAAAT
TCTACGAGTCCGCTCGGCAAGAGCGATCCTGAACCACAGACGGACACGAATACTACACAG
TTAGGGTACAGCGTACCCAAAGGGACGGAGATGCTTTGGTCGCCTGAGGTGGTTCCTCTA
TACATGGCGTTACTCCAAACATGTTCCAATCCTGAGACCCTGGAGGCCGCGGCCGGAGCT
CTACAAAACCTCGCAGCTTGTTACTGGCAACCCTCCATAGATATACGTGCAGCTGTTAGG
AAAGAAAAAGGGTTACCAATTCTCGTAGAACTGTTACGTATGGAAGTCGACAGGGTGGTG
TGTGCTGTGGCCACAGCTCTACGTAATTTGGCCATCGATCAACGCAACAAAGAACTCATA
GGAAAATACGCGATGCGTGATTTAGTACAGAAATTACCCAGCGGTAATCAACAACACGAT
CAGGGTACATCGGACGACACCATAGCCGCGGTACTGGCTACATTAAACGAGGTTATAAAG
AAGAGTGCCGAGTTCTCACGTTCGTTGTTAGAGGCGGGCGGGGTCGAGCGCTTGTTGAAT
CTGACCAAACAACGTCATCGCCACACGCCCAGAGTACTAAAGTTTGCTGGCCAAGTCCTA
ATGACGATGTGGTCTCACGTTGAGCTGCGTGAGGTGTATCGTAAGCACGGATGGCGTGAA
GCAGATTTCCTCACACCGGCCAGGGCTGCTCAACCCCGGGCAGCATCTAACACTAGCGTT
CATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCAT
TTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCT
CACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGC
TCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTG
ACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGC
TCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATA
CTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCAT
ACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCA
CTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATT
CAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACG
AAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAAC
GATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTC
ACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATG
TTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCAT
GTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTA
ATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCAC
TTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCA
GTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACT
CATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCG
GCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTC
GGCTCAGTTCATAACGATGACATGCCACCCATGGATGTGAATTACGTCGACGGTCTGGGT
ATGGGGAACCCTAACGGCCGTCCCATGAACCTACAGGGGGTCCAAGCCCAAATGCCCCCA
CCAAGCAGCCGTTAA
Protein sequence:
AGDYPVGGGVEPDYGQYVSSPAAHYNHMGHLTMAPSQKYPHVMETVSVSSGYAGGVGAGH
YGTYAHYGAAYPTQPAYLVEPQVPAYMAQEFVEGGGSASPRSASPGAIPPQHNMHLQQRY
DSASLEQLGRHYCVTSPRGEYAPDAYGYQHYSAAYDTTHQPTAFKDSQNGLSLGSTGGQS
MYGDEEELQKQMANMALVHGSVGVGGREEGGGLQWRDPNLPEVIGFLNSPSDVVKANAAA
YLQHLTYMDDPNKQKTRSLENVNEYLKLAANADKQQLARIKAVFEKKNQKSALCIVQLQK
KLEGYNKRIKSWEIKELVTGVIWNMSSCEDLKQSIIDDAAQVIFNKVIIHHSGWHPTNPG
DTYWSNVFRNASGVLRNASSAGEYARRRLRSLAGLAEALLHTVRVALVKNAIGTKVVENC
VCVLRNLSYRCQEIEDPLYDTRAPPTQSSGQARIQASASKGENLGCFGGSKKKKEGSSSN
STSPLGKSDPEPQTDTNTTQLGYSVPKGTEMLWSPEVVPLYMALLQTCSNPETLEAAAGA
LQNLAACYWQPSIDIRAAVRKEKGLPILVELLRMEVDRVVCAVATALRNLAIDQRNKELI
GKYAMRDLVQKLPSGNQQHDQGTSDDTIAAVLATLNEVIKKSAEFSRSLLEAGGVERLLN
LTKQRHRHTPRVLKFAGQVLMTMWSHVELREVYRKHGWREADFLTPARAAQPRAASNTSV
HVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILG
SVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQI
LGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTI
QILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMF
TIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHL
MFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLT
HLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHNDDMPPMDVNYVDGLG
MGNPNGRPMNLQGVQAQMPPPSSR