DPGLEAN08875 in OGS1.0

New model in OGS2.0DPOGS208589 
Genomic Positionscaffold2965:+ 337-10923
See gene structure
CDS Length3495
Paired RNAseq reads  2129
Single RNAseq reads  5635
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005731 (3e-13)
Best Drosophila hit  adherens junction protein p120, isoform A (2e-141)
Best Human hitplakophilin-4 isoform b (2e-77)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC001464 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC001464 [Tribolium castaneum] (2e-159)
GeneOntology terms





  
GO:0005912 adherens junction
GO:0005737 cytoplasm
GO:0005813 centrosome
GO:0007155 cell adhesion
GO:0005488 binding
GO:0001745 compound eye morphogenesis
GO:0034334 adherens junction maintenance
InterPro families


  
IPR016024 Armadillo-type fold
IPR011989 Armadillo-like helical
IPR000225 Armadillo
IPR019394 Predicted transmembrane/coiled-coil 2 protein
Orthology groupMCL10994

Nucleotide sequence:

GCGGGTGATTACCCTGTCGGTGGCGGTGTGGAGCCGGACTATGGCCAGTACGTGTCGTCA
CCAGCCGCACATTACAATCACATGGGACACCTCACCATGGCGCCATCACAGAAATACCCT
CATGTGATGGAGACTGTGTCTGTGAGTAGCGGTTACGCGGGCGGAGTGGGGGCGGGTCAC
TACGGCACGTACGCTCACTACGGCGCCGCGTATCCAACTCAGCCCGCTTACTTAGTTGAA
CCTCAAGTACCCGCTTATATGGCACAGGAATTTGTAGAGGGCGGTGGTAGCGCTTCCCCT
CGCAGTGCTTCACCCGGGGCCATACCTCCGCAACACAATATGCATCTGCAACAGAGGTAC
GACTCAGCGTCTCTAGAACAATTAGGTCGACACTACTGTGTGACGTCACCGCGCGGTGAA
TACGCCCCGGACGCGTACGGCTACCAGCACTACTCCGCCGCCTACGACACTACCCACCAG
CCAACGGCGTTTAAGGATTCACAAAACGGTTTGAGTTTAGGCAGCACCGGAGGGCAGTCT
ATGTATGGCGATGAAGAGGAGCTACAAAAGCAAATGGCCAATATGGCATTAGTTCACGGT
AGTGTTGGGGTGGGAGGTCGGGAGGAGGGTGGTGGTCTTCAGTGGCGAGACCCCAACCTG
CCAGAGGTCATCGGTTTCCTGAACTCGCCGTCGGATGTCGTGAAGGCAAACGCCGCCGCT
TACTTACAACACCTCACCTATATGGATGACCCTAACAAACAGAAAACTAGAAGCTTAGAG
AACGTGAACGAGTACCTGAAGCTAGCAGCGAACGCTGACAAGCAGCAGCTGGCGAGGATC
AAAGCTGTGTTCGAGAAGAAGAATCAGAAGAGTGCGCTCTGTATCGTACAGCTGCAGAAG
AAGCTGGAGGGGTACAACAAGAGAATTAAGAGCTGGGAGATCAAGGAGTTGGTGACCGGC
GTCATATGGAACATGTCTTCCTGCGAGGATCTCAAGCAGTCCATTATAGACGACGCGGCT
CAAGTTATATTCAACAAGGTCATCATACATCACTCCGGCTGGCATCCGACAAACCCTGGG
GACACGTACTGGTCCAACGTGTTCCGTAACGCGTCCGGTGTGCTCCGTAACGCGTCCTCG
GCCGGGGAGTACGCTCGCAGACGTCTCCGTTCCCTGGCGGGGCTGGCCGAGGCTCTGCTG
CACACAGTGCGGGTGGCGCTCGTCAAGAACGCTATAGGGACCAAGGTCGTCGAAAATTGC
GTCTGTGTCCTAAGAAATCTGTCGTACAGGTGTCAGGAAATAGAGGACCCGCTGTATGAC
ACTCGAGCGCCTCCGACACAGTCATCGGGACAGGCGAGGATTCAAGCGAGTGCGTCGAAA
GGAGAAAATCTCGGATGTTTCGGTGGAAGTAAAAAGAAAAAAGAGGGTTCGTCATCAAAT
TCTACGAGTCCGCTCGGCAAGAGCGATCCTGAACCACAGACGGACACGAATACTACACAG
TTAGGGTACAGCGTACCCAAAGGGACGGAGATGCTTTGGTCGCCTGAGGTGGTTCCTCTA
TACATGGCGTTACTCCAAACATGTTCCAATCCTGAGACCCTGGAGGCCGCGGCCGGAGCT
CTACAAAACCTCGCAGCTTGTTACTGGCAACCCTCCATAGATATACGTGCAGCTGTTAGG
AAAGAAAAAGGGTTACCAATTCTCGTAGAACTGTTACGTATGGAAGTCGACAGGGTGGTG
TGTGCTGTGGCCACAGCTCTACGTAATTTGGCCATCGATCAACGCAACAAAGAACTCATA
GGAAAATACGCGATGCGTGATTTAGTACAGAAATTACCCAGCGGTAATCAACAACACGAT
CAGGGTACATCGGACGACACCATAGCCGCGGTACTGGCTACATTAAACGAGGTTATAAAG
AAGAGTGCCGAGTTCTCACGTTCGTTGTTAGAGGCGGGCGGGGTCGAGCGCTTGTTGAAT
CTGACCAAACAACGTCATCGCCACACGCCCAGAGTACTAAAGTTTGCTGGCCAAGTCCTA
ATGACGATGTGGTCTCACGTTGAGCTGCGTGAGGTGTATCGTAAGCACGGATGGCGTGAA
GCAGATTTCCTCACACCGGCCAGGGCTGCTCAACCCCGGGCAGCATCTAACACTAGCGTT
CATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCAT
TTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCT
CACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGC
TCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTG
ACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGC
TCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATA
CTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCAT
ACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCA
CTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATT
CAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACG
AAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAAC
GATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTC
ACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATG
TTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCAT
GTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTA
ATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCAC
TTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCA
GTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACT
CATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCG
GCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTC
GGCTCAGTTCATAACGATGACATGCCACCCATGGATGTGAATTACGTCGACGGTCTGGGT
ATGGGGAACCCTAACGGCCGTCCCATGAACCTACAGGGGGTCCAAGCCCAAATGCCCCCA
CCAAGCAGCCGTTAA

Protein sequence:

AGDYPVGGGVEPDYGQYVSSPAAHYNHMGHLTMAPSQKYPHVMETVSVSSGYAGGVGAGH
YGTYAHYGAAYPTQPAYLVEPQVPAYMAQEFVEGGGSASPRSASPGAIPPQHNMHLQQRY
DSASLEQLGRHYCVTSPRGEYAPDAYGYQHYSAAYDTTHQPTAFKDSQNGLSLGSTGGQS
MYGDEEELQKQMANMALVHGSVGVGGREEGGGLQWRDPNLPEVIGFLNSPSDVVKANAAA
YLQHLTYMDDPNKQKTRSLENVNEYLKLAANADKQQLARIKAVFEKKNQKSALCIVQLQK
KLEGYNKRIKSWEIKELVTGVIWNMSSCEDLKQSIIDDAAQVIFNKVIIHHSGWHPTNPG
DTYWSNVFRNASGVLRNASSAGEYARRRLRSLAGLAEALLHTVRVALVKNAIGTKVVENC
VCVLRNLSYRCQEIEDPLYDTRAPPTQSSGQARIQASASKGENLGCFGGSKKKKEGSSSN
STSPLGKSDPEPQTDTNTTQLGYSVPKGTEMLWSPEVVPLYMALLQTCSNPETLEAAAGA
LQNLAACYWQPSIDIRAAVRKEKGLPILVELLRMEVDRVVCAVATALRNLAIDQRNKELI
GKYAMRDLVQKLPSGNQQHDQGTSDDTIAAVLATLNEVIKKSAEFSRSLLEAGGVERLLN
LTKQRHRHTPRVLKFAGQVLMTMWSHVELREVYRKHGWREADFLTPARAAQPRAASNTSV
HVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILG
SVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQI
LGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTI
QILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMF
TIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHL
MFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLT
HLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHNDDMPPMDVNYVDGLG
MGNPNGRPMNLQGVQAQMPPPSSR