DPGLEAN19575 in OGS1.0

New model in OGS2.0DPOGS215171 
Genomic Positionscaffold59:- 68813-174117
See gene structure
CDS Length1950
Paired RNAseq reads  181
Single RNAseq reads  678
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008647 (6e-72)
Best Drosophila hit  dystrophin, isoform C (1e-66)
Best Human hitdystrophin Dp427c isoform (7e-27)
Best NR hit (blastp)  PREDICTED: similar to dystrophin CG31175-PA, isoform A [Apis mellifera] (3e-91)
Best NR hit (blastx)  PREDICTED: similar to dystrophin CG31175-PA, isoform A [Apis mellifera] (5e-81)
GeneOntology terms















  
GO:0008586 imaginal disc-derived wing vein morphogenesis
GO:0005856 cytoskeleton
GO:0005198 structural molecule activity
GO:0008307 structural constituent of muscle
GO:0016010 dystrophin-associated glycoprotein complex
GO:0008092 cytoskeletal protein binding
GO:0003779 actin binding
GO:0005509 calcium ion binding
GO:0008270 zinc ion binding
GO:0007517 muscle organ development
GO:0005200 structural constituent of cytoskeleton
GO:0007274 neuromuscular synaptic transmission
GO:0048172 regulation of short-term neuronal synaptic plasticity
GO:0030010 establishment of cell polarity
GO:0046716 muscle cell homeostasis
GO:0050699 WW domain binding
GO:0007474 imaginal disc-derived wing vein specification
InterPro families  IPR018159 Spectrin/alpha-actinin
Orthology groupMCL19697

Nucleotide sequence:

ATGCGTGGCGTCGCGACGGCAGTCGGCCGGCGGACGCCGCGCGCGACGGACGCCTCGCGC
CATGTGACCCTGCCGGCGCCCGCCATGACCGAGGGGGCGGGCAGGCCTAGGCCCGCGCCG
CCCCGGAAGCCGGACTCCCTGACCCTTTGGCCCTGGGACGACCCTCCGCCGCCGGCCCCT
GAACCCACGAAGCCTCCGCTGCCGCTCGCAGACTTCGGCGGCTTCTCTCCGTACACGCCG
CCCGCTCCCATCCAGCCCGAGCGCCGACCGCGGCGACCTCAGAGCGCGGTGACGGGTCGC
CAGGCGCCACCGGTCCCGAGGAGACCTCACTCCCTTGGCGCCGTAGACGATGACTGGCTC
GGGCCAGGTCCGGGGCTGGTGGCGCCCGTGCCTCTCAGGCCCGCGCCTCACGCCACGTTG
CCTCGCGTGCTCACCATGCCGTCCTCCCGGAGTGACTATCAGTTGCAACGGCCCGCGATT
ATGACGAACGGTTATGGTGCTGTGCCGCCGTCGACTTCTAGAAGGTTGTCTTTGCCCGGG
GTCCAGAACGCCCCAAAACTCTCCGCGACGTTCCATGGCCTGTCCGGGCTCATGGGACCC
GTACCCTCCCTGGGGACCCCCGGGACCCCGAGCACGCCGGCCGCCGTGCGGGAGGCCGTA
CGTCACCTCCTCCAGCAGCCCCGCAACGGTTTCCCCATCATGGATCACAAGCTGTCTCTG
TTTATCGACATACTCGACGCCCAAGAAAGATTTTCACAGCGAGCCGCTTACATAATAAGA
GCGATGATGGTGTATTTCAATTCAAATGACAATGATGACGTCACAGGGTCTCAGCTGCCG
CCGTCTCCTCGGATGCTTATCGGCCTGAGGGGCGACATCGAATCAGTGATCTCGAAATGG
AGATGGCGGCCCGAGCCAGCTGATAGGAACATCGAAGAGTGGAAGCTGCAAAGGAGTCCA
CAAGGGCTTCGGCCTCGGAACAGCTCGGGGACATGGCCCGGCCGCCCTCGCTCGGGGGCG
GCCTGGCCTCTGGAGAAGTTCGTCACTGAAGGCCTGTGTCATCGATGCGGTGGTGACTAC
AGTTACAAAGGTGCTCTTAAAGTGAACGTTGACAGTTCGGAGTATACGGGTTACAAGTAT
TTTTATAACAAGCCTGGCTATCGAGTGAGAGCCCGGTCCTACGAAGACGCCAGGTTCATT
CATATCAACTTCCTCCTGCTGGCTGCGATGAGACATTTCGAATCGGATCTACAATCAGAG
ATAGAAACTCATCGCGATGTGTACGCGTCTCTCACGGGAACTGGCCGCCGTCTGCTGGGC
TCGCTCTCATCCCAGGAAGACGCCGTGATGCTGCAGAGAAGATTAGACGAGATGAATCAG
AGATGGCATCACCTTAAAGCGAAGAGCATGGCCATCAGGAACCGTCTGGAGAGCAACGCT
GAACACTGGTCCGCGCTGTTGCTGTCGTTACGAGAACTCACCGAGTGGGTCATCAGGAAG
GACACCGAGCTGAACGCCCTGGCTCCGCCGAGAGGAGACCTCAACGCTCTCATAAAACAA
CAGGACGACCACCGTGCCTTCCGCCGCCAATTGGAGGATAAGCGCCCAGTGGTTGAGAGT
AACCTGCTCTCTGGGAGGCAGTACGTGGCCAACGAACCTCCGCTCTCTGACACCAGTGAC
ACGGAACCGAGTCGTGACTCAGAAGGTGACTCCCGAGGATACCGTTCTGCTGAGGAGCAG
GCTCGGGAATTGGCGAGGTCCATCCGAAGAGAGGTCGCAAAGTTAGCTGATAAATGGAAC
TCCCTGGTCGATAGGAGCGACGCCTGGGGCCGCTGTCTCGATGATGCCGTGCAGGAGTCG
CCTGTAGTGATTGTGCGAGGTGAGACTCTCTCTGTGAATACTATTATTAATACAGCGGAC
CCGGAGTCGGCGGACAGATGCTGTCGCTAA

Protein sequence:

MRGVATAVGRRTPRATDASRHVTLPAPAMTEGAGRPRPAPPRKPDSLTLWPWDDPPPPAP
EPTKPPLPLADFGGFSPYTPPAPIQPERRPRRPQSAVTGRQAPPVPRRPHSLGAVDDDWL
GPGPGLVAPVPLRPAPHATLPRVLTMPSSRSDYQLQRPAIMTNGYGAVPPSTSRRLSLPG
VQNAPKLSATFHGLSGLMGPVPSLGTPGTPSTPAAVREAVRHLLQQPRNGFPIMDHKLSL
FIDILDAQERFSQRAAYIIRAMMVYFNSNDNDDVTGSQLPPSPRMLIGLRGDIESVISKW
RWRPEPADRNIEEWKLQRSPQGLRPRNSSGTWPGRPRSGAAWPLEKFVTEGLCHRCGGDY
SYKGALKVNVDSSEYTGYKYFYNKPGYRVRARSYEDARFIHINFLLLAAMRHFESDLQSE
IETHRDVYASLTGTGRRLLGSLSSQEDAVMLQRRLDEMNQRWHHLKAKSMAIRNRLESNA
EHWSALLLSLRELTEWVIRKDTELNALAPPRGDLNALIKQQDDHRAFRRQLEDKRPVVES
NLLSGRQYVANEPPLSDTSDTEPSRDSEGDSRGYRSAEEQARELARSIRREVAKLADKWN
SLVDRSDAWGRCLDDAVQESPVVIVRGETLSVNTIINTADPESADRCCR