DPGLEAN11742 in OGS1.0

New model in OGS2.0DPOGS210287 
Genomic Positionscaffold1382:+ 17121-22290
See gene structure
CDS Length1386
Paired RNAseq reads  458
Single RNAseq reads  1317
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000030 (5e-153)
Best Drosophila hit  sarcoglycan alpha (3e-55)
Best Human hitepsilon-sarcoglycan isoform 2 (1e-14)
Best NR hit (blastp)  PREDICTED: similar to Sarcoglycan CG7851-PA [Apis mellifera] (3e-73)
Best NR hit (blastx)  PREDICTED: similar to Sarcoglycan CG7851-PA [Apis mellifera] (2e-71)
GeneOntology terms


  
GO:0016012 sarcoglycan complex
GO:0008307 structural constituent of muscle
GO:0005509 calcium ion binding
GO:0005515 protein binding
InterPro families

  
IPR008908 Sarcoglycan alphaepsilon
IPR006644 Dystroglycan-type cadherin-like
IPR015919 Cadherin-like
Orthology groupMCL13689

Nucleotide sequence:

ATGAGAGCGGTTGTATGTCATGTTTATAATGCTGTCGAAACTGAAATGTTCTCTATACCC
ATCAGTCCTAACTTATTCAACTGGACTTATCAAGAATTTGATCAGCAGTACCGTTTCCAC
GCGTCCTTGATCGGTAAACCTGAATTGCCGATATGGCTTCGTTACATCTACAGCGGGCGA
CATCACTCGGGATTCATTTTTGGCACGCCGCCCCGAAATACTGAATCTCCTATTACGTTA
GAGGTGATAGGGTTGAACCGTCAGGACTATGAAACCCGCCGGGTGCTGTTAACCCTGAAG
GTTCTTCCCAAGGAGAAGATGGCTCGCCACGAGGTCGAGTTCAAGATAGACAATCTTAAT
GTTGAAGATCTTCTCGATGAGCATAGAATGAGCCGTCTGAAGGACATACTACGTACTAAA
CTATGGTTTGAGAGCAGCGAGGATCTGTATCCGACGTTCCTTGCATCAGCTATAGACTTG
GGAGCCAGGCTACCGTTGAAGCCCAGCGATGGAGAAGGTCTGGTGATACGTCTGGGTAGT
TCTCACCCGTTCTCGTCGGAGATGAAACGTCTCAGAGAGGAGGTACGCCCTCTCAGCAGA
CTACCCAGCTGTCCGAGGGAATACAAGAGAACAACCGTGGAGAGACTGTTCAGAGACGCC
GGCTTCACACTGGACTGGTGTAACTTTGAGCTGTACAATACAATATACGGTCCACGGTCC
ACGGATCACTTGGAATACTTAACTGAGATTCCTTCACCCATAAATCGCGTCCGATCTGAA
AGTCGCGAAGTGTGGACGGCGCCTAACAAGCAATCCTTGCCGACGAGGAGTTACGCGAAA
CAATTGACAGCAGCGATAGTGGGACCGTTGATTTTGCTGCTGCTATCGGTAGCAGCACTA
ACCGGTGTGCTGTGCTTCCATTATGCTGCTATAGCGCACAAGTCATCTAACGTTGAAATA
TGCAAATATGGTACTAGCAACACAGAGCAAACTCAATTAGCTGACAACACCAGCACTAAA
AGTTTAGGAATCAGTCCAAGCAGCAGTCTAGCGCGGCCCTACAGTCCTAAATCGACGACA
AACTTAGCCGGCAGCTACAACCGACCTCAACCACCGCCGTACGGGACCCTCCATCATAGG
AAACTGGACAAAACACCCGACAAAAGGTCGCGTTCACTGGAAGAATCATTAAAATTATTA
AACGAAGCCAACATAGCTACGGAGTACGAGAGGAATCCGATCATAGACTACGCCGACAGC
ACGGACGATTACATATCAATAAAACCTGACACTGATTACATTATTAATAAAATGCAAAAC
GATTTGGACGACATAGTGGTTCCTGAACTCGCTAAATACGGCATATCCGGCATAGGACCG
ATTTGA

Protein sequence:

MRAVVCHVYNAVETEMFSIPISPNLFNWTYQEFDQQYRFHASLIGKPELPIWLRYIYSGR
HHSGFIFGTPPRNTESPITLEVIGLNRQDYETRRVLLTLKVLPKEKMARHEVEFKIDNLN
VEDLLDEHRMSRLKDILRTKLWFESSEDLYPTFLASAIDLGARLPLKPSDGEGLVIRLGS
SHPFSSEMKRLREEVRPLSRLPSCPREYKRTTVERLFRDAGFTLDWCNFELYNTIYGPRS
TDHLEYLTEIPSPINRVRSESREVWTAPNKQSLPTRSYAKQLTAAIVGPLILLLLSVAAL
TGVLCFHYAAIAHKSSNVEICKYGTSNTEQTQLADNTSTKSLGISPSSSLARPYSPKSTT
NLAGSYNRPQPPPYGTLHHRKLDKTPDKRSRSLEESLKLLNEANIATEYERNPIIDYADS
TDDYISIKPDTDYIINKMQNDLDDIVVPELAKYGISGIGPI