New model in OGS2.0 | DPOGS210287  |
---|---|
Genomic Position | scaffold1382:+ 17121-22290 |
See gene structure | |
CDS Length | 1386 |
Paired RNAseq reads   | 458 |
Single RNAseq reads   | 1317 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000030 (5e-153) |
Best Drosophila hit   | sarcoglycan alpha (3e-55) |
Best Human hit | epsilon-sarcoglycan isoform 2 (1e-14) |
Best NR hit (blastp)   | PREDICTED: similar to Sarcoglycan CG7851-PA [Apis mellifera] (3e-73) |
Best NR hit (blastx)   | PREDICTED: similar to Sarcoglycan CG7851-PA [Apis mellifera] (2e-71) |
GeneOntology terms    | GO:0016012 sarcoglycan complex GO:0008307 structural constituent of muscle GO:0005509 calcium ion binding GO:0005515 protein binding |
InterPro families    | IPR008908 Sarcoglycan alphaepsilon IPR006644 Dystroglycan-type cadherin-like IPR015919 Cadherin-like |
Orthology group | MCL13689 |
Nucleotide sequence:
ATGAGAGCGGTTGTATGTCATGTTTATAATGCTGTCGAAACTGAAATGTTCTCTATACCC
ATCAGTCCTAACTTATTCAACTGGACTTATCAAGAATTTGATCAGCAGTACCGTTTCCAC
GCGTCCTTGATCGGTAAACCTGAATTGCCGATATGGCTTCGTTACATCTACAGCGGGCGA
CATCACTCGGGATTCATTTTTGGCACGCCGCCCCGAAATACTGAATCTCCTATTACGTTA
GAGGTGATAGGGTTGAACCGTCAGGACTATGAAACCCGCCGGGTGCTGTTAACCCTGAAG
GTTCTTCCCAAGGAGAAGATGGCTCGCCACGAGGTCGAGTTCAAGATAGACAATCTTAAT
GTTGAAGATCTTCTCGATGAGCATAGAATGAGCCGTCTGAAGGACATACTACGTACTAAA
CTATGGTTTGAGAGCAGCGAGGATCTGTATCCGACGTTCCTTGCATCAGCTATAGACTTG
GGAGCCAGGCTACCGTTGAAGCCCAGCGATGGAGAAGGTCTGGTGATACGTCTGGGTAGT
TCTCACCCGTTCTCGTCGGAGATGAAACGTCTCAGAGAGGAGGTACGCCCTCTCAGCAGA
CTACCCAGCTGTCCGAGGGAATACAAGAGAACAACCGTGGAGAGACTGTTCAGAGACGCC
GGCTTCACACTGGACTGGTGTAACTTTGAGCTGTACAATACAATATACGGTCCACGGTCC
ACGGATCACTTGGAATACTTAACTGAGATTCCTTCACCCATAAATCGCGTCCGATCTGAA
AGTCGCGAAGTGTGGACGGCGCCTAACAAGCAATCCTTGCCGACGAGGAGTTACGCGAAA
CAATTGACAGCAGCGATAGTGGGACCGTTGATTTTGCTGCTGCTATCGGTAGCAGCACTA
ACCGGTGTGCTGTGCTTCCATTATGCTGCTATAGCGCACAAGTCATCTAACGTTGAAATA
TGCAAATATGGTACTAGCAACACAGAGCAAACTCAATTAGCTGACAACACCAGCACTAAA
AGTTTAGGAATCAGTCCAAGCAGCAGTCTAGCGCGGCCCTACAGTCCTAAATCGACGACA
AACTTAGCCGGCAGCTACAACCGACCTCAACCACCGCCGTACGGGACCCTCCATCATAGG
AAACTGGACAAAACACCCGACAAAAGGTCGCGTTCACTGGAAGAATCATTAAAATTATTA
AACGAAGCCAACATAGCTACGGAGTACGAGAGGAATCCGATCATAGACTACGCCGACAGC
ACGGACGATTACATATCAATAAAACCTGACACTGATTACATTATTAATAAAATGCAAAAC
GATTTGGACGACATAGTGGTTCCTGAACTCGCTAAATACGGCATATCCGGCATAGGACCG
ATTTGA
Protein sequence:
MRAVVCHVYNAVETEMFSIPISPNLFNWTYQEFDQQYRFHASLIGKPELPIWLRYIYSGR
HHSGFIFGTPPRNTESPITLEVIGLNRQDYETRRVLLTLKVLPKEKMARHEVEFKIDNLN
VEDLLDEHRMSRLKDILRTKLWFESSEDLYPTFLASAIDLGARLPLKPSDGEGLVIRLGS
SHPFSSEMKRLREEVRPLSRLPSCPREYKRTTVERLFRDAGFTLDWCNFELYNTIYGPRS
TDHLEYLTEIPSPINRVRSESREVWTAPNKQSLPTRSYAKQLTAAIVGPLILLLLSVAAL
TGVLCFHYAAIAHKSSNVEICKYGTSNTEQTQLADNTSTKSLGISPSSSLARPYSPKSTT
NLAGSYNRPQPPPYGTLHHRKLDKTPDKRSRSLEESLKLLNEANIATEYERNPIIDYADS
TDDYISIKPDTDYIINKMQNDLDDIVVPELAKYGISGIGPI