Monarch geneset OGS2.0

DPOGS200585
TranscriptDPOGS200585-TA978 bp
ProteinDPOGS200585-PA325 aa
Genomic positionDPSCF300076 - 875891-884904
RNAseq coverage1344x (Rank: top 9%)
Annotation
HeliconiusHMEL0045702e-10663.82% 
BombyxBGIBMGA009552-TA5e-4667.97% 
DrosophilaScgbeta-PA1e-4336.75% 
EBI UniRef50UniRef50_D6WBL63e-6843.73%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WBL6_TRICA
NCBI RefSeqXP_968260.16e-6943.73%PREDICTED: similar to beta-sarcoglycan [Tribolium castaneum]
NCBI nr blastpgi|910763881e-6743.73%PREDICTED: similar to beta-sarcoglycan [Tribolium castaneum]
NCBI nr blastxgi|910763882e-6643.73%PREDICTED: similar to beta-sarcoglycan [Tribolium castaneum]
Group
Gene OntologyGO:00160217.4e-55integral to membrane
GO:00070107.4e-55cytoskeleton organization
GO:00160127.4e-55sarcoglycan complex
KEGG pathwaytca:6566572e-68 
 K12566 (SGCB)maps-> Dilated cardiomyopathy
    Viral myocarditis
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[57-309] IPR0068757.4e-55Sarcoglycan complex subunit protein
Orthology groupMCL11562 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200585-TA
ATGGCTCCCGACCCTAGCGGAGGGTCGCCTTCAGACATGACGGATCATGTTGATAATCGCGGCAAAGCTCTTTTGACACCTATCCCATCTAACAGCCAGGCCTTTCTTCATACTTTCGCTAAGAACTACTCCGACAAAAACGGGAATGATGTCACCAGGGATGTCAGAAAGGGTCGCAGGACCTTCGTTTTCTGGATCCTCGTCTTCCTATTACTGGTGACGGCTGTTGGCAACCTCGTTCTCACCTTCAGCATCCTTGGAGTTTTACGTCTTGGAAGCGGTCTGGAGAGCATGGAGTTCCTCCCGCTTCATGACGCTGTGAAGTTCCTTGGGGACACGAATCTTGATAACATCTACAAGAAGGACGGCCTGATTGAGAGCTTCCGGGACACGCCCTTGAGCATCACCAGCGAGAACGGATCAGTTCTGATAACACTTCAGACGAGGGCTCCTAGATCTGAGACCAAGCTCATAGTCAACACCACGGGCGTGTTCATCAAGGGCGTCACCACTTTCAACATCAACGATCCCGACTCTGGCGTCCAACTCTTCAGTTCTGGGAATCCAGAAGTAACTGTTAACGAAAATCTTAACAGTTTAAACGCAAGACAAGTATCAACCAAGAGGATTTCGTCGCCGGTAGACGAGGACCTGGTGTTCAGGTCGGACGCCTCCGCCTACCTGCGAGGGGCCGAGGGGACGCACATGGAATCCAAGGAGTTATATTGGAGTGCTGACCAGGACATACATCTTAGATCCGCCAACGGTTCAGTGATACTCAGCGGCAAGGAGGGAGTTTTCGTCGACGTGCGTTACCTCCCTATAGCCTTGCCGTCAAAAGAACACCACGGCACTGGTCAGTTTAAAGTATGCGTTTGTATGCCTCAAGGCAAGCTGTTCAGGATCGCTGTCCCCGACGGTCAGAGAGTCACATGCTCGCACATCAACACCACCGGCGACTTAAATCCTTGCGGGTAG

Protein sequence:

>DPOGS200585-PA
MAPDPSGGSPSDMTDHVDNRGKALLTPIPSNSQAFLHTFAKNYSDKNGNDVTRDVRKGRRTFVFWILVFLLLVTAVGNLVLTFSILGVLRLGSGLESMEFLPLHDAVKFLGDTNLDNIYKKDGLIESFRDTPLSITSENGSVLITLQTRAPRSETKLIVNTTGVFIKGVTTFNINDPDSGVQLFSSGNPEVTVNENLNSLNARQVSTKRISSPVDEDLVFRSDASAYLRGAEGTHMESKELYWSADQDIHLRSANGSVILSGKEGVFVDVRYLPIALPSKEHHGTGQFKVCVCMPQGKLFRIAVPDGQRVTCSHINTTGDLNPCG-