Monarch geneset OGS2.0

DPOGS211974
TranscriptDPOGS211974-TA942 bp
ProteinDPOGS211974-PA313 aa
Genomic positionDPSCF300011 + 1314208-1315668
RNAseq coverage264x (Rank: top 40%)
Annotation
HeliconiusHMEL0180552e-11765.92% 
BombyxBGIBMGA000920-TA9e-11560.83% 
DrosophilaScgdelta-PD7e-4937.55% 
EBI UniRef50UniRef50_D2A3J99e-5642.86%Putative uncharacterized protein GLEAN_07505 n=3 Tax=Tribolium castaneum RepID=D2A3J9_TRICA
NCBI RefSeqXP_001869861.12e-5742.11%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700712773e-5642.11%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|2700054472e-5742.86%hypothetical protein TcasGA2_TC007505 [Tribolium castaneum]
Group
Gene OntologyGO:00160215.9e-60integral to membrane
GO:00070105.9e-60cytoskeleton organization
GO:00160125.9e-60sarcoglycan complex
KEGG pathwaydre:3249611e-39 
 K12563 (SGCD)maps-> Dilated cardiomyopathy
    Viral myocarditis
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[47-298] IPR0068755.9e-60Sarcoglycan complex subunit protein
Orthology groupMCL26405 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211974-TA
ATGAAGGTTGAGGATGCCAACAGTGAAGGTATCCGGGGTTGGGGCTGTACACCCCCCGGGGAGCCCCCGCCGGCTACAGTGACCACCTCCGCCACCGTTACCACTACACCGCCCGCCTGTCTACCGCCGTTCGTCACACGAGGATGGAGACGAAACGCGCTCTATGGAATCATAGTGTTCCTCATCATACTCGTATTTCTTAATATCGCCTTGACATTGTGGATTATAAGTACCTTGAAACTGAGCGCGAGAGGTATAGGACCGATCACTATCATCAAGGAGGGGATCCGGTTGGATGGACAGGCTTGGGTCCTTGACAATCTGATCGCGTCCACGCTGTCTTCACAAACGGGTCAACCGCTCACCCTGCACTCATACCGCAATTTCACTATCATAGTGTCCGATGAAAACCACAAGGAAGCAGCAAAACTATTCCTGAAACGAGACAGCCTGGAATGCAGTGGTCGTTCGTTCCACGTGCAAGACGCTCGCGGCGAGGAGGTGTTCCACGCGTCTCGAGAGGAGGTGCGCGTCTTCTCGGAGACTCTGTCAGTGAGTGGAGCGGGCGGGCTGGCAGTGAGGGGAGCTCTGCAGACGCCGGTAGTACGAGCTCCGCCGGCTGCAGACTTACAGTTGGAATCATTGACGAGGCGACTCGACCTGCGAGCCCCGCAGTCCATCCACCTGGAGAGTAGAGCGGGGAGTATCGACATAACATCACACAGCGACATTCAATTAGACTCAGTCGTCGGAGCTATTAAGATCGACGCTCCTAACATCATAATAAGCAATTTAAAAGAAGCCAAAATCGCAGACAATCCCGCTAAGAATATGAGAAAAAAGGTGTACCAGCTGTGCGCCTGCGGCTCGGGGAAGCTGTTCCTGGCACCCTCGGACGGGAGGTGTAGTGTGGAGGAAGAAGACCAAGAACTCTGTAGATGA

Protein sequence:

>DPOGS211974-PA
MKVEDANSEGIRGWGCTPPGEPPPATVTTSATVTTTPPACLPPFVTRGWRRNALYGIIVFLIILVFLNIALTLWIISTLKLSARGIGPITIIKEGIRLDGQAWVLDNLIASTLSSQTGQPLTLHSYRNFTIIVSDENHKEAAKLFLKRDSLECSGRSFHVQDARGEEVFHASREEVRVFSETLSVSGAGGLAVRGALQTPVVRAPPAADLQLESLTRRLDLRAPQSIHLESRAGSIDITSHSDIQLDSVVGAIKIDAPNIIISNLKEAKIADNPAKNMRKKVYQLCACGSGKLFLAPSDGRCSVEEEDQELCR-