Monarch geneset OGS2.0

DPOGS208370
TranscriptDPOGS208370-TA633 bp
ProteinDPOGS208370-PA210 aa
Genomic positionDPSCF300146 - 10738-14067
RNAseq coverage110x (Rank: top 59%)
Annotation
HeliconiusHMEL0072314e-5375.00% 
BombyxBGIBMGA011593-TA1e-2835.56% 
DrosophilaSdhC-PA2e-1838.10% 
EBI UniRef50UniRef50_F4WXF22e-2141.44%Succinate dehydrogenase cytochrome b560 subunit, mitochondrial n=10 Tax=Apocrita RepID=F4WXF2_ACREC
NCBI RefSeqXP_972464.18e-2643.55%PREDICTED: similar to succinate dehydrogenase [Tribolium castaneum]
NCBI nr blastpgi|910771841e-2443.55%PREDICTED: similar to succinate dehydrogenase [Tribolium castaneum]
NCBI nr blastxgi|910771842e-2443.55%PREDICTED: similar to succinate dehydrogenase [Tribolium castaneum]
Group
Gene OntologyGO:00060997.9e-23tricarboxylic acid cycle
GO:00001047.9e-23succinate dehydrogenase activity
GO:00452827.9e-23plasma membrane succinate dehydrogenase complex
GO:00166275.6e-21oxidoreductase activity, acting on the CH-CH group of donors
KEGG pathwayame:4095496e-23 
 K00236 (SDHC, SDH3)maps-> Huntington's disease
    Citrate cycle (TCA cycle)
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[90-209] IPR0143147.9e-23Succinate dehydrogenase, cytochrome b556 subunit
[87-206] IPR0007015.6e-21Succinate dehydrogenase/Fumarate reductase, transmembrane subunit
Orthology groupMCL34749 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208370-TA
ATGTTGAATATAGTAATTAAATATCATTGCTGCAAAGAAACATCGAGTTTTATGAGATTATTGAATAAAAACAGGTCGATTTTCGGGCCACAGTCATCGATTGTGGGTTGCAATTTTCTGAGATATAAAGGAGTTTGTCCTCCCGCAAGTTCAGGGCCTGGTGCAGCCAAAGGCTCAGGTCCTGGAAAGGGCGCCAAACATAAGATAACATATCAACCTTACACCGCGCCGCCACCAACGTGCCATGATTTTAAAAATATGAGTTTAAACCGCCCTATGTCTCCACATCTAACGATTTTCGCTCCCACCCTACCCGCTATGACATCCATTGTTCAGCGTATTACAGGCATGATAATAACGTTTTACGCTCTCCTCCTATCCTCTGGAAGTTTGTTCCTGTCGAACGGCGTAGAAACATACGTGTCTATTATCCAGAGCTTTGATTTTTCCTTACCTATGGTATTTATTATTAAGATGATGTTAGGTGCGCCGTTTGTCTATCATTATTTCAACGGTATCCGTTTCGTTATGTGGAATGCTGGTAAGTGGTTGTCCATCAAAGAAGTTTACGATTCAGCTAAGAAGAGTTTTGTTGCGACAGCCGTATTAACATTGCTTTTCTCCATAATTTAA

Protein sequence:

>DPOGS208370-PA
MLNIVIKYHCCKETSSFMRLLNKNRSIFGPQSSIVGCNFLRYKGVCPPASSGPGAAKGSGPGKGAKHKITYQPYTAPPPTCHDFKNMSLNRPMSPHLTIFAPTLPAMTSIVQRITGMIITFYALLLSSGSLFLSNGVETYVSIIQSFDFSLPMVFIIKMMLGAPFVYHYFNGIRFVMWNAGKWLSIKEVYDSAKKSFVATAVLTLLFSII-