Monarch geneset OGS2.0

DPOGS202024
TranscriptDPOGS202024-TA876 bp
ProteinDPOGS202024-PA291 aa
Genomic positionDPSCF300053 - 828753-829628
RNAseq coverage89x (Rank: top 63%)
Annotation
HeliconiusHMEL0080477e-13183.66% 
BombyxBGIBMGA009206-TA2e-11574.09% 
DrosophilaSdhB-PA8e-8656.25% 
EBI UniRef50UniRef50_Q095452e-7753.17%Succinate dehydrogenase [ubiquinone] iron-sulfur subunit, mitochondrial n=579 Tax=root RepID=DHSB_CAEEL
NCBI RefSeqXP_002433069.14e-8755.51%succinate dehydrogenase, iron-sulfur subunit [Pediculus humanus corporis]
NCBI nr blastpgi|2420253118e-8655.51%succinate dehydrogenase, iron-sulfur subunit [Pediculus humanus corporis]
NCBI nr blastxgi|3214683487e-8759.77%hypothetical protein DAPPUDRAFT_231048 [Daphnia pulex]
Group
Gene OntologyGO:00060991e-70tricarboxylic acid cycle
GO:00551141e-70oxidation-reduction process
GO:00164911e-70oxidoreductase activity
GO:00515362.1e-41iron-sulfur cluster binding
GO:00090552.7e-28electron carrier activity
KEGG pathwayphu:Phum_PHUM6122301e-86 
 K00235 (SDHB, SDH2)maps-> Huntington's disease
    Citrate cycle (TCA cycle)
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[44-263] IPR0044891e-70Succinate dehydrogenase/fumarate reductase iron-sulphur protein
[139-266] IPR0122852.1e-41Fumarate reductase, C-terminal
[139-271] IPR0090514.7e-35Alpha-helical ferredoxin
[38-138] IPR0126751.1e-31Beta-grasp fold, ferredoxin-type
[37-138] IPR0010412.7e-28Ferredoxin
Orthology groupMCL25588 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202024-TA
ATGCACAGTATTAAGAAAGCAATATGTTCAAAACTGCCATGTTTGATATCAATAAGACTCTACTCGGGTCCAAAACCAGCAGCTGCCGCGGCTAAAAAGCCTTCGGATCCAAGAAAGGTGTTTAAAATTTACAGGTTTGGAGGAATTCTAAGTAATGAGAAACCAACATTAAAAAGTTACGATTTAGACATCAATACTTGCGGACGGATGGTTTTAGACGCTCTCATTAAAATTAAAGATATGGATCCTACGCTTGTATTTCGAAGATCCTGTCGTGAGGGTATCTGTGGGTCATGTGCTATTAATCTTCAAGGTCAAAATTGTCTCGCTTGTATAACTGCAATTCCTTCTGACAAAGTCATAACCATACATCCCATTCCTCATATGTATGTCATCAGGGATCTGGTCGTAGATATGACACATTTCTTTGATGTTTACAATAGCCTCCGTCCATATTTAATCCGAAACAATTCTGGGGCCCTCGGAAAATTTCAGTATGCACAAAGTGAGAAGGATAACTCCAAATTAGTTGGGCTGTACGAGTGTGTTCTGTGCTCTTGTTGTGCTACAGCTTGTCCTAGTTATTGGTGGAATGGCCGACGTTTCATGGGGCCAGCGTCTTTGCTTCACGCATACAGATGGATTATAGATTCTCGAGATGAAGAATCCGAACAAAGATTATTTGAACTACGAGATGACTTTAAAGCTTTTCGATGTCACACTATATATAATTGTACTCTGGCATGTCCTAAAGGATTACACCCAGCTCTAGCTATAGCAAAATTAAAAAGATTAATTTCAGGATTAGATAAAAAACCTTTACCCGAAATGGATCCTATGAAATTTGCTTCGGGTTCGCTGTCCGGTTGTAAATGA

Protein sequence:

>DPOGS202024-PA
MHSIKKAICSKLPCLISIRLYSGPKPAAAAAKKPSDPRKVFKIYRFGGILSNEKPTLKSYDLDINTCGRMVLDALIKIKDMDPTLVFRRSCREGICGSCAINLQGQNCLACITAIPSDKVITIHPIPHMYVIRDLVVDMTHFFDVYNSLRPYLIRNNSGALGKFQYAQSEKDNSKLVGLYECVLCSCCATACPSYWWNGRRFMGPASLLHAYRWIIDSRDEESEQRLFELRDDFKAFRCHTIYNCTLACPKGLHPALAIAKLKRLISGLDKKPLPEMDPMKFASGSLSGCK-