Monarch geneset OGS2.0

DPOGS212973
TranscriptDPOGS212973-TA927 bp
ProteinDPOGS212973-PA308 aa
Genomic positionDPSCF300057 + 839805-843303
RNAseq coverage7292x (Rank: top 2%)
Annotation
HeliconiusHMEL0113125e-15190.58% 
BombyxBGIBMGA011618-TA5e-14988.64% 
DrosophilaCG4769-PA5e-11767.22% 
EBI UniRef50UniRef50_Q9VRL07e-11567.22%CG4769 n=41 Tax=Coelomata RepID=Q9VRL0_DROME
NCBI RefSeqNP_001106230.15e-14888.96%cytochrome c1 [Bombyx mori]
NCBI nr blastpgi|1638386949e-14788.96%cytochrome c1 [Bombyx mori]
NCBI nr blastxgi|1638386943e-16288.96%cytochrome c1 [Bombyx mori]
Group
Gene OntologyGO:00090551.5e-186electron carrier activity
GO:00200371.5e-186heme binding
GO:00055061.5e-186iron ion binding
KEGG pathwaydmo:Dmoj_GI133372e-120 
 K00413 (CYT1, CYC1, petC)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Cardiac muscle contraction
    Parkinson's disease
InterPro domain[26-308] IPR0023261.5e-186Cytochrome c1
[68-261] IPR0090561e-82Cytochrome c domain
[262-303] IPR0211573.5e-17Cytochrome c1, transmembrane anchor, C-terminal
Orthology groupMCL14849 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212973-TA
ATGGCGGCCGCCGTAGGCAGGATTTGTGGGGCACGACTTTTGCAAAAATATGGGATAGTGGCTCCCCAGATTTGTCAGCTGTCCACTAATGCTCAGGCATGGACTAAGAGTAGGAAATTGTGGTTGACCACATTAGGTGTGGGTGCGGGAGGTGTGGGAGCCTTACTGTTCGCATTGGAACAGTCAGTACAGGCTTCAGGCACGGAAGCACATCCCTACCACCAGCCTTGGAGCCACAATGGATGGTTCCAATCTCTTGATCATGCCAGTATTCGGCGAGGCTATGAAGTTTACAAGCAAGTATGCAAGGCTTGCCACTCTTTACAGTACATTGCCTACCGTAACTTAGTCAATGTGAGCCATACTGAGGAGGAGGCGAAGGCAGAAGCTGCTGAGGTCATGATTAAGGACGGTCCTGATGAGGAGGGCAACTATTTTGAACGTCCAGGAAAACTATCCGACTACTTCCCATCACCATACCCCAATGAAAATGCAGCTCGAGCTGCTAACAACGGTGCCTATCCTCCGGATCTATCTTTGATAGTGTCTGGTCGTAAGGGCGGCGAGGATTACATCTTCGCTCTCCTGACGGGTTACATGGAAGCACCGGCTGGGGTCATACTGCGTGAGGGACAGAACTACAACCCATACTTCCCTGGTGGTGCCATTTCTATGGCACAAGTTTTGTTCGATGAGGCTGCGGAATACACTGATGGTACTCCAGCGACGGCTTCCCAGTTGGCAAAGGACGTGGCAACTTTCCTACGTTGGTGCTCTGAACCAGAGCTGGACGATCGCCGCCTCATGACGATCAAGGCCATTGGAATATTTTCCTTCCTAGCCGCCATCGTGTACTACTATAAACGGCACAAGTGGTCCACAATGAAGTCCCGCAAACTAGCCTACAAAGCGGTATCTAAGAAGTAA

Protein sequence:

>DPOGS212973-PA
MAAAVGRICGARLLQKYGIVAPQICQLSTNAQAWTKSRKLWLTTLGVGAGGVGALLFALEQSVQASGTEAHPYHQPWSHNGWFQSLDHASIRRGYEVYKQVCKACHSLQYIAYRNLVNVSHTEEEAKAEAAEVMIKDGPDEEGNYFERPGKLSDYFPSPYPNENAARAANNGAYPPDLSLIVSGRKGGEDYIFALLTGYMEAPAGVILREGQNYNPYFPGGAISMAQVLFDEAAEYTDGTPATASQLAKDVATFLRWCSEPELDDRRLMTIKAIGIFSFLAAIVYYYKRHKWSTMKSRKLAYKAVSKK-