Monarch geneset OGS2.0

DPOGS201113
TranscriptDPOGS201113-TA525 bp
ProteinDPOGS201113-PA174 aa
Genomic positionDPSCF300137 - 200029-201384
RNAseq coverage13673x (Rank: top 1%)
Annotation
HeliconiusHMEL0179855e-8683.33% 
BombyxBGIBMGA013680-TA3e-8379.21% 
DrosophilaCG10664-PA1e-5154.14% 
EBI UniRef50UniRef50_E9GI311e-5663.51%Putative uncharacterized protein n=3 Tax=Coelomata RepID=E9GI31_DAPPU
NCBI RefSeqNP_001073120.17e-8279.21%cytochrome c oxidase polypeptide IV [Bombyx mori]
NCBI nr blastpgi|3545495111e-8180.90%cytochrome c oxidase polypeptide IV [Antheraea yamamai]
NCBI nr blastxgi|3545495116e-8280.90%cytochrome c oxidase polypeptide IV [Antheraea yamamai]
Group
Gene OntologyGO:00041295.3e-64cytochrome-c oxidase activity
KEGG pathwayaga:AgaP_AGAP0087271e-59 
 K02263 (COX4)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Cardiac muscle contraction
    Parkinson's disease
InterPro domain[31-174] IPR0042035.3e-64Cytochrome c oxidase subunit IV
[46-59] IPR0132881.6e-21Cytochrome c oxidase subunit IV, subgroup
Orthology groupMCL11225 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201113-TA
ATGGCCAGCCATCTTTTGCGCCGAGCGCTGTTTGATGCTATCCGTGTCCCAGCTGGAACCCGTGCAGCGAGTGAAATTGCCAAGATCGGCGATCGTGAGTGGGTGGGCTATGGTTTCAATGGCCGTCCTAATTACGTAGACAGGAACGATTTCCCCTTACCCGCTGTTAGGTTCAGGGCTGACACACCCGATGTTAAGGCTCTCCGTGAAAAGGAAAAGGGAGACTGGCGTAAGTTGACCCTGGAGGAGAAGAAAGCCTTGTACCGAGCTTCATTCTGTCAGACGTTTGCCGAGTTCCAAGCACCCACCGGGGAGTGGAAGGGAGCGCTCGGCTGGGCGCTTGTTATGGCGTCCATGTCTTTATGGTTCTACATGGCCATGAAGAAGTTTGTATACAACCCCTTGCCCGAGTCCTTCAGCGAGGAGTCTCAGAAGGCGCAGCTGAAACGTATGTTGGACTTGAAGGTGAACCCTGTGGACGGCCTCTCCTCCAAGTGGGACTACGAGAACAACCGCTGGAAGTAA

Protein sequence:

>DPOGS201113-PA
MASHLLRRALFDAIRVPAGTRAASEIAKIGDREWVGYGFNGRPNYVDRNDFPLPAVRFRADTPDVKALREKEKGDWRKLTLEEKKALYRASFCQTFAEFQAPTGEWKGALGWALVMASMSLWFYMAMKKFVYNPLPESFSEESQKAQLKRMLDLKVNPVDGLSSKWDYENNRWK-