Monarch geneset OGS2.0

DPOGS201117
TranscriptDPOGS201117-TA744 bp
ProteinDPOGS201117-PA247 aa
Genomic positionDPSCF300137 + 209227-214607
RNAseq coverage66x (Rank: top 67%)
Annotation
HeliconiusHMEL0179832e-5665.22% 
BombyxBGIBMGA013666-TA2e-5874.63% 
DrosophilaCG10664-PA7e-4247.13% 
EBI UniRef50UniRef50_A6N9X32e-4150.96%Cytochrome c oxidase polyprotein IV n=6 Tax=Arthropoda RepID=A6N9X3_ORNPR
NCBI RefSeqXP_314839.47e-4956.85%AGAP008727-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3545495114e-4858.50%cytochrome c oxidase polypeptide IV [Antheraea yamamai]
NCBI nr blastxgi|3545495111e-4758.90%cytochrome c oxidase polypeptide IV [Antheraea yamamai]
Group
Gene OntologyGO:00041295.6e-55cytochrome-c oxidase activity
KEGG pathwayaga:AgaP_AGAP0087272e-48 
 K02263 (COX4)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Cardiac muscle contraction
    Parkinson's disease
InterPro domain[30-171] IPR0042035.6e-55Cytochrome c oxidase subunit IV
[44-57] IPR0132882.1e-15Cytochrome c oxidase subunit IV, subgroup
Orthology groupMCL26110 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201117-TA
ATGAATCCCGTGTTAAAATTCCCCTACATATTTAAACCAAATGTCCAAAACATCAGAACGACTTACTGGTATTGTAGGACAGGTTACAGAGATGTAGTGGGCCACGGTGTTAATGGTATAGCCGGCTACAAAGATGATTGCCATTTTCCATTCCCAGCTGTGAGGTTTAAGGAGAACACTCAGGACATATGTGCCCTTCGCGAGAAGGAACGTTGTGATTGGCGGATGCTCTGCTGTGAGGAGAAGAAGGCACTGTACCGCGCTTCCTTCTGCCAGACCTTCGCGGAGTTCCAGGCGCCAACTGGCCAATGGAAGTTCATCATGGGATGGGTGTTCGTCGTTACTTCCTTCACATTCTGGGCAGCCATGTTTTATCATCATTACGTGTATGAGCCGTTGCCAAGTACATTCTCCAAGGAGTCCCAGAAGGCTCAACTGCGCCGCATGCTGGAGCTACGTGTCAATCCCATAGACGGAATTTCATCCCTCTGGGACTACGACAACGACAAATGGCTGGTAGCTATTGTCCTAGCCTTTATTGGCATGTCCTTGGCCAATCCCGTCCCATCTGGTTTAGGGGGTCTGATGGAACACGAGCAGATAATGAGAAGTATGAACGATAAAGAGTGGCAGTATGAAGGAATACCTATAGAGTTGCCGCTCAGAAGAAGCGAAACTGTAAAGAAAACGACTCCCAAAAAAAAAATTACCAGTAAGGGTTTTGTTTATGTAAGCTTTGAGTGA

Protein sequence:

>DPOGS201117-PA
MNPVLKFPYIFKPNVQNIRTTYWYCRTGYRDVVGHGVNGIAGYKDDCHFPFPAVRFKENTQDICALREKERCDWRMLCCEEKKALYRASFCQTFAEFQAPTGQWKFIMGWVFVVTSFTFWAAMFYHHYVYEPLPSTFSKESQKAQLRRMLELRVNPIDGISSLWDYDNDKWLVAIVLAFIGMSLANPVPSGLGGLMEHEQIMRSMNDKEWQYEGIPIELPLRRSETVKKTTPKKKITSKGFVYVSFE-