Monarch geneset OGS2.0

DPOGS205606
TranscriptDPOGS205606-TA1368 bp
ProteinDPOGS205606-PA455 aa
Genomic positionDPSCF300167 + 159143-161652
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0174614e-13553.45% 
BombyxBGIBMGA007211-TA3e-9948.48% 
DrosophilaCG4169-PA2e-2526.32% 
EBI UniRef50UniRef50_E9H2T28e-3427.58%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9H2T2_DAPPU
NCBI RefSeqNP_001165809.15e-3428.92%cytochrome b-c1 complex subunit 2, mitochondrial [Nasonia vitripennis]
NCBI nr blastpgi|3072054621e-3527.39%Cytochrome b-c1 complex subunit 2, mitochondrial [Harpegnathos saltator]
NCBI nr blastxgi|3072054623e-3526.77%Cytochrome b-c1 complex subunit 2, mitochondrial [Harpegnathos saltator]
Group
Gene OntologyGO:00468723.7e-23metal ion binding
GO:00038243.7e-23catalytic activity
GO:00065084.3e-11proteolysis
GO:00042224.3e-11metalloendopeptidase activity
GO:00082701.7e-05zinc ion binding
KEGG pathwaynvi:1001242771e-33 
 K00415 (QCR2, UQCRC2)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Cardiac muscle contraction
    Parkinson's disease
InterPro domain[27-232] IPR0112493.7e-23Metalloenzyme, LuxS/M16 peptidase-like, metal-binding
[36-231] IPR0112371.3e-21Peptidase M16, core
[46-188] IPR0117654.3e-11Peptidase M16, N-terminal
Orthology groupMCL25322 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205606-TA
ATGAAGAGATTTGCTAATCAAATTCCTCAAAGCCTGAGGAATTCCACTAGCAGGAGTGTGAAGTCAGCAATGTCCAAGCCGAGGTTTGGTCCGACTGGTATAGTGAGATCGGAGATGAAAAATGGAATTAAGATAACCTCGGCTCAAGATACAGGCTTTATGTTTGCAGCTTGCACCATCATGTTTCAGGCTGGTTCTCGCTACGAAACTCATGACGATTTGGGAGCTACTCACTTCCTGCGCTTGGCTTCAACAGGCGGCGGATGTAGGGCTACAGCGTTTTCCAAACTCAGAGTATTACAGCAGGCTGGAGCTTATATAACTACGACCTTGGACAGACAAACCGTAGCCTACACCTTGCGATGTCCACTCCATATGTTTAGTGATCTCAAATATTATTTATTGGACACAGCAGTGGGTTGCTGCTACCATGATTGGGAAATAACGGATCTCAAGCGGATCGTGCGAGGTGATCTTAACAGGATCCATCCAGAGCAGAGAGTCATCGACTTGATACAAAAAGCGGCCTGGGCTGGTCCTTTGGCAAATTCTGTATACTGCGAAGAAGATAGAATTGACAGTATGGATGGAGAAAAGCTGAGGAATTACGTCTCAATGAACTTTATTCCTCCTATGTGTAGTGTAGCAAGTGTTGGAGTTCCGTTTGAGGAAACTATGAAGATTGCTGAAAAGATAGAAAGAAGCAGCGAAGAGCATTTGCCACAGAGTCAGAAGGTTTACCCTCGTATGGGCCATGAATATTACGACCTTGGTCCAGATAGTGACACATGGATAGCAGTCGCTGTTCCAAGCTGTGACTCATGGGATTTCCAGGGTGTCTTTAAGCACGCAATTATTGCATCAGCGTGCGGCACAGATAACATGCAAGAGGGCATGACATCGATGAACCGTATTCCTCAATCACCTCTGGGTCTAATAAGTGATAAAGATGTACACACAGATTGCAGAGCTTTCAACATATCGTATATAGACACTGGATTATTTGGTATCTTAGCAAAGACACCTTCATGCTCCGCCTATAGGATATCAAAGATGATAGCTGAGTTCCTCGCTCGTGTCGGTGAACTCGGCGTTACACAAATCGAAGAAGGGAAGGAGAGACTTTTGCTCAACCTTGCTATTCACGATGAAGACTGTGTCAGGATATGCGAAGGGTTGGCGCTGCAGTCTCTATACAACAGTCAGATAGACTGCGCTGAAAACTCTGCAAAGATTATACAAGGAATAAGTTCCGAGGAAGTATCAGCAACAGCTAAATTATTGTCCTCCAAATGTCACCAAATGGCTATAGCCATTGTTGGCGATATCGGTGTTGTACCAGTCGATCAGGACATATTTAGACGATAA

Protein sequence:

>DPOGS205606-PA
MKRFANQIPQSLRNSTSRSVKSAMSKPRFGPTGIVRSEMKNGIKITSAQDTGFMFAACTIMFQAGSRYETHDDLGATHFLRLASTGGGCRATAFSKLRVLQQAGAYITTTLDRQTVAYTLRCPLHMFSDLKYYLLDTAVGCCYHDWEITDLKRIVRGDLNRIHPEQRVIDLIQKAAWAGPLANSVYCEEDRIDSMDGEKLRNYVSMNFIPPMCSVASVGVPFEETMKIAEKIERSSEEHLPQSQKVYPRMGHEYYDLGPDSDTWIAVAVPSCDSWDFQGVFKHAIIASACGTDNMQEGMTSMNRIPQSPLGLISDKDVHTDCRAFNISYIDTGLFGILAKTPSCSAYRISKMIAEFLARVGELGVTQIEEGKERLLLNLAIHDEDCVRICEGLALQSLYNSQIDCAENSAKIIQGISSEEVSATAKLLSSKCHQMAIAIVGDIGVVPVDQDIFRR-