Monarch geneset OGS2.0

DPOGS204174
TranscriptDPOGS204174-TA1002 bp
ProteinDPOGS204174-PA333 aa
Genomic positionDPSCF300034 - 82487-87283
RNAseq coverage448x (Rank: top 27%)
Annotation
HeliconiusHMEL0220622e-9889.78% 
BombyxBGIBMGA005089-TA1e-9984.69% 
DrosophilaCG5045-PA1e-7472.32% 
EBI UniRef50UniRef50_O886962e-7370.39%Putative ATP-dependent Clp protease proteolytic subunit, mitochondrial n=9 Tax=Opisthokonta RepID=CLPP_MOUSE
NCBI RefSeqXP_001599209.14e-8677.44%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565411967e-8577.44%PREDICTED: putative ATP-dependent Clp protease proteolytic subunit, mitochondrial-like [Nasonia vitripennis]
NCBI nr blastxgi|1565411962e-8079.79%PREDICTED: putative ATP-dependent Clp protease proteolytic subunit, mitochondrial-like [Nasonia vitripennis]
Group
Gene OntologyGO:00042521.8e-74serine-type endopeptidase activity
GO:00065081.8e-74proteolysis
KEGG pathwaynvi:1001146151e-85 
 K01358 (clpP, CLPP)maps-> Cell cycle - Caulobacter
InterPro domain[143-321] IPR0235625.6e-116Peptidase S14/S49
[142-312] IPR0019071.8e-74Peptidase S14, ClpP
Orthology groupMCL12421 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204174-TA
ATGCCCGTGTCTCCCAGTGTTGCCAACAACAGACCATCTACCAGTGGAATGTCATTGGGACGATTTAAATCTATGCCGTTCAATCAGAGCAGTGACAATATTTTAGATGGCTGTTCAATAAAGTATACTCAAGATGAAATAAAAAAGAAACATCAACAAGCAAGAGAAAAACTTCTCGCTAAAGGAATGTTGCCATTATTATCTCAAAAACAAACTCCACAAATATTGCCAGCTCAAAAAGATCAACCAAGAAACAATATCCATAATAAAGGTGACAACAGACAAAATAAAGTCACAGATAAAAACAAGAATGTTATGAACAAGAACCCTAGTTCAGAATCAGACTTAAAACCTGACATAAAAACACTGATTGAAAAAAAAAGACAGGAAGCATTGATGAAATTAAGGAAGAGACAGGCCCAGTGTAGGTTACTTCGAGAACGTATTATTTGCCTGATGGGACCAATTAACGATGAAATCAGTTCCCTCGTTGTTGCTCAGCTGTTGTTCCTTCAATCTGAATCGAGTAAAAAGCCTGTGCATCTTTATATAAATTCTCCTGGTGGAAATGTTACGGCCGGTCTCGGTATTTACGATACGATGCAATATATCACTCCGCCCGTAGCCACTTGGTGTGTTGGTCAAGCCTGCAGTATGGCGTCCCTCCTGTTAGCAGCTGGGGCTCCTGGTATGCGTCATGCACTTCCAAATTCGCGCATCATGATTCACCAACCTTCCGGAGGAGTCAGAGGTCAAGCAACAGATATTCAAATTCAGGCGGAGGAAATTTTAAAACTGAAGTCTCAAATAAACAATCTATATGTCCGTCACACCGGTCTTCCAATAGAACGTATTCAAACATCCATGGAACGTGACTGTTTCATGTCACCAATAGAAGCAAAGAGTTTTGGCTTGATCGATAATGTGTTGGAACATCCGCCGTCTCATGTGGTGGAAGGTGATAATGTTTCATCAACCCCAGTGAACACCACAACTACTTAG

Protein sequence:

>DPOGS204174-PA
MPVSPSVANNRPSTSGMSLGRFKSMPFNQSSDNILDGCSIKYTQDEIKKKHQQAREKLLAKGMLPLLSQKQTPQILPAQKDQPRNNIHNKGDNRQNKVTDKNKNVMNKNPSSESDLKPDIKTLIEKKRQEALMKLRKRQAQCRLLRERIICLMGPINDEISSLVVAQLLFLQSESSKKPVHLYINSPGGNVTAGLGIYDTMQYITPPVATWCVGQACSMASLLLAAGAPGMRHALPNSRIMIHQPSGGVRGQATDIQIQAEEILKLKSQINNLYVRHTGLPIERIQTSMERDCFMSPIEAKSFGLIDNVLEHPPSHVVEGDNVSSTPVNTTTT-