Monarch geneset OGS2.0

DPOGS200178
TranscriptDPOGS200178-TA1239 bp
ProteinDPOGS200178-PA412 aa
Genomic positionDPSCF300128 + 640879-643113
RNAseq coverage1079x (Rank: top 12%)
Annotation
HeliconiusHMEL0094391e-15686.89% 
BombyxBGIBMGA002929-TA5e-10288.32% 
DrosophilaCG5037-PA2e-12663.07% 
EBI UniRef50UniRef50_Q9VKZ12e-12463.07%Protoheme IX farnesyltransferase, mitochondrial n=20 Tax=Metazoa RepID=Q9VKZ1_DROME
NCBI RefSeqXP_002089084.13e-12660.80%GE18922 [Drosophila yakuba]
NCBI nr blastpgi|1954736075e-12560.80%GE18922 [Drosophila yakuba]
NCBI nr blastxgi|1954736072e-12160.80%GE18922 [Drosophila yakuba]
Group
Gene OntologyGO:00160213.7e-83integral to membrane
GO:00084953.7e-83protoheme IX farnesyltransferase activity
GO:00480343.7e-83heme O biosynthetic process
GO:00046598.6e-27prenyltransferase activity
KEGG pathwaydya:Dyak_GE189227e-126 
 K02257 (COX10)maps-> Oxidative phosphorylation
    Porphyrin and chlorophyll metabolism
InterPro domain[90-361] IPR0063693.7e-83Protohaem IX farnesyltransferase
[99-331] IPR0005378.6e-27UbiA prenyltransferase family
Orthology groupMCL13199 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200178-TA
ATGTCCTACTTACAGCTGACAAAATGTTCCTTATGTTTAAAACATGGATTCTTAGATAAGAAAGTGCTGCTTAAATTTTGTTTAAACAGTCCAATACAGCATGTCTCCAGAATATCAACATCATGCTACCTCAATAGTAATCTGCCACCACCTCTCACAACTCAGACGACTATAAAGAAAACAAAAAAAAATGTACCGCAAGATACAAGAGTGTGGAGGGAAACACCAACTCATGATATAAAAAATAATCTAGGCCAATACTGCATGATGCTATCTAAATTCAGACTTACTTCACTGGTTGTTATGACATCAATGGCCGGATATGCTTTAGCACCAGCTCCATTTGACCTCACAACATTTACACTGTGCGCACTTGGTACAGGACTAGTCAGCTCAGCAGCCAATTCCATTAATCAATATCATGAAGTACCATTTGATGCTCAGATGTCTCGGACAAAGAACAGGGTGCTGGTGAAGGGATTATTAGAACCAGTACATGCCATAGGTTTTGCCGCTGCAACAAGCATGACAGGCCTCGGTTTGCTATACTTTGGTGTGAATCCTCTGACCGCAGCTTTAGGAGCTGGCAACCTTGTATTGTATACATCAATATACACACCATTAAAGAGAATATCGATATTAAATACATGGCTGGGATCTGTTGTTGGTGCTATTCCTCCCATGATGGGGTGGGCGGGATGTAGCGGTCATTTGGACGCGGGTGCTTTAGTTCTAGCTGTGTTGCTATACTCATGGCAGTTCCCACACTTCAATGCATTATCTTGGAACCTAAGGCCAGACTACTCGCGAGCGGGTTACAGGATGATGGCGGTAACTGATCCCGCTCTCTGCAGACGGGTGGCCTTACGACACACTGGCGTCATAACTGCCACTTGTCTGGCTTCTTCCTATTTCGAAGTTACCAATATGTGGTTTGCACTCGAGTCATTACCACTCAATATTTATTTTATGTACTTAGCTTGGAATTTTTACAAGAACTCGGACAGTGGCAGTTCAAGGAAGTTATTCAGATTTTCATTGATACATCTTCCGGCACTGATGTTACTCATGCTAGTGAACAAGAAATATTGGAGTTCAAGTGAACCACAGGAGAACAGCGAAATTATTAGGACACATGATGTCAATAAGCTACCGGAGACAAAAAGAATAACAGTATTGCCGAGAGGACCTTATGTTTCAGCACAGGATTCTGATATAGAACGGAGTGCTAATCAGTAG

Protein sequence:

>DPOGS200178-PA
MSYLQLTKCSLCLKHGFLDKKVLLKFCLNSPIQHVSRISTSCYLNSNLPPPLTTQTTIKKTKKNVPQDTRVWRETPTHDIKNNLGQYCMMLSKFRLTSLVVMTSMAGYALAPAPFDLTTFTLCALGTGLVSSAANSINQYHEVPFDAQMSRTKNRVLVKGLLEPVHAIGFAAATSMTGLGLLYFGVNPLTAALGAGNLVLYTSIYTPLKRISILNTWLGSVVGAIPPMMGWAGCSGHLDAGALVLAVLLYSWQFPHFNALSWNLRPDYSRAGYRMMAVTDPALCRRVALRHTGVITATCLASSYFEVTNMWFALESLPLNIYFMYLAWNFYKNSDSGSSRKLFRFSLIHLPALMLLMLVNKKYWSSSEPQENSEIIRTHDVNKLPETKRITVLPRGPYVSAQDSDIERSANQ-