Monarch geneset OGS2.0

DPOGS209982
TranscriptDPOGS209982-TA1245 bp
ProteinDPOGS209982-PA414 aa
Genomic positionDPSCF300148 + 301032-305161
RNAseq coverage533x (Rank: top 24%)
Annotation
HeliconiusHMEL0135452e-14186.04% 
BombyxBGIBMGA011268-TA0.076.19% 
DrosophilaCoprox-PA1e-14469.23% 
EBI UniRef50UniRef50_G3SQY56e-12661.78%Uncharacterized protein n=29 Tax=cellular organisms RepID=G3SQY5_LOXAF
NCBI RefSeqNP_001040239.10.074.64%coproporphirynogen oxidase [Bombyx mori]
NCBI nr blastpgi|1140523300.074.64%coproporphirynogen oxidase [Bombyx mori]
NCBI nr blastxgi|1140523300.074.82%coproporphirynogen oxidase [Bombyx mori]
Group
Gene OntologyGO:00041096e-222coproporphyrinogen oxidase activity
GO:00551146e-222oxidation-reduction process
GO:00067796e-222porphyrin biosynthetic process
KEGG pathwaynvi:1001199366e-155 
 K00228 (E1.3.3.3, hemF)maps-> Porphyrin and chlorophyll metabolism
InterPro domain[61-405] IPR0012606e-222Coproporphyrinogen III oxidase, aerobic
Orthology groupMCL14017 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209982-TA
ATGCCGTACAAAGTAGCAGTGAGAAGTTTTAAAACTTTGAGTTATATTGGAAGTTATAAAAAAATGAGAGATCGACCATTAGCTTTCTACTGTGCAGCAATACTTGGTGCTGGGTATGCAGCATACAGCCAACATAAACAAAACAAAGCGGAGATGAAAGAAATAATACAACTCAAAAATTATATGGCTGAACCCATAACACCGATTAGTGAGTTAGAGAAGAACCAAGATGATATGAAGACACAGATGGAATTGCTTATCATGAGAATACAAGCAGAGTTCTGCAGAGCCTTGGAGAAAGAAGAGGATGAGGCTTGGGAAGATGAGAAAACCACCGATGACTTTGATTTTGACAACGATGTATTCTTACCCTCTCCGGAATCAAAATTCAAAGTAGACCGTTGGAAACGTAAAGAGGGTGGCGGAGGGATCACGTGCGTGTTGCAAGATGGCCGCGTGTTTGAGAAGGCCGGCGTTAATATATCAGTGGTGTCCGGGATCCTGCCGCCGCCCGCCGTGCAGCAGATGAAGAGCAGGGGGAAGAATTTCGAGAACAAGGAGCTGCCGTTCTTCGCAGCGGGAGTGAGCGCCGTCATCCATCCCCGGAACCCGATGGTCCCCACCATACACTTCAACTACAGATACTTCGAGGTCCAGGATAAAAACGAGGTCCACTGGTGGTTCGGCGGCGGCACCGACCTGACCCCCTACTACCTCAACGAGGACGACGCGGTGCACTTCCACCGCACCCTCAAACAGGCCTGCGATGAACACGACCCCACTTATTACGACAAATTCAAGAAGTGGTGTGATGACTATTTCGTCATAAGTCACCGCGGCGAGCGGCGCGGGGTGGGCGGCATCTTCTTCGATGACGTGGACTATCCGGACCAGCAGAGCGCATTCAAATTTGTGACCTCCTGCGCCGAGGCCGTCATACCCAGTTACATCCCCCTCGTCCAGAAGCACGCCGACGCCGGGTACGGGTACCATGAGCGTCAGTGGCAGCTACTCCGACGAGGCCGCTACGTGGAGTTCAACCTCATCTATGACCGAGGAACCAAGTTCGGTCTCCACACGCCCGGCGCTCGATACGAGTCCATACTTATGTCGCTGCCGCTGAACGCGAAATGGGAGTACATGCACGATCCCAAACCGAACTCTCCGGAAGAAAAACTCATGAAAGTTCTCAAAGAACCAAGAGATTGGTTGAATTTCCAGCAGTCGAAGAACACTTCGACATGA

Protein sequence:

>DPOGS209982-PA
MPYKVAVRSFKTLSYIGSYKKMRDRPLAFYCAAILGAGYAAYSQHKQNKAEMKEIIQLKNYMAEPITPISELEKNQDDMKTQMELLIMRIQAEFCRALEKEEDEAWEDEKTTDDFDFDNDVFLPSPESKFKVDRWKRKEGGGGITCVLQDGRVFEKAGVNISVVSGILPPPAVQQMKSRGKNFENKELPFFAAGVSAVIHPRNPMVPTIHFNYRYFEVQDKNEVHWWFGGGTDLTPYYLNEDDAVHFHRTLKQACDEHDPTYYDKFKKWCDDYFVISHRGERRGVGGIFFDDVDYPDQQSAFKFVTSCAEAVIPSYIPLVQKHADAGYGYHERQWQLLRRGRYVEFNLIYDRGTKFGLHTPGARYESILMSLPLNAKWEYMHDPKPNSPEEKLMKVLKEPRDWLNFQQSKNTST-