Monarch geneset OGS2.0

DPOGS202307
TranscriptDPOGS202307-TA810 bp
ProteinDPOGS202307-PA269 aa
Genomic positionDPSCF300032 + 285506-287328
RNAseq coverage311x (Rank: top 36%)
Annotation
HeliconiusHMEL0047233e-13282.90% 
BombyxBGIBMGA004979-TA2e-12681.41% 
DrosophilaDph5-PA6e-9660.74% 
EBI UniRef50UniRef50_Q5E9825e-9660.74%Diphthine synthase n=12 Tax=Opisthokonta RepID=DPH5_BOVIN
NCBI RefSeqXP_001604120.18e-10668.63%PREDICTED: similar to ENSANGP00000003767 [Nasonia vitripennis]
NCBI nr blastpgi|1565511472e-10468.63%PREDICTED: diphthine synthase [Nasonia vitripennis]
NCBI nr blastxgi|1565511471e-10268.63%PREDICTED: diphthine synthase [Nasonia vitripennis]
Group
Gene OntologyGO:00171831.1e-159peptidyl-diphthamide biosynthetic process from peptidyl-histidine
GO:00041641.1e-159diphthine synthase activity
GO:00081522.1e-60metabolic process
GO:00081682.1e-60methyltransferase activity
KEGG pathway 
InterPro domain[2-269] IPR0045511.1e-159Diphthine synthase
[112-268] IPR0147762.1e-60Tetrapyrrole methylase, subdomain 2
[1-269] IPR0008789e-58Tetrapyrrole methylase
[1-111] IPR0147778.5e-34Tetrapyrrole methylase, subdomain 1
Orthology groupMCL13128 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202307-TA
ATGTTTTACTTAATTGGTTTAGGATTAGGAGATGCGAAGGATGTTACAGTTAAAGGCTTAGAAATTATCAGAAAGTGTGATAAAGTTTTTTTGGAAGCTTACACTTCTATTTTAACCGTTGGGAAAGAAGTTCTGGAAGAATTTTATCAAAGACCTCTAATTATAGCAGATAGAGATTTATGTGAAAGTAACATAGATGAAATATTAAAAGAAGCAAAAGTTCAAGACATAGCTCTTTTAGTAGTGGGTGATCCTTTAGGAGCCACAACTCACACGGATATGTTGCTCCGGGCAAAGGATTTTGGTGTTGAGACAATGATTGTGCACAATGCTTCCATAATGAATGCAGTTAGTTGTTGTGGGTTACAGCTTTATAATTTCGGAGAAACTGTATCAATACCATTTTGGTCAGACACATGGAAGCCAGATAGTTTCTTTGAAAAAATTATAGGAAATTACTCAAGAAATTTACACACCCTTTGTTTGTTAGATATAAAAGTTAAGGAACCCACGGAAGAATCATTGACAAAAAAGGTTAGACAATACATGGATCCAAAATTTATGTCAGTCAAAGATGCAGCAAGACAATTAGTAGAGATAATAGCAAACAAGAATAATGATAAACAAGATCTAACACCAGACACGCTTGTGGTCGGCTTATCCAGAGTCGGTGCACTCGATCAAAAGATAGTCGCTGGTAGCTTAGAATATATGCAAAAATGTGACCTAGGACCCCCTCTACATAGTCTTGTTATCCCAGCTCCTAACTTGCACCCACTTGAACTGGAGTACCTTGCTCAGTTCAGGTAG

Protein sequence:

>DPOGS202307-PA
MFYLIGLGLGDAKDVTVKGLEIIRKCDKVFLEAYTSILTVGKEVLEEFYQRPLIIADRDLCESNIDEILKEAKVQDIALLVVGDPLGATTHTDMLLRAKDFGVETMIVHNASIMNAVSCCGLQLYNFGETVSIPFWSDTWKPDSFFEKIIGNYSRNLHTLCLLDIKVKEPTEESLTKKVRQYMDPKFMSVKDAARQLVEIIANKNNDKQDLTPDTLVVGLSRVGALDQKIVAGSLEYMQKCDLGPPLHSLVIPAPNLHPLELEYLAQFR-