Monarch geneset OGS2.0

DPOGS201785
TranscriptDPOGS201785-TA1260 bp
ProteinDPOGS201785-PA419 aa
Genomic positionDPSCF300145 - 308936-310288
RNAseq coverage191x (Rank: top 48%)
Annotation
HeliconiusHMEL0083440.081.27% 
BombyxBGIBMGA013148-TA0.081.25% 
DrosophilaCG11652-PA2e-14961.26% 
EBI UniRef50UniRef50_A7SLX54e-13461.41%Diphthamide biosynthesis protein 1 n=13 Tax=Eukaryota RepID=DPH1_NEMVE
NCBI RefSeqNP_001040240.10.080.77%diphthamide synthesis protein [Bombyx mori]
NCBI nr blastpgi|1140523660.080.77%diphthamide synthesis protein [Bombyx mori]
NCBI nr blastxgi|1140523660.080.77%diphthamide synthesis protein [Bombyx mori]
Group
Gene OntologyGO:00171831.1e-222peptidyl-diphthamide biosynthetic process from peptidyl-histidine
GO:00057371.1e-222cytoplasm
KEGG pathway 
InterPro domain[18-417] IPR0027281.1e-222Diphthamide synthesis, DPH1/DHP2
[6-395] IPR0164352.9e-134Diphthamide synthesis, DHP1
[72-370] IPR0224281.3e-61Diphthamide synthesis, DHP1, archaea
Orthology groupMCL14927 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201785-TA
ATGGAGATTCTTGAAACAAACCATAGTGTCGTAGTAGTCCGTGCTAAGCCTGAAGGACAGCGGAAAACATTTAAACCAAACATTCGTTCGATAAATAAAATTCCCGATGAGTTACTCAATGACCCGCTTTTAAACAGAGCATGTGAAGCATTACCTCAGAATTATAATTTTGAGATACACAAGACGATATGGAGGATCAGATCTCTTGGTTCTAAAGTAGTAGCTCTACAGCTACCTGAAGGTTTGACGATGTTTGCGACTACATTGTGTGATATAATTGAGGCATTTACTGAAGCCGAGACTATAATTATGGGTGATGTGACGTATGGCGCATGTTGTATTGACGATTTCACTGCAATTTCCTTGGATGCTGATTTATTAATTCACTATGGACACTCGTGCCTTATACCCATTGATCAAACAAGCAACATAAAAGTTTTATACATATTTGTAGACATAAAAATAGATCCATCTCATTTCATAGAAACTATAAAGTTAAATTTGCCTAAACCAACACATTTGGCTATCATTAGTACGATTCAGTTTGTGACGACGCTACATTCTGTTGCTAAAAGTTTGAGAAATGACGGTTATACAGTCTCAGTTCCTCAGAGTAAGCCGCTGTCACCTGGTGAAATTCTTGGTTGTACTGCACCTAAATTGAATGCTGATGTCATCGTATACCTTGGCGACGGAAGATTTCATTTAGAGTCTATAATGATAGCAAACCCAACCGTTCCCGCTTATAAATATGATCCTTATGACAAGAAATTTACATTGGAAACTTACGAGCATGAATTAATGCAAACAAACAGGAGGAATCAAATAAAGACCGCCGAAAATGCCGCAAGTTTTGGACTCATTCTTGGCACTTTGGGAAGACAAGGTAGTACAAAGGTTTTAGCTAACTTGGAAAAGCAAATACAAAATGCTGACAAAAGCTATGTTAAGATACTGTTATCCGAGATATTTCCTAGTAAATTGGCGTTATTCGATTTAGATGCCTATGTCCAAATAGCCTGCCCGAGACTATCAATTGACTGGGGCACAGCTTTTGCAAGGCCATTGCTAACTCCATATGAATTTTCAGTGGCACTAGGCAATAGTAAGTGGCTTAAAGATGACGGCACATACCCTATGGATTTTTATGCTAATGATAGTCTAGGGCCTTGGACACCAAATTATAAACCAGTTTTATGTTCAGCAACCGACAAAAAATGTGAAAACTGTTGCGGTGGTAAAGAAAAAGATATAAAATAA

Protein sequence:

>DPOGS201785-PA
MEILETNHSVVVVRAKPEGQRKTFKPNIRSINKIPDELLNDPLLNRACEALPQNYNFEIHKTIWRIRSLGSKVVALQLPEGLTMFATTLCDIIEAFTEAETIIMGDVTYGACCIDDFTAISLDADLLIHYGHSCLIPIDQTSNIKVLYIFVDIKIDPSHFIETIKLNLPKPTHLAIISTIQFVTTLHSVAKSLRNDGYTVSVPQSKPLSPGEILGCTAPKLNADVIVYLGDGRFHLESIMIANPTVPAYKYDPYDKKFTLETYEHELMQTNRRNQIKTAENAASFGLILGTLGRQGSTKVLANLEKQIQNADKSYVKILLSEIFPSKLALFDLDAYVQIACPRLSIDWGTAFARPLLTPYEFSVALGNSKWLKDDGTYPMDFYANDSLGPWTPNYKPVLCSATDKKCENCCGGKEKDIK-