Monarch geneset OGS2.0

DPOGS202219
TranscriptDPOGS202219-TA1377 bp
ProteinDPOGS202219-PA458 aa
Genomic positionDPSCF300149 + 250310-252272
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0091967e-17864.37% 
BombyxBGIBMGA013494-TA2e-17861.44% 
DrosophilaCG7265-PA7e-7034.10% 
EBI UniRef50UniRef50_Q1HQ938e-17059.48%Diptheria toxin resistance protein n=2 Tax=Obtectomera RepID=Q1HQ93_BOMMO
NCBI RefSeqNP_001040357.12e-17059.48%diptheria toxin resistance protein [Bombyx mori]
NCBI nr blastpgi|1140515923e-16959.48%diptheria toxin resistance protein [Bombyx mori]
NCBI nr blastxgi|1140515922e-16759.48%diptheria toxin resistance protein [Bombyx mori]
Group
Gene OntologyGO:00171834e-132peptidyl-diphthamide biosynthetic process from peptidyl-histidine
GO:00057374e-132cytoplasm
KEGG pathway 
InterPro domain[2-459] IPR0027284e-132Diphthamide synthesis, DPH1/DHP2
[2-459] IPR0100144e-132Diphthamide synthesis, DHP2
Orthology groupMCL14194 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202219-TA
ATGTCTAATTTTACAACAGATGGTAAAATCTGTATAGAAAGAGAGTTGGAAGTGGCTAAATCCGAAATAGAATTTGACAACTTGGAACAACGTTACAGTGTTACAGAAATTTGTAATTGGCTTAAGGAACACAACTTTTCTAAAGTGTGCTTACAATTTCCTGATGAACTAATTGGGGTCAGTGCTGCTATTTATCAAGAAATTAAAAAAAATATAAATGTAGACTTATACATTCTCGGTGACACATCCTATGCTAGTTGTTGTGTTGACTCCGTGGCTGCTATGCATGTTCAGAGTGATGCAGTGATTCACTTTGGACATTCGTGTTTCACAAAGACAAACATCCCCGTGTTCACAGTACTGCCAAAACGGAACTTATCCACTGATGCCATAGAATCAGTATTACGAGATCATTTCAAATCAGATGACACTAAACTCTGTTTATTTTACGATGCAGAATATGAACACTGTAAAGCTCTAAGATCAAAGACATTTATTTCTTCTTTATTACAACAGTTTTGTTTATTCTTTGATTTAGATAACATCCAGACGATTATATTGGGGAGACTTGTTAAAAATGAAACCGGCGTTGAATACCCAATGGAAAGTTTAAGAGATTGTATTTGTATATACATTGGATCAAAAGGACAGACAGTGTTTAATTTTAGTGTTTCAGTTCCAGCTTTGAAATGGTTCCTACTGGATCCAGAGGACAAAAAACTTGAACATCTAGAAGAAACAATTTGGTTTAAAAGGAGAAGATTTTTGATAGAGAAATGCAAAGATGCAAATGTCATCGGCATACTGGTGTGTAAACTTGCCGGTGAGCAGACGAAACAAATAGTAAAAAGAATGAAACAAATATGTAAAGCTAATGGGAAAAAAAGTTACATAGTGTCAGTGGGGAAGCCGAATGTTGCCAAGTTGGCGAATTTCCCAGAGATTGATATATATGTTATGATAGCTTGTCCGGAAAATGACTTGTATAACAATCGAGACTTCTACAGACCAATAATATACCCTTTTGAATTGGAAGTGGCGCTCAATTCTAATAGGGAGCAATACTACAACTATCATGTTACAGACTATGACGATCTCTTGCCAGGGAAACGCCATCACTTAGAAATAGATCATACCAAGCAAGCAACTGACGTCAGTCTTGTGACTGGCAAAATAAGGGAAAATAAAATACATAGCAATGAAGAGGGTGGCATGGAGGTGGCTGAAAAACAAAATTGGGCATTAGAGAGCATCGGTCAGAACCTTCAAGAGAGGTCATGGAAAGGATTGGAACAAAAATTTGGAGAGACTGATGTCAAAAATGTTGAAGAAGGTCGAAAAGGAATCCCCTTACAGTACAGCAATGAACCAGAATAG

Protein sequence:

>DPOGS202219-PA
MSNFTTDGKICIERELEVAKSEIEFDNLEQRYSVTEICNWLKEHNFSKVCLQFPDELIGVSAAIYQEIKKNINVDLYILGDTSYASCCVDSVAAMHVQSDAVIHFGHSCFTKTNIPVFTVLPKRNLSTDAIESVLRDHFKSDDTKLCLFYDAEYEHCKALRSKTFISSLLQQFCLFFDLDNIQTIILGRLVKNETGVEYPMESLRDCICIYIGSKGQTVFNFSVSVPALKWFLLDPEDKKLEHLEETIWFKRRRFLIEKCKDANVIGILVCKLAGEQTKQIVKRMKQICKANGKKSYIVSVGKPNVAKLANFPEIDIYVMIACPENDLYNNRDFYRPIIYPFELEVALNSNREQYYNYHVTDYDDLLPGKRHHLEIDHTKQATDVSLVTGKIRENKIHSNEEGGMEVAEKQNWALESIGQNLQERSWKGLEQKFGETDVKNVEEGRKGIPLQYSNEPE-