Monarch geneset OGS2.0

DPOGS203876
TranscriptDPOGS203876-TA1530 bp
ProteinDPOGS203876-PA509 aa
Genomic positionDPSCF300402 - 81770-83299
RNAseq coverage960x (Rank: top 13%)
Annotation
HeliconiusHMEL0081320.092.17% 
BombyxBGIBMGA003833-TA0.087.50% 
DrosophilaNop60B-PC0.081.61% 
EBI UniRef50UniRef50_O440810.081.61%H/ACA ribonucleoprotein complex subunit 4 n=200 Tax=root RepID=DKC1_DROME
NCBI RefSeqXP_318082.40.079.30%AGAP004739-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582979690.079.30%AGAP004739-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582979690.074.70%AGAP004739-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00063961.4e-155RNA processing
GO:00037235.6e-81RNA binding
GO:00094515.6e-81RNA modification
GO:00099825.6e-81pseudouridine synthase activity
GO:00015225.6e-81pseudouridine synthesis
KEGG pathway 
InterPro domain[55-380] IPR0048021.4e-155Pseudouridine synthase-related
[47-294] IPR0201035.6e-81Pseudouridine synthase, catalytic domain
[48-105] IPR0129602.4e-28Dyskerin-like
[294-378] IPR0159472.1e-24PUA-like domain
[296-370] IPR0024782.7e-23Pseudouridine synthase/archaeosine transglycosylase
[109-225] IPR0025011e-19Pseudouridine synthase II
[285-366] IPR0045211.1e-09Uncharacterised domain CHP00451
Orthology groupMCL13436 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203876-TA
ATGACGGAAGTAATGCAATCCGGCTCGGAGGTGTTCTCGGAGAAGAAGAAAAAGAAAAATAAGGAAACAGTGAGCTTGGGTGCGTTCCAAAAAATTGGTGACTTTAAAATCGAGCCATCCGAAAGTGTGACCAAGTTGGATACTGCGTATTGGCCTTTGTTATTGAAGAACTTCGACCGTCTTAACGTCCGAACCAACCACTACACGCCGCTTCCATTCGGCAGTTCACCTTTAAAACGTCCGATTTCGGAGTACGTTAAGGCTGGATTCATCAACGTGGACAAACCTAGCAATCCCAGCTCCCACGAAGTCGTGTCTTGGATTAAAAGAATCCTCAAAGTAGAGAAGACTGGCCATTCCGGAACACTTGATCCTAAGGTTACTGGTTGCCTTATCGTGTGTATAGACAGAGCAACGAGGCTGGTGAAATCCCAGCAGAATGCTGGTAAGGAATATGTCGCTGTATTCAACCTGCACTCGCCCGTAGAAAACCTTAAGAAAGTAACACAGGGTTTGGAGAAATTACGAGGTGCTCTTTTCCAACGTCCGCCATTGATATCAGCAGTCAAACGACAGCTCCGAGTGCGTTCTGTGTACGACAGCAAGTTACTCGATTATGACCAGGAACGTAATATTGGTGTTTTCTGGGTTAGTTGCGAGGCTGGTTCCTATATACGTACGATGTGCGTACATTTGGGTCTCATGCTCGGAGTTGGAGGACAAATGATTGAATTACGCAGAGTCCGCTCTGGTATACAGGGGGAGAAGGAGGGCATGGTTACCATGCACGACATATTGGACGCTCAATGGGCGTATGAGAACCATAAGGATGAAACCTATTTGAGAAGAGTCATTAAGCCATTAGAAGGTCTGCTGGTAGCTCACAAGAGGATCTTTATCAAGGACAGTGCGGTTAACGCAGTATGTTACGGAGCCAAAGTACTTTTGCCTGGTATCCTAAGATACGAGGATGGTATTGAAGTCGACCAAGAAATTGTTATAGTAACAACAAAGGGAGAAGCTGTGGCATTGGCTATAGCCCTTATGACCACGTCCACTATGGCATCCTGTGATCATGGGGTAGCGGCCAAACTGAAACGTGTTATCATGGAAAGAGACACATACCCTCGCAAATGGGGCTTAGGTCCGAAAGCATCTCAAAAGAAAATGCTTATCCAGCAAGGGAAATTAGATAAATTTGGAAAACCCAACGAAAACACACCGTCCGAATGGTTGAATAGCTATGTAGACTACAAAGCTAAGAAGGACACAGAGAACGGTGATGCACAGGAAGATGCAGGTAGAAAGAGAACCGCTAGCACAGCGAACGCTGACAACCCGAATAACTCGACAGAAATCAAGTCGGAGAAAAAGAAGAAAAAGAAAAAACGTGACACCGACGTAGAAATGGATAATGAAGCTGATACAACGGTAGACCCGGATCAGACAATAGAAGGGGATGAGTCGGTGCGCAAAGAAAAGAAAAAAAAGAAAAAGAAGGATAAAGATCAAGAGAGACAGGACGAGTAA

Protein sequence:

>DPOGS203876-PA
MTEVMQSGSEVFSEKKKKKNKETVSLGAFQKIGDFKIEPSESVTKLDTAYWPLLLKNFDRLNVRTNHYTPLPFGSSPLKRPISEYVKAGFINVDKPSNPSSHEVVSWIKRILKVEKTGHSGTLDPKVTGCLIVCIDRATRLVKSQQNAGKEYVAVFNLHSPVENLKKVTQGLEKLRGALFQRPPLISAVKRQLRVRSVYDSKLLDYDQERNIGVFWVSCEAGSYIRTMCVHLGLMLGVGGQMIELRRVRSGIQGEKEGMVTMHDILDAQWAYENHKDETYLRRVIKPLEGLLVAHKRIFIKDSAVNAVCYGAKVLLPGILRYEDGIEVDQEIVIVTTKGEAVALAIALMTTSTMASCDHGVAAKLKRVIMERDTYPRKWGLGPKASQKKMLIQQGKLDKFGKPNENTPSEWLNSYVDYKAKKDTENGDAQEDAGRKRTASTANADNPNNSTEIKSEKKKKKKKRDTDVEMDNEADTTVDPDQTIEGDESVRKEKKKKKKKDKDQERQDE-