Monarch geneset OGS2.0

DPOGS202470
TranscriptDPOGS202470-TA2103 bp
ProteinDPOGS202470-PA700 aa
Genomic positionDPSCF300174 + 415100-420383
RNAseq coverage266x (Rank: top 40%)
Annotation
HeliconiusHMEL0173022e-8448.70% 
BombyxBGIBMGA010884-TA1e-4242.86% 
DrosophilaCG6745-PA2e-10036.07% 
EBI UniRef50UniRef50_B0VZ886e-13544.50%tRNA pseudouridine synthase D n=5 Tax=Culicidae RepID=B0VZ88_CULQU
NCBI RefSeqXP_001841699.11e-13544.50%tRNA pseudouridine synthase D [Culex quinquefasciatus]
NCBI nr blastpgi|1700276282e-13444.50%tRNA pseudouridine synthase D [Culex quinquefasciatus]
NCBI nr blastxgi|1700276286e-13743.79%tRNA pseudouridine synthase D [Culex quinquefasciatus]
Group
Gene OntologyGO:00037232.6e-159RNA binding
GO:00094512.6e-159RNA modification
GO:00099822.6e-159pseudouridine synthase activity
GO:00015222.6e-159pseudouridine synthesis
KEGG pathway 
InterPro domain[4-677] IPR0170912.6e-159Pseudouridine synthase TruD, eukaryotic
[44-669] IPR0016569.9e-150Pseudouridine synthase, TruD
[58-663] IPR0201037.1e-89Pseudouridine synthase, catalytic domain
Orthology groupMCL13352 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202470-TA
ATGCGAGGTGGGCATAGAGGTGGTCGCGGTCAAAACCGTGGGAATAGAGGATTCAGAGGTTCGAGAGGTAATTATTCGCGTGGTGACAGACAAAATAATTTGGGATATAATAATTACAATAAGCCATGGCATTCGCAAACCAAACGTGATATTCCAACTAAACGACTCTCTGAAGAGGATATTGGAGTCACCGAGTACCTCAGTGAGCATCAAGGATTTAACGGAATCATAAAGTCGAGATTTTCAGATTTTCAAGTGTCAGAAATAAATGAGGCGGGTGAAGTTGCTCAGCTGACAGATGAGTCTCCACCCGAAGCACCATTAGAGGAGCTGGTTGAAGACGACGAAGAACTGCTTCTCAATAAATACAATTTGGAAATTCTACCTATGGAAACTTGGGACAAAATAAACAAGTTGGCTGTCTCTACTGAACAAACTTCAGAAAAAATTGAGGTTGATGTATCAGGAATGTCCAAAGAGGACAGGACAAAAATCCACGAGGCTGTCAAAAAAGCGTTCGGTGAAAGCATCATTGGTAGCACAGTCACGGTAGAAGACAAGAAGTTTGTTACATTCACCAAGTATCGCAAGGGAGTCAGATTTGATAATCGTGTGAAGTGGGTGTGGGGTGGTGAGTATGTACATTTTGTGGTGTACAAGGAGAACTGCGATACCATGGAAGCTGCATCAGTACTGGCGCAAAGGTTACATATAAGCCCTTCAATGCTGGGTTATGCTGGAACTAAAGATAGGAGAGCTAAGACCAGCCAGTGGTTCAGCATCCGTAGGTTCGAACCTCGTCGTATAGCGTCAGCGGCCCAGCAGAATCGGATGCCTAGAGTGCGGGTTGGCAATTTCTGCTTCAAGGATTACCATCTCAAGCTGGGCATGCTGAAAGGAAACAAATTTCGTATATGTCTGCGTAACGTGACGGCGCCTGACGACTGTGTCGACGTGGCGTGCAGGTTGTTGAACGAGAACGGATTCATTAATTACTACGGTCTGCAGAGGTTCGGTTCGCGCGCTGATGTGCCGACGCACGAGATCGGTCTGAAACTGCTGCAGGGGAATTTTAAAAAGGCTATAGACAGTATACTGACCCAGCGGCCTGATGGACCCCTGAGTGCGGCGTTACAGCAGTACTCGTCGGGGGACGTGCGGGCGGCGGGGAGGGCGGTGCCGCCGCGTGACGTCACACCGGAAGCCCGACTCCTGAGGGCGCTCGCAGCTCGACCCAACGACCTGCTGGGGGCGCTGCAACAGGTGTCCCGTAACTGTCGTCTCCTCTACATACATTCGTACCAGTCATTGATATGGAACCGAGCGGCCACACATCGTCTGACACAATCTAGGACACCTTTAGTGGGAGATCTAGTGCCTTTAGAGCCGGTGGCTCAGGTTACTGAAGACGAACTAGATGAAGAATCTGATAATGAGGACAGCGAAAATGTTACACAAGATCAGAACACAGATAACATTGAGAAAACAGAGGAAGCGACTTGTAATGGCACAGAAAATAACACTGATAGCAATGAGAACACAGATAAAAAAAAGAAGGGCAACACTCCCAAAAAAGTGATACCAGTTAAAGTGTTAACAAAGGAAGACATTGATAGCGGGAAATATAGTATCTTTGATATAGTGTTGCCATTACCCGGCTACAGTATAGAGTACCCTCCTAACATGAGGGAGTTCTATGAAGAGTGTTTAACTAAGGACGAACTGACAATGGATATGAAGCATAAGTATAAATCGTACAACCTGAGCGGTGGTTACCGTCACGTGGTGTGTCGCGTGTCGGACATGTCCTGGAGGTGTGTGAGGTACGACCGGCCCACCGACGACCTCGTGCTCTCAGACGCCGACCTCATTGACGGGGTCAACATCACTCACAATGAGGATGGTAAATACAAGGCTTTGTTGCTCACAATGACCTTGCCCACGAGCTGTTACGCGACTATGGCGCTGCGGGAATTGTTGCGTGTGGGCACTTCGAGTGACGTGCAAGCCTTACAGAACGACTACCACGTCGACAAAGACGCTGTCAAAAGAAAGATGGCCGACGACGAAACAGACGCCAAGAAAACTAAAGTTGACGAATAA

Protein sequence:

>DPOGS202470-PA
MRGGHRGGRGQNRGNRGFRGSRGNYSRGDRQNNLGYNNYNKPWHSQTKRDIPTKRLSEEDIGVTEYLSEHQGFNGIIKSRFSDFQVSEINEAGEVAQLTDESPPEAPLEELVEDDEELLLNKYNLEILPMETWDKINKLAVSTEQTSEKIEVDVSGMSKEDRTKIHEAVKKAFGESIIGSTVTVEDKKFVTFTKYRKGVRFDNRVKWVWGGEYVHFVVYKENCDTMEAASVLAQRLHISPSMLGYAGTKDRRAKTSQWFSIRRFEPRRIASAAQQNRMPRVRVGNFCFKDYHLKLGMLKGNKFRICLRNVTAPDDCVDVACRLLNENGFINYYGLQRFGSRADVPTHEIGLKLLQGNFKKAIDSILTQRPDGPLSAALQQYSSGDVRAAGRAVPPRDVTPEARLLRALAARPNDLLGALQQVSRNCRLLYIHSYQSLIWNRAATHRLTQSRTPLVGDLVPLEPVAQVTEDELDEESDNEDSENVTQDQNTDNIEKTEEATCNGTENNTDSNENTDKKKKGNTPKKVIPVKVLTKEDIDSGKYSIFDIVLPLPGYSIEYPPNMREFYEECLTKDELTMDMKHKYKSYNLSGGYRHVVCRVSDMSWRCVRYDRPTDDLVLSDADLIDGVNITHNEDGKYKALLLTMTLPTSCYATMALRELLRVGTSSDVQALQNDYHVDKDAVKRKMADDETDAKKTKVDE-