Monarch geneset OGS2.0

DPOGS211709
TranscriptDPOGS211709-TA1332 bp
ProteinDPOGS211709-PA443 aa
Genomic positionDPSCF300423 - 19748-24050
RNAseq coverage434x (Rank: top 28%)
Annotation
HeliconiusHMEL0096793e-16770.73% 
BombyxBGIBMGA008689-TA7e-12556.25% 
DrosophilaCG4159-PB1e-8144.70% 
EBI UniRef50UniRef50_E1JIR12e-7944.70%Pseudouridine synthase n=6 Tax=melanogaster subgroup RepID=E1JIR1_DROME
NCBI RefSeqXP_002054173.14e-8243.91%GJ24293 [Drosophila virilis]
NCBI nr blastpgi|1953910397e-8143.91%GJ24293 [Drosophila virilis]
NCBI nr blastxgi|1953910391e-7743.85%GJ24293 [Drosophila virilis]
Group
Gene OntologyGO:00037235.4e-71RNA binding
GO:00094515.4e-71RNA modification
GO:00099825.4e-71pseudouridine synthase activity
GO:00015225.4e-71pseudouridine synthesis
KEGG pathway 
InterPro domain[63-363] IPR0014065.4e-71Pseudouridine synthase I, TruA
[83-315] IPR0201032.6e-45Pseudouridine synthase, catalytic domain
[85-158] IPR0200941.5e-17Pseudouridine synthase I, TruA, N-terminal
[159-296] IPR0200953.7e-15Pseudouridine synthase I, TruA, C-terminal
[91-176] IPR0200978e-10Pseudouridine synthase I, TruA, alpha/beta domain
Orthology groupMCL13833 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211709-TA
ATGTCCCTTAAATTTATCAGAGTATTCGTTAGTTCATTGAGGCAGCATCTACCACGACAGAACAATTTATTATCAGGTTTTCAAATACGAACTATAACAGCAATGGAGGTTGCGCCTGTTGATAAAAAGGACAATTTAGTTCAGGATAAAAATCGCACTAGGTATAAACGAAGGCGACAATGGGACGATAGAAAAGAAAATGGTGAGAGTCAAGAGAAAAAAACATGCGATAAACCTTTCGAGAGAATTAAAAGAAAAAAAATGGCCATGCTACTCGGCTATTGCGGAGTTGATTATTATGGTATGCAAAGGAACCCCGGAGTTCCGACTATTGAAGAGGATCTCCTAAAAGCTTTATATGAAGCGAAATATATCACGGAAGATGATTTTAACAACCAGCAGAATGCACAGTTCCAAAGAAGTTCTAGGACCGACAAGGGTGTCTCAGCGGCTAGGCAGGTGGTAACATACAGCTACACGCTACCTACTTACGTGTTTGAGTCAAATGTCGCATCGGAAGATGAGAGGAAAGCATACAGAATCACGTCTCAGAAGATAGATCAAGTAAATGAAGTGCTTGGTTACTATAAAGGCACAAAGAGTTACCATAACTTCACCGAGAAGAAACATTTCCAAGATCCGTCATCTCTGAGGTACATGATGAGTTTTGTATTGGAGAGGGTTTTCATGGAGTCGGAAATGGAGTTCGCGGAATTGCTAGTTAAAGGACAAAGTTTCATGTTGCATCAAATAAGAAAGATGATAGGGTTGATGATCGCTGTAGTTAGAGGTCATACAGACATTTCTACATTGGAGAAGTCGTTTGGCAAAGAAAAGGTCATGATACCAACAGCGCCTGGCCTGGGATTGGTTCTCGATAAGGTACATTACGAGAGATATGACGCCAAATTCAAAGACAGCCACGAAAGCTTAACCTGGGACGAAGAGGAAGACGCAGTGGAGAAATTCAAACGAGAAAAAATATTCCCTAACATTGTTAAAGGTGAATTGGAATCAAATTCGATGGGATTGTGGCTCGAGAAGATGAAGAATCATTCTTATGAACCATCTGAAGATGCTAATGATGAAAGAGTTAAAGACGAAGAGAAAGATTGTGGTGATGATGATGGTGATGATGAAATAAAAGATGATGATGACGATGAGGTTGAGGTTAAAGACCTTGGGAATAAAGTAGAAGATGCTAGTGACGCTGAAGTAAAAGATAATGGTTATGAAGTAAAAGATAATGGTGATGAAGTAAAAGATAATGGTGATGAAGTAAAAGATAAAAGTGATGTTAGTGAAATAGAAGCTAAGAAGGTGAACGTGTAA

Protein sequence:

>DPOGS211709-PA
MSLKFIRVFVSSLRQHLPRQNNLLSGFQIRTITAMEVAPVDKKDNLVQDKNRTRYKRRRQWDDRKENGESQEKKTCDKPFERIKRKKMAMLLGYCGVDYYGMQRNPGVPTIEEDLLKALYEAKYITEDDFNNQQNAQFQRSSRTDKGVSAARQVVTYSYTLPTYVFESNVASEDERKAYRITSQKIDQVNEVLGYYKGTKSYHNFTEKKHFQDPSSLRYMMSFVLERVFMESEMEFAELLVKGQSFMLHQIRKMIGLMIAVVRGHTDISTLEKSFGKEKVMIPTAPGLGLVLDKVHYERYDAKFKDSHESLTWDEEEDAVEKFKREKIFPNIVKGELESNSMGLWLEKMKNHSYEPSEDANDERVKDEEKDCGDDDGDDEIKDDDDDEVEVKDLGNKVEDASDAEVKDNGYEVKDNGDEVKDNGDEVKDKSDVSEIEAKKVNV-