Monarch geneset OGS2.0

DPOGS211835
TranscriptDPOGS211835-TA1647 bp
ProteinDPOGS211835-PA548 aa
Genomic positionDPSCF300031 + 881401-895251
RNAseq coverage398x (Rank: top 30%)
Annotation
HeliconiusHMEL0033100.085.71% 
BombyxBGIBMGA006020-TA0.079.71% 
DrosophilaRluA-1-PB0.062.81% 
EBI UniRef50UniRef50_B4LVB50.063.36%GJ13872 n=10 Tax=Neoptera RepID=B4LVB5_DROVI
NCBI RefSeqXP_553504.30.065.50%AGAP009693-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582985720.065.50%AGAP009693-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571165270.066.18%ribosomal pseudouridine synthase [Aedes aegypti]
Group
Gene OntologyGO:00037231.8e-68RNA binding
GO:00094511.8e-68RNA modification
GO:00099821.8e-68pseudouridine synthase activity
GO:00015221.8e-68pseudouridine synthesis
KEGG pathwaydme:Dmel_CG61876e-166 
 K01718 (E4.2.1.70)maps-> Pyrimidine metabolism
InterPro domain[86-333] IPR0062251.8e-68Pseudouridine synthase, RluC/RluD
[153-537] IPR0201039.2e-52Pseudouridine synthase, catalytic domain
[171-318] IPR0061454.8e-33Pseudouridine synthase, RsuA and RluB/C/D/E/F
Orthology groupMCL11180 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211835-TA
ATGGCTGTTAACTTCACGTCTCGAGGGGGTGAGAAAGACAACGCCAAAAACTGCATACTATCCTGCGAGAAGCGAAAAGCAGACGATTCGGACGTCAACAAAGATTTGAAAAAGGCTAAACTCGAAACAAAAGCTCTGAAGGCAAAAAGACCTGGGTTCACGGATGACAGATACAACGAAACTTCTTATTATATAGAAAATGGACTTAGAAAAGTTTACCCCTACTATTTTACGTTCACAACCTTCACTAAAGGGAGATGGGTAGGTGAGAAGATACTCGACGTTTTCGCGAGGGAGTTTAGAGCTCATCCCGCTGCGGAGTACGAGAGATGCATCAGGGCTGGCACATTAACGGTCAACTATGAACGTGTCGATCCAGATTATAGGCTGAAACACAACGATCTGTTAGCGAACGTCGTCCACAGGGCTTTTAATAATGTTCTGTGGTGGGCCAGGCACGAGACGCCAGTGCTCGCTAGTACCTTGCGCATCATACACGCGGATGAGGAGATTCTCGTGCTAGACAAGCCCTGTTCGTTGCCGGTACACCCATGCGGGCGGTATCGACACAACACAGTTGTTTTCATACTCGCCAAGGAATATAACCTCAAAAACCTTAGAACCATTCATAGACTCGACAGGCTAACTTCCGGTCTCCTATTATTCGGGAGGAGTCCAAAGAAAGCCAGGCAAATGGAACATCAGATAAGGAACAGACAGGTGCAGAAGGAATATGTTTGTCGAGTCGATGGAGAGTTTCCTGACGAGGAGATCGAATGTACGGAACCAATAGAGGTAGTGAGTTATAAAATCGGCGTTTGTAAAGTGTCACAAAAAGGGAAAGACTGTTCCACTACTTTCAAGCGATTAGGATACAATACGAAAAGCAACACCAGCGTTGTGCTCTGTCGACCAAAAACAGGCAGGATGCATCAGATCAGGGTTCATTTACAATACCTAGGTTACCCGGTCGTGAACGATCCATTGTATAATCATCCTGTATTCGGACCCCTTCGTGGTAAGGGTGGTGATACAGGTGGTAAGACTGACGAACAGCTAGTGAGAGACCTGATCGCTATACACAACGCCGAGAACTGGTTAGGTGTTGACGCCGGTGACGATGATATGCTGTTCTCCAAACCGGTAGCTGGTGACAAAGTTGAGGATGAATGTGAGGCTGGGATGGCGTCGAGGGAGTCGTCTCCGAGGTTGGAGTCACCAGCCCCCGGGCTCACACCGGCCACTGTAATGACTGCAATACTGGCGAGTCCATCGAGTGGATCGGAGGCCCCGGTAGAGGTCAGGTCCCCCGCACACTCACCCTCACTTAACGAGGACTCGAATGACGCCAAGTCAGACAAAGTGACAGTAGCGACACAGACCGGGTGCACACCGGCGCATGTCGTACCAAACGTGTCCACTGGTGTTTCGACAAGTGTTTCCAATGTCACGGGCGTGTACACCACCAGCCAGGAACTGACGGTGGACCCGCACTGTTACGAGTGTCGCGTGAGGTACAGAGACCCACGGCCTAGAGACCTTGTTATGTTCCTGCACGCTTGGAAATACAAGGGGCCGGGTTGGGAATACGAAACGGAACTTCCACAGTGGGCCGACATAGACTGGGAAGAATCGGAGAGCTCGTAG

Protein sequence:

>DPOGS211835-PA
MAVNFTSRGGEKDNAKNCILSCEKRKADDSDVNKDLKKAKLETKALKAKRPGFTDDRYNETSYYIENGLRKVYPYYFTFTTFTKGRWVGEKILDVFAREFRAHPAAEYERCIRAGTLTVNYERVDPDYRLKHNDLLANVVHRAFNNVLWWARHETPVLASTLRIIHADEEILVLDKPCSLPVHPCGRYRHNTVVFILAKEYNLKNLRTIHRLDRLTSGLLLFGRSPKKARQMEHQIRNRQVQKEYVCRVDGEFPDEEIECTEPIEVVSYKIGVCKVSQKGKDCSTTFKRLGYNTKSNTSVVLCRPKTGRMHQIRVHLQYLGYPVVNDPLYNHPVFGPLRGKGGDTGGKTDEQLVRDLIAIHNAENWLGVDAGDDDMLFSKPVAGDKVEDECEAGMASRESSPRLESPAPGLTPATVMTAILASPSSGSEAPVEVRSPAHSPSLNEDSNDAKSDKVTVATQTGCTPAHVVPNVSTGVSTSVSNVTGVYTTSQELTVDPHCYECRVRYRDPRPRDLVMFLHAWKYKGPGWEYETELPQWADIDWEESESS-