Monarch geneset OGS2.0

DPOGS204924
TranscriptDPOGS204924-TA1515 bp
ProteinDPOGS204924-PA504 aa
Genomic positionDPSCF300160 - 713861-718640
RNAseq coverage140x (Rank: top 55%)
Annotation
HeliconiusHMEL0071923e-16354.44% 
BombyxBGIBMGA011417-TA2e-3568.75% 
DrosophilaCG3045-PA1e-10447.09% 
EBI UniRef50UniRef50_B0XGH82e-10749.00%Pseudouridine synthase n=4 Tax=Culicidae RepID=B0XGH8_CULQU
NCBI RefSeqXP_001602396.13e-11354.30%PREDICTED: similar to pseudouridylate synthase [Nasonia vitripennis]
NCBI nr blastpgi|1565370856e-11254.30%PREDICTED: tRNA pseudouridine synthase 3-like [Nasonia vitripennis]
NCBI nr blastxgi|1565370852e-10854.30%PREDICTED: tRNA pseudouridine synthase 3-like [Nasonia vitripennis]
Group
Gene OntologyGO:00037231.3e-93RNA binding
GO:00094511.3e-93RNA modification
GO:00099821.3e-93pseudouridine synthase activity
GO:00015221.3e-93pseudouridine synthesis
KEGG pathway 
InterPro domain[47-371] IPR0014061.3e-93Pseudouridine synthase I, TruA
[73-347] IPR0201032.8e-58Pseudouridine synthase, catalytic domain
[75-195] IPR0200941.5e-25Pseudouridine synthase I, TruA, N-terminal
[198-325] IPR0200951.1e-23Pseudouridine synthase I, TruA, C-terminal
[217-330] IPR0200973e-20Pseudouridine synthase I, TruA, alpha/beta domain
Orthology groupMCL14032 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204924-TA
ATGTCGAAACAAATTAATAAATTACCACCAAAACAGAGAAAAACTAAAGGCTTATCTAGAGAGGAGCTTATGAATATGGATAAAAATGAATTAGTTGATAGAATAATACAGTTGGAAGCTCACACTACGCAACTTAAAAATATAATAAGCAAAAGTGAACCAGTTACAGAGAATATACAGGGTTACAATAATCAAAGAAAATTTGATTTCACGAAGTGTACCTTCCGACGGGTTCTGCTACATATAATATACTTCGGCTGGGATTACCACGGGCTGGCCGTCCAGGAGGATTCAACGCACACAATTGAGCATTACCTCTTCAATGCCCTCGTCAAGTCATGTCTCATTGAGAGTAGAGAACAGTCACAGTACCATCGATGTGGAAGGACAGATAAAGGCGTCAGTGCCTTCGGACAGATAATATCTATATCATTACGGAGCAAACTGGAACCGTCTTCAACCGACTACTCATCCGAGATCCAATACTGCAAGATCCTTAACAGATTGTTTCCGAGGGATATTAAAGCGGTAGCCTGGATGCCCATCCCTGATGATAGACCAGATTTCAGTGCAAGATTCGACTGTAAGGGCCGGCAGTACAAGTACTATTTCCCGAAATCTAATCTCAATATAACCGCTATGAGGGAGGCCTGTCGCCAGCTCATCGGTTCACACGACTTCCGCCACCTCTGCAAGATGGACGTGGGGAACGGCGTCACTGAGTTCACAAGGCGTGTCGTATCAGCTGACATTATAGCTCTGGATAAGGATTGCGAACAGACAACATCGATGTACGCATTAGTGATAGAAGGTAATGCATTTCTGTGGCATCAGATCAGGTGTATAATGGGCGTGTTGTTGCTCGTGGGCCAAGGACACGAGAGCCCGGGTATCATAGCCGAATTACTGGACGTCGAAGCAAATCCACGCAAACCTCAATACAATATGGCTCTGGATTTGCCGTTGAACCTGTTCTGCTGCAGATATGATGTGAAGAGCCGCTGGGTTTATGACGACGAGGAGCTCAAATACATCATCACCAACTTACAGGCGGACTGGACCTTGTATAATGTCAAATCCACCATGATAAAAGATGCTCTGGAACATCTGGAAGGTGTCCTATATGACTTGAGCAAGGAGGGGAAAAAGTGTGACAGAGACGGAGAATATAACGACGTGGGAAGGAGAGAGATGCAAGATAAGGAAGATGTAGCGTTAGAGGGAGACCAAGATAAAATATGTGACATAGACAGAAATTTAGAAGAGTTGGGAGAGAAAGAGAAAGAAGATGATAACAATAAGTGCGAGAGAGACAGGGGATTAAAAGAGTTGGAAGGGAAAGAGACGGGAGATAGAATAATATCGCACGCAGAATGCCTGCTACAAGGAGTCAAACCAAAAATATACACACCGCTGTTGAAAAGACAAACCTGCTCGAGTCTGCAGGAACGATTGCAATACTACAGGAAGAAAAGGAAAGTGGAGAGCGGTTCTGATGATGAAGAAATAAAATAA

Protein sequence:

>DPOGS204924-PA
MSKQINKLPPKQRKTKGLSREELMNMDKNELVDRIIQLEAHTTQLKNIISKSEPVTENIQGYNNQRKFDFTKCTFRRVLLHIIYFGWDYHGLAVQEDSTHTIEHYLFNALVKSCLIESREQSQYHRCGRTDKGVSAFGQIISISLRSKLEPSSTDYSSEIQYCKILNRLFPRDIKAVAWMPIPDDRPDFSARFDCKGRQYKYYFPKSNLNITAMREACRQLIGSHDFRHLCKMDVGNGVTEFTRRVVSADIIALDKDCEQTTSMYALVIEGNAFLWHQIRCIMGVLLLVGQGHESPGIIAELLDVEANPRKPQYNMALDLPLNLFCCRYDVKSRWVYDDEELKYIITNLQADWTLYNVKSTMIKDALEHLEGVLYDLSKEGKKCDRDGEYNDVGRREMQDKEDVALEGDQDKICDIDRNLEELGEKEKEDDNNKCERDRGLKELEGKETGDRIISHAECLLQGVKPKIYTPLLKRQTCSSLQERLQYYRKKRKVESGSDDEEIK-