Monarch geneset OGS2.0

DPOGS212407
TranscriptDPOGS212407-TA1344 bp
ProteinDPOGS212407-PA447 aa
Genomic positionDPSCF300258 - 175922-178255
RNAseq coverage158x (Rank: top 52%)
Annotation
HeliconiusHMEL0086379e-9864.71% 
BombyxBGIBMGA002807-TA4e-5554.70% 
DrosophilaCG34140-PA9e-4337.31% 
EBI UniRef50UniRef50_B0WVR98e-8037.58%Pseudouridine synthase n=5 Tax=Culicidae RepID=B0WVR9_CULQU
NCBI RefSeqXP_001861491.11e-8037.58%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700508373e-7937.58%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700508371e-7637.58%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00037231.7e-34RNA binding
GO:00094511.7e-34RNA modification
GO:00099821.7e-34pseudouridine synthase activity
GO:00015221.7e-34pseudouridine synthesis
GO:00081384.3e-32protein tyrosine/serine/threonine phosphatase activity
GO:00064704.3e-32protein dephosphorylation
KEGG pathway 
InterPro domain[2-317] IPR0201031.7e-34Pseudouridine synthase, catalytic domain
[310-444] IPR0204224.3e-32Dual specificity phosphatase, subgroup, catalytic domain
[318-440] IPR0003409.3e-30Dual specificity phosphatase, catalytic domain
[124-284] IPR0200957.8e-20Pseudouridine synthase I, TruA, C-terminal
[4-122] IPR0200945e-14Pseudouridine synthase I, TruA, N-terminal
[175-254] IPR0200974.6e-08Pseudouridine synthase I, TruA, alpha/beta domain
Orthology groupMCL15279 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212407-TA
ATGAGAGCTAGATTTGCGACATTCTTTTCATATATTGGAACACGGTTTAGATCGTCTGAAAAAATATGGCTTAAGGAGGGTCGTTACTATCCGGACCCTGAGAGCATTCAAGGTTTAATGGAACTGGCTCTACTTAAAATTAAACCATTAAATTACCCTAACGTATTTCTATCAAGCCGCACAGATGGTGGAGTACATGCTCTGAACACATGTGCACATTTTGATTTGGAGAGACATAAGGATTTAGTTTACGAGCCAGCATACCTAGAACATGAAATCAATAAATACTTCTTTGAGCACAACATAAGCATACAAGTTAACAAATGTCTTCAAGTCAACGATGATTTCAACGCTAGACATGACGCTTTATCACGGACATACCTCTACAGGTTAGCAGTTATAAAACCTGAAATAAAATTAGACGAGAAAATAAACATTGCAACATTGATACCAATCGAAGAATGGAGGAGATGTCACTTCATTAAACCAAAAATTTTTGATATAGAGCTGTTTAAAGAAGGTGCTAAGTACTTTGTGGGATATCACGACTTCACAACATTCAAAAGATTCGACAAATTGAAGGCCCACAAACACAACAGGAGAGAAATTAAGAGTATAAACATAAAACAGGGCTCTCCGTGTGTCACAAGTTATTCAACTAAAAGGGATAGTTGTTTTAATTACTGGGATATTGAGGTACATGGAAGATCCTTTGTACATAATCAGATACGTCGGATGATAGGTACATTAATATCTGTGGCTTCCGTGGACATGAGTTTCTTGAGCGAATTGAAAAAGAAAAAGGAGGAATTGAGGAACGCCGTGACTGTCGTGACTCGTGCTGACGGACTCAAGTTCATCGAAGGTTCTGACGACGTTTTGACAGAAACTTGTTATGGCTTCGTCGTGGACACGAAACCAGATGACGTTCCTGTACTAATCATAGATTATCTGTATATAGGCTCACAGGATTGCGCTGTTGAAAGTGTAGTCGAAGCCTTTAATATAAAACACGTTCTAAGCTTAGGAGTAAATGTGAATGTAGCCAATATTGACCAAAAATATGTTGCCGTTCTGGACTTGCCAGACTTTGATATAAAACCAGTGCTAGCGGAGTGTTTGCCGTACATCAGGGAATGTTTGTCGATGATGCAGAATGTATTGGTTCACTGCAACGCTGGCGTGTCTCGGACAGCCGTTGTGGCCATTGCGTATTTGATGCATTACGAACTAATGACATACAACGACGCTTATGATCTCGTTAAACAAAAACGACCGGCCATTCACCCGAACACTGGATTCAAAAAGCAATTACAGGACGCGACACCCGGAGAATTAATTTGA

Protein sequence:

>DPOGS212407-PA
MRARFATFFSYIGTRFRSSEKIWLKEGRYYPDPESIQGLMELALLKIKPLNYPNVFLSSRTDGGVHALNTCAHFDLERHKDLVYEPAYLEHEINKYFFEHNISIQVNKCLQVNDDFNARHDALSRTYLYRLAVIKPEIKLDEKINIATLIPIEEWRRCHFIKPKIFDIELFKEGAKYFVGYHDFTTFKRFDKLKAHKHNRREIKSINIKQGSPCVTSYSTKRDSCFNYWDIEVHGRSFVHNQIRRMIGTLISVASVDMSFLSELKKKKEELRNAVTVVTRADGLKFIEGSDDVLTETCYGFVVDTKPDDVPVLIIDYLYIGSQDCAVESVVEAFNIKHVLSLGVNVNVANIDQKYVAVLDLPDFDIKPVLAECLPYIRECLSMMQNVLVHCNAGVSRTAVVAIAYLMHYELMTYNDAYDLVKQKRPAIHPNTGFKKQLQDATPGELI-