Monarch geneset OGS2.0

DPOGS200450
TranscriptDPOGS200450-TA2274 bp
ProteinDPOGS200450-PA757 aa
Genomic positionDPSCF300260 - 261078-270120
RNAseq coverage125x (Rank: top 57%)
Annotation
HeliconiusHMEL0127862e-14567.37% 
BombyxBGIBMGA011230-TA6e-11854.94% 
DrosophilaCG3709-PA2e-5032.98% 
EBI UniRef50UniRef50_E0VC889e-6738.85%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VC88_PEDHC
NCBI RefSeqXP_002423732.12e-6738.85%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420057653e-6638.85%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420057653e-6237.63%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00037232.6e-08RNA binding
GO:00094512.6e-08RNA modification
GO:00099822.6e-08pseudouridine synthase activity
GO:00015222.6e-08pseudouridine synthesis
KEGG pathway 
InterPro domain[601-748] IPR0201032.6e-08Pseudouridine synthase, catalytic domain
Orthology groupMCL14821 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200450-TA
ATGGATGAAAAGGCTATATATAATTATTGTAGAAACCTCGGCTGCTGTGTAGCTTGCTGCCTTCGATATCTTGGTATTAAAAACTCGAACGCGTATGCCAATCCAACAAATTTTGCATTAAAATTTCAAGCCGAGGAGACTTTAAATAAAGCGTCAGAAGACGCTGAAAAATGTACGGGTAAAATTATTGAGGGCCTTAGTGATGCAGTAAATGAATCACAAAGCGAAGAAATAAAGATACCAGATGATATAAGTATTGAAAATACACAAGAGAATAATTTAAACTCAAGTAATACTATAAATGAATTGAGTAACGGTCTTGAAGTTTTTGGTAACGGTTGCTCTCCACCATCTAAGAAACTTAAATTAGCGACATGCATCAGTTGCTTAGGTGTTTTACAGGAATTCACTTGGCAAGAGAGCATTAATATGGTGAAAGAAGTCCTCGATAAGAAGGGTTATGCTTGCAACACATTCGCCTGCGCCCTCTCTGCACCAATAGCCACTATGTTACGTGAACGAGTCATCACCCTGCAACTTACCGATGAGTTCCCGGGATATGATCATAATTCTCTCACTCCGTTAAAGGAAGCCTGGAAATGGTCGTTTGGTACACATTTATCGAGACACATCCATAAAACTCTGGACTCGGGAGCCATTTCTCCGTTGTTGGTCACACTGAATATTGACTACGATGATGATATACAGGAATTGGAGATCCTCAAGAAGGTTTCACCGCAGCTGTTCGAAGAGAGAAGCAGGCAGAGGAGAAGATTTGCTACAGAATTCACTCGTAGATCAGTCGAACAGGCGCTAGAGACCGCCACGCTGCCCTCTTTCCGTGCGGCGGGTGATGCTCTATCCCGGGTTAGCTCTAACGCACACTGCGTTAGTGTCATATGCACACACGCGCCCATTTACGTGTGCGGGAGGTATGTAAAGCTGAGTAGAAACCTTCCCCAATCTCCATGGGTCTTGGATGGTAAGAGGGTCCTGCCGTCGTCAGTGCAGGAGATAATATTTGGACCCATAGCCAAATTTTACAATTTCAGCGATGTTGACTGTGAGGGTAGATTGAAATTTATAGCAGCCGGTAGAGAGGACGTTGATGTGAGGTGCCTGGGTGACGGCAGGCCATTTGCTATTGAGGAAGCCTGGAAATGGTCGTTTGGTACACATTTATCGAGACACATCCATAAAACTCTGGACTCGGGAGCCATTTCTCCGTTGTTGGTCACACTGAATATTGACTATGATGATGATATACAGGAATTGGAGATCCTCAAGAAGGTTTCACCGCAGCTGTTCGAAGAGAGAAGCAGGCAGAGGAGAAGATTTGCTACAGAATTCACTCGTAGATCAGTCGAACAGGCGCTAGAGACCGCCACGCTGTCCTCTTTCCGTGCGGCGGGTGATGCTCTATCCCGGGTTAGCTCTAATGCACACTGCGTTAGTGTCATATGCACACACGCGCCCATTTACGTGTGCGGGAGGTATGTAAAGCTGAGTAGAAACCTTCCCCAATCTCCATGGGTCTTGGATGGTAAGAGGGTCCTGCCGTCGTCAGTGCAGGAGATAATATTTGGACCCATAGCCAAATTTTACAATTTCAGCGATGTTGACTGTGAGGGTAGATTGAAATTTATAGCAGCCGGTAGAGAGGACGTTGATGTGAGGTGCCTGGGTGACGGCAGGCCATTTGCTATTGAGATAACAGATCCCAAGCGAGAGCTGACATCTGAAGAATTGAATGAAGCCTGTGACATGATATCTACAAGTGAGGAGGTGGTTGTTAAGAAACTTGTGCCCATCAATAGAGAGGACCTCGCGTTACTGAAGAAAGGGGAGGAGACTAAGAGCAAGACTTATGAGGCGTTGTGCATCAAACTGACTCACTCCAAACATGATGATTGCCCTCCAGACAGTCCGATAACAGTGACGCCCACAGACTTGCAACTGATTAACGACTATAAGGAATCGGATGATGTAACGATGACGATATCACAGAAGACGCCTCTTCGAGTACTACACCGACGACCTCTCCTGGTCAGGAAAAGAGAGATTCTGGAAATGTCCGCTGTGCCTGTACCCGAACATCCTCAGTTGTTTCGTTTGTTTGTCCGCACGTCCGCGGGGACTTACGTCAAGGAGTGGGTGCACGGCGAGTTGGGGCGGAGCACTCCGTCCTTAGGAGACGTGATGGGATCGCGAGTAGACCTGCTGGCGCTGGACGTTACAGCTGTACATTTACAATGGCCGCCAGTCAAAGAATAA

Protein sequence:

>DPOGS200450-PA
MDEKAIYNYCRNLGCCVACCLRYLGIKNSNAYANPTNFALKFQAEETLNKASEDAEKCTGKIIEGLSDAVNESQSEEIKIPDDISIENTQENNLNSSNTINELSNGLEVFGNGCSPPSKKLKLATCISCLGVLQEFTWQESINMVKEVLDKKGYACNTFACALSAPIATMLRERVITLQLTDEFPGYDHNSLTPLKEAWKWSFGTHLSRHIHKTLDSGAISPLLVTLNIDYDDDIQELEILKKVSPQLFEERSRQRRRFATEFTRRSVEQALETATLPSFRAAGDALSRVSSNAHCVSVICTHAPIYVCGRYVKLSRNLPQSPWVLDGKRVLPSSVQEIIFGPIAKFYNFSDVDCEGRLKFIAAGREDVDVRCLGDGRPFAIEEAWKWSFGTHLSRHIHKTLDSGAISPLLVTLNIDYDDDIQELEILKKVSPQLFEERSRQRRRFATEFTRRSVEQALETATLSSFRAAGDALSRVSSNAHCVSVICTHAPIYVCGRYVKLSRNLPQSPWVLDGKRVLPSSVQEIIFGPIAKFYNFSDVDCEGRLKFIAAGREDVDVRCLGDGRPFAIEITDPKRELTSEELNEACDMISTSEEVVVKKLVPINREDLALLKKGEETKSKTYEALCIKLTHSKHDDCPPDSPITVTPTDLQLINDYKESDDVTMTISQKTPLRVLHRRPLLVRKREILEMSAVPVPEHPQLFRLFVRTSAGTYVKEWVHGELGRSTPSLGDVMGSRVDLLALDVTAVHLQWPPVKE-