Monarch geneset OGS2.0

DPOGS215299
TranscriptDPOGS215299-TA921 bp
ProteinDPOGS215299-PA306 aa
Genomic positionDPSCF300120 - 402675-404116
RNAseq coverage302x (Rank: top 37%)
Annotation
HeliconiusHMEL0118594e-13772.22% 
BombyxBGIBMGA007612-TA2e-7675.76% 
DrosophilaCG7849-PB1e-6838.26% 
EBI UniRef50UniRef50_E2AVX73e-8549.01%Probable tRNA pseudouridine synthase 2 n=5 Tax=Formicidae RepID=E2AVX7_CAMFO
NCBI RefSeqXP_973881.16e-8147.84%PREDICTED: similar to AGAP002077-PA [Tribolium castaneum]
NCBI nr blastpgi|3071698901e-8449.01%Probable tRNA pseudouridine synthase 2 [Camponotus floridanus]
NCBI nr blastxgi|3071698901e-8249.01%Probable tRNA pseudouridine synthase 2 [Camponotus floridanus]
Group
Gene OntologyGO:00037235.3e-35RNA binding
GO:00094515.3e-35RNA modification
GO:00099825.3e-35pseudouridine synthase activity
GO:00015225.3e-35pseudouridine synthesis
GO:00063961.7e-16RNA processing
KEGG pathway 
InterPro domain[13-292] IPR0201035.3e-35Pseudouridine synthase, catalytic domain
[97-227] IPR0025011.7e-16Pseudouridine synthase II
Orthology groupMCL13746 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215299-TA
ATGGTAAACATTAAAGATGCTGCACTCGCATACAAAACTTTAAATGGAATTATTTGTGTTTATAAACCATCCTGCGTTAGTGTGCGTCAAGTTCAGAATACAATTTTAACTCATTTATGTAGAGATTTAAACTCCTTACCCGACCGACAGCTTGAAAAACGAGTTGAGATTATTGGTGCAACAAATGAACCAATGCAAGTTAAATTGGCACCAAACTATGCAGACCATCCACTTGCATGTGGTCCACGATACATAAAAGAAGACTTCCGATGCAGTTGGGCTACTCATTTAGGCTTATTTAGTAGTGGGGTTTTATTGTTAGGGATCAATGATGGAACAAAACTTACATACCAAATAAATACAGCGAGACCAACAAGGGCATTCAAAGTACACGGGCAGTTTGGAAAAGCAACGGATACATATTTTTGGAATGGACGGACAATGGAACGAGCTTCTTACAAACATGTTACAAGGGAAAAACTAGATAGAGTTGTAGCTCATATACAGGCCGCTCATCAGAAAACTATGTTCGAATTATCAGGTCTTCATATGGAGTCCCAAACAGCTTATGACTTGGCATCACAGGGTCTCATAAGACCAGCTAACAGTAAATTACCGATCATTTATGGAGTCAAATGTGTGCATTTCGATTCCCCCAATTTCACGTTAGAGATTCAGTCAGTGAATGAATATGATAAATATTTGTGGACTCTAGTCCATGATTTAGGCATTCAATTGAAGACAGCAGCTCACTGCACAGGGTTGCAGTGTATCAGACAAGGACGGTTCAATCTACAATTAGCACTGTTAAGAAAACACTGGCAACTCAATCATATATTAAATAATATGGATCAGTGTCGCCAACTGTTAGAAGAAAATGAAAACTTATTGAAGCCTAAATCAGCCCATCTAACTGTTTGA

Protein sequence:

>DPOGS215299-PA
MVNIKDAALAYKTLNGIICVYKPSCVSVRQVQNTILTHLCRDLNSLPDRQLEKRVEIIGATNEPMQVKLAPNYADHPLACGPRYIKEDFRCSWATHLGLFSSGVLLLGINDGTKLTYQINTARPTRAFKVHGQFGKATDTYFWNGRTMERASYKHVTREKLDRVVAHIQAAHQKTMFELSGLHMESQTAYDLASQGLIRPANSKLPIIYGVKCVHFDSPNFTLEIQSVNEYDKYLWTLVHDLGIQLKTAAHCTGLQCIRQGRFNLQLALLRKHWQLNHILNNMDQCRQLLEENENLLKPKSAHLTV-