Monarch geneset OGS2.0

DPOGS206643
TranscriptDPOGS206643-TA954 bp
ProteinDPOGS206643-PA317 aa
Genomic positionDPSCF300048 - 388799-390749
RNAseq coverage117x (Rank: top 58%)
Annotation
HeliconiusHMEL0111423e-13968.06% 
BombyxBGIBMGA008341-TA2e-5281.82% 
DrosophilaCG3021-PA9e-7441.36% 
EBI UniRef50UniRef50_E2C8U01e-8644.97%Mitochondrial tRNA-specific 2-thiouridylase 1 n=10 Tax=Endopterygota RepID=E2C8U0_HARSA
NCBI RefSeqXP_972131.17e-9147.66%PREDICTED: similar to tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase [Tribolium castaneum]
NCBI nr blastpgi|910859871e-8947.66%PREDICTED: similar to tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase [Tribolium castaneum]
NCBI nr blastxgi|910859872e-8847.79%PREDICTED: similar to tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase [Tribolium castaneum]
Group
Gene OntologyGO:00167401.5e-119transferase activity
GO:00057371.5e-119cytoplasm
GO:00080331.5e-119tRNA processing
KEGG pathway 
InterPro domain[1-311] IPR0045061.5e-119tRNA-specific 2-thiouridylase
[3-217] IPR0147296.4e-80Rossmann-like alpha/beta/alpha sandwich fold
Orthology groupMCL12145 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206643-TA
ATGTTTAAGAAAATAGCTGTAGGCATTTCAGGCGGTGTTGATAGTTCAGTCGCTGCTTTATTACTGAAAAGAGCAAATTATAAAGTGGAAGGAGTATTCATGCGAAATTGGGATAGCAATTACGAAGTTGGAAGCTGTTCTGATGAAAAAGACTTTGAAGATGCGTCATTCGTGTGCCGTAAGCTTGATATTCCTTTGCATAGAGTTCACTTCATAAAAGAATATTGGAATGATGTCTTCACAGTTCTTCTTAAGGAATATGAAACTGGTTTAACACCAAACCCAGATATACTGTGCAACAGATACATTAAATTTGATAGTTTCTTTGAACATTGCAGGAATAATTTAGAGGTTGATGCTATAGCAACTGGTCATTATGCTAATACATCATTTGGACCTTTCTTAGAGAATTATTTGGAAAATGAAGGTGTTAAATTGCTCCGTCCAGTAGATAAACACAAAGATCAAACATTTTTCCTGTCACAAGTTAAGCAATTCTCACTAAGGAAATGTATGTTCCCAATAGCGAATCTAATGAAAAGTGAAGTCAGAGACCTAGCAAGGAAAGAAGGTTTATTGCCAGTCGCCGACAAAAAAGATAGCACTGGTATTTGTTTTATTGGAAAAAGAAGATTTAAAGACTTTATTGATGATGTATCTGGTACTAGCCATCCAGCACTGTGGAACAACATCTGTATATCTGACAAACCTCATTGGATAAATGAAGTACCTGAGGAACTAAATGATAACAATGTACTGAATTGTACATTCAGATTTCAACATACCAAACCTCTAGAACCTTGTAGGATAGTAAATAATCCTGAAGGATTGACTATTATATTAAATAACAGTCTAAGAGCCCTGACAGAGGGCCAGTTTGCTTGTTTCTACAGAGGTGATGAATGCCTCGGAAGTGCTAAAATAAAACATGTTTGTCATAATCTAGTGTACTAA

Protein sequence:

>DPOGS206643-PA
MFKKIAVGISGGVDSSVAALLLKRANYKVEGVFMRNWDSNYEVGSCSDEKDFEDASFVCRKLDIPLHRVHFIKEYWNDVFTVLLKEYETGLTPNPDILCNRYIKFDSFFEHCRNNLEVDAIATGHYANTSFGPFLENYLENEGVKLLRPVDKHKDQTFFLSQVKQFSLRKCMFPIANLMKSEVRDLARKEGLLPVADKKDSTGICFIGKRRFKDFIDDVSGTSHPALWNNICISDKPHWINEVPEELNDNNVLNCTFRFQHTKPLEPCRIVNNPEGLTIILNNSLRALTEGQFACFYRGDECLGSAKIKHVCHNLVY-