Monarch geneset OGS2.0

DPOGS212408
TranscriptDPOGS212408-TA1374 bp
ProteinDPOGS212408-PA457 aa
Genomic positionDPSCF300258 - 165737-167895
RNAseq coverage184x (Rank: top 49%)
Annotation
HeliconiusHMEL0086390.070.97% 
BombyxBGIBMGA002808-TA2e-10590.82% 
DrosophilaCG8078-PA6e-15377.37% 
EBI UniRef50UniRef50_Q7JWW59e-15177.37%Cytoplasmic tRNA 2-thiolation protein 1 n=82 Tax=root RepID=CTU1_DROME
NCBI RefSeqXP_001959100.12e-15277.68%GF12710 [Drosophila ananassae]
NCBI nr blastpgi|1947536005e-15177.68%GF12710 [Drosophila ananassae]
NCBI nr blastxgi|910937152e-14779.69%PREDICTED: similar to CG8078 CG8078-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[46-225] IPR0147291.5e-20Rossmann-like alpha/beta/alpha sandwich fold
[53-229] IPR0110633.9e-18tRNA(Ile)-lysidine/2-thiocytidine synthase
Orthology groupMCL12370 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212408-TA
ATGCCTGTACTATGCAAAGCAGGATGTGGAAAAAATGCTATGTTAAAGCGTCCTAAAACGGGGGATACTCTATGTAAAGAATGCTTTTACGAAGCCTTTGAAACAGAAATCCATTTTACAATAACAAAAGCAGAATTATTTAATAGAGGAGATTCTGTCGCCATCGCCGCCTCTGGTGGCAAGGATTCAACCGTATTGGCACATGTACTTAAAACATTAAATCAAAGATATGACTATGGACTTAATCTTATGTTGCTGTCCATAGACGAGGGCATAACCGGCTACAGAGATGACAGCTTGGAAACAGTCAAACAAAACAGAGATGATTACGAGATGAATCTCAAAATATTATCATATAAAGATTTATATGGTTGGACCATGGACGAAATTGTAGCCCAAATCGGAAGAAAGAATAATTGTACATTCTGTGGAGTATTCAGGAGGCAAGCTTTGGATAGAGGTGCCGCCATGCTTAATGTGAAATGTATAGCAACAGGACACAATGCTGATGACATAGCAGAGACGGTGCTGATGAATGTGCTGAGAGGAGACATAGCTCGGCTCAAGAGATGCACTGCTATATCTACTGGCAGCGAGGGCACAATTCCGAGAGTGAAGCCGTTGAAGTATACGTATGAAAAGGAGATTGTTATGTACGCTCATTACAAGAAGCTGGTGTACTTCTCAACAGAATGTGTGTTTGCTCCAAACGCCTACAGAGGTCATGCTAGGGCTCTGTTAAAAGATCTGGAAAAAATTAGACCTACTTGCATTATGGATATCATATACTCAGGCGAAACAATGGCTGTGAAAGAGGAAGTGTCACTGCCCACACAGAGAATTTGCACGAGATGCAAATTTGTCTCCTCTCAAGAGGTATGCAAGGCTTGCGTTCTTCTGGAAGGATTGAACAAGGGTTTACCAAAACTTGGCATTGGAAAGAGTTCCAAAGCCAAGAAGATGCTAGAAGAATACAACGCAAACCAAAATAGTACGAATAAAGCTATCGATGAAATTAATGTCGACTGCCAGAAAAATAATTGTGTCTCTAGAGGAAAAGCGTGCAGGTCGAATCGAAATAAAACAAATGATAACGAAGTCAACAGCCGAAACGGAGAAAAGTGCTGTAGTACACAGGAAAAGACACATGACAGTGCTAATATAAGCAATACTAAATTAAACACACTTTTACAAGACTACGGCATACCAGAAAATGATTGTGGACACGATAAAAACTCATCAATAGGGGAAAGTTCGGAAAATCATAATTTCGAAGTCGATTTACACAACGAAGATGTTACATCCTTAGCAGAGGAGACTGACGCGTGTGGCGGAGCATGCGGCAAGATGGACTCCATGCATATAGGTTTCTGA

Protein sequence:

>DPOGS212408-PA
MPVLCKAGCGKNAMLKRPKTGDTLCKECFYEAFETEIHFTITKAELFNRGDSVAIAASGGKDSTVLAHVLKTLNQRYDYGLNLMLLSIDEGITGYRDDSLETVKQNRDDYEMNLKILSYKDLYGWTMDEIVAQIGRKNNCTFCGVFRRQALDRGAAMLNVKCIATGHNADDIAETVLMNVLRGDIARLKRCTAISTGSEGTIPRVKPLKYTYEKEIVMYAHYKKLVYFSTECVFAPNAYRGHARALLKDLEKIRPTCIMDIIYSGETMAVKEEVSLPTQRICTRCKFVSSQEVCKACVLLEGLNKGLPKLGIGKSSKAKKMLEEYNANQNSTNKAIDEINVDCQKNNCVSRGKACRSNRNKTNDNEVNSRNGEKCCSTQEKTHDSANISNTKLNTLLQDYGIPENDCGHDKNSSIGESSENHNFEVDLHNEDVTSLAEETDACGGACGKMDSMHIGF-