Monarch geneset OGS2.0

DPOGS205624
TranscriptDPOGS205624-TA999 bp
ProteinDPOGS205624-PA332 aa
Genomic positionDPSCF300023 - 506358-507692
RNAseq coverage279x (Rank: top 39%)
Annotation
HeliconiusHMEL0062074e-15079.73% 
BombyxBGIBMGA001137-TA5e-13082.61% 
DrosophilaTs-PA4e-11763.96% 
EBI UniRef50UniRef50_P048181e-11970.38%Thymidylate synthase n=496 Tax=root RepID=TYSY_HUMAN
NCBI RefSeqNP_001040281.12e-14682.58%thymidylate synthase isoform 1 [Bombyx mori]
NCBI nr blastpgi|1140532794e-14582.58%thymidylate synthase isoform 1 [Bombyx mori]
NCBI nr blastxgi|1140532791e-14182.58%thymidylate synthase isoform 1 [Bombyx mori]
Group
Gene OntologyGO:00062313.7e-117dTMP biosynthetic process
GO:00047993.7e-117thymidylate synthase activity
KEGG pathwaycqu:CpipJ_CPIJ0113591e-123 
 K00560 (E2.1.1.45, thyA)maps-> One carbon pool by folate
    Pyrimidine metabolism
InterPro domain[45-332] IPR0234513.5e-139Thymidylate synthase/dCMP hydroxymethylase domain
[48-332] IPR0003983.7e-117Thymidylate synthase
Orthology groupMCL11505 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205624-TA
ATGTTCTTATTTTTTCGCGCCAAAACGTGCTTACAACTAAAATCATACAATTTGCTAAAGTGTATCTTTACTAAAGTGTATCTACCATTACCTCGATCAACAAATATGTCATTAAGCAATGGTTTTCATACACAACATGATGAATATCAATATCTAAACCTTATTAGAGATATTATTAAAACAGGTGACAAACGTAATGACAGAACAGGTGTTGGAACCTTATCTGTATTTGGATCAGTACAAAGGTATTCTTTAAAAAATAATACTCTGCCTTTACTTACAACAAAACGAGTTTTTGTCAGAGGAGTTATAGAAGAACTTTTGTGGATGATATCCGGTTCGACTGATAGTAAAAAACTAGCTAGCAAAGGTGTGCATATATGGGATGCGAATGGTTCCAGAGCTTTTCTTGATAATTTGGGGTTCAAAGATAGAGAAGAAGGTGATCTTGGTCCTGTTTATGGTTTTCAGTGGAGACACAGTGGTGCCAAATATATAGACGCTAAGACTGATTACACTGGACAAGGAGTAGATCAGCTCCAAAATGTCATTGACACTATCAAAAAAAACCCTGGTGATAGACGTATAATAATGTGCGCATGGATTCCTGCTGACCTGAGCTTAATGGCTCTCCCACCTTGTCACTGCCTTGCACAATTCTATGTATCCAATGGAAAGCTGAGTTGTCTTTTGTACCAAAGAAGTGCAGATATGGGTCTTGGAGTACCATTCAATATTGCCAGTTATTCATTGTTGACACATATGATTGCACATATCACAAATTTGGAGGCTGGTGAGTTTGTTCATACAACAGGAGATACCCATGTATATCTGAACCACATAGAACCATTAAATAAGCAACTTGAAAGAGAACCGAGACCCTTCCCAACTCTAGAGTTTAAACGCAAAATTGATTCAATTGATGACTTTAAGTATGAAGACTTTGTTGTTAAGAACTATAACCCTTACCCTAAGATAGACATGGAATTGGCTGTATAG

Protein sequence:

>DPOGS205624-PA
MFLFFRAKTCLQLKSYNLLKCIFTKVYLPLPRSTNMSLSNGFHTQHDEYQYLNLIRDIIKTGDKRNDRTGVGTLSVFGSVQRYSLKNNTLPLLTTKRVFVRGVIEELLWMISGSTDSKKLASKGVHIWDANGSRAFLDNLGFKDREEGDLGPVYGFQWRHSGAKYIDAKTDYTGQGVDQLQNVIDTIKKNPGDRRIIMCAWIPADLSLMALPPCHCLAQFYVSNGKLSCLLYQRSADMGLGVPFNIASYSLLTHMIAHITNLEAGEFVHTTGDTHVYLNHIEPLNKQLEREPRPFPTLEFKRKIDSIDDFKYEDFVVKNYNPYPKIDMELAV-