Monarch geneset OGS2.0

DPOGS209035
TranscriptDPOGS209035-TA1869 bp
ProteinDPOGS209035-PA622 aa
Genomic positionDPSCF300102 - 113836-119869
RNAseq coverage6312x (Rank: top 2%)
Annotation
HeliconiusHMEL0060980.084.30% 
BombyxBGIBMGA014211-TA0.083.58% 
DrosophilaCG8036-PC0.068.78% 
EBI UniRef50UniRef50_Q9H0I90.058.62%Transketolase-like protein 2 n=226 Tax=cellular organisms RepID=TKTL2_HUMAN
NCBI RefSeqNP_001040158.10.082.96%transketolase [Bombyx mori]
NCBI nr blastpgi|1140508330.082.96%transketolase [Bombyx mori]
NCBI nr blastxgi|1140508330.082.96%transketolase [Bombyx mori]
Group
Gene OntologyGO:00081523.9e-31metabolic process
GO:00038243.9e-31catalytic activity
KEGG pathwayaag:AaeL_AAEL0044340.0 
 K00615 (E2.2.1.1, tktA, tktB)maps-> Pentose phosphate pathway
    Biosynthesis of ansamycins
    Carbon fixation in photosynthetic organisms
InterPro domain[13-272] IPR0054749.3e-81Transketolase, N-terminal
[315-479] IPR0054754.4e-44Transketolase-like, pyrimidine-binding domain
[490-620] IPR0159413.9e-31Transketolase-like, C-terminal
[486-622] IPR0090141.4e-29Transketolase, C-terminal/Pyruvate-ferredoxin oxidoreductase, domain II
[494-612] IPR0054766.8e-24Transketolase, C-terminal
Orthology groupMCL10524 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209035-TA
ATGAAGGGTGAAAAGAACGTTGATATACAAGCTTTAAAAGATATCGCCAATAAGCTGAGGATCGACAGCATCGTTGCTACAAATGCATCAAAATCCGGTCACCCCACGTCATGTGCGTCCATGGCGGAGATCATGTCCGTGCTGTTCTTCCACACTATGAGGTACAAGGTCTCCGCACCCAAGGACCCCTCGGCCGACAGATTCATACTCTCTAAGGGTCACGCAGCGCCAATCCTATACGCCGCCTGGGCGGAGGCCGGCCTGTTCCCTCTGGACGACCTGAAGAACCTCCGCAAATTGACCTCCGACCTCGAGGGTCACCCCACGCCCAGGCTGAACTTCGTAGACGTCGGCACCGGCTCGCTGGGTCAGGGGCTGGCGGTCGCCGCCGGTATGGCGTACGTCGGGAAATACTTCGACCAGGCGCCGTACAGGGTGTACTGTCTGGTGGGTGACGGCGAGGCGGCCGAGGGCAGTGTGTGGGAGGCGCTGCACTTCGCAGGTCACTACAAGTTGGACAACCTGGTCGTGGTGTTCGACGTCAACCGCCTGGGACAGTCCGAGCCCACCTCGCTCCAGCATCAGATGGATGTGTACAAGGCTCGCCTCCAAGCGTTCGGATCTCACACGCTGGTCGTGGACGGACATGACGTCACGGAGCTCGTGAAGGCCTTCGACGAAGCCGCCAACACCAGCGGACGACCCACTGCCATCGTCGCCAAGACATACAAAGGAAAAGGGTTCCCCGGGATAGAGGATAAGGACAACTGGCACGGGAAGGCGCTAGGAGCTGACGGAGAGAAGATCATTAAGCACCTCCAGTCGCTGATGAAGTCCCAGTCCGTATCGTTGAAGCCCCGGGCTCCGCTAGCGGCCGCGCCCCGGGTCCACCTGGAGGACCTTACGCTGTCCTCTCCGCCAGCATACAAGCTCGGGGAGCTGGTCGCCACCCGCCTGGCTTACGGACACGGACTCAAGAAACTCGCCGACAATAACCAGAGAGTAATCGCCTTGGACGGGGACACCAAGAACTCCACATTCAGTGACAAACTTCGTAACGCCTACCCCGACAGATATATCGAATGTTTCATCGCGGAGCAGACGCTCGTGGGTGTGGCGACCGGCGCCGCGTGTCGAGACCGCGCCGTGGTGTTCGCCTCCACCTTCGCCGCCTTCTTCACTAGGACCTTCGACCAGATCCGCATGGGCGCCATCAGTCAGAGCAACATGAACCTGGTGGGGTCTCACTGTGGCGTAAGCATCGGAGAGGACGGACCCTCGCAGATGGGGCTCGAGGACCTGGCCATGTTCCGCGCCGTTCCCACCGCCACTGTCTTCTATCCCTCTGACGCGGTGAGCACGGAGCGCGCGGTGGAGCTGGCGGCCGGCACGCGCGGCATCTGTTACATCCGCACCTCGAGACCGAACACGCCGGTTCTGTACGAAAACGACGCCGTCTTCAAGGTGGGCGAGGCCCGCGTGGTGGTGCAGTCTGCCGCGGACCAGGCGCTCGTCATCGGAGCAGGCGTCACCTTACACGAGGCGATGGCGGCCGTGGAGTCTCTCCGGGCGGAAGGTGTGTCGGTGCGCGTGATGGATCCCTTCACCATCAAGCCGCTGGACGAGGCGGCGGTGCGCGCGCACGCGGCGGCGGTCGGCGGACGAGTGGTGGTGGTCGAGGATCACTACCAGGCCGGTGGTCTGGGCGAGGCGGTGATGTCGGCCCTGGCTCTGGTCCGAGGGTCGGTGGTCCGTCACCTGTGTGTCCGCGAGGTTCCTCGCTCGGGCGCTCCTCAGGAGCTGTTGGACCACTACGGCCTGTCCGCCAGACACGTGGCCGCCGCCATCAGGGAGATATTGAAGGCTTAG

Protein sequence:

>DPOGS209035-PA
MKGEKNVDIQALKDIANKLRIDSIVATNASKSGHPTSCASMAEIMSVLFFHTMRYKVSAPKDPSADRFILSKGHAAPILYAAWAEAGLFPLDDLKNLRKLTSDLEGHPTPRLNFVDVGTGSLGQGLAVAAGMAYVGKYFDQAPYRVYCLVGDGEAAEGSVWEALHFAGHYKLDNLVVVFDVNRLGQSEPTSLQHQMDVYKARLQAFGSHTLVVDGHDVTELVKAFDEAANTSGRPTAIVAKTYKGKGFPGIEDKDNWHGKALGADGEKIIKHLQSLMKSQSVSLKPRAPLAAAPRVHLEDLTLSSPPAYKLGELVATRLAYGHGLKKLADNNQRVIALDGDTKNSTFSDKLRNAYPDRYIECFIAEQTLVGVATGAACRDRAVVFASTFAAFFTRTFDQIRMGAISQSNMNLVGSHCGVSIGEDGPSQMGLEDLAMFRAVPTATVFYPSDAVSTERAVELAAGTRGICYIRTSRPNTPVLYENDAVFKVGEARVVVQSAADQALVIGAGVTLHEAMAAVESLRAEGVSVRVMDPFTIKPLDEAAVRAHAAAVGGRVVVVEDHYQAGGLGEAVMSALALVRGSVVRHLCVREVPRSGAPQELLDHYGLSARHVAAAIREILKA-