Monarch geneset OGS2.0

DPOGS210330
TranscriptDPOGS210330-TA1464 bp
ProteinDPOGS210330-PA487 aa
Genomic positionDPSCF300025 - 621774-624872
RNAseq coverage81x (Rank: top 64%)
Annotation
HeliconiusHMEL0138471e-11757.30% 
BombyxBGIBMGA011967-TA1e-9964.52% 
Drosophila% 
EBI UniRef50UniRef50_Q4T0492e-5830.59%Xylose isomerase n=4 Tax=Chordata RepID=Q4T049_TETNG
NCBI RefSeqXP_001633389.11e-6033.84%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|3403771223e-7033.73%PREDICTED: xylose isomerase-like [Amphimedon queenslandica]
NCBI nr blastxgi|3403771223e-6733.73%PREDICTED: xylose isomerase-like [Amphimedon queenslandica]
Group
Gene OntologyGO:00059751.3e-17carbohydrate metabolic process
GO:00090451.3e-17xylose isomerase activity
KEGG pathwaycin:1001797992e-65 
 K01805 (xylA)maps-> Pentose and glucuronate interconversions
    Fructose and mannose metabolism
InterPro domain[21-253] IPR0130226.9e-60Xylose isomerase-like, TIM barrel domain
[110-132] IPR0019981.3e-17Xylose isomerase
Orthology groupMCL25906 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210330-TA
ATGTCCCACGTCCAAGCCGGGAAAAGACAGAAAACTAGGGATGTAAGGGCTGAAAACATTGACTACTTTCAAGGCATAGACAGAATAGAGTACAACAACATGGCCACTGTAACAGACACGGCTTACTATCGCCATTACAACAGCGGAGAGAAAATACTGTCCAAAGATATGGAAGAATGGCTTAAATATAGTGTATCCTTCACAGAATTTAAATATGACGGTTCAGATGCTCAAGGAAGACCTACTTTTAACCGGCAATGGGATGATCACACAAACACTATAGATAATTGCAAACGTTGTATCAGAGCTTTCTACGACTTCTGTACAAAGCTCGGAGTGAAGTACTGGACCGCGTTTGATAATGATCTAGTGCCACAAACAGACAACTGGGATGAAAACAAAAGTAATTGGGATGAAATCACTGATTATATAACGGAAATGGCGCAAAAAACACAGATCAAATTGTTATGGATGGCGCCCGACTTGCATTCCCATCAGAGATATTCGTCAGGAGCTTTCACGAGTAACGAAGCGACAACTTTCTTGCAAGCAGCCAGTCAGGTCAAGAAATGTCTGGAAGTGTCTCAGCGTCTGAACGCCGAATGCTTCCTCCTCTGGCCGTACAGGGAGGGCTACGACGCTGTGTTCCAAACAGACGTCGCCAGAGAGATCAAATTGTTCGCTAAACTTTTAAAGATAACAGCGGAATACAGGGACAGGTTGAGTTATAAATGTCAGCTGCTGCTGATGCCGTACCCGAGCTTTGGTAAGAACTTCAGAAGTACGGAGATATGGCGACCGAGATGTGACTTTGATAGCGATACACTGAATAGGTACATGTGGGACGTGACCAGCTGCTTGTACTTCCTGAAGTTCCATAGTTTAGATCGCTATTATAAAGTGTGCTCTCCTCCAGGACAACATGTCTATTTAGCTGGAGTTTACAACGCGTTCGGAGGAGTGACAATGACTAATACATTCGATCCCTCGGACATTAAGACTATAACGCTCATGTTGAAATGTATTATTGATCAAGGTTCCGCCCCCCCGGGTGGGATATCCCTGCGAGTGTCGTGTCCCCGCGGCGGTACCATCCGTGACCAGCTAGCGATGTACATTAATTATATCGACGAGTGCGCCAGAGGACTCAGGGTCGCAGCCGCTGTGCTCGCAGAACAGGTGTTCGTGAAGCATGTACAGCAACGTTACTCCTCATACTACAGCGGCTTCGGAGCTCGACTAGTGAGCGGAGACGTGTCTATGGAAGAGTGTAAGCATGTACAGCAACGTTACTCGTCATACTACAGCGGCTTCGGAGCTCGACTCGTGAGTGGAGACGTGTCTATGGAAGAGTGTGAAGAATTATACAAGAAAAATCAAGCGCAAACTGAAATACAAAGCGGGAGGCGCGGGAACTACGAATTGGTCTTCCAACGATACCTAGACGCGTGCGACCACGTGTGA

Protein sequence:

>DPOGS210330-PA
MSHVQAGKRQKTRDVRAENIDYFQGIDRIEYNNMATVTDTAYYRHYNSGEKILSKDMEEWLKYSVSFTEFKYDGSDAQGRPTFNRQWDDHTNTIDNCKRCIRAFYDFCTKLGVKYWTAFDNDLVPQTDNWDENKSNWDEITDYITEMAQKTQIKLLWMAPDLHSHQRYSSGAFTSNEATTFLQAASQVKKCLEVSQRLNAECFLLWPYREGYDAVFQTDVAREIKLFAKLLKITAEYRDRLSYKCQLLLMPYPSFGKNFRSTEIWRPRCDFDSDTLNRYMWDVTSCLYFLKFHSLDRYYKVCSPPGQHVYLAGVYNAFGGVTMTNTFDPSDIKTITLMLKCIIDQGSAPPGGISLRVSCPRGGTIRDQLAMYINYIDECARGLRVAAAVLAEQVFVKHVQQRYSSYYSGFGARLVSGDVSMEECKHVQQRYSSYYSGFGARLVSGDVSMEECEELYKKNQAQTEIQSGRRGNYELVFQRYLDACDHV-