Monarch geneset OGS2.0

DPOGS202385
TranscriptDPOGS202385-TA978 bp
ProteinDPOGS202385-PA325 aa
Genomic positionDPSCF300104 + 559593-562050
RNAseq coverage409x (Rank: top 30%)
Annotation
HeliconiusHMEL0095329e-6076.69% 
BombyxBGIBMGA014492-TA4e-15878.88% 
DrosophilaGmer-PA1e-11459.44% 
EBI UniRef50UniRef50_F6ZAF04e-11963.16%Uncharacterized protein n=9 Tax=root RepID=F6ZAF0_MONDO
NCBI RefSeqXP_967089.16e-13468.83%PREDICTED: similar to nad dependent epimerase/dehydratase [Tribolium castaneum]
NCBI nr blastpgi|910852711e-13268.83%PREDICTED: similar to nad dependent epimerase/dehydratase [Tribolium castaneum]
NCBI nr blastxgi|910852715e-12968.83%PREDICTED: similar to nad dependent epimerase/dehydratase [Tribolium castaneum]
Group
Gene OntologyGO:00054886.6e-62binding
GO:00442372.9e-56cellular metabolic process
GO:00038242.9e-56catalytic activity
GO:00506622.9e-56coenzyme binding
KEGG pathwaytca:6554522e-133 
 K02377 (E1.1.1.271, fcl)maps-> Amino sugar and nucleotide sugar metabolism
    Fructose and mannose metabolism
InterPro domain[8-242] IPR0160406.6e-62NAD(P)-binding domain
[8-245] IPR0015092.9e-56NAD-dependent epimerase/dehydratase
Orthology groupMCL12134 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202385-TA
ATGAGTGAAGATAAGAGCGTTATACTCGTCACCGGTGGTTCAGGGCTAGTGGGACGAGCAATAAGGACTGTTATAGAAGAAGAGAGAAAATTAGACAGTAAACTTGTTAGAAATGAGACTTGGATTTTTTGTGGCTCCAACGATGGGGATTTGAGAGATAAAGCCCAAACTGAGGCATTGTTTGAAAAACACAGACCAACTCATGTAATACATTTAGCGGCCATGGTAGGAGGCCTATTTCACAACATGGCTCATAATTTGGATTTCTTTAGAGAAAACATGTCTATAAATGATAATGTTTTGAATGTGAGCTACAATTATAAAGTTAAAAAAGTAGTGTCCTGTCTCTCGACATGCATATTCCCTGATAAGGTCACCTATCCCATTGATGAGACTATGATACACAATGGTCCACCCCATCCATCCAACTATGGCTACAGTTATGCAAAGAGGATGGTTGATATTCTGAACAGAGGGTATCACTCTCAACACGGTTGCATGTTCACATCAGTGATACCATGTAATGTTTTTGGACCTCATGATAACTTTTCTCTGGAGTCGAGTCATGTCATCCCTGCCTTGATACGGAGAATGGATGATACTATTAAAAAGGGTGAGCCAACCTTCACTGTACTGGGGAGTGGAAAACCGTTGCGCCAGTTTATCTACTCCCTGGACCTGGCCAAGCTGTTTGTGTGGGTGTTGAGGAATTACAATAGCGTGGAACCAATCATCTTGTCAGTTGATGAAGAAGATGAAGTCACGATCAGTCGAGTGGCGGAGGCGATAAAGGAAGCTCACGGTTATCAAGGAGAGATTGTATATGACACAAGCAAGGCGGACGGACAACACAAGAAAACAGCTTCAAACATGAAACTGAGATCGCTGTACAAAGAATTCAACTTCACACCTTTTGATAAAGCTATTAAGGATACGGTTGTCTGGTTCAGGGACAACCGAGACCGAGCTCGGTTATAG

Protein sequence:

>DPOGS202385-PA
MSEDKSVILVTGGSGLVGRAIRTVIEEERKLDSKLVRNETWIFCGSNDGDLRDKAQTEALFEKHRPTHVIHLAAMVGGLFHNMAHNLDFFRENMSINDNVLNVSYNYKVKKVVSCLSTCIFPDKVTYPIDETMIHNGPPHPSNYGYSYAKRMVDILNRGYHSQHGCMFTSVIPCNVFGPHDNFSLESSHVIPALIRRMDDTIKKGEPTFTVLGSGKPLRQFIYSLDLAKLFVWVLRNYNSVEPIILSVDEEDEVTISRVAEAIKEAHGYQGEIVYDTSKADGQHKKTASNMKLRSLYKEFNFTPFDKAIKDTVVWFRDNRDRARL-