Monarch geneset OGS2.0

DPOGS203073
TranscriptDPOGS203073-TA1137 bp
ProteinDPOGS203073-PA378 aa
Genomic positionDPSCF300294 - 28605-31987
RNAseq coverage484x (Rank: top 26%)
Annotation
HeliconiusHMEL0117298e-3867.52% 
BombyxBGIBMGA007768-TA0.086.77% 
DrosophilaCG7979-PA5e-16171.24% 
EBI UniRef50UniRef50_Q7QAZ62e-16272.43%AGAP004268-PA n=38 Tax=cellular organisms RepID=Q7QAZ6_ANOGA
NCBI RefSeqXP_001651838.19e-16474.25%dtdp-glucose 4-6-dehydratase [Aedes aegypti]
NCBI nr blastpgi|1571126662e-16274.25%dtdp-glucose 4-6-dehydratase [Aedes aegypti]
NCBI nr blastxgi|3479715902e-15872.43%AGAP004268-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00054885e-74binding
GO:00442371.9e-45cellular metabolic process
GO:00038241.9e-45catalytic activity
GO:00506621.9e-45coenzyme binding
KEGG pathwaydme:Dmel_CG79794e-159 
 K01710 (E4.2.1.46, rfbB, rffG)maps-> Streptomycin biosynthesis
    Polyketide sugar unit biosynthesis
    Biosynthesis of vancomycin group antibiotics
InterPro domain[51-243] IPR0160405e-74NAD(P)-binding domain
[53-277] IPR0015091.9e-45NAD-dependent epimerase/dehydratase
Orthology groupMCL13759 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203073-TA
ATGGATTCAGAAAGCAATGTTAATAAAGATAAGGAAGAATTGAAAGAAGCCAAAACTAGAATCGAACAGCTGGAGAAGAAAATAGCAATTCTGGAGGGAAGAATACCAGAAAAGTATCCTGATGTTAAATATCTTGGATACAAGGAAAGAAAGAGAATATTGATCACAGGGGGAGCTGGTTTTGTTGGATCACATCTTGTTGACATCCTCATGATACAAGGACACGAAGTCATAGTTGTGGACAACTTCTTCACAGGCAGAAAACGGAATGTGGAACACTGGTTTGGACACAGACACTTTGAAATGATTCATCACGACATTGTGAATCCCTTATATGTGGAGGCCGATGAAATATATCATTTAGCGAGTCCTGCTAGTCCACCACACTATATGCAGAATCCGGTCAAAACAATTAAAACAAATACTTTGGGCACAATCAATATGCTTGGTTTGGCAAGGAGAGTCGGTGCAAAAATCCTAATAGCCAGTACATCAGAGGTTTATGGTGACCCCATGGTACATCCACAACCAGAATCATATTGGGGTCATGTCAATCCTATAGGACCACGCGCGTGTTACGATGAGGGCAAAAGAGTGGCCGAAACACTTGCGTACTCATACGCTAAGCAGGAGAATGTGTCGGTACGTGTTGCGAGGATATTTAACACGTATGGACCAAGAATGCATGTCTCTGACGGAAGGGTGGTGTCAAACTTCGTTATGCAGGCATTACAGAACCTTACAATTACTGTTTACGGCAACGGTAAACAGACGCGTTCATTTTGTTACGTGTCAGATCTAGTCGATGGTCTCATAGCTTTGATGGCGTCTAGCTACACGTTACCTGTCAATTTGGGCAATCCCGTAGAACACACTATAGAAGGCGATTACGTTGCACTGGTTAACTTAGTCCCTGGTTGTCGCAGTACAGTGGCGACGGGCGCTGCAGTAGAAGACGATCCTCAGCGCCGCAGACCAGACATCACGTTAGCCAACACACATCTCAAATGGAAGCCAAAGGTTTCATTAGAAGAGGGACTCCAAAGAACAATAGAATACTTCAGAGAAGAATTATCCAGAACAACATTTTATAATAATCAGACATATATTGATGTTAAGGTAAAAAACAAACACTAA

Protein sequence:

>DPOGS203073-PA
MDSESNVNKDKEELKEAKTRIEQLEKKIAILEGRIPEKYPDVKYLGYKERKRILITGGAGFVGSHLVDILMIQGHEVIVVDNFFTGRKRNVEHWFGHRHFEMIHHDIVNPLYVEADEIYHLASPASPPHYMQNPVKTIKTNTLGTINMLGLARRVGAKILIASTSEVYGDPMVHPQPESYWGHVNPIGPRACYDEGKRVAETLAYSYAKQENVSVRVARIFNTYGPRMHVSDGRVVSNFVMQALQNLTITVYGNGKQTRSFCYVSDLVDGLIALMASSYTLPVNLGNPVEHTIEGDYVALVNLVPGCRSTVATGAAVEDDPQRRRPDITLANTHLKWKPKVSLEEGLQRTIEYFREELSRTTFYNNQTYIDVKVKNKH-