Monarch geneset OGS2.0

DPOGS205846
TranscriptDPOGS205846-TA954 bp
ProteinDPOGS205846-PA317 aa
Genomic positionDPSCF300081 + 126157-127110
RNAseq coverage45x (Rank: top 71%)
Annotation
HeliconiusHMEL0099283e-11462.15% 
BombyxBGIBMGA010866-TA4e-11562.15% 
DrosophilaCG5955-PA5e-1323.65% 
EBI UniRef50UniRef50_G6D6T80.0100.00%NAD-dependent epimerase/dehydratase n=11 Tax=cellular organisms RepID=G6D6T8_DANPL
NCBI RefSeqXP_320583.35e-1523.34%AGAP011948-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2933930903e-10658.04%NAD-dependent epimerase/dehydratase [Serratia odorifera DSM 4582]
NCBI nr blastxgi|2933930904e-10158.60%NAD-dependent epimerase/dehydratase [Serratia odorifera DSM 4582]
Group
Gene OntologyGO:00054881.1e-29binding
GO:00442371.1e-19cellular metabolic process
GO:00038241.1e-19catalytic activity
GO:00506621.1e-19coenzyme binding
KEGG pathwaybpd:BURPS668_A20183e-79 
 K00043 (E1.1.1.61)maps-> Butanoate metabolism
InterPro domain[1-176] IPR0160401.1e-29NAD(P)-binding domain
[3-194] IPR0015091.1e-19NAD-dependent epimerase/dehydratase
Orthology groupMCL20950 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205846-TA
ATGAACGTAGTTATAACTGGTGCTTCGGGATTTTTAGGCTCTCGATTAACGGAAGCACTTCTTTCAGAAAATTCAGCAGTTCCGGTGGCTAAGCTGTTATTAGCAGATGTCGTGCGTCCACAACCGCACCTAGATCCCAGAGTATCTACAGTGGCTATTGATTTAAACCAACCTTCAGCTGCAAATTTAATTATAAAGCCAGATGTTCACATACTATTTCATCTTGCAGCAATAGTCAGTGGTCAAGCAGAAGCAGACTTCGATCTCGGTTTAAGTGTCAATTTCGACGCAACACGAGCTCTCGCTGATGCAGCCCGACACCAGGTCCCTTCTCTACGCTTCATATTTACCAGCACAGTTGGCGCGTTCGGTGGAAATTTGCCACCAATCATAGATGACCTGACGGCTGTTACTCCTCAGAATTCATATGGTTCTCAAAAAGCTATGTGTGAACTATTGCTCAATGATTATGGACGTCGGGGTTTCATGGATGCTCGTATTGTACGCCTTCCCACAATCAGTATAAGGGCAGGAGTGGCGAATAAAGCGGTCACTGCATTCGCTAGTGGAATAATAAGAGAACCTTTGAATGGACTGGAGAGCATTTGCCCAGTGGATAGGGATCTTAAATTATGGTTGTCTAGTCCGAATACGGTGATCAGAAATATTATACACGCCGCTACTCTGCCGAAGGATGTTTTGGGCCCTTGGCGCGTCATAAACTTACCAGGAATAAGTGTTTCGGTGGATGAGATGATAAAAGCACTGCATACAGTGGCAGGTGATAAAGCGACGTCACTTATTCGCTTTGAGCACAATGAGCTGATCGCGCGCATCGTTGGCAGTTTTCCGAATAAATTTGATAACACACGAGCATTTGGCATGGGATTCATTGCTGATGAAAATTTCGAACAAATGATCCGCATGTACATAAGAGACGATCTGAAGCGTTGA

Protein sequence:

>DPOGS205846-PA
MNVVITGASGFLGSRLTEALLSENSAVPVAKLLLADVVRPQPHLDPRVSTVAIDLNQPSAANLIIKPDVHILFHLAAIVSGQAEADFDLGLSVNFDATRALADAARHQVPSLRFIFTSTVGAFGGNLPPIIDDLTAVTPQNSYGSQKAMCELLLNDYGRRGFMDARIVRLPTISIRAGVANKAVTAFASGIIREPLNGLESICPVDRDLKLWLSSPNTVIRNIIHAATLPKDVLGPWRVINLPGISVSVDEMIKALHTVAGDKATSLIRFEHNELIARIVGSFPNKFDNTRAFGMGFIADENFEQMIRMYIRDDLKR-