Monarch geneset OGS2.0

DPOGS215024
TranscriptDPOGS215024-TA957 bp
ProteinDPOGS215024-PA318 aa
Genomic positionDPSCF300256 + 280200-284099
RNAseq coverage1464x (Rank: top 9%)
Annotation
HeliconiusHMEL0148215e-14477.36% 
BombyxBGIBMGA012152-TA2e-14981.42% 
DrosophilaCG6084-PB2e-10461.76% 
EBI UniRef50UniRef50_D6WCD52e-10754.40%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WCD5_TRICA
NCBI RefSeqXP_969456.15e-13165.92%PREDICTED: similar to CG6084 CG6084-PA [Tribolium castaneum]
NCBI nr blastpgi|3286708731e-14274.53%aldo-keto reductase [Helicoverpa armigera]
NCBI nr blastxgi|3286708734e-13974.53%aldo-keto reductase [Helicoverpa armigera]
Group
Gene OntologyGO:00551143e-56oxidation-reduction process
GO:00164913e-56oxidoreductase activity
KEGG pathwaytca:6579431e-130 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[2-314] IPR0013951.1e-155Aldo/keto reductase
[8-314] IPR0232108.4e-124NADP-dependent oxidoreductase domain
[38-62] IPR0204713e-56Aldo/keto reductase subgroup
Orthology groupMCL10202 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215024-TA
ATGGCGTCCAAAAATGTTATGGTTAGGTTCAGTGATGGGAGAAAATACCCGCAGCTGGGTCTGGGCACTTGGAAGTCCAAACCTGGTGAAGTGAGTCAAGCGGTGAAAGATGCCATCGACATCGGCTACAGGCACATTGACTGCGCCTACGTGTACGGCAACGAGAAGGAAGTGGGAGACGCCATCACAGAGAAGATTAAGGAGGGAGTTGTCAAGAGGGAGGAGCTGTTCATAACGTCCAAACTGTGGAACACCTTCCACCGCCCGGACCTGGTGCGAGGAGCGCTCATGAAGAGTCTCCAGAACCTGAACTTGGACTACTTGGATCTGTACCTCATACACTGGCCTCAAGCCTATAAGGAGGACGGAGAATTGTTCCCCGTGGACGAGAGCGGTAAGATCCAATTCTCTGATGTGGACTACGTGGACACCTGGAAGGCGTTGGAGCCGCTCCAGGCCGAGGGACTGATTCGGAGCCTCGGAGTGTCCAACTTCAACTCGCGCCAGCTGGACCGAGTGCTGGAGTCCGCCAGCGTCAAGCCCGTCGTCAACCAGGTGGAATGCCACCCCTACTTAGTCCAGAAAAAGTTGAAGGAGTTCTGCGCGGCCCGAGGGGTGCTGCTCGCGGCCTACTCCCCGCTGGGCTCCCCGGACAGGCCCTGGGCCAAACCCGACGACCCACGGCTCCTGGACGACCCTCGACTGAAGGCAATAGCGGACAGGCTGGGCAGGACCGTGGCACAGGTGTTGATCAGGTATCAGCTGGAGCGCGGCAACATCGTGCTGCCCAAGTCGGTGACTCGCTCCCGGATCGAGTCCAACTTCGCAGTGATGGACTTCCAGCTGTCCAAGGCAGACCTGGAGCTGATTGACTCCTTCGACTGCAACGGACGCTTCGTGCCCATGACGGCGTCTCTCGGCCACAAACACCATCCGTTTGAGAACGACGCCTTCTGA

Protein sequence:

>DPOGS215024-PA
MASKNVMVRFSDGRKYPQLGLGTWKSKPGEVSQAVKDAIDIGYRHIDCAYVYGNEKEVGDAITEKIKEGVVKREELFITSKLWNTFHRPDLVRGALMKSLQNLNLDYLDLYLIHWPQAYKEDGELFPVDESGKIQFSDVDYVDTWKALEPLQAEGLIRSLGVSNFNSRQLDRVLESASVKPVVNQVECHPYLVQKKLKEFCAARGVLLAAYSPLGSPDRPWAKPDDPRLLDDPRLKAIADRLGRTVAQVLIRYQLERGNIVLPKSVTRSRIESNFAVMDFQLSKADLELIDSFDCNGRFVPMTASLGHKHHPFENDAF-