Monarch geneset OGS2.0

DPOGS214630
TranscriptDPOGS214630-TA978 bp
ProteinDPOGS214630-PA325 aa
Genomic positionDPSCF300050 + 432718-434712
RNAseq coverage149x (Rank: top 54%)
Annotation
HeliconiusHMEL0099132e-9174.63% 
BombyxBGIBMGA005047-TA4e-14474.61% 
DrosophilaCG2767-PB8e-7544.38% 
EBI UniRef50UniRef50_E0VSM58e-9253.94%Aldose reductase, putative n=3 Tax=Pancrustacea RepID=E0VSM5_PEDHC
NCBI RefSeqXP_001600536.11e-10458.86%PREDICTED: similar to CG2767-PA [Nasonia vitripennis]
NCBI nr blastpgi|2700102903e-10357.14%hypothetical protein TcasGA2_TC009671 [Tribolium castaneum]
NCBI nr blastxgi|2700102906e-10157.14%hypothetical protein TcasGA2_TC009671 [Tribolium castaneum]
Group
Gene OntologyGO:00551144.3e-45oxidation-reduction process
GO:00164914.3e-45oxidoreductase activity
KEGG pathwaydme:Dmel_CG27676e-73 
 K00002 (E1.1.1.2, adh)maps-> Glycolysis / Gluconeogenesis
    Glycerolipid metabolism
    Caprolactam degradation
InterPro domain[1-314] IPR0013952.1e-109Aldo/keto reductase
[1-295] IPR0232104.9e-94NADP-dependent oxidoreductase domain
[23-47] IPR0204714.3e-45Aldo/keto reductase subgroup
Orthology groupMCL17781 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214630-TA
ATGCCTACATTAGGTCTGGGTACCTGGCAGGCTTCTCCAGAAGTGATAGAGCAGACTGTGTACAAAGCATTAGATTTAGGATATCGTCACATTGACACCGCATTCAATTATAACAATGAAGAGGCAATTGGAAATGCGATAAAAAAATGGATAGATAATGGAAAAGGAACGAGGAAGGATTTATTTATAACTACAAAGCTACCCCACGTGGGTTTATCTCGTCCACTACAGTTTTTATTCTTAAATTTGCAATTACAAAGGCTACAAATGGATTACGTAGACTTATACTTAATTCACATGCCATTCGGGTTCCACTGCAACCCGGATACAATGACGCCGTTGGTGAAGAGCAGCGGGGAGTATGACCTTGACCTTGACACAAATCACATCACCACGTGGAAGATCATGGAGGAGTGCCAGAAGGAAGGCCGTATTCGTAATCTGGGACTATCAAACTTCAACGAAAACCAGATAGCGAGGATCATGTCCGCTTCGACGCTCAAACCTCAAGTGCTGCAGGTGGAGTTGCACGCTTCCTTCCAGCAGTTGGAACTGAGGAAGTTTTGCGCCGAAAATGAGATAGTGGTCACGGCATACGCCCCGTTAGGGTCTCCGGGGGCGAAGGACCATTTCGTTAACAAGTATAATTATAGTCCTGGCGCATTTCCCGATTTACTCGGACATCCCGAGGTCGCGGACATAGCAAAAAGTCACGGCAAAACGACAGCACAAGTTTTATTGCGTTTCCTGGTCCAGCAGAAGGTGGTTGTGATACCAAAGAGCACCAGCGAAACGAGGCTAAAGGAGAACTCCGAGTTGTACGACTTTGAGCTGACGCCGTCCGAAATGAACCGTCTGAAGAAATTGGATACGGGAGAGAAAGGACGCATCTTTAATTTCTTGTTCTGGAAAGGCGTCGAAAAACATCCTGAGTACCCGTTCAAATTAGACAAGGTCGTGGAAGTCGAGAAGAATTGA

Protein sequence:

>DPOGS214630-PA
MPTLGLGTWQASPEVIEQTVYKALDLGYRHIDTAFNYNNEEAIGNAIKKWIDNGKGTRKDLFITTKLPHVGLSRPLQFLFLNLQLQRLQMDYVDLYLIHMPFGFHCNPDTMTPLVKSSGEYDLDLDTNHITTWKIMEECQKEGRIRNLGLSNFNENQIARIMSASTLKPQVLQVELHASFQQLELRKFCAENEIVVTAYAPLGSPGAKDHFVNKYNYSPGAFPDLLGHPEVADIAKSHGKTTAQVLLRFLVQQKVVVIPKSTSETRLKENSELYDFELTPSEMNRLKKLDTGEKGRIFNFLFWKGVEKHPEYPFKLDKVVEVEKN-