Monarch geneset OGS2.0

DPOGS201485
TranscriptDPOGS201485-TA1389 bp
ProteinDPOGS201485-PA462 aa
Genomic positionDPSCF300006 + 146624-150041
RNAseq coverage844x (Rank: top 15%)
Annotation
HeliconiusHMEL0159430.074.68% 
BombyxBGIBMGA002674-TA1e-12983.76% 
DrosophilaT3dh-PA8e-15656.68% 
EBI UniRef50UniRef50_Q8R0N62e-15658.57%Hydroxyacid-oxoacid transhydrogenase, mitochondrial n=87 Tax=cellular organisms RepID=HOT_MOUSE
NCBI RefSeqXP_968236.12e-17764.81%PREDICTED: similar to Type III alcohol dehydrogenase CG3425-PA [Tribolium castaneum]
NCBI nr blastpgi|910921724e-17664.81%PREDICTED: similar to Type III alcohol dehydrogenase CG3425-PA [Tribolium castaneum]
NCBI nr blastxgi|910921723e-17864.81%PREDICTED: similar to Type III alcohol dehydrogenase CG3425-PA [Tribolium castaneum]
Group
Gene OntologyGO:00468722.8e-101metal ion binding
GO:00551142.8e-101oxidation-reduction process
GO:00164912.8e-101oxidoreductase activity
KEGG pathway 
InterPro domain[49-450] IPR0016702.8e-101Alcohol dehydrogenase, iron-type
Orthology groupMCL12600 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201485-TA
ATGGTTTCCCGAAAAAGAGTATTTGATTTATTTAGAACTATGAACGTAGCTGCATGCCAATGTCCAGCTCATGGGTTTAGAGGAAATTTTCAAGTTTCGAATGCAACTCCAACTAAGGATTATGCTTTTGAGATTAAATCCTCAACAGTAAGGTATGGATTGGGAGTCACAAGAGAAGTCGGTCAGGACTTGGTCAATATTGGTGCTAAAAATGTTTGTGTTATGACAGACCCTAATGTAGTCTCTCTGAGTCCAATGAAAGCTGTATTGGAATCACTAACAAGGAATGGTGTGAAATATAAAGTATATGACAGAGTTCGAGTAGAACCTACTGATACAAGTTTTAAAGATGCCATAAAGTTTGCAAAGGAGGGCAACTTCGACAGTTTCGTGGCTGTGGGAGGTGGTTCTGTAATGGACACGGCTAAAGCTGCTAATCTTTATTACTGCGACCCTAGTGCTGACTTCCTTGATTACGTCAACCAGCCTGTGGGTAAAGGGAAACCGGTTGTTGTACAGCTGAAACCTTTGATAGCAATTCCAACAACCAGCGGTACTGGCAGTGAAACAACTGGCATCAGTATTTTTGATTTTGAAGAAATACATGCTAAGACGGGCATATCCCATTTAAATATACGCCCATTGTTGGCCCTCATAGATCCTCTGCATACACTAACAATGCCTAAAAACGTGGCCACTTATTCAGGATTCGATATATTTTGTCATGCATTAGAAAGTTTTACTACAATACCATACAACGAAAGAGAACCGGCTCCGGAAAATCCATCCTTAAGGCCAGTGTATCAAGGAAGTAATCCTATATCGGATGTGTGGGCTAGATTTTGTTTGCAAGCACTGAATAAATACTTCCACCGGTCTGTCAACAATCCTGACGATGTAGAGGCACGCTCGAGTATGCATCTGGCAGCTACCATGGCGGGTGTGGGCATTGGCAACGCTGGCATACACTTATGTCACGGTCTAGCATATCCCATAGCTGGGAATGTCAAAGATTTCGTTCCTAAGGATTATGGTTCAAAGCCAATGATACCTCATGGCCTGGCCGTGTCTATGACAGCACCAGCGGTTTTCAGATTCACAGCTGTCAGTGATCCCGAGAAGCATCTTGAGGCGGCTAGTTTACTTGGAACCGATATAACTAATAAGAAAAAAGAAGATGCTGGCAAAGTACTGTCTGATATTATATTGCAGTACATGGACAAAATGGGTATTGATGATGGATTGTCAGCTCTCGGCTTCAACTCTGGTGACATTCCTAGTCTGGTTAAGGGAGCTTTACCTCAGCAACGGCTGCTGAAAATGGCTCCATTGCCACAAAGCGAGGAGGACTTGAGCAGACTCTTGGAGGATTCATTGACTATATATTGA

Protein sequence:

>DPOGS201485-PA
MVSRKRVFDLFRTMNVAACQCPAHGFRGNFQVSNATPTKDYAFEIKSSTVRYGLGVTREVGQDLVNIGAKNVCVMTDPNVVSLSPMKAVLESLTRNGVKYKVYDRVRVEPTDTSFKDAIKFAKEGNFDSFVAVGGGSVMDTAKAANLYYCDPSADFLDYVNQPVGKGKPVVVQLKPLIAIPTTSGTGSETTGISIFDFEEIHAKTGISHLNIRPLLALIDPLHTLTMPKNVATYSGFDIFCHALESFTTIPYNEREPAPENPSLRPVYQGSNPISDVWARFCLQALNKYFHRSVNNPDDVEARSSMHLAATMAGVGIGNAGIHLCHGLAYPIAGNVKDFVPKDYGSKPMIPHGLAVSMTAPAVFRFTAVSDPEKHLEAASLLGTDITNKKKEDAGKVLSDIILQYMDKMGIDDGLSALGFNSGDIPSLVKGALPQQRLLKMAPLPQSEEDLSRLLEDSLTIY-