Monarch geneset OGS2.0

DPOGS210828
TranscriptDPOGS210828-TA861 bp
ProteinDPOGS210828-PA286 aa
Genomic positionDPSCF300027 - 355672-363345
RNAseq coverage965x (Rank: top 13%)
Annotation
HeliconiusHMEL0219194e-9867.87% 
BombyxBGIBMGA007144-TA5e-14388.46% 
DrosophilaCG15093-PA3e-6443.21% 
EBI UniRef50UniRef50_B3GQU65e-14088.11%3-hydroxyisobutyrate dehydrogenase n=2 Tax=Obtectomera RepID=B3GQU6_BOMMO
NCBI RefSeqNP_001124349.11e-14088.11%3-hydroxyisobutyrate dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1959633532e-13988.11%3-hydroxyisobutyrate dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1959633532e-14188.11%3-hydroxyisobutyrate dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00551148.5e-114oxidation-reduction process
GO:00164918.5e-114oxidoreductase activity
GO:00084424.9e-1043-hydroxyisobutyrate dehydrogenase activity
GO:00512874.9e-104NAD binding
GO:00166168.1e-44oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00506628.1e-44coenzyme binding
GO:00054882.8e-42binding
GO:00046162.1e-40phosphogluconate dehydrogenase (decarboxylating) activity
GO:00060982.1e-40pentose-phosphate shunt
KEGG pathwayame:5512082e-74 
 K00020 (E1.1.1.31, mmsB)maps-> Valine, leucine and isoleucine degradation
InterPro domain[1-284] IPR0158158.5e-1143-hydroxyacid dehydrogenase/reductase
[1-281] IPR0115484.9e-1043-hydroxyisobutyrate dehydrogenase
[153-284] IPR0133288.1e-44Dehydrogenase, multihelical
[1-152] IPR0160402.8e-42NAD(P)-binding domain
[1-151] IPR0061152.1e-406-phosphogluconate dehydrogenase, NADP-binding
[153-283] IPR0089273.9e-356-phosphogluconate dehydrogenase, C-terminal-like
Orthology groupMCL13113 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210828-TA
ATGGGTGGATTTATGGCCGCTAACCTCGTGAAAAAGGGCTTCAATGTTCGAGGATACGATCCTTCAAAGGAGGCTCAGAACGCAGCAGCCAAAGAAGGCGTGGCCAGCGCCAGTTCCATAGCTGCAGCGGTGGACGGAGTCGATGCTGTGGTGTCGATTCTGCCCAGCAACAAAGTCGTTCTCGACGCTTACCTCGGAAAAGATGGCGTGGTAAAACATGCACCAAAAGGTTCACTTCTCATTGATTCGAGTACTGTGGACCCGAATGTTCCCAAGGAGATATTCCCAGTGGCCATCGAGAGCGGTGTCAGCTTCATAGATGCCCCGGTTTCTGGAGGTGTAATGGGGGCCCAAAATGCCACTTTAGCATTCATGGTAGGGGGCCGCAAAGAGGATTTCGAAAGATCCCTGCCAATGCTGAAAGCCATGGGCCTCAAACATTTTCATTGTGGCGACAGCGGTGCCGGTCAAGTGGCTAAGTTAGCAAATAATATGCTAATGGGAATCACTGGTATGGCAACAGCCGAATGTATGAATATGGGCATTAAAATGGGCTTAGATCCCAAAGTTTTGCTGGACGTGTTGAACAACTCATCAGCGCGCTCGTGGTCTACTGAGGTTTACTGCCCTGTGCCCGGACTGGTACCGACCGCTCCATCCAGCAGAAACTATGATGGTGGTTTCAAGAATGAACTCATGGTCAAGGATTTGGAGCTGGCTAGCGGCATGGCTTTGGGCATCCGCTCCCCCATCCCTCTCGGTGCAGTTGCCACACAGTTGTACCGTATTGTACAGTCACGTGGATATGGACAGAAAGACTTCTCATTCGTATATCAGCTGCTTAAGGATGAGAAAAAATAA

Protein sequence:

>DPOGS210828-PA
MGGFMAANLVKKGFNVRGYDPSKEAQNAAAKEGVASASSIAAAVDGVDAVVSILPSNKVVLDAYLGKDGVVKHAPKGSLLIDSSTVDPNVPKEIFPVAIESGVSFIDAPVSGGVMGAQNATLAFMVGGRKEDFERSLPMLKAMGLKHFHCGDSGAGQVAKLANNMLMGITGMATAECMNMGIKMGLDPKVLLDVLNNSSARSWSTEVYCPVPGLVPTAPSSRNYDGGFKNELMVKDLELASGMALGIRSPIPLGAVATQLYRIVQSRGYGQKDFSFVYQLLKDEKK-