Monarch geneset OGS2.0

DPOGS213891
TranscriptDPOGS213891-TA2586 bp
ProteinDPOGS213891-PA861 aa
Genomic positionDPSCF300572 - 8440-11296
RNAseq coverage833x (Rank: top 15%)
Annotation
HeliconiusHMEL0161330.090.66% 
BombyxBGIBMGA003229-TA0.078.55% 
DrosophilaHmgcr-PB0.050.54% 
EBI UniRef50UniRef50_O768190.087.57%3-hydroxy-3-methylglutaryl-coenzyme A reductase n=5 Tax=Obtectomera RepID=HMDH_AGRIP
NCBI RefSeqNP_001093298.10.085.82%3-hydroxy-3-methylglutaryl-CoA reductase [Bombyx mori]
NCBI nr blastpgi|111328550.087.57%3-hydroxy-3-methylgluraryl coenzyme A reductase [Agrotis ipsilon]
NCBI nr blastxgi|111328550.087.57%3-hydroxy-3-methylgluraryl coenzyme A reductase [Agrotis ipsilon]
Group
Gene OntologyGO:00159368.1e-151coenzyme A metabolic process
GO:00044208.1e-151hydroxymethylglutaryl-CoA reductase (NADPH) activity
GO:00551148.1e-151oxidation-reduction process
GO:00506628.1e-151coenzyme binding
GO:00506614.3e-140NADP binding
GO:00082994.3e-140isoprenoid biosynthetic process
GO:00160214.3e-140integral to membrane
GO:00166165.9e-100oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
KEGG pathwaydan:Dana_GF162580.0 
 K00021 (E1.1.1.34, HMG1)maps-> Terpenoid backbone biosynthesis
InterPro domain[34-850] IPR0022020Hydroxymethylglutaryl-CoA reductase, class I/II
[438-844] IPR0045544.3e-140Hydroxymethylglutaryl-CoA reductase, eukaryotic/arcaheal type
[416-834] IPR0090295.9e-100Hydroxymethylglutaryl-CoA reductase, class I/II, substrate-binding
[677-846] IPR0230744.5e-84Hydroxymethylglutaryl-CoA reductase, class I/II, catalytic domain
[561-676] IPR0090231.5e-49Hydroxymethylglutaryl-CoA reductase, class I/II, NAD/NADP-binding
[427-511] IPR0232827.9e-26Hydroxymethylglutaryl-CoA reductase, N-terminal
Orthology groupMCL12272 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213891-TA
ATGTTGTTCGACCTCTGCCCTAAGATTGTTTTGAAAATATCACCCGCTGTAATACACAGGCTGCACAGCGGCACAGTATCTATGATGAAGGTCTGGGGAGCGCACGGGGAGTTCTGTGCGAAACATCAATGGGAGGTCATAGTGGCGACCCTCGCCCTCCTCGCCTGTGCTGCGAGTGTCGAGAGACACGGCACAGGCGTCAGATCTGAACACTGTGCGGGCTGGGCTAGAGCGTGTCCAGGGTTAGAGGCGGAGTATCAAGCGGCCGACGCTGTTATCATGACCTTCGTCCGCTGCGCCGCCTTGCTCTACGCCTATTACCAAGTGTCGAATCTTCAGAAAATTGCTTCGAAATATCTTCTAATAATTGCTGGCGTGTTCTCAACATTCGCCAGTTTCATATTCACGTCGGCATTGGCTAGTTTGTTTTGGAGCGAGCTGGCGAGTATTAAGGATGCGCCGTTTCTGTTCCTATTGGTTGCTGATGTGGCTAGAGGGGCCAGGATGGCTAAGGCTGGATGGAGCGCAGGAGAGGATCAGGGGAAGAGGGTCGGCAAAGCGCTTTCATTGCTGGGACCGACGGCGACTTTAGACACACTTCTAGCAGTTCTTCTAGTCGGTGTCGGTGGACTATCTGGTGTTCCAAGATTGGAGCACATGTGCACGTTCGCTTGTCTGGCTCTGTTAGTCGACTACTCCGTGTTCGTTACTTTCTACCCGGCCTGCTTATCGCTCGTGTCAGATTTCGCATCCGGCAGGAAAGAAATGAGACCGGATAGTCCGTTTGCTGAAAGTGATCTGAAACCTAACCCGGTGGTGCAGAGGGTCAAGATGATCATGGCTGCTGGTTTGTTGTGTGTACATCTGACAAGCAGGTTGCCGTGGGCGAAGGAGAACGGGATGATAGAAGGTTCCTTATCAAAAGATTTCAAGTCGACATCCGATGAGAATGTTTTATTCAATTCGTACGTCAAATGGTTCTCAGTGAGCGCCGATTATATCGTCATCGCCACGTTGTTGTGCGCTCTCATTATAAAATTCATCTTCTTCGAGGAGCAAAGGAACTGGGTCATCGATATGAATGATCTAACGGTGAAGGAAGTCACTAATAAATATGACAAGCCTAAATTCATTGTTGGCGAGGAATTTAAAGCGGAGATATGTACTCAAACCGATGACCTGTTGAATTGGGAGGAGACCGAATGGCCAGTGCTCTCCCCGAGCTCATCAGCCGCTAAACTTAACGCGAAAAAACGCCCAATGGCCGAATGTTTGGAAATATACCGTTCAGAGGGTGTCTGTACATCGCTTAGTGACGATGAAGTCGTAATGCTTGTGGAACAATCGCATATACCGTTGCATAGATTAGAGAATGTTCTCAACGATCCATTGCGCGGTGTCAGGCTACGCAGGAGAGTCATATCGGCGAGATTCGAAACGGAATCTGCCGTGAAGAAGCTGCCTTATCTCAACTACGACTATAGCAAAGTACTAAATGCGTGTTGCGAAAACGTGATTGGATATGTTGGTATTCCAGTAGGTTATGCTGGTCCCTTGGTCGTCGATGGTAAAGCTTATATGATCCCCATGGCGACCACTGAAGGGGCTTTGGTAGCATCCACGAATAGAGGCGCTAAGGCAATTGGAACCAGAGGAGTTACCAGTGTGGTTGAAGATGTGGGCATGACAAGAGCTCCAGCGGTGAAGTTACCTAATGTAGTGCGAGCTCACGAGTGCCGTCAGTGGCTCGACAATAAAGATAATTACGCTATTATCAAAACGGCTTTCGATTCAACATCTAGATTCGCGCGACTCCAGGAAGTGCACGTTGGCGTCGACGGCGCCATTTTATATTTGCGATTTAGAGCCACCACTGGCGACGCCATGGGAATGAACATGGTGTCTAAGGGTGCCGAAAACGCTCTCAAGCTACTCAAGAATTACTTCCCGGACATGGAAGTTATAAGTTTATCTGGCAATTACTGTTCTGATAAAAAAGCGGCTTCAATCAACTGGGTCAAAGGTAGAGGCAAACGTGTAATATGCGAGACGACAATATCGGCTACAAATTTGAAGAATATTTTCAAAACTGACGCCAAAACTATGACAAGGTGCAACAAAATAAAGAATTTGTCCGGATCGGCGTTGGCTGGTTCGATAGGAGGCAACAACGCTCACGCTGCTAACATGGTCACCGCTATCTTCATAGCTACCGGCCAAGATCCAGCTCAGAATGTGACGAGCAGCAACTGCTCCACCAGCATGGAGGTTTGTGGGGAGAACAACGAGGATCTGTATGTGACATGCACCATGCCTTCGTTGGAAGTAGGAACTGTTGGCGGCGGCACGGTTCTGACTGGTCAGGGTGCCTGCCTCGAGATCCTCGGCGTCAAAGGAGCGGCGGAACGACCGGCTGAGAACTCAGCCAGACTGGCTTCCCTAATATGCGCCACCGTCCTGGCCGGCGAGCTCAGCCTTATGGCCGCTTTAGTCAACTCGGACTTAGTGAAATCTCACATGCGGCACAACAGATCCACTATCAACGTACAGAACGCGTCCAACGAACTGAAAGTACCCACATTATAA

Protein sequence:

>DPOGS213891-PA
MLFDLCPKIVLKISPAVIHRLHSGTVSMMKVWGAHGEFCAKHQWEVIVATLALLACAASVERHGTGVRSEHCAGWARACPGLEAEYQAADAVIMTFVRCAALLYAYYQVSNLQKIASKYLLIIAGVFSTFASFIFTSALASLFWSELASIKDAPFLFLLVADVARGARMAKAGWSAGEDQGKRVGKALSLLGPTATLDTLLAVLLVGVGGLSGVPRLEHMCTFACLALLVDYSVFVTFYPACLSLVSDFASGRKEMRPDSPFAESDLKPNPVVQRVKMIMAAGLLCVHLTSRLPWAKENGMIEGSLSKDFKSTSDENVLFNSYVKWFSVSADYIVIATLLCALIIKFIFFEEQRNWVIDMNDLTVKEVTNKYDKPKFIVGEEFKAEICTQTDDLLNWEETEWPVLSPSSSAAKLNAKKRPMAECLEIYRSEGVCTSLSDDEVVMLVEQSHIPLHRLENVLNDPLRGVRLRRRVISARFETESAVKKLPYLNYDYSKVLNACCENVIGYVGIPVGYAGPLVVDGKAYMIPMATTEGALVASTNRGAKAIGTRGVTSVVEDVGMTRAPAVKLPNVVRAHECRQWLDNKDNYAIIKTAFDSTSRFARLQEVHVGVDGAILYLRFRATTGDAMGMNMVSKGAENALKLLKNYFPDMEVISLSGNYCSDKKAASINWVKGRGKRVICETTISATNLKNIFKTDAKTMTRCNKIKNLSGSALAGSIGGNNAHAANMVTAIFIATGQDPAQNVTSSNCSTSMEVCGENNEDLYVTCTMPSLEVGTVGGGTVLTGQGACLEILGVKGAAERPAENSARLASLICATVLAGELSLMAALVNSDLVKSHMRHNRSTINVQNASNELKVPTL-