Monarch geneset OGS2.0

DPOGS202685
TranscriptDPOGS202685-TA3783 bp
ProteinDPOGS202685-PA1260 aa
Genomic positionDPSCF300438 - 69397-81460
RNAseq coverage2387x (Rank: top 5%)
Annotation
HeliconiusHMEL0145720.081.15% 
BombyxBGIBMGA011223-TA0.072.25% 
DrosophilaACC-PD0.053.51% 
EBI UniRef50UniRef50_E9G1C90.051.16%Putative uncharacterized protein n=16 Tax=Coelomata RepID=E9G1C9_DAPPU
NCBI RefSeqXP_002429216.10.055.49%acetyl-CoA carboxylase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420174800.055.49%acetyl-CoA carboxylase, putative [Pediculus humanus corporis]
NCBI nr blastxgi|1892383750.055.42%PREDICTED: similar to acetyl-coa carboxylase [Tribolium castaneum]
Group
Gene OntologyGO:00168745.5e-136ligase activity
GO:00039892.5e-90acetyl-CoA carboxylase activity
GO:00055242.5e-90ATP binding
GO:00066332.5e-90fatty acid biosynthetic process
KEGG pathwayphu:Phum_PHUM4243500.0 
 K11262 (ACAC)maps-> Propanoate metabolism
    Insulin signaling pathway
    Fatty acid biosynthesis
    Pyruvate metabolism
InterPro domain[506-1147] IPR0000225.5e-136Carboxyl transferase
[82-359] IPR0135372.5e-90Acetyl-CoA carboxylase, central domain
Orthology groupMCL10587 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202685-TA
ATGAAAATTTGTGATAGACATGATGATGAAGTATCCCCTGAGGAGTGGAAATTTCCTACAGTTGTATGGATATTTATATATTATACGGTGTGCAACGCTGCGTTGGAGGTTTACGTCCGGCGCGCTTACACCTCGTACGACATAACATGCCTGCAGCATCTGGCTTTGTCCGGGGAATTGGGTGTGGTGCACTTCCAGTTCGTGCTCCCGACGGGACATCCGAACAGTATTAACCAAACAGCTATCCGTAACCAGGAGTTGGAGCCGATCCACATCCTCATGATCGGTATCCGGGACAGCGGGGAGAGCGATGACGTCACCGTGTCCCGGCGCTTCGGTCACTTCTGCAGGGTCCACATGAGAGAGCTGCACCACAAACGTATAAGAAGGATCACATTCATGTTGCTCATCAAGCGTCAGTTCCCCAAGTTCTTCACGTACCGCGCGAGGAATGACTTCTCCGAGGACACTATCTACCGACACTTGGAGCCGGCGTCCGCCTTCCAGCTGGAGCTGTACCGGATGAGGAGCTACGAGCTGGAGGCGCTGCCGACCAGCAACCAGAAGATGCATCTGTACCTGGGCAAGGCCAAGGTGAAGAAAGGCCAGGAGGTGACGGATTACCGCTTCTTCATCAGATCCATCATCAGGCACCAGGATCTCATCACCAAGGAAGCGTCCTTCGAGTACCTACAGAATGAAGGGGAAAGGTCTCTGCTGGAAGCGATGGATGAACTGGAGGTGGCCTTCTCTCACCAGCTCGCCAAGAGAACGGACTGCAACCACATCTTCCTTAACTTTGGCCCAACCGTCATCATGGATATTGCCAAGATCGAGGAATCAGTGCTGGGAATGGTTATGAGGTATGGACCTCGTCTGTGGAAGCTCAGGGTACTGCAGGCGGAGATAAGATTCACTTTGAGAACAGGCCCCGGCGTGCCCACTAAGAACGTCCGTCTATGTCTGTCCAACGGTTCCGGATACTCCCTAGACGTGTACACATACGAGGAAGTAGTCGACCCGAGGACGGGAGTGATAATATTCCAGTCCTTCGGCCCCAAGCAAGGTCCAATGCACGGTCTTCCAATATCGACGCCCTACGTCACCAAGGACTACCTGCAGCAGAAGAGATTTTTGGCTACATCCCAGGGCACTACATACGTGTACGACATCCCGGATATGTTCAGGCAGGTGGTAGAGGGGCAGTGGAGGGAGAGCATCGAGGAGGGGGCTGTGGACGGTGAGATTCTTCCTTTGCCTCCTTTGTGTATCCAGGCTACATCCCAGGGCACTACATACGTGTACGACATCCCGGATATGTTCAGACAGGTGGTAGAGGGGCAGTGGAGGGAGAGCATCGAGGAGGGGGCCGTGGACGGTCCTATGCCCGACACCGTGATGGTGTCCTTGGAGCTGGTGGTGGAGACAGACGGTGAGAGACGCATCATGGAAGTCACAAGGCTACCTGGACAGAACACCGTCGGGATGGTCGCGTGGCGGATGACGCTCTACACTCCCGAAGTGCCGTCGGGTCGGGATGTGGTGCTGATAGCGAACGATCTGACACATTACATGGGGTCGTTCGGACCCCAAGAGGACTGGGTCTATTATCGCGCATCCGAATACGCCAGGGAACACAAGATACCCCGGTTGTACGTGAGCGTCAACTCCGGCGCCCGCATCGGCGTTGCTGAAGAAGTGAAATCAGAATTCAAAGTAGCCTGGCTAGATTCAGAGCGGCCCGAGAGGGGCTTTAAGTATCTGTACCTCAGCCCTGAGGCGTATTCCCGTCTGGGGGCATTGAACTCCGTGAAGACCGAATTGATCGATGACGAAGGAGAGTCCAGATACAAGATAACCGATATCATAGGCAAGGAGGACGGTCTGGGAGTGGAGTGTCTCCGGGACGCCGGGCTCATAGCGGGGGAGACGGCCCAGGCTTACGAGGACATCGTCACCATCTCTATAGTCACGTGCAGGGCTATCGGAATAGGGTCCTACGTTGTGAGATTGGGTCACCGTGTCATCCAAGTGGATTCCTCGTACATCATCCTGACCGGCTACATGGCGCTGAACAAGGTCCTCGGCCGCTCCGTGTACGCGTCTAACAACCAATTAGGCGGTGTACAGATCATGCATAACAACGGAGTGACACACGCAGTGGCGCCCTCCGACCTGGAGGCCGTGCGGACGGCGCTCAGGTGGCTGGCATATGTACCTAAGTATATGTTGATGAACGGAAAGCGCCATCTGTCGGCTGCGTCGCTGTACTACACCCCGCACAGCCGTTGGGAGGGGGTGGTTAAGTTTAACAGAATTAAGGGAAATATCATAGCGGATTATGACAAATTAAGTATGGTGCCGATAATAAGGAGCGTGGACCCCATAGACCGGCCGGTGGAGTGGGTCCCTCCCCGAGCGGCGCACGACCCTCGCCTCATGCTGACGGGAGACGGCATCCGGGCCGGCTTCCTCGACAACGGCTCCTTCGACGAGATCATGAAGCCCTGGGCACAAACTGTTGTTACAGGTCGTGGTCGCCTGGGCGGTATCCCCGTGGGCATCATCGCCGTGGAGACGAGGACTGTGGAGCTGACGCTGCCCGCCGACCCCGCCAACTACGACTCCGAGGCCAAGACTGTGCAACAGGCCGGGCAGGTGTGGTTCCCCGACTCCGCCTACAAGACCTCCCAAGCGATAAACGACTTCTCCAGAGAGAATCTCCCCATCATCATATTCGCCAATTGGAGAGGTTTCAGCGGAGGTCAGAAAGACATGTACGAGCAGATCTTGAAGTTCGGCGCCGAGATAGTCCGGGCCCTACGCGGGGCCACCGCTCCCGTGCTGGTGTACATACCGCCCGGGGCCGAGCTCCGCGGCGGGGCCTGGGCCGTCGTGGACCCCAGCGTCAACAACCTCAGGATGGAGATGTACGCCGACAACGAGGCACGGTCAGTTGAGCACGTCTATGTTTGTTACAGCTGTACTCAGAAACTGCTTCCAGTGGTTTCGGCGGCGTGTTGGATGTTGGAGGCGGAGGGGATCGTTGAGGTGAAATTCAAACAACGAGACATACTCAAGACCATGAACAGGCTGGACTCCAACCTACTCAGACTCAACTCCAGAGTTAGTGAAATTAAGGAACAAATCAAAGAAATCTCAAAGAACCTGGACAGGCGGGGATCCATAGACGACGTGTTGATAAAGACCGAGACCGGCAAACAAGCGGAAGCGAAAATACACGAATTGGAAGCAGAATTAGGCACCGCTGAGAAATCTATAAAGGCCAGGGAGAAGGAATTGAGTCCGATATACCATGAGATAGCGGTTCAGTTTGCTGAGCTCCACGACACAGCTGAGAGGATGTTAGAGAAGGGTTGTATATTTGATATAATACCCTGGCGCGAGTCCCGTCAGTTGTTGCACTGGAGGCTTCGAAGGTTGTTGCTGCAGAACGAACAGGAGAGGAGAGTGCAGGCAGCGACCCGACCAGCCAGGATGGACCAGAGGGCTGCAGCGGCCACACTGAGGAGGTGGTTCACAGAAGACCGGGGAGAGACACAGTCACACCAGTGGGAGAATGACAACCAAGCGGTCTGTAGCTGGCTGGAGAACCAGGTGAAGGACAACGAGTCCGTGCTGGAGAGGAACCTCAGAGCTATCACAGAGGATGCAGCGCTACAGGCCTGTAACGAGCTAGTCAGGAAACTGAGTCCATCACAGCGGGCAGAGTTCATCAGAAAGATCACAGCATTGGACATGGAGACGGAGTACAACAACTAG

Protein sequence:

>DPOGS202685-PA
MKICDRHDDEVSPEEWKFPTVVWIFIYYTVCNAALEVYVRRAYTSYDITCLQHLALSGELGVVHFQFVLPTGHPNSINQTAIRNQELEPIHILMIGIRDSGESDDVTVSRRFGHFCRVHMRELHHKRIRRITFMLLIKRQFPKFFTYRARNDFSEDTIYRHLEPASAFQLELYRMRSYELEALPTSNQKMHLYLGKAKVKKGQEVTDYRFFIRSIIRHQDLITKEASFEYLQNEGERSLLEAMDELEVAFSHQLAKRTDCNHIFLNFGPTVIMDIAKIEESVLGMVMRYGPRLWKLRVLQAEIRFTLRTGPGVPTKNVRLCLSNGSGYSLDVYTYEEVVDPRTGVIIFQSFGPKQGPMHGLPISTPYVTKDYLQQKRFLATSQGTTYVYDIPDMFRQVVEGQWRESIEEGAVDGEILPLPPLCIQATSQGTTYVYDIPDMFRQVVEGQWRESIEEGAVDGPMPDTVMVSLELVVETDGERRIMEVTRLPGQNTVGMVAWRMTLYTPEVPSGRDVVLIANDLTHYMGSFGPQEDWVYYRASEYAREHKIPRLYVSVNSGARIGVAEEVKSEFKVAWLDSERPERGFKYLYLSPEAYSRLGALNSVKTELIDDEGESRYKITDIIGKEDGLGVECLRDAGLIAGETAQAYEDIVTISIVTCRAIGIGSYVVRLGHRVIQVDSSYIILTGYMALNKVLGRSVYASNNQLGGVQIMHNNGVTHAVAPSDLEAVRTALRWLAYVPKYMLMNGKRHLSAASLYYTPHSRWEGVVKFNRIKGNIIADYDKLSMVPIIRSVDPIDRPVEWVPPRAAHDPRLMLTGDGIRAGFLDNGSFDEIMKPWAQTVVTGRGRLGGIPVGIIAVETRTVELTLPADPANYDSEAKTVQQAGQVWFPDSAYKTSQAINDFSRENLPIIIFANWRGFSGGQKDMYEQILKFGAEIVRALRGATAPVLVYIPPGAELRGGAWAVVDPSVNNLRMEMYADNEARSVEHVYVCYSCTQKLLPVVSAACWMLEAEGIVEVKFKQRDILKTMNRLDSNLLRLNSRVSEIKEQIKEISKNLDRRGSIDDVLIKTETGKQAEAKIHELEAELGTAEKSIKAREKELSPIYHEIAVQFAELHDTAERMLEKGCIFDIIPWRESRQLLHWRLRRLLLQNEQERRVQAATRPARMDQRAAAATLRRWFTEDRGETQSHQWENDNQAVCSWLENQVKDNESVLERNLRAITEDAALQACNELVRKLSPSQRAEFIRKITALDMETEYNN-