Monarch geneset OGS2.0

DPOGS215498
TranscriptDPOGS215498-TA3495 bp
ProteinDPOGS215498-PA1164 aa
Genomic positionDPSCF300518 - 30056-44138
RNAseq coverage1244x (Rank: top 10%)
Annotation
HeliconiusHMEL0145720.086.59% 
BombyxBGIBMGA011222-TA0.080.39% 
DrosophilaACC-PD0.078.07% 
EBI UniRef50UniRef50_E9G1C90.071.10%Putative uncharacterized protein n=16 Tax=Coelomata RepID=E9G1C9_DAPPU
NCBI RefSeqXP_969851.20.079.12%PREDICTED: similar to acetyl-coa carboxylase [Tribolium castaneum]
NCBI nr blastpgi|2700089880.075.99%hypothetical protein TcasGA2_TC015613 [Tribolium castaneum]
NCBI nr blastxgi|2700089880.076.06%hypothetical protein TcasGA2_TC015613 [Tribolium castaneum]
Group
Gene OntologyGO:00039892.7e-96acetyl-CoA carboxylase activity
GO:00055242.7e-96ATP binding
GO:00066332.7e-96fatty acid biosynthetic process
GO:00168741.1e-93ligase activity
GO:00038248e-53catalytic activity
GO:00081523.8e-52metabolic process
KEGG pathwaytca:6583620.0 
 K11262 (ACAC)maps-> Propanoate metabolism
    Insulin signaling pathway
    Fatty acid biosynthesis
    Pyruvate metabolism
InterPro domain[833-1160] IPR0135372.7e-96Acetyl-CoA carboxylase, central domain
[367-632] IPR0138161.1e-93ATP-grasp fold, subdomain 2
[63-230] IPR0161858e-53PreATP-grasp-like fold
[300-485] IPR0054793.8e-52Carbamoyl-phosphate synthetase, large subunit, ATP-binding
[108-247] IPR0138175.2e-47Pre-ATP-grasp fold
[516-631] IPR0110543.8e-33Rudiment single hybrid motif
[108-227] IPR0054813.8e-28Carbamoyl-phosphate synthase, large subunit, N-terminal
[521-628] IPR0054821.5e-24Biotin carboxylase, C-terminal
[303-364] IPR0138155.3e-19ATP-grasp fold, subdomain 1
[751-833] IPR0110531.2e-13Single hybrid motif
[767-829] IPR0000897.4e-13Biotin/lipoyl attachment
Orthology groupMCL10587 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215498-TA
ATGTTGAAGAGGAGAAGCTCGAAGCGGTTCGTGCTTGGGGACACCTCGGTCGATGAAGGGGAGTGGGCTGATGAGGCAACAGACGTGATGCCAGCGCACGTCGTGCAGAAGTTCCAGGTGACGATCAACTCGGAACCTGAGGGGAATGTGAAAAATGAAGCGCTGGAGGTGGATAAAAGGCTACTCGGACCTCCCACGGCCAGCACCTTGCAGCCGTCGATGTCCCAGGGTACTGTGATCCACAGTCAGCGTTTCCAAGAGAAGGATTTCACTGTTGCCACCCCTGAAGAATTTGTTCGACGTTTCCAAGGGACCAAGGCCATTAACAAGGTGCTGATAGCGAATAATGGCATTGGAGCTGTGAAATGTATGCGTTCTATACGAAGGTGGTCCTATGAGATGTTCAAGAATGAGAGGGCTGTGCGCTTCGTTGTTATGGTGACCCCCGAGGACCTGAAAGCGAATGCTGAATACATAAAGATGGCTGATCATTATGTCCCTGTGCCCGGAGGCTCAAACAACAACAACTATGCCAACGTCGAACTCATAGTGGACATCGCTGTGCGGACCCAAGTACAGGCCGTATGGGCTGGTTGGGGTCATGCTTCAGAGAATCCCAAGCTTCCGGAACTGCTCCATCGAGCTGGTGTAGTTTTCATCGGACCGCCAGAGAAAGCGATGTGGGCTCTCGGAGACAAGATCGCCTCGTCCATAGTGGCCCAGACAGCTGATATACCCACGCTAGCATATATATATATATATATATATATATATATATATATATATATATATCTTTATTTATTTATTATTATATTAGAAAAAAGGTTCCCAAACTTTTTTTTTGAACTTAAAGCGGAGTACAATACAAAGAAGATAAAGATATCATCGGAACTGTTCGCTAGAGGCTGTGTGTCTACACCGGAGCAAGGACTCCAAGCTGCTAACAAGATCGGTTTCCCCGTCATGATAAAGGCTTCTGAGGGTGGAGGAGGTAAAGGTATCAGGAAGGTCGAGGTGCCTGATGACTTCAGCAACATGTTCAGACAGGTCCAAGCGGAAGTGCCAGGTTCTCCGATATTCGTGATGAAGCTGGCGCGGTCAGCTAGACATCTGGAAGTGCAGCTGTTGGCTGATCAGTACGGTAACGCTATATCTCTGTTCGGGAGAGACTGTTCCATCCAGCGACGGCATCAGAAGATCATTGAGGAGGCACCGGCTGCCGTCGCCAAGCCGGAAGTGTTCATTGAAATGGAGAAGGCGGCTGTCCGTCTCGCTAAGATGGTCGGCTACGTGAGTGCTGGCACCGTGGAGTATCTATATGAGCCGATAACGGGGGCTTACTACTTCCTGGAGCTAAACCCTCGCCTGCAGGTGGAACACCCCTGTACGGAGATGGTAGCGGACGTCAACCTACCAGCCGCGCAACTACAGATCGCCATGGGTCTCCCATTATACCACATCAAGGACATCCGTCTCCTGTATGGGGAGTCGCCCTGGGGTATGACTCAGATAGACTTCGACGAGCCCAAACAGAGACCCTCGCCCTGGGGACATGTCATCGCTGCCAGGATCACGTCGGAGAATCCTGATGAAGGTTTCAAACCTTCCTCGGGTACGGTCCAAGAGCTCAACTTCCGGTCGTCAAAGAACGTGTGGGGCTATTTCAGTGTGGCCGCATCCGGGGGCTTGCACGAGTTTGCCGATTCACAGTTCGGGCACTGCTTCTCCTTCGGGGAGACCAGGGAACAGGCTAGAGAGAACCTGGTGATAGCTCTCAAGGAGCTCAGCATCCGAGGAGATTTCAGAACCACGGTCGAATATCTGATCACATTGTTGGAGACGCCCGCCTTCCAGGACAACAACATTGATACTAGCTGGCTGGACGCGCTCATCGCGGAGAGGGTCCAGTCGGAGAAGCCGGACGTGATGCTGGGCGTCATCTGCGGGTCCATCCTCATCGCCGACAACCTCATCACGGCACATCTGCTGGAGTTTAAGAGCGCCCTGGAGAAGGGTCAGATCCAAGGGTCGAGTCAGCTGTCGAACACTGTAGAAGTGGAGCTGATACACTCCGGTCACAAGTACAAAGTTCAAGCAACTAAGTCTGGCCCCACCTCGTACTTCCTGGCTATGAACGGCAGCTTCAAGGAGCTTGAGGTTCATAAGCTAACTGACGGCGGTACCTTACTGTCAATCGACGGAGCGTCATACACGACGTATTTAAAGGAGGAGGTTGACAAGTACAGGATAGTGATCGGCAACCAAACCGTGGTCTTCGAGAAGGAGAAGGATCCGTCCAAGCTGAGAGCCCCCTCCGCTGGGAAGCTGGTCAACACGCTGGTTGAAGATGGAGGGCATGTGGACAAGGGTCAGCCTTACGCTGAGATAGAGGTGATGAAAATGGTGATGACTCTATCAGCGCCAGAATCGGGCAAGGTCTTCTGGAATCTAAGACCTGGTGCTGTGCTGGATATGGGAGCGTTGATGGGAACACTAGAGTTGGACGATCCGTCGCTAGTGACCACAGCCCAGCCCTACAAGGGCCAGTTCCCTCACGAGGACAACCCCAACCTGACGGAGAAACTAAACCACGCCCACAACAAATATAGATATATACTGGAGAACACACTTGCTGGTTACTGTCTGCCCGAGCCGTACAACACTCCTCGTCTCCGGGAGGTGGTGGAGAAATTCATGTCCAGTCTGAGAGACCCCTCGCTGCCCTTACTGGAGTTACAGGAGGTGTTATCATCAACGTCCGGTCGTATCCCGGTCTCCGTGGAGAAGAAGGTCCGCAAGCTGATGGCGCTCTACGAGAGCAACATCACGAGTGTGCTGGCACAGTTCCCCAGCCAGCAGATAGCCAGCGTCATTGACCATCACGCGGCGTCCCTCGCGAAGCGCGCTGACAGGGACGTGTTCTTCATGAGCACGCAGGCCTTGGTGGTGCTAGTGCAGCGGTACAGGAACGGTATCCGCGGTAGGATGAAGGCGGCCGTTCATGATCTCTTGAAGCAGTACTACCAGGTCGAGAGCAACTTCCAACTGGGTTCGTACGACAAATGTGTGATGGCGCTGCGGGATCGTTACAAGGACGACATGCAGGCCGTGGCTAACATCATCTTCTCACACAACCAAGTGGCTAAGAAGAATCTCCTGGTGACGATGTTGATAGACCACCTGTGGTCTAATGAACCAGGTCTCACCGATGAACTGGCGGCCACCCTCAACGAACTGACCTCGTTACACCGAGCCGAACACAGTCGGGTAGCGCTGAGAGCCAGACAGGTGCTGATCGCCGCCCACCAGCCCGCCTACGAGTTGCGGCACAATCAAATGGAGTCTATCTTTTTGTCAGCGGTCGATATGTACGGACACGACTTCCATCCGGAGAACCTTCAGAAGCTGATCCTCTCGGAGACTTCGATATTCGACATCCTTCACGATTTCTTCTATCACACGAACGCGGCGGTGAGTAGACCGAGTAGACCATAG

Protein sequence:

>DPOGS215498-PA
MLKRRSSKRFVLGDTSVDEGEWADEATDVMPAHVVQKFQVTINSEPEGNVKNEALEVDKRLLGPPTASTLQPSMSQGTVIHSQRFQEKDFTVATPEEFVRRFQGTKAINKVLIANNGIGAVKCMRSIRRWSYEMFKNERAVRFVVMVTPEDLKANAEYIKMADHYVPVPGGSNNNNYANVELIVDIAVRTQVQAVWAGWGHASENPKLPELLHRAGVVFIGPPEKAMWALGDKIASSIVAQTADIPTLAYIYIYIYIYIYIYIYLYLFIIILEKRFPNFFFELKAEYNTKKIKISSELFARGCVSTPEQGLQAANKIGFPVMIKASEGGGGKGIRKVEVPDDFSNMFRQVQAEVPGSPIFVMKLARSARHLEVQLLADQYGNAISLFGRDCSIQRRHQKIIEEAPAAVAKPEVFIEMEKAAVRLAKMVGYVSAGTVEYLYEPITGAYYFLELNPRLQVEHPCTEMVADVNLPAAQLQIAMGLPLYHIKDIRLLYGESPWGMTQIDFDEPKQRPSPWGHVIAARITSENPDEGFKPSSGTVQELNFRSSKNVWGYFSVAASGGLHEFADSQFGHCFSFGETREQARENLVIALKELSIRGDFRTTVEYLITLLETPAFQDNNIDTSWLDALIAERVQSEKPDVMLGVICGSILIADNLITAHLLEFKSALEKGQIQGSSQLSNTVEVELIHSGHKYKVQATKSGPTSYFLAMNGSFKELEVHKLTDGGTLLSIDGASYTTYLKEEVDKYRIVIGNQTVVFEKEKDPSKLRAPSAGKLVNTLVEDGGHVDKGQPYAEIEVMKMVMTLSAPESGKVFWNLRPGAVLDMGALMGTLELDDPSLVTTAQPYKGQFPHEDNPNLTEKLNHAHNKYRYILENTLAGYCLPEPYNTPRLREVVEKFMSSLRDPSLPLLELQEVLSSTSGRIPVSVEKKVRKLMALYESNITSVLAQFPSQQIASVIDHHAASLAKRADRDVFFMSTQALVVLVQRYRNGIRGRMKAAVHDLLKQYYQVESNFQLGSYDKCVMALRDRYKDDMQAVANIIFSHNQVAKKNLLVTMLIDHLWSNEPGLTDELAATLNELTSLHRAEHSRVALRARQVLIAAHQPAYELRHNQMESIFLSAVDMYGHDFHPENLQKLILSETSIFDILHDFFYHTNAAVSRPSRP-