Monarch geneset OGS2.0

DPOGS209348
TranscriptDPOGS209348-TA2022 bp
ProteinDPOGS209348-PA673 aa
Genomic positionDPSCF300336 + 217758-224591
RNAseq coverage5134x (Rank: top 2%)
Annotation
HeliconiusHMEL0132496e-16088.82% 
BombyxBGIBMGA014294-TA9e-16189.00% 
DrosophilaATPCL-PD0.063.09% 
EBI UniRef50UniRef50_Q6AWP80.062.91%RE70805p n=33 Tax=cellular organisms RepID=Q6AWP8_DROME
NCBI RefSeqXP_001842482.10.063.45%ATP-citrate synthase [Culex quinquefasciatus]
NCBI nr blastpgi|3123709030.062.91%hypothetical protein AND_22915 [Anopheles darlingi]
NCBI nr blastxgi|2897245700.063.45%ATP-citrate lyase [Glossina morsitans morsitans]
Group
Gene OntologyGO:00469129.3e-58transferase activity, transferring acyl groups, acyl groups converted into alkyl on transfer
GO:00442629.3e-58cellular carbohydrate metabolic process
GO:00081522.1e-16metabolic process
GO:00038242.1e-16catalytic activity
GO:00054882.4e-06binding
KEGG pathwaycqu:CpipJ_CPIJ0008590.0 
 K01648 (ACLY)maps-> Citrate cycle (TCA cycle)
    Reductive carboxylate cycle (CO2 fixation)
InterPro domain[111-549] IPR0161419.3e-58Citrate synthase-like, core
[66-244] IPR0161025.9e-56Succinyl-CoA synthetase-like
[105-230] IPR0058112.1e-16ATP-citrate lyase/succinyl-CoA ligase
[394-538] IPR0020209.8e-15Citrate synthase-like
[443-539] IPR0161438.9e-11Citrate synthase-like, small alpha subdomain
[28-45] IPR0058102.3e-06Succinyl-CoA ligase, alpha subunit
[7-65] IPR0160402.4e-06NAD(P)-binding domain
Orthology groupMCL11576 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209348-TA
ATGGATGTAGCTATGAGGAAACACCAAGAGGCGACCGTCCTTGTTAACTTCGCGTCCCTCCGGTCAGCTTACGACAGCACCATGCAGGCGATGCAGCATCCACAGATCAACACAGTCGTTATTATAGCGGAGGGCATACCGGAGAATATGACGAGGAAGATAATCAAGCTAGCCGACGACAGAGGGGTGAATATAATCGGTCCGGCCACAGTGGGTGGTATCAAACCCGGATGTTTCAAGATTGGCAACACTGCCGGCATGATCGACAACATTATAGGCAGCAAGTTGTACAGACCTGGCAGCGTGGCGTACGTGTCCCGTTCCGGCGGCATGAGTAACGAGCTGAACAACATCATATCCAAGGAGGCGGACGGCGTGTGCGAGGGAGTCGCCATAGGGGGCGACAGATACCCGGGGACCACCTTCATAGACCACCTGATGAGGTACGAGGCGGATCCTAACGTGAAGATGCTGGTGCTCCTGGGCGAGGTGGGCGGAGTTGAGGAGTACCACGTGTGCCGCGCCATCAAGGACGGGAGGATCACGAAGCCGCTCGTCGCCTGGTGTATAGGGACGTGCTCCGACATGTTCACGTCTGAGGTACAGTTCGGGCACGCGGGTTCGCTCGCCGGCTCGGCTCTAGAGAAGGCCGCTGCTAAGAATAAGGCTTTGGCCAAACACGGGGCGGTGGTGCCAGACTCGTTTGACACTCTGGGGGCGGCCATTAATAAGCATGTGAGTAATAAATTTTCTGTATCATCTTCAGACATGTTCACGTCTGAGGTACAGTTCGGGCACGCGGGTTCGCTCGCCGGCTCGGCTCTAGAGAAGGCCGCTGCTAAGAATAAGGCTTTGGCCAAACACGGGGCGGTGGTGCCAGACTCGTTTGACACTCTGGGGGCGGCCATTAATAAGGTTTACAAGAAGCTCGTCTCCGAAGGCAAGATAATTGAGAAGGAAGAGGTCGGCCCACCAAAAGTACCCATGGACTACGACTGGGCTCGGAAACTCGGTATAATTCGTAAGCCGGCAGCGTTTGTGAGCACTATATGCGACGAGCGAGGCCAGGAGTTGAGCTACTGCGGCGTGCCCATAACGTCCATACTGGAGAAACAGATGGGGGTCGGTGGTACCATCAGCCTGCTGTGGTTCCAGCGCGAGTTGCCGGACTGGGCGTGCAAGTTCTTCGAGCTGGTGCTGATAGTGACGGCCGACCACGGGCCCGCTGTCTCCGGGGCTCACAATACGATGGTCACCGCGCGGGCCGGCAAGGACCTTATATCGTCTGTGGTCTCCGGGCTGCTTACTATCGGAGATCGTTTCGGCGGTGCCCTGGACAGAGCTGCCGCGGATTTCTGTGCTGCTTACGATCGAGGGCAGCACCCCCAGGAGTTTGTTAACGAGAAACGTGCCAAGGGTGAACTTATTATGGGGATCGGACATCGCGTGAAGTCTATCAACAACCCAGACTCGCGCGTTCGCGAGCTGAAGGCGTATGTGACGTCACGTTGGCCCGCGTGGCCGGTGACGAGGTACGCCCTGGACGTGGAGGCGATCACCACGAGGAAGAAGCCCAACCTGATCCTGAACGTGGACGGCATAGTGGCGGCCGCCATGGTGGACCTGTTCCGACACTGCCAGCTGTTCTCGCACGTGAAGTCTATCAACAACCCAGACTCGCGCGTTCGCGAGCTGAAGGCGTATGTGACGTCACGCTGGCCCGCGTGGCCGGTGACGAGGTACGCCCTGGACGTGGAGGCGATCACCACGAGGAAGAAGCCCAACCTGATCCTGAACGTGGACGGCATAGTGGCGGCCGCCATGGTGGACCTGTTCCGACACTGCCAGCTCTTCTCGCAGGAGGAAGGTAACAGCTACATCAGTATGGGCTCTATAAACGCGTTGTTCGTGCTCGGCCGCACTATCGGCCTGGTCGGACACTACCTGGACCAGAAGCGTCTCAAGCAGCCTCTGTACCGTCATCCCTGGGACGACATCACCTACATGTCCCCTCTCAACTAA

Protein sequence:

>DPOGS209348-PA
MDVAMRKHQEATVLVNFASLRSAYDSTMQAMQHPQINTVVIIAEGIPENMTRKIIKLADDRGVNIIGPATVGGIKPGCFKIGNTAGMIDNIIGSKLYRPGSVAYVSRSGGMSNELNNIISKEADGVCEGVAIGGDRYPGTTFIDHLMRYEADPNVKMLVLLGEVGGVEEYHVCRAIKDGRITKPLVAWCIGTCSDMFTSEVQFGHAGSLAGSALEKAAAKNKALAKHGAVVPDSFDTLGAAINKHVSNKFSVSSSDMFTSEVQFGHAGSLAGSALEKAAAKNKALAKHGAVVPDSFDTLGAAINKVYKKLVSEGKIIEKEEVGPPKVPMDYDWARKLGIIRKPAAFVSTICDERGQELSYCGVPITSILEKQMGVGGTISLLWFQRELPDWACKFFELVLIVTADHGPAVSGAHNTMVTARAGKDLISSVVSGLLTIGDRFGGALDRAAADFCAAYDRGQHPQEFVNEKRAKGELIMGIGHRVKSINNPDSRVRELKAYVTSRWPAWPVTRYALDVEAITTRKKPNLILNVDGIVAAAMVDLFRHCQLFSHVKSINNPDSRVRELKAYVTSRWPAWPVTRYALDVEAITTRKKPNLILNVDGIVAAAMVDLFRHCQLFSQEEGNSYISMGSINALFVLGRTIGLVGHYLDQKRLKQPLYRHPWDDITYMSPLN-