Monarch geneset OGS2.0

DPOGS209347
TranscriptDPOGS209347-TA1425 bp
ProteinDPOGS209347-PA474 aa
Genomic positionDPSCF300336 + 208422-217172
RNAseq coverage5273x (Rank: top 2%)
Annotation
HeliconiusHMEL0132490.081.65% 
BombyxBGIBMGA014446-TA5e-13690.69% 
DrosophilaATPCL-PD7e-16756.97% 
EBI UniRef50UniRef50_Q6AWP81e-16456.97%RE70805p n=33 Tax=cellular organisms RepID=Q6AWP8_DROME
NCBI RefSeqXP_001808341.12e-17359.34%PREDICTED: similar to ATP-citrate synthase [Tribolium castaneum]
NCBI nr blastpgi|2700085661e-17259.92%hypothetical protein TcasGA2_TC015096 [Tribolium castaneum]
NCBI nr blastxgi|2700085662e-16759.92%hypothetical protein TcasGA2_TC015096 [Tribolium castaneum]
Group
Gene OntologyGO:00055246.2e-11ATP binding
GO:00168746.2e-11ligase activity
KEGG pathwaytca:6567284e-173 
 K01648 (ACLY)maps-> Citrate cycle (TCA cycle)
    Reductive carboxylate cycle (CO2 fixation)
InterPro domain[187-352] IPR0161025.2e-20Succinyl-CoA synthetase-like
[115-166] IPR0138166.2e-11ATP-grasp fold, subdomain 2
Orthology groupMCL11576 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209347-TA
ATGAAAACTGTTACGTCATCGTATGCAGTGACCCAGTACGTGAAGCAACTAGTGGTGAAGCCCGACCAGCTGATCAAGAGACGAGGCAAGTTGGGACTGGTTGGTGTGAATAAAACGGCGGCGGAGGTCAGGCGCTGGCTGGCCGAGCATGACTCCAAGGAGCAGAAGGTCGGAGCGGCCGCGGGGAAGCTCAGGAGATTCGTGGTGGAACCGTTCGTGAAGCACGATCCGAGCGAGGAGATGTACCTGTGCATCCAATCAGGCCGACGAGCGGACACCATCATGTTCCATCACCAGGGCGGAGTGGACGTGGGTGATGTGGACGCGCTGGCTCTCAGGATGGATTTCATCGTGAGCCTGTACCGCGTGTTCGTGAACCTGTACTTCACGTACATGGAGATCAACCCGGTGGTGGTGACCAACGAGCGAGTCTACCTCCTTGACCTGGCCGCCAAGTTGGATCAGACGGCGGATTTCATATGCGCCAAGAACTGGGGGGAGATCACCTTCCCCCCGCCCTTCGGCAGGGACGCGTACCCCGAGGAAGCTCATATAGCTGATCTTGACGCTAAGAGCGGGGCTAGCTTGAAGCTGACGGTGCTGAACAAGTCCGGGCGTATCTGGACGATGGTGGCGGGCGGCGGGGCGTCCGTGGTGTACACGGACACGGTCTGCGCCCTGGGCGGGGCGGCCGAGCTCGCCAACTACGGCGAGTACTCCGGGGCGCCCACCGAGAGCCAGACAGCCGACTACGCCAAGACCATATTCAGTCTCATGTGCAGAGAGAAGCATCCCAAGGGCAAGGTCCTGATTATCGGCGGCGGCATCGCGAACTTCACGAACGTGGCGGACACCTTCCGCGGCATCATCACCGCCATCGAGACGTACCGGGACGCTCTGCTTCAGTACAACGTCACCATCTTCGTGAGGCGGGGCGGCCCCAACTACCAGGAAGGGCTGAGACAAATGCGTGAAGTGGGGCAGCGTCTCCGTATCCCCATGTACGTGTTCGGTCCGGAGAGCAACATGACCGCCATCGTGAGGCTGGCTCTGGGACACGCGGTCATACCCAGCGACCACCAGCTCGACTACGCCCCGAAACAGCTGCCCAAGCCGGACACCGCTCCGTCCCCGCAGATCGAGCTTCCAGAGCTGAGCCCGTCCCTGCTGGAGCTGGTGTGCTCGCAGGCGCCCACCAGGAGCGACCTCGGCCAGCAGCTGTCCACAGCCAGGCCGCTGTTCAGTGACAGGACCAAGGCTATAGTGTGGGGAATGCAGAACAGAGCTATACAGGGTATGCTCGACTTCGACTACGTGTGTCGCCGCAGTGAGCCGTCAGTGGTGGCCATAGTGTACCCCTTCACCGCTGACCACAAACAGAAGTACTACTTCGGCACTAAGGAACACAAATATGGGCCACTCTAA

Protein sequence:

>DPOGS209347-PA
MKTVTSSYAVTQYVKQLVVKPDQLIKRRGKLGLVGVNKTAAEVRRWLAEHDSKEQKVGAAAGKLRRFVVEPFVKHDPSEEMYLCIQSGRRADTIMFHHQGGVDVGDVDALALRMDFIVSLYRVFVNLYFTYMEINPVVVTNERVYLLDLAAKLDQTADFICAKNWGEITFPPPFGRDAYPEEAHIADLDAKSGASLKLTVLNKSGRIWTMVAGGGASVVYTDTVCALGGAAELANYGEYSGAPTESQTADYAKTIFSLMCREKHPKGKVLIIGGGIANFTNVADTFRGIITAIETYRDALLQYNVTIFVRRGGPNYQEGLRQMREVGQRLRIPMYVFGPESNMTAIVRLALGHAVIPSDHQLDYAPKQLPKPDTAPSPQIELPELSPSLLELVCSQAPTRSDLGQQLSTARPLFSDRTKAIVWGMQNRAIQGMLDFDYVCRRSEPSVVAIVYPFTADHKQKYYFGTKEHKYGPL-