Monarch geneset OGS2.0

DPOGS206130
TranscriptDPOGS206130-TA1920 bp
ProteinDPOGS206130-PA639 aa
Genomic positionDPSCF300028 + 1053711-1063972
RNAseq coverage433x (Rank: top 28%)
Annotation
HeliconiusHMEL0028239e-16183.23% 
BombyxBGIBMGA000721-TA0.067.35% 
DrosophilaAcCoAS-PA0.070.16% 
EBI UniRef50UniRef50_Q9VP610.070.31%Acetyl-coenzyme A synthetase n=63 Tax=Eukaryota RepID=ACSA_DROME
NCBI RefSeqXP_316594.20.070.74%AGAP006569-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|583888620.070.74%AGAP006569-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|583888620.070.85%AGAP006569-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00039871.4e-269acetate-CoA ligase activity
GO:00162081.4e-269AMP binding
GO:00081528.5e-104metabolic process
GO:00038248.5e-104catalytic activity
KEGG pathwayaga:AgaP_AGAP0065690.0 
 K01895 (ACSS, acs)maps-> Glycolysis / Gluconeogenesis
    Propanoate metabolism
    Reductive carboxylate cycle (CO2 fixation)
    Pyruvate metabolism
InterPro domain[1-635] IPR0119041.4e-269Acetate-CoA ligase
[88-541] IPR0008738.5e-104AMP-dependent synthetase/ligase
Orthology groupMCL12656 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206130-TA
ATGCATAAAAAATCCCTTGAAGACCCAGAGGCGTTTTGGTCGGAAATAGCCAAAGAATTTCACTGGCAAACCCCCTGCCAGCCGGGAAAATTTCTTTCGTACAACTTCGATATCGACAAGGGGGATATTTTTGTTAAATGGATGGAAGGGGCTACGACAAATGTTTGCTTTAACGTCCTAGATCGTAACATAAGAAATGGTCATGGGGATAAAATCGCTTATTACTGGGAAGGTAACCATCCTGATGACTACAGTCGAATCACATACAAGAAGCTATTGGATTCTGTTTGTATGTTTGCGAACGCTCTACGAGAGCTGGGGGTTCGCAAAGGAGACAGAGTAGCCATCTACATGCCCATGATTATGGAGACGGTGATTTGTATGTTGGGCTGTGCAAGAATCGGCGCAGTACATTCTGTCGTATTCGCTGGGTTTTCCTCGGATTCACTAGCGGAGCGAATGTCCGACTGCAAGGCGAAGGTCATTGTGACCTCTGACGGGGCGTGGAGAGGAGAAAAAAAGTTGTTCTTGAAGAATACTTGCGACGAAGCTATCGAGAAAGCTAGAACTAAGCACAACCATGAAGTCAACTTGTGCATCGTCGTATCCCATTTGGGGAGAGTGAAGCCGGGTGCAAGAATGAATGTTTTAAAAAAACCGTACACTTGGAATGACAACGTGGATATATGGTGGCACGAGATCATGGAAGGTCAATCACCCATCTGCGCTCCCGAGTGGATGAATGCTGAGGACCCCTTGTTCATGCTATACACTAGCGGTTCCACGGGCAAGCCGAAGGGCGTTCTACACACCATCGCTGGTTACATGCTCTACGCGGCGACAACCTTCCGATATGTATTCGATTATCGCGAGAAGGACATCTATTGGTGCACCGCTGACGTAGGCTGGATCACGGGGCACACTTACGTCGTGTACGCGCCCCTTGCGAATGCCGCTACGTCGCTTATGTTCGAAGGTACACCTTTCTACCCAGATAACGATCGCTACTGGTTGTTGGTTAAGAAGTACAAGGTTACTCAATTCTACACAGCACCCACCGCCATTAGAGCTCTCATGAAATTTGGCGACGAGCTCGTCACCAAGAACAATTTAAAAACTTTGCGTGTGTTGGGGAGCGTTGGGGAGCCTATCAACCCGGAAGCGTGGTTGTGGTTCTACAACCTAGTTGGTAATAAACGTTGCTCCATCGTGGATACTTTCTGGCAGACTGAAACCGGTGGCCACGTACTCACCGGCCTGCCAGGCGCCTCGCCTATGAAGCCTGGAGCTGCTGGGTTTCCATTCTTCGGCGTGGAACCGACACTGCTTGACGAAAGCGGCAAAGTGATCGAAGGGCCCGGCGAGGGCTACCTGGTCTTCTCGCGACCATGGCCAGGCATCATGAGGACCCTCTTCGGTGACCACGCTCGTTACCAAAAGGTCTACTTCTCTAAATTCAAAGGATATTATTGCACAGGCGATGGCGCCAGGCGTGACGAGGATGGGTTCCTGTGGGTGACGGGACGTATCGATGACATGCTGAATGTGTCCGGTCATCTGCTGTCTACTTCCGAGGTGGAAGGTGTCCTCACTGAAGAGCCCTCCGTGTCCGAAGCTGCTGTAGTCTCCAAGCCACATCCCGTCAAAGGCGAGTCTCTGTACTGCTTCGTCATCCTCAACGAGGGCGTCCAGTTCGGCCCCGAATTGGTGGACGCTCTGAAGAAACGCGTGAGGAATAGGATCGGAGCCTTTGCAGCTCCGGATGTCATTCAATACGCTCCCGGTCTGCCGAAAACCAGGTCCGGGAAGATCATGAGGAGAATCCTAAGGAAGATAGCGCTCGGTGACACCGACATCGGCGACACGTCGACTTTGGCCGACCCGTCCGTCGTCGACGAGCTCTTTAAATGCAGACCCTAG

Protein sequence:

>DPOGS206130-PA
MHKKSLEDPEAFWSEIAKEFHWQTPCQPGKFLSYNFDIDKGDIFVKWMEGATTNVCFNVLDRNIRNGHGDKIAYYWEGNHPDDYSRITYKKLLDSVCMFANALRELGVRKGDRVAIYMPMIMETVICMLGCARIGAVHSVVFAGFSSDSLAERMSDCKAKVIVTSDGAWRGEKKLFLKNTCDEAIEKARTKHNHEVNLCIVVSHLGRVKPGARMNVLKKPYTWNDNVDIWWHEIMEGQSPICAPEWMNAEDPLFMLYTSGSTGKPKGVLHTIAGYMLYAATTFRYVFDYREKDIYWCTADVGWITGHTYVVYAPLANAATSLMFEGTPFYPDNDRYWLLVKKYKVTQFYTAPTAIRALMKFGDELVTKNNLKTLRVLGSVGEPINPEAWLWFYNLVGNKRCSIVDTFWQTETGGHVLTGLPGASPMKPGAAGFPFFGVEPTLLDESGKVIEGPGEGYLVFSRPWPGIMRTLFGDHARYQKVYFSKFKGYYCTGDGARRDEDGFLWVTGRIDDMLNVSGHLLSTSEVEGVLTEEPSVSEAAVVSKPHPVKGESLYCFVILNEGVQFGPELVDALKKRVRNRIGAFAAPDVIQYAPGLPKTRSGKIMRRILRKIALGDTDIGDTSTLADPSVVDELFKCRP-