Monarch geneset OGS2.0

DPOGS208650
TranscriptDPOGS208650-TA1371 bp
ProteinDPOGS208650-PA456 aa
Genomic positionDPSCF300281 - 8382-13036
RNAseq coverage7x (Rank: top 86%)
Annotation
HeliconiusHMEL0033380.093.86% 
BombyxBGIBMGA007783-TA0.091.06% 
Drosophilaskap-PD6e-17165.93% 
EBI UniRef50UniRef50_Q95U389e-16965.93%GH10480p n=43 Tax=cellular organisms RepID=Q95U38_DROME
NCBI RefSeqXP_970725.10.071.33%PREDICTED: similar to AGAP004744-PA [Tribolium castaneum]
NCBI nr blastpgi|910820370.071.33%PREDICTED: similar to AGAP004744-PA [Tribolium castaneum]
NCBI nr blastxgi|910820378e-17571.33%PREDICTED: similar to AGAP004744-PA [Tribolium castaneum]
Group
Gene OntologyGO:00081521.5e-261metabolic process
GO:00038241.5e-261catalytic activity
GO:00055242.1e-49ATP binding
GO:00168742.1e-49ligase activity
KEGG pathwaytca:6593150.0 
 K01900 (LSC2)maps-> Citrate cycle (TCA cycle)
    Propanoate metabolism
InterPro domain[35-438] IPR0058091.5e-261Succinyl-CoA synthetase, beta subunit
[287-431] IPR0161022.7e-70Succinyl-CoA synthetase-like
[42-248] IPR0136501.5e-65ATP-grasp fold, succinyl-CoA synthetase-type
[141-286] IPR0138162.1e-49ATP-grasp fold, subdomain 2
[309-428] IPR0058112.1e-26ATP-citrate lyase/succinyl-CoA ligase
[63-140] IPR0138153.6e-23ATP-grasp fold, subdomain 1
Orthology groupMCL13761 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208650-TA
ATGGCGAATTTACTACCTCGTTCCTTTGGTCTTGCCGAAACAATTTTTGTTAAAAATGGGACGAAGTTGTTGGCAGCAACAACATCCCACAATCTACCAAATAATCAACAAGTTAGGCATTTGAATGTCCATGAATATGTGAGCTACACACTTCTTAGGGACAATGGCATTCCTGTGCCTAAATTTAACATTGCCAAGACCCAGGAAGAGGCAGTCAAGTATGCACAAGAACTGAACACCAAAGATATTGTACTAAAGGCTCAGGTTCTGGCTGGAGGTAGAGGAAAAGGTTCATTCAAAAATGGGCTGAAAGGTGGTGTTAGAATGGTTGACAAACCTGAGGTGGCTGGAGATATTGTCGGCAAGATGCTCAAGCAGTACTTGGTGACGAAGCAGACAGGTGCAGCAGGTCGTATTTGTAACATGGTCATGGTTACCGAGAGGAAGTTCCCACGCAGAGAATTCTATGTAGCCATCATGATGGAAAGGAGTTTTAACGGTCCGGTAATCATTGCATCATCTCAAGGAGGTGTCAACATTGAGGATGTGGCTGCCGAGAATCCCGATGCCATCACATATGAACCCATAGATATTGTCACCGGCATCACTGATGAACAAATATCCCGTGTTATTAACAAGATAGGACTTCAAGAACATGCTCCAGCGGCTAGTGATATGATGAAGAAAATGTATGACTTGTTCTGTAAGAAAGATGCTTTACTGATTGAAGTCAACCCTTATGCCGAGGACGCTCTAACAGGACAATTCTTCTGCCTGGATGCTAAGTTTAGATTTGACGACAACGCTCAGTTTAGACAGAAGGAACTCTTCAAGCTTAGAGATATTTCCCAAGAAGACCCCAAAGAGATTGAGGCAGCCAAGTTTGACTTGAACTACATCGCTCTTGACGGTAGTATCGGTTGTATGGTGAATGGAGCTGGTCTTGCTATGGCCACTATGGATATCATCAAACTATACGGAGGGGACCCAGCCAACTTCCTCGACGTTGGAGGAGGCGCCACCGCTCAGGCCGTCTCGGAGGCATTCAAGATTATCTTATCAGACCCAAAGGTGTCTGCTATCTTAGTGAACATCTTCGGAGGAATTATGCGCTGTGACGTCATCGCCGAGGGTATCATCAACGCTGCGAAGAACCTCAACATCCAAATACCAGTGATTGTACGACTTCAGGGTACCAAAGTCAATGAGGCTCGCAAGTTGATCGCGGATTCGGGTCTGCGCATTGTACCCAGAGACGACCTGGACGAGGCGGCCCAGCTCGCCGTACAACTCTCCGAGATCGTGTCCCTCGCCAAGAAAGCCGGAGTCGAGGTCAAATTCGATATCCCTAAATACATGTTAGAAAAGTAA

Protein sequence:

>DPOGS208650-PA
MANLLPRSFGLAETIFVKNGTKLLAATTSHNLPNNQQVRHLNVHEYVSYTLLRDNGIPVPKFNIAKTQEEAVKYAQELNTKDIVLKAQVLAGGRGKGSFKNGLKGGVRMVDKPEVAGDIVGKMLKQYLVTKQTGAAGRICNMVMVTERKFPRREFYVAIMMERSFNGPVIIASSQGGVNIEDVAAENPDAITYEPIDIVTGITDEQISRVINKIGLQEHAPAASDMMKKMYDLFCKKDALLIEVNPYAEDALTGQFFCLDAKFRFDDNAQFRQKELFKLRDISQEDPKEIEAAKFDLNYIALDGSIGCMVNGAGLAMATMDIIKLYGGDPANFLDVGGGATAQAVSEAFKIILSDPKVSAILVNIFGGIMRCDVIAEGIINAAKNLNIQIPVIVRLQGTKVNEARKLIADSGLRIVPRDDLDEAAQLAVQLSEIVSLAKKAGVEVKFDIPKYMLEK-