Monarch geneset OGS2.0

DPOGS200904
TranscriptDPOGS200904-TA1398 bp
ProteinDPOGS200904-PA465 aa
Genomic positionDPSCF300066 + 100873-113750
RNAseq coverage5270x (Rank: top 2%)
Annotation
HeliconiusHMEL0134000.097.20% 
BombyxBGIBMGA000672-TA0.094.62% 
Drosophilakdn-PB0.077.26% 
EBI UniRef50UniRef50_O753900.075.00%Citrate synthase, mitochondrial n=251 Tax=cellular organisms RepID=CISY_HUMAN
NCBI RefSeqXP_970124.10.080.56%PREDICTED: similar to citrate synthase [Tribolium castaneum]
NCBI nr blastpgi|3323738600.081.43%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323738600.081.43%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00469129.9e-277transferase activity, transferring acyl groups, acyl groups converted into alkyl on transfer
GO:00442629.9e-277cellular carbohydrate metabolic process
GO:00041085.9e-245citrate (Si)-synthase activity
KEGG pathwaytca:6586670.0 
 K01647 (CS, gltA)maps-> Citrate cycle (TCA cycle)
    Glyoxylate and dicarboxylate metabolism
InterPro domain[14-461] IPR0020209.9e-277Citrate synthase-like
[33-458] IPR0101095.9e-245Citrate synthase, eukaryotic
[30-462] IPR0161412.2e-150Citrate synthase-like, core
[33-353] IPR0161422.3e-132Citrate synthase-like, large alpha subdomain
Orthology groupMCL11457 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200904-TA
ATGGCTCTATTCAGGATCACATCATCAAAACTTATTGATATACAGAAAACATGCCCAACAGCGACGATACTGCTGCGAAACTTAAGCGCGGAACAGACGAATCTTAAAAGTATCCTACAGGAGAAGATCCCCAAAGAACAAGAGAAGATCAAAGAATTCCGTAAGAAGCATGGGGCCACCAAAGTTGGCGAAGTCACCGTCGACATGATGTACGGTGGCATGAGAGGCATCAAGGGTCTTGTTTGGGAGACGTCTGTATTGGATCCTGATGAGGGCATCCGTTTCCGTGGCCTCTCAATCCCCGAGTGCCAACAGCAGCTGCCAAAAGCCAAGGGCGGAGAGGAACCCTTACCCGAAGGTCTGTTCTGGCTTCTGGTGACCGGTGAAATACCAACCGAGGCTCAAGTGAAGGCTATTTCCAAGGAATGGGCTCAAAGAGCTGAACTTCCCGCTCACGTGGTGACCATGTTGAACAACATGCCCAGCAAGTTGCATCCGATGTCACAGTTCTCAGCAGCCGTTACAGCCCTCAACAGCGAGTCGAAGTTCGCGCAGGCCTACTCCGAGGGTGTTCACAAGTCCAAGTATTGGGAGTATGTGTACGAGGACTCTATGAATCTGATCGCTAAGCTGCCGGTTATAGCCGCTACTATCTACCGCAACACCTATCGCGATGGTAAGGGCATTGGAGCGATCGATGACAACAAGGACTGGTCAGCCAACTACTGCACCATGTTGGGCTTCGACGATCCCCAGTTCACAGAGCTCATGAGGCTATACCTCACCATCCACAGCGACCACGAAGGCGGTAACGTGTCTGCTCACACCACCCACCTCGTGGGCTCGGCTCTAAGTGACCCCTACCTTTCATTCGCTGCTGGTCTCAACGGTCTCGCGGGACCTCTCCACGGACTCGCCAACCAGGAGGTATTGATCTGGTTGGAGAAACTGCGCAAGCAGGTCGGTGATAACTTCACGGAGGAAGGCCTCAAGGAGTTCATTTGGAAGACACTTAAATCCGGTCAAGTCGTGCCCGGATACGGTCACGCCGTGCTTAGGAAGACTGATCCCAGATACACCTGCCAACGCGAATTCGCCCTCAAGCACCTTCCCAACGACCCCCTGTTCAAGCTTGTGGCAGCGGTGTACAAAGTCGTTCCTCCAATCCTAACCGAGCTCGGCAAGGTCAAGAACCCGTGGCCAAATGTCGACTCACACTCTGGTGTGCTCTTACAGTATTACGGTCTTAAGGAGATGAATTACTACACGGTGATGTTCGGTGTGTCTCGTGCTTTGGGTGTGTTGGCTCAGCTGATCTGGTCCCGTGCTCTCGGTCTGCCCATAGAACGCCCCAAATCCCTCAGCACCGACCTCCTCATCAAACAGATCGGCAAGTAA

Protein sequence:

>DPOGS200904-PA
MALFRITSSKLIDIQKTCPTATILLRNLSAEQTNLKSILQEKIPKEQEKIKEFRKKHGATKVGEVTVDMMYGGMRGIKGLVWETSVLDPDEGIRFRGLSIPECQQQLPKAKGGEEPLPEGLFWLLVTGEIPTEAQVKAISKEWAQRAELPAHVVTMLNNMPSKLHPMSQFSAAVTALNSESKFAQAYSEGVHKSKYWEYVYEDSMNLIAKLPVIAATIYRNTYRDGKGIGAIDDNKDWSANYCTMLGFDDPQFTELMRLYLTIHSDHEGGNVSAHTTHLVGSALSDPYLSFAAGLNGLAGPLHGLANQEVLIWLEKLRKQVGDNFTEEGLKEFIWKTLKSGQVVPGYGHAVLRKTDPRYTCQREFALKHLPNDPLFKLVAAVYKVVPPILTELGKVKNPWPNVDSHSGVLLQYYGLKEMNYYTVMFGVSRALGVLAQLIWSRALGLPIERPKSLSTDLLIKQIGK-