Monarch geneset OGS2.0

DPOGS200795
TranscriptDPOGS200795-TA1173 bp
ProteinDPOGS200795-PA390 aa
Genomic positionDPSCF300454 - 42913-54804
RNAseq coverage760x (Rank: top 17%)
Annotation
HeliconiusHMEL0169550.079.29% 
BombyxBGIBMGA014217-TA8e-13389.96% 
DrosophilaCG7920-PA1e-11762.39% 
EBI UniRef50UniRef50_Q9VAC12e-11562.39%CG7920, isoform A n=63 Tax=cellular organisms RepID=Q9VAC1_DROME
NCBI RefSeqXP_313680.37e-12669.23%AGAP004396-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1187843491e-12469.23%AGAP004396-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1187843491e-12069.23%AGAP004396-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00060843.6e-188acetyl-CoA metabolic process
GO:00038243.6e-188catalytic activity
KEGG pathwaycch:Cag_00603e-79 
 K01067 (E3.1.2.1, ACH1)maps-> Pyruvate metabolism
InterPro domain[14-390] IPR0037023.6e-188Acetyl-CoA hydrolase/transferase
Orthology groupMCL12583 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200795-TA
ATGTCTGTTATGGGCAAAATTATCGTGCCTAATGTATGCCGTCGTAATTTTATTAATATAAGCGCCTCATTGCAATCTGTGAGATCAAATCGAAGTTACTTTACATACACTCAAGAATTGTCTCAGCCCTTAGACCGAAAACCCGAATTTGTTAGTGCCAAGGAAGCTTTTGAAAAATGTTTGAAGTCTGGTCACACTGTTTTCGCTCAGGGCGCAGCTGCGACCCCTGTCCCTCTTCTGAACGCCATGACCGATGTTGGCAAGGCCGGTTCACTGCGGGACATAAAGGTTGTGCATATGCATACTGAGAGGGATGCACCATATGTTGCTCCGGAATGCAAGGATATTTTCAGGTCCGTTTCCCTGTTCATGGCGGCAAATGTTCGCCAATCAGTCGCCGAGGGTCGTTCGGACGCCATTCCCATCTTCCTGCAAGACATACCGAAGCTGTTCCACAGGAAGATCATAAGGCCAGACATCGCTGTCATACAGGTTTCCCCCCCAGACCAGCACGGATACTGTAGCCTTGGTACATCCGTGGATTGTGTGAGATCTGCTCTCGTTAACTCAAAGATTATTATAGCTCAAATTAACGTGAACATGCCGCGTACGTTCGGCGACGCGATCATCCACGTGTCGCACGTGGACTACGCCGTAGAGGACAACACGCCGCTGCCGGAACACGGAGGGAAGGCCGCCACGCCCGAGGAGACGAAGATAGGCCAGCTGATCGGGGATAACCTGGTCGAGGACGGAGCTACACTACAGATGGGTATCGGTAACATACCTGATGCTGTGCTCTCCGCTCTCAAGAATCACAAAGATCTCGGGATACATTCGGAGATGTTCAGTGTGGGTGTCATTGACCTCGTGAGGAGGGGATGTGTCACCAATAACAAGAAGAAAAATCACAAAGGTCGTATCGTTGGCAGTTTCCTTGTTGGTAACAAGGAGCTTTACGACTTTGTGGATAACAATCCGTTCATAGGTGCTGGAGTAGTGACTACTAGGGCGCACGTCCATTACGTCGTAACTGAACAGGGCATAGCTTACTTGTTCGGCAAGACATTAAGGCAACGCGCATACGAGCTCATTAAAATAGCTCACCCGGATCATCGCGAAGCGCTAGAGAAGGCGGCCTTCGAGCGACTGAAATGCATGCCTGCGCCTTAA

Protein sequence:

>DPOGS200795-PA
MSVMGKIIVPNVCRRNFINISASLQSVRSNRSYFTYTQELSQPLDRKPEFVSAKEAFEKCLKSGHTVFAQGAAATPVPLLNAMTDVGKAGSLRDIKVVHMHTERDAPYVAPECKDIFRSVSLFMAANVRQSVAEGRSDAIPIFLQDIPKLFHRKIIRPDIAVIQVSPPDQHGYCSLGTSVDCVRSALVNSKIIIAQINVNMPRTFGDAIIHVSHVDYAVEDNTPLPEHGGKAATPEETKIGQLIGDNLVEDGATLQMGIGNIPDAVLSALKNHKDLGIHSEMFSVGVIDLVRRGCVTNNKKKNHKGRIVGSFLVGNKELYDFVDNNPFIGAGVVTTRAHVHYVVTEQGIAYLFGKTLRQRAYELIKIAHPDHREALEKAAFERLKCMPAP-