Monarch geneset OGS2.0

DPOGS210502
TranscriptDPOGS210502-TA948 bp
ProteinDPOGS210502-PA315 aa
Genomic positionDPSCF300186 - 31533-36226
RNAseq coverage732x (Rank: top 18%)
Annotation
HeliconiusHMEL0046283e-6490.70% 
BombyxBGIBMGA012588-TA1e-11868.67% 
DrosophilaTaz-PA1e-8252.11% 
EBI UniRef50UniRef50_D6WPU96e-8454.96%Putative uncharacterized protein n=7 Tax=Coelomata RepID=D6WPU9_TRICA
NCBI RefSeqXP_623345.22e-9158.30%PREDICTED: similar to tafazzin CG8766-PA, isoform A isoform 2 [Apis mellifera]
NCBI nr blastpgi|3407155974e-8854.36%PREDICTED: tafazzin homolog [Bombus terrestris]
NCBI nr blastxgi|3454975323e-8758.78%PREDICTED: tafazzin homolog [Nasonia vitripennis]
Group
Gene OntologyGO:00084154.3e-27acyltransferase activity
GO:00081524.3e-27metabolic process
KEGG pathwayame:5509489e-90 
 K13511 (TAZ)maps-> Glycerophospholipid metabolism
InterPro domain[1-252] IPR0008729.8e-117Tafazzin
[45-188] IPR0021234.3e-27Phospholipid/glycerol acyltransferase
Orthology groupMCL12556 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210502-TA
ATGGCGTACGATATCGGGTGGATCATCCCGCGGCTGCGGAACCCCGGCGTCCTGTGGAACTGCGCCAGCTCCATAACCGTGGCTGTGGTCGGTCTGTTCAGCAAGATCATCGTAGATTTCCTGAACAAGACGACGGTGTACAACCGGGAGGCGCTCCAGCGAGCCGTCCGACGGCCGCGGGACGTGCCCCTTCTCACTGTCTCCAACCACCATTCGTGTTTCGACGATCCCGGCCTCTGGGGTGTGTTGGACGTCGGCACGTTGACGCGTTACTCCCGCATGCGCTGGTCGCTGGCGGCTCACGACATCTGCTTCACAAACGCGCTACACTCCGCCTTCTTCGCGCTCGGCAAGTGTGTCCCCGTTGTGAGAGGGGCCGGAGTCTATCAGACGGCGATGGACTTCTGCGTGGACCGTCTGTGCGGCGGAGAGTGGGTGCACATCTTTCCCGAGGGTCGCGTGAACGTAGACAAACAACGTATCCGGTTCAAGTGGGGAGTGGGCCGACTGGTGATGGACAGCGCTGCCGCGGGCCGCGCGCCGCTCGTGTTGCCCGTGTGGCACGAGGGCATGGACCGCGTGCTGCCCAACGTCGAGCCCTATCGCTTGCGCTTCCGGAACCACCTGTACCTCGCCGTCGGGGAGCCGCTGCCACTCAGCCCGCTGCTCGACAAGCTCCGCAGCGCGAACGCGTCCGAGGAGGAGACACGGCGTGTCATCACGGAGCGGATCCAGGAGGAGCTGATGAAACTCCGCGACCACACGCACGCGCTCATCCGTCGCACGTGTCCCCCGGGCGCGGACCGGCTTCTGGAGCCGCCCGTCCCCGACCCCGGCAGCTCGGCCGCGCCCCGGGCCCCGGCCGCGCCTCTACACAACGGCAAAGAGCACACGCACGGCGAGGCCCCCGCCCGTCGGCCCGCCCTAACCAAGGAGAAGGAACTCTAA

Protein sequence:

>DPOGS210502-PA
MAYDIGWIIPRLRNPGVLWNCASSITVAVVGLFSKIIVDFLNKTTVYNREALQRAVRRPRDVPLLTVSNHHSCFDDPGLWGVLDVGTLTRYSRMRWSLAAHDICFTNALHSAFFALGKCVPVVRGAGVYQTAMDFCVDRLCGGEWVHIFPEGRVNVDKQRIRFKWGVGRLVMDSAAAGRAPLVLPVWHEGMDRVLPNVEPYRLRFRNHLYLAVGEPLPLSPLLDKLRSANASEEETRRVITERIQEELMKLRDHTHALIRRTCPPGADRLLEPPVPDPGSSAAPRAPAAPLHNGKEHTHGEAPARRPALTKEKEL-