Monarch geneset OGS2.0

DPOGS210775
TranscriptDPOGS210775-TA1827 bp
ProteinDPOGS210775-PA608 aa
Genomic positionDPSCF300312 + 162943-178731
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0138911e-7735.71% 
BombyxBGIBMGA014047-TA3e-11242.11% 
DrosophilaCG1443-PA1e-9035.06% 
EBI UniRef50UniRef50_E2AKQ31e-10038.81%Fatty acyl-CoA reductase 1 n=11 Tax=Endopterygota RepID=E2AKQ3_CAMFO
NCBI RefSeqXP_001602734.11e-10037.40%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3071764194e-10038.81%Fatty acyl-CoA reductase 1 [Camponotus floridanus]
NCBI nr blastxgi|3320307386e-9939.53%Putative fatty acyl-CoA reductase [Acromyrmex echinatior]
Group
Gene OntologyGO:00054883.6e-27binding
GO:00166201.8e-19oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:00551141.8e-19oxidation-reduction process
KEGG pathwaytca:6622262e-91 
 K13356 (FAR)maps-> Peroxisome
InterPro domain[112-386] IPR0131202e-67Male sterility, NAD-binding
[374-430] IPR0160403.6e-27NAD(P)-binding domain
[459-552] IPR0042621.8e-19Male sterility
Orthology groupMCL19846 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210775-TA
ATGGCTGAAAGTTTTGCCGGCGTACCATCATGGCACCACTTCCGCCTGGCTCGGGTGTCGGCCTTTCTTCAGCTGGACTCGGCTTGTGGTATTGTCGTGGACGGTGACGAGGACGCCGAATTATACGGACCCCCGCAGTACTCGCCCTCCAGGATAGCTCCAGAGGCTGACATCCCCGAAGCCGAGGATCCGAGGGAGAGAGATTCCGACAATAAAATGGAAATAGCAAGAAGGATAGAAGAGAAAGGCTTAGAGAAGTCTTTAAGAGTGAAACAGGCTCTAGAGAGCGGGAATTCTGAAGTCGTGACGGTCTACGATGGGGCTGTCCTCTTCATCACCGGTGGTTCAGGTTTCATTGGGAAGCAACTCGTGGAGAAGATACTGAGAACATGCAACGTGAAAAAGATCTATCTATTGCTACGACCGAAGAAGGGCAAAACAGCTATCCAGAGGTTAAATCAAATTTTGGAAGATCCGGTATATGGAATTCTTCGCTCAGAACAACCAGACTTCGCGTCCAAGCTGATCCCGGTCGAGGGTGACGTGGTTGATCTAAATCTTGGAATTGAGGAAGAGAGTCGGAAGAAAATTATCGAAGAGGTAAACATTATTTTTCACGGCGCTGCAACAATTAATTTCGAAGAAACTATCAAAGTAGCCGCGCTCACGAACATCAGGGGCACCAGGGAGATACTGAATCTGGCGAAAAGCTGCAAGCAACTGAAATCCCTGGTTCATATTTCAACCGCTTACGCCCATGCCACCAGATCAAGAATTAAGACTGAGATAAAGGAAGACTTCTATGACAGCCCTCTACCGCCAGACGCTCTCATACAACTCGCGGAAGATCTGGAGAATGAACAGCTGGAAAAAGTTATTGAACCATTGAGGAGAGACTGGCCCAACACTTACACGTTCACGAAAGCGATAACCGAAGAGCTTGTGAGACAGACAGCCACAGATCTTCCCGTCTGCATAGTGCGGCCAGCTATCGTAATATCCGCTTACAAGGAGCCAGTGCCTGGTTGGGTGGATATCAAGAACGCGTATGGGCCCAGCGGCATGGTCTTAGGGGTGTCCCTGGGCGTCACCCACACGGTGCATGCTGATGAAGACATAATGCTGGACTTCGTGCCAGTGGACATTGTGAACAATGCCTTAATAGTAGCCGCTTGGACAACCCATCAGAGTTACATCGCCGGGGAGAAACAGATAAAAATATATTTCGTCACCGGACACCGAAATCCAATTTATTACCGCGATGTAGTCAACGTAGTCAAGGAACAAGCCAGGCCGCTGGTGTCGCCTAAGGCAATATGGCATAGCTTCGCCGTGGTGACCAAGTACAAGCTGATCTACCTCCTCCTCACCTGGCTCCTGCACTACATACCCGGCTACATCATCGACGGCGTCTGTGTGATGATAGGCGAGAAACCGCAATTCATAAAAGTGTATAAGAAAGTGTATTCCGTGTCGTCCGTGTTTGTGTATTTCACAAACAACGACTGGGTGTTCCTGGACGACAACGCTTTGAGGCTTTATGACCAGCTGAACAGCGCCGACAAGGAGCTGTTCACGTGCGACATGCAACAAGTGGACATGCCAGCCATGCTGATGACGTGGTTCTACGGCGTGAGCAAGTTCATTATCAAGGACGACGTCACGCAGTACGAGTACGCTGTGAGGAAACAGTGGTGGCTGAGGATAGCTAACGTCATGTTCCTGACCCTCTATTTCTACGCGCTGTACAAACTAGTTGCAGTAACTTTCGCCTGCTTGTTTTATTTCTTCAACCTGTCCGCCGGTGTTTATCAAACTTCTGTGTGA

Protein sequence:

>DPOGS210775-PA
MAESFAGVPSWHHFRLARVSAFLQLDSACGIVVDGDEDAELYGPPQYSPSRIAPEADIPEAEDPRERDSDNKMEIARRIEEKGLEKSLRVKQALESGNSEVVTVYDGAVLFITGGSGFIGKQLVEKILRTCNVKKIYLLLRPKKGKTAIQRLNQILEDPVYGILRSEQPDFASKLIPVEGDVVDLNLGIEEESRKKIIEEVNIIFHGAATINFEETIKVAALTNIRGTREILNLAKSCKQLKSLVHISTAYAHATRSRIKTEIKEDFYDSPLPPDALIQLAEDLENEQLEKVIEPLRRDWPNTYTFTKAITEELVRQTATDLPVCIVRPAIVISAYKEPVPGWVDIKNAYGPSGMVLGVSLGVTHTVHADEDIMLDFVPVDIVNNALIVAAWTTHQSYIAGEKQIKIYFVTGHRNPIYYRDVVNVVKEQARPLVSPKAIWHSFAVVTKYKLIYLLLTWLLHYIPGYIIDGVCVMIGEKPQFIKVYKKVYSVSSVFVYFTNNDWVFLDDNALRLYDQLNSADKELFTCDMQQVDMPAMLMTWFYGVSKFIIKDDVTQYEYAVRKQWWLRIANVMFLTLYFYALYKLVAVTFACLFYFFNLSAGVYQTSV-