Monarch geneset OGS2.0

DPOGS210763
TranscriptDPOGS210763-TA1347 bp
ProteinDPOGS210763-PA448 aa
Genomic positionDPSCF300312 - 112300-117182
RNAseq coverage2837x (Rank: top 4%)
Annotation
HeliconiusHMEL0138861e-13646.56% 
BombyxBGIBMGA011149-TA3e-13755.26% 
DrosophilaCG12268-PA9e-12143.90% 
EBI UniRef50UniRef50_D9D5H75e-17861.20%Fatty-acyl CoA reductase 4 n=2 Tax=Obtectomera RepID=D9D5H7_OSTNU
NCBI RefSeqXP_968755.11e-12648.33%PREDICTED: similar to GA11521-PA [Tribolium castaneum]
NCBI nr blastpgi|2984029152e-17761.20%fatty-acyl CoA reductase 4 [Ostrinia nubilalis]
NCBI nr blastxgi|2984029157e-17361.20%fatty-acyl CoA reductase 4 [Ostrinia nubilalis]
Group
Gene OntologyGO:00166201.5e-28oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:00551141.5e-28oxidation-reduction process
GO:00054883e-20binding
KEGG pathwayaga:AgaP_AGAP0022791e-109 
 K13356 (FAR)maps-> Peroxisome
InterPro domain[18-260] IPR0131208.5e-64Male sterility, NAD-binding
[311-404] IPR0042621.5e-28Male sterility
[14-246] IPR0160403e-20NAD(P)-binding domain
Orthology groupMCL14546 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210763-TA
ATGGCGCCATCTATCGGCGTAGCTGAGTTTTACAAGCACAAAACAATATTTCTGACGGGGGGTACAGGGTTTATGGGTAAAGTGTTGGTGGAGAAATTGTTACGGTGCTGTCCCGATTTAAACAAAATTTATTTATTAATGCGGCCAAAGAAAGGGCAGAGTACGAAAGAAAGGCTCGACGATTACTTCAATTGTAGGGTTTTCGACAATCTCCAAGAAAGATCACCTAAGTGCTTCGACAAGCTAGCAGTCATACCAGGAGATATTCTACAAGAAGATCTAGGCATCTCCATAGAAGATTGGGATAAATTACAGAGAGAAACTGAGATTGTGTTCCATTGTGCTGCTTGTGTTAGATTTGATATGCCCATCCGTGACGCTGTTAATATGAATACCTTGGGCACGAACAAGGTTCTCAAACTAGCTGACGGGATGGTCAATCTTGAGGTTTTCGTGCACGTATCTACGTCATATTGTCGCTGCGAAGTCCACACTTTAGAAGAACGTCTGTATCCGGCTAAGCATAGACCACAAGACGTGATGGAGTGTGTCAAGTGGATGGATGACGAGCTCCTGACATACCTGCAGACCAAGTTAATCGAGCCGCAACCAAACACGTACGCTTACACCAAATCATTAACTGAGGATCTCGTTTCTCAAAAAGCGGGGAAGTACCCCATTGTGATAGCAAGGCCATCTATTGTGACTGCTGCCCACAAGGAGCCTTTACCGGGATGGGTTGATAACCTCAATGGACCCACAGAAGTTAGGATCTGCAATGTGACTCAATCCGGCCACAATCCCATTACGTGGGGAGAAGCTTTAGATATGGGTCGCGTCCACGTGCAGGAGTTTCCTTTCTCCGTCTGTCTGTGGTACCCCGGCGGATCAGCGAAGAGTTCCAAAGTCCAGCATTTATTAGCATTATTTTTCACCCATCTCTTGCCAGCGTATTTCGTTGATCTACTCATGTTTCTGATGGGAAAGAAGACATTTATGGTGAAAATACAGAAACGAGTCAGCTATGGATTGAACGTTCTTCAATATTACACCACGAAGGAGTGGTTTTTTGACAACGATTACTACAGATCGCTGTCCAAACGAATATCAAAGGATGATAATGATGTCTTCTATACTAATTTGAAGACGATAAACTGGAGCGCCTATATTCGGAACTACATAAAAGGAGCGAGGGAATTCTGTTGTAAAGAAGATCCCAGCACATTACCACAGGCGAGGAAGCTACAGAGACAATTGTTCTGGTTGGATAAGGCTGTGCAAGTCGTTATATACTTCCTCATCGCGTACTTTATATATTATTACATTATTCGTTTCTTTGTTATGTAG

Protein sequence:

>DPOGS210763-PA
MAPSIGVAEFYKHKTIFLTGGTGFMGKVLVEKLLRCCPDLNKIYLLMRPKKGQSTKERLDDYFNCRVFDNLQERSPKCFDKLAVIPGDILQEDLGISIEDWDKLQRETEIVFHCAACVRFDMPIRDAVNMNTLGTNKVLKLADGMVNLEVFVHVSTSYCRCEVHTLEERLYPAKHRPQDVMECVKWMDDELLTYLQTKLIEPQPNTYAYTKSLTEDLVSQKAGKYPIVIARPSIVTAAHKEPLPGWVDNLNGPTEVRICNVTQSGHNPITWGEALDMGRVHVQEFPFSVCLWYPGGSAKSSKVQHLLALFFTHLLPAYFVDLLMFLMGKKTFMVKIQKRVSYGLNVLQYYTTKEWFFDNDYYRSLSKRISKDDNDVFYTNLKTINWSAYIRNYIKGAREFCCKEDPSTLPQARKLQRQLFWLDKAVQVVIYFLIAYFIYYYIIRFFVM-