Monarch geneset OGS2.0

DPOGS211702
TranscriptDPOGS211702-TA1494 bp
ProteinDPOGS211702-PA497 aa
Genomic positionDPSCF300374 + 140037-148813
RNAseq coverage643x (Rank: top 20%)
Annotation
HeliconiusHMEL0122672e-9335.10% 
BombyxBGIBMGA011148-TA0.078.44% 
DrosophilaCG1441-PA1e-14449.90% 
EBI UniRef50UniRef50_E2AK998e-15853.74%Fatty acyl-CoA reductase 1 n=31 Tax=Neoptera RepID=E2AK99_CAMFO
NCBI RefSeqXP_001601970.12e-16455.56%PREDICTED: similar to ENSANGP00000021191 [Nasonia vitripennis]
NCBI nr blastpgi|2984029130.075.05%fatty-acyl CoA reductase 3 [Ostrinia nubilalis]
NCBI nr blastxgi|2984029130.075.05%fatty-acyl CoA reductase 3 [Ostrinia nubilalis]
Group
Gene OntologyGO:00054881.7e-26binding
GO:00166201.3e-20oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:00551141.3e-20oxidation-reduction process
KEGG pathwayaga:AgaP_AGAP0022792e-98 
 K13356 (FAR)maps-> Peroxisome
InterPro domain[26-245] IPR0131207.9e-59Male sterility, NAD-binding
[214-314] IPR0160401.7e-26NAD(P)-binding domain
[346-439] IPR0042621.3e-20Male sterility
Orthology groupMCL15487 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211702-TA
ATGTCTCTAGATGACAATGACTTGGACTCCTTGCCGGATAGGATTGCTGAGACCTTCGCTGGGTTGACCCTTCTCGTGACTGGAGGGACTGGTTTCATGGGGAAGGTTCTAGTGGAAAAGTTATTGCGAAAATGTCCGGACATAGCGAAAATCATGCTGTTAGTACGTCCGAAGAAAGGAAAGTCACCGAAACAGCGTCTGGAGGAAATGTTGAACGATGAGTTGTTTGCTAAGTTACGCAGTCTCCGCGGTGGTGTTGAACCGTTGTTGGAGAAGCTGCAAATAGTAACCGGAGACGTGAGTGCTCCAGATTTGGCTATCAGCGACACCGACAGGCTAGATGTCATTGAAAATGTACACATAGTGGTGCACGCTGCGGCCACTATAAGATTCGACGAAGAATTGAAAAAGGCTGTGTTTTTGAACGTGCGCGGAACTAAATTAATTTTAGATTTGGCGAAGCAGTGTAAGAAACTTAAGCTGTTTATCCACATTTCAACGGCGTACTGCCATCTTCACGAGAAACTCCTGGAGGAGAAACCTTATCCACCACCAGCTAATCCGCACAAGATCATTGAGGCTATGGAGTGGATGACTGATGAAGCTGTAGCCACTATCACACCCAATTTGCTGAGTAAACTGCCGAATTCTTATGCTTTCACGAAAGCTTTGGGAGAAGCTCTGGCAGTCGAAGCGATGGAACACATACCGGTGATCGTATTACGACCTTCCATCGGTGCTGGTAAGGGTGTCATCAGAACCATGTACTGTAAAAGTAACAGCTACGCAGACTACTTGCCGGTCGACGTTTTCATTAATGGCATCATGATCTGCGTTTGGAATTACATCAAATTGGGTGACACGAAGTCCAATGTAATAAATTTCACGTCATCCGCTGAGATCAAGGTCACGTGGTTGGAGATGATTGATGCTGGGCGGGAGATAATAATGAACAGAGTGCCCTTGAATAACGTTGTTTGGTACCCGGGTGGGTCCATGAAGCACTCCAGGCTATATCACAACATATGCGCATTATTCTTTCATTGGATACCAGCCATTATAATAGACACTTTACTGTTTTGTTTAGGATATAAACCTGTTCTGATGAGGGTTCATCGCCGCATCAGCAAGGGCTTCGAAGTCTTCGAGTACTACACGAACAATCAATGGGACTTCAAATCTGACATCGCTCAGACAGTCCGACAGAAACTGAATCCGCGGGAGAGAAGAGATTACAAGGTTGACGCTATAGGCTTGGATATATCAAAATATTTCGAGGATTGCATCCGAGCTGCTCGCATATTCATCCTTAAGGAGTCTGACGACACCTTACCGAGCGCCAGGCGGCACATGAAGATCATGTGGTTCGTCGATATTGTCGCGCAGTTCCTGTTCTGGGCTCTCATGATGTACTGGATCTCCGGATGGATGGCCAGCTTTTATTCGTTTGTTGTTGGTTCCAGTAATATTACGCCTAAGGCTATAGTTGAATAA

Protein sequence:

>DPOGS211702-PA
MSLDDNDLDSLPDRIAETFAGLTLLVTGGTGFMGKVLVEKLLRKCPDIAKIMLLVRPKKGKSPKQRLEEMLNDELFAKLRSLRGGVEPLLEKLQIVTGDVSAPDLAISDTDRLDVIENVHIVVHAAATIRFDEELKKAVFLNVRGTKLILDLAKQCKKLKLFIHISTAYCHLHEKLLEEKPYPPPANPHKIIEAMEWMTDEAVATITPNLLSKLPNSYAFTKALGEALAVEAMEHIPVIVLRPSIGAGKGVIRTMYCKSNSYADYLPVDVFINGIMICVWNYIKLGDTKSNVINFTSSAEIKVTWLEMIDAGREIIMNRVPLNNVVWYPGGSMKHSRLYHNICALFFHWIPAIIIDTLLFCLGYKPVLMRVHRRISKGFEVFEYYTNNQWDFKSDIAQTVRQKLNPRERRDYKVDAIGLDISKYFEDCIRAARIFILKESDDTLPSARRHMKIMWFVDIVAQFLFWALMMYWISGWMASFYSFVVGSSNITPKAIVE-