Monarch geneset OGS2.0

DPOGS202810
TranscriptDPOGS202810-TA1362 bp
ProteinDPOGS202810-PA453 aa
Genomic positionDPSCF300018 - 216309-220985
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0037562e-12253.33% 
BombyxBGIBMGA010457-TA2e-8447.00% 
DrosophilaCG5065-PC9e-8635.76% 
EBI UniRef50UniRef50_D2SNU92e-11342.99%Fatty-acyl reductase n=1 Tax=Heliothis virescens RepID=D2SNU9_HELVI
NCBI RefSeqNP_001036967.19e-9838.36%fatty-acyl reductase [Bombyx mori]
NCBI nr blastpgi|2609079827e-11342.99%fatty-acyl reductase [Heliothis virescens]
NCBI nr blastxgi|2609079828e-11142.99%fatty-acyl reductase [Heliothis virescens]
Group
Gene OntologyGO:00054881.6e-25binding
GO:00166204.4e-20oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:00551144.4e-20oxidation-reduction process
KEGG pathwayphu:Phum_PHUM2886006e-91 
 K13356 (FAR)maps-> Peroxisome
InterPro domain[21-290] IPR0131201.2e-68Male sterility, NAD-binding
[278-327] IPR0160401.6e-25NAD(P)-binding domain
[358-450] IPR0042624.4e-20Male sterility
Orthology groupMCL34464 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202810-TA
ATGGTACCGACGTCGAACTACACACCCGTTTCTGATTTTTACAACGGCAAAACAATCTTCGTTACGGGCGGGACGGGTTTCCTGGGCAAGGTGTTAATAGAAAAACTTTTATACAGCTGTCCGGGCATTAGTAAAATTTATATGCTTGTTCGTGAAAAGAGAAACTTAACCGCAACAGATAGGATGAAGAAGTTCTTCGATTGTCCAGTTTTTGATAGACTCAAAAAAGAGAAGCCAGAAAATTTAAAAAAGCTGAAACCTGTTAACGGTGACTTATCTCAACCCTGCCTTGGTATTGACCCGAAGGAATTACAAGTAATTAAAGATGAGGTTACCATTGTATTCCATTTCGCTGCAACTGTGAAGTTCAATGAACCACTGTCCATAGCCATGAAGATCAACGTCGAGGGAACTGAACAAGTAATAGACTTATGTCACAGTTTAAAGAAAATTGAGGTTTTTGTATACATGTCAACAATATTTTCAAACACAGACGATAAATTGAATTCCGTAGAAGAAAGACTATACCGCAGTCCAAAAGAGGTCGATGAAATTTACAAAATGATAAAAGAAAATGATCCCAGAGAAGTTTTCAATCCGGAAGTCTTAGATGGTCGCCCAAACACATATACCTTTACTAAGGCAGTAGCGGAAAATATCGTGGCACAAAAACGTGGCAATCTTCCAACAATCATGATTCGACCCTCAGTTGTTACACCAGCTAAAGAGGAGCCAGTGAGAGGTTGGGTAGCAAATTGGATGGGGCCAACCGCAACGCTTTTATATCTATCCAGGGGATGGATAAGATGCCACTACGGAGAAGACGACTTCACATTCGAAATCATTCCAGTCGACTACGTCGTCAATCTTACTATAATTTCTGCTGCTAAATTCAAAAGGTCAGATGATATACCAATATATCATTCATGTACAAGTGCTCTTCATCCCGTGACTTTTAAAGAAACTGCTTCTTATTTAATTAAAGAAAGTTGCAAGAAGAAATTAGTTGACATACCTTTTCCTTGGATAACATTCTCAAGATCACTATGGTTTCTCGGTATCATATCGTTCTTTATGCAGCTGCTACCAGCTTATATAGCAGATTTATATTTATTTATTTGCGGGCATCCAACAAGGTACGTGAAAATGTTGAAAAAGTACAGTCAAAATATGTCTGCACTTCATTACTTCTCGTCAAGAACATGGTACATGACGGCGAACCGTTCACAAGAGCTGATAGACGGATTAAGCGCTGAAGACCAGAAGATATTTCCTTGCAGTCCGGCTAGCATTGACTGGAGTGAATACATGGCAACATACTTACATGGAGTATATAGATTTTTATTAAAATCACATTATTAA

Protein sequence:

>DPOGS202810-PA
MVPTSNYTPVSDFYNGKTIFVTGGTGFLGKVLIEKLLYSCPGISKIYMLVREKRNLTATDRMKKFFDCPVFDRLKKEKPENLKKLKPVNGDLSQPCLGIDPKELQVIKDEVTIVFHFAATVKFNEPLSIAMKINVEGTEQVIDLCHSLKKIEVFVYMSTIFSNTDDKLNSVEERLYRSPKEVDEIYKMIKENDPREVFNPEVLDGRPNTYTFTKAVAENIVAQKRGNLPTIMIRPSVVTPAKEEPVRGWVANWMGPTATLLYLSRGWIRCHYGEDDFTFEIIPVDYVVNLTIISAAKFKRSDDIPIYHSCTSALHPVTFKETASYLIKESCKKKLVDIPFPWITFSRSLWFLGIISFFMQLLPAYIADLYLFICGHPTRYVKMLKKYSQNMSALHYFSSRTWYMTANRSQELIDGLSAEDQKIFPCSPASIDWSEYMATYLHGVYRFLLKSHY-