Monarch geneset OGS2.0

DPOGS200094
TranscriptDPOGS200094-TA1488 bp
ProteinDPOGS200094-PA495 aa
Genomic positionDPSCF300044 + 56970-59768
RNAseq coverage236x (Rank: top 43%)
Annotation
HeliconiusHMEL0098110.075.56% 
BombyxBGIBMGA000659-TA6e-15970.17% 
DrosophilaCG30427-PC8e-12445.14% 
EBI UniRef50UniRef50_E9ID652e-12945.21%Putative uncharacterized protein (Fragment) n=9 Tax=Endopterygota RepID=E9ID65_SOLIN
NCBI RefSeqNP_001177850.19e-13350.34%hypothetical protein LOC412986 [Apis mellifera]
NCBI nr blastpgi|3001164072e-13150.34%uncharacterized protein LOC412986 [Apis mellifera]
NCBI nr blastxgi|3800170382e-12947.05%PREDICTED: putative fatty acyl-CoA reductase CG5065-like [Apis florea]
Group
Gene OntologyGO:00166203.3e-21oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:00551143.3e-21oxidation-reduction process
GO:00054884.3e-16binding
KEGG pathwaydsi:Dsim_GD249082e-125 
 K13356 (FAR)maps-> Peroxisome
InterPro domain[1-264] IPR0131201.2e-57Male sterility, NAD-binding
[336-429] IPR0042623.3e-21Male sterility
[73-301] IPR0160404.3e-16NAD(P)-binding domain
Orthology groupMCL26451 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200094-TA
ATGGGAAAGGTTTTGATAGAGAAGCTTCTATTTAGTGTACCTGATATAGGGAATATTTATGTTTTAATGAGACCAAAGAAAGGGAAGTCCGTCAACCAAAGATATGAGGACATGCAAAGATTACCAATTTTCGATCGTCTGAGAAACACAAAACCATCCTCTTTGAAAAAAATAGTACCACTAACTGGAGATGTTTTATTTGATGATTTTGGACTATCGGAGAGTGACATGCAAAAAATATCTGAAGATGTTTCGATTGTTTTTCATTTTGCTGCAACCCTAAAATTGGAGGCACCTCTCTATGAAAACGTGAATATGAATACATGCGGCACGCAAAGAGCGCTTAATGTAGCTAAAAAGTTGAAGAACTTACGTTTATTTATACATCTATCAACCGCTTTCTGCTATCCCGATTACGCCGTTCTAGAAGAAAAGATGCATGCACCACCGGTAAAGCCTTCAGATATAATGCATCTTCTAGAGTGGCTTGATGAAAAGAAAGTTGCTATTCTGACTCCATCTTTACTCGGACCTCATCCTAATTGTTACACGTTTTCCAAAAGACTAGCTGAGAATATTGTAGAAAATGAATATGAAAACCTTCCTGCGGTGGTCGTAAGGCCGAGCATAGTATGTCCCTCTATAAAGGAACCTGTACCAGGGTGGGTGGACAGCCTCAACGGACCTGTAGGACTTATGCTCGGTGCTGGTAAAGGCGTTATAAGAAGTATGCTTTGCGATGGAAGTCTCATCGCACAAGTTGTTCCCGTTGATACTTGCATAAATGCACTCATTGCTATCGGCATGATAGAAGGCAAGAGAGAAGATAAAGCAGAGCTTATGCCAGTGTATAACGTAAACATTGGCCATCAAAAACCAACAACGTGGGGAGAAGTTCTCCAAATTGGTAAAGATTATGGACGCAAGTACCCACTGGCTTGGCCACTGTGGTACCCTAATGGTGATATCACGACGAACTACGTATTGCATGAATTTAAAAGACTATTTTATCACCTATTACCGGCATATTGCATAGACTTACTTTTATTCTTGTTGAGACAACCACGTTTCATGGTACGTGTTCAAGACCGAATAAGCCAGGGTTTGCAAGTTTTACAATACTTTACAATGAGACCATGGACTTTCCCGTGTCCAAATTTTGATAGTATACAATCCAAATTGGACAAGGAGGAACAAGTTATTTTTAATACAGATTTGACAACAGCTGACCGCGATGCTTATTTGCAGCAATGTATTGAAGGAGGTCGAATTTTCTGCCTTAAAGAAGATCCTAGCAAAATCCGCATTAACAGAGCCTACCACAACTTCCTCTTCGTACTGGATTGTTTAGTGAAAATACTCTTCTGGCTACTGCTTCTAAGTTTTTTCGCTGCTTATTTTACACCACTACGAGACGTATTGAGTTATGGAGAACCGGTCGTAAAATATCTGCCATTTGTAGGACGGGTAGTATGCGATAAGGAATAG

Protein sequence:

>DPOGS200094-PA
MGKVLIEKLLFSVPDIGNIYVLMRPKKGKSVNQRYEDMQRLPIFDRLRNTKPSSLKKIVPLTGDVLFDDFGLSESDMQKISEDVSIVFHFAATLKLEAPLYENVNMNTCGTQRALNVAKKLKNLRLFIHLSTAFCYPDYAVLEEKMHAPPVKPSDIMHLLEWLDEKKVAILTPSLLGPHPNCYTFSKRLAENIVENEYENLPAVVVRPSIVCPSIKEPVPGWVDSLNGPVGLMLGAGKGVIRSMLCDGSLIAQVVPVDTCINALIAIGMIEGKREDKAELMPVYNVNIGHQKPTTWGEVLQIGKDYGRKYPLAWPLWYPNGDITTNYVLHEFKRLFYHLLPAYCIDLLLFLLRQPRFMVRVQDRISQGLQVLQYFTMRPWTFPCPNFDSIQSKLDKEEQVIFNTDLTTADRDAYLQQCIEGGRIFCLKEDPSKIRINRAYHNFLFVLDCLVKILFWLLLLSFFAAYFTPLRDVLSYGEPVVKYLPFVGRVVCDKE-