Monarch geneset OGS2.0

DPOGS208129
TranscriptDPOGS208129-TA1554 bp
ProteinDPOGS208129-PA517 aa
Genomic positionDPSCF300154 + 152410-156543
RNAseq coverage102x (Rank: top 61%)
Annotation
HeliconiusHMEL0122670.082.75% 
BombyxBGIBMGA006569-TA0.078.39% 
DrosophilaCG30427-PC2e-14347.83% 
EBI UniRef50UniRef50_E9ID659e-15550.49%Putative uncharacterized protein (Fragment) n=9 Tax=Endopterygota RepID=E9ID65_SOLIN
NCBI RefSeqNP_001177850.13e-15955.97%hypothetical protein LOC412986 [Apis mellifera]
NCBI nr blastpgi|3001164075e-15855.97%uncharacterized protein LOC412986 [Apis mellifera]
NCBI nr blastxgi|3800170382e-15752.25%PREDICTED: putative fatty acyl-CoA reductase CG5065-like [Apis florea]
Group
Gene OntologyGO:00166202.3e-22oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:00551142.3e-22oxidation-reduction process
GO:00054883.5e-22binding
KEGG pathwaydsi:Dsim_GD249082e-135 
 K13356 (FAR)maps-> Peroxisome
InterPro domain[16-285] IPR0131201.5e-62Male sterility, NAD-binding
[358-451] IPR0042622.3e-22Male sterility
[91-320] IPR0160403.5e-22NAD(P)-binding domain
Orthology groupMCL16470 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208129-TA
ATGGCGTCCGAAGTAAACGAATGGTACAAGGGAAGAAAGGTGTTTGTGACGGGGGCTCTAGGTCTCATGGGGAAGGTTTTGATAGAGAAATTGTTATACAGCGTTCCGGATATAGGATGCGTGTACGCGCTAGTGCGAAGTAAACGTGGCAAGTCACCGGAAACAAGGATCGAGGAAATGTGGCAATTGCCTTTATTTGCAAGGATACGGGAAGAGAAGCCTCATGTTATGAAGAAACTGATCGCTGTGGCTGGTGATATTCAGTACGATGATTTGGGTATTAATAATAAACAGACAGAGGAAATATATAACGAGGTATCAGTAGTTTTTCATTTCGCTGCTACTCTACGTCTAGAGGCACCCCTAAAAGAAGGTCTCGAATTGAATACAAAGGGAACCCTCCGTGTTTTGGAGATGGCGAAGAAAATGAGAAAACTGGCAGGCTTCGTTCACCTCTCAACTGCCTTCTGTTACCCGGATTACGACAGAATGGCAGAAGCGGTTCATCCACCACCAACCGATCCTCACGAGGTTCTGCGTGCTGCGAGCTGGATGACAAACGAACAGCTAAACGTCCTAGCACCAACCTTGATGCAAAAGCATCCCAATTCCTACACATACTCCAAGCGACTGGCCGAGGCACTTGTCAAAGAATGCTACCCAGAAATACCGGCAGTGGTCGTCCGCCCTAGCATTGTCACGCCATCGTTCAAAGAACCAAATCCAGGTTGGGTAGATAACCTCAATGGCCCAATAGGATTAATGATCGGGGCCGGGAAGGGTGTCATCAGATCAATGCATTGCTATGGACACTATCACGCGGAAGTCATACCAGTGGATATCGCTATCAATGCAACTATTGTCATTCCTTACTATATCAACACCCAAATGGAAAGGTCACAAGAAATACCTGTGTTCAATCTAACTACAGGTGACGATAGGAACAATACGTGGAAGGAGGTTCTCGACGTAGGAAAGGCGACGGTCCGAAAGTATCCTTTCGAAATGCCATTATGGTATCCAGACGGGAACATACGGCATAGCAAACTGTTGCACGAGCTCTGTGTCTTCTTCTATCATATTGTTCCCGCGTATCTAATAGACTTCTTGATGTTTATCTTTGGCCAGCAGAGATTCATGGTCCGAATACAAAAGCGAATTTCTGTCGGACTTGAAGTTCTACAGTATTTCACAACAAGAGAATGGTGGTTCGACACAGACAACTTCAAAGATTTGGCGAAAAAGTTACACGGAGCAGACTTCACAACTTTTCCGATGGATTTGAAAATTATCGAAATCGGTCCGTACATAGAAAGTTGCATGATAGGCGGCAAACTGTACTGTTTGAAGGAAAAGTTAGAAAATCTCCCGAAAGCAAAACTGCATAATAATATATTATTCGTTTTGGATTTTCTTGCTAGTGTATTCTTCTATCTGTTATTGGTTTACTGGATGGTTCTACTATTTGAACCTGTTCGTGAAATTCTAAGCTACTGCGGGCCATTCGTGAAGTATCTGCCCCTCGTCGGAAAGGCCGTCTTCCGAGATGATTGA

Protein sequence:

>DPOGS208129-PA
MASEVNEWYKGRKVFVTGALGLMGKVLIEKLLYSVPDIGCVYALVRSKRGKSPETRIEEMWQLPLFARIREEKPHVMKKLIAVAGDIQYDDLGINNKQTEEIYNEVSVVFHFAATLRLEAPLKEGLELNTKGTLRVLEMAKKMRKLAGFVHLSTAFCYPDYDRMAEAVHPPPTDPHEVLRAASWMTNEQLNVLAPTLMQKHPNSYTYSKRLAEALVKECYPEIPAVVVRPSIVTPSFKEPNPGWVDNLNGPIGLMIGAGKGVIRSMHCYGHYHAEVIPVDIAINATIVIPYYINTQMERSQEIPVFNLTTGDDRNNTWKEVLDVGKATVRKYPFEMPLWYPDGNIRHSKLLHELCVFFYHIVPAYLIDFLMFIFGQQRFMVRIQKRISVGLEVLQYFTTREWWFDTDNFKDLAKKLHGADFTTFPMDLKIIEIGPYIESCMIGGKLYCLKEKLENLPKAKLHNNILFVLDFLASVFFYLLLVYWMVLLFEPVREILSYCGPFVKYLPLVGKAVFRDD-