Monarch geneset OGS2.0

DPOGS211698
TranscriptDPOGS211698-TA1575 bp
ProteinDPOGS211698-PA524 aa
Genomic positionDPSCF300374 + 33520-41065
RNAseq coverage196x (Rank: top 48%)
Annotation
HeliconiusHMEL0167158e-13977.96% 
BombyxBGIBMGA011395-TA0.077.10% 
DrosophilaCG5065-PC0.060.39% 
EBI UniRef50UniRef50_A1ZAI50.060.39%Putative fatty acyl-CoA reductase CG5065 n=14 Tax=Endopterygota RepID=FACR1_DROME
NCBI RefSeqNP_001163168.10.060.39%CG5065, isoform B [Drosophila melanogaster]
NCBI nr blastpgi|246542090.060.39%CG5065, isoform A [Drosophila melanogaster]
NCBI nr blastxgi|1953805210.060.36%GJ20993 [Drosophila virilis]
Group
Gene OntologyGO:00054881.3e-30binding
GO:00166209.3e-28oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:00551149.3e-28oxidation-reduction process
KEGG pathwaydme:Dmel_CG50650.0 
 K13356 (FAR)maps-> Peroxisome
InterPro domain[29-299] IPR0131209e-78Male sterility, NAD-binding
[214-340] IPR0160401.3e-30NAD(P)-binding domain
[369-462] IPR0042629.3e-28Male sterility
Orthology groupMCL12050 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211698-TA
ATGGCGTCGTGCCTTGGTGGCCATTTTGGTTTGAACCAGGACTATATACCGGTTGCCGATTTTTATGCTGATAAATCGATTTTTGTCACCGGTGGAACGGGATTCATGGGGAAGGTCCTGGTCGAGAAGCTACTGCGGAGTTGTCCAAAAATCAAGAAAATCTATCTTCTAATGAGGCCGAAGCGAGGTCAGGACGTGGCCTCCCGTCTGACAGAACTAACACAGTCGCCGTTGTTTGAATCGTTAAGAAAAGAAAGACCTCAGGAATTAAATAAGATAGTACCAATAGTAGGGGACATAACGGAACCTGAGCTGGGTATCAGCCCGGCTGACCAGGAGATGCTATGCCAAAAGGTGTCCGTGGTGTTCCATTCAGCTGCAACAGTTAAATTCGACGAGAAATTGAAGCTGTCTGTAACAATCAACATGCTGGGGACGCAACAACTAGTACAGTTGTGTCATCGTATGCTGGGATTGGAGGCCTTAGTCCACGTGTCCACCGCGTACTGTAACTGTGAACGTGAGCGTGTGGAGGAGACGGTCTACTCGCCGCCGGCTCAGCCTGAACACGTGGTGACCCTCGTCCAGACGCTGAACGACGAGCTGGTCGACAGGATCACGCCTGACCTAGTCGGTGACCGACCCAACACATACACCTTCACTAAGGCCCTGGCCGAGGACATGTTGATAAAGGAGTGTGGGAACCTGCCCGTGGCGATCGTTAGACCATCTATAGTGCTCTCGTCAGTTCGCGAACCTGTTAAGGGTTGGGTAGACAATTGGAATGGACCCAATGGCATAATAGCGGCGGTTGGCAAGGGAGTGTTCCGTACAATGCTGGGGAACGGCACGAGGGTAGCTGATCTGGTGCCAGTGGATACTGTCATTAATCTTATGATAGTATGCGCTTGGAGGACTCATCTCAGAAGAGGCGACGGCGTTGTCGTCTACAACTGCTGCACCGGCCAGCAGAACCCCATAACGTGGCAACGCTTTGTTAAGACGAGTTTCAAGTACATGAGAAAACATCCGTTCAGTGAAGTGGTGTGGTATCCAGGTGGTGATATCACCAGCAGTCGTTTCCAGCACGGGATCCTCTCCCTCCTCCAGCACAGGCTGCCGGCTGTCCTCATAGACCTGGTAGCCAGAATCACCGGCAGTAAGCCTGTGATGGTTAGAGTTCAGAACAAACTGGAGAAGGCATCGGCCTGTCTGGAGTACTTCACAACAAGACAGTGGGCGTTCGCTGACAACAACGTGCAGGCGCTGTGCCGGAGCCTGTCGCCCGAGGACAGGGACACCTTCGACTTTGACGTCACCAACATCAACTGGGATGGATATATTGAGTCCTATGTCCTTGGAATAAGGAGGTTTCTATTCAAGGAGAGCCCGCACACCCTGCCCAAGTCCAGGACGATAATGCGGAGGTTACACATAGTCCACGTCCTGGGCCAGGTGCTGGCCGTGTTATTCCTCTGGCGGTTCGTGTTCCTCCGCTCAGCCAGTCTAAGAAGTGTTTGGCGCTCCATCGTGGAGCTGCTGACCCGCGCCGCCAGGATGCTCGCCGTCGCTTGA

Protein sequence:

>DPOGS211698-PA
MASCLGGHFGLNQDYIPVADFYADKSIFVTGGTGFMGKVLVEKLLRSCPKIKKIYLLMRPKRGQDVASRLTELTQSPLFESLRKERPQELNKIVPIVGDITEPELGISPADQEMLCQKVSVVFHSAATVKFDEKLKLSVTINMLGTQQLVQLCHRMLGLEALVHVSTAYCNCERERVEETVYSPPAQPEHVVTLVQTLNDELVDRITPDLVGDRPNTYTFTKALAEDMLIKECGNLPVAIVRPSIVLSSVREPVKGWVDNWNGPNGIIAAVGKGVFRTMLGNGTRVADLVPVDTVINLMIVCAWRTHLRRGDGVVVYNCCTGQQNPITWQRFVKTSFKYMRKHPFSEVVWYPGGDITSSRFQHGILSLLQHRLPAVLIDLVARITGSKPVMVRVQNKLEKASACLEYFTTRQWAFADNNVQALCRSLSPEDRDTFDFDVTNINWDGYIESYVLGIRRFLFKESPHTLPKSRTIMRRLHIVHVLGQVLAVLFLWRFVFLRSASLRSVWRSIVELLTRAARMLAVA-