Monarch geneset OGS2.0

DPOGS210602
TranscriptDPOGS210602-TA1872 bp
ProteinDPOGS210602-PA623 aa
Genomic positionDPSCF300168 - 16014-19837
RNAseq coverage2967x (Rank: top 4%)
Annotation
HeliconiusHMEL0059010.083.50% 
BombyxBGIBMGA014417-TA5e-11978.29% 
DrosophilaCG7461-PB0.061.94% 
EBI UniRef50UniRef50_E3WRA60.057.50%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WRA6_ANODA
NCBI RefSeqXP_001949624.10.066.23%PREDICTED: similar to acyl-coa dehydrogenase [Acyrthosiphon pisum]
NCBI nr blastpgi|3287131840.065.90%PREDICTED: very long-chain specific acyl-CoA dehydrogenase, mitochondrial-like isoform 1 [Acyrthosiphon pisum]
NCBI nr blastxgi|910760060.068.55%PREDICTED: similar to acyl-coa dehydrogenase isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00166271.6e-66oxidoreductase activity, acting on the CH-CH group of donors
GO:00081521.6e-66metabolic process
GO:00551141.6e-66oxidation-reduction process
GO:00039956.2e-37acyl-CoA dehydrogenase activity
GO:00506604.8e-28flavin adenine dinucleotide binding
KEGG pathwayapi:1001620510.0 
 K09479 (ACADVL)maps-> Fatty acid metabolism
InterPro domain[53-310] IPR0091001.6e-66Acyl-CoA dehydrogenase/oxidase
[290-444] IPR0090752.5e-58Acyl-CoA dehydrogenase/oxidase C-terminal
[297-443] IPR0060901.1e-39Acyl-CoA oxidase/dehydrogenase, type 1
[185-289] IPR0060916.2e-37Acyl-CoA oxidase/dehydrogenase, central domain
[51-183] IPR0137864.8e-28Acyl-CoA dehydrogenase/oxidase, N-terminal
[77-182] IPR0060923.1e-19Acyl-CoA dehydrogenase, N-terminal
Orthology groupMCL11772 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210602-TA
ATGAAAGGATCCAAGATGGTAACATGCGCAAGTCGCTGCTTCGGTGGCAGCAAGCGACTCCTTCCGATGCGGAGCCGCTGCATGGCGACGGCCGCGGCCGCTGAAGCTCCCCGCGGAGCGCGTCAGAGTACATCCTTCACACTGAACCTGTTCCGAGGTCAGTTCGAGCCGACGCAGGTGTTTCCATTCCCGGAGCCACTCTCCGAAGACCAGCGTCAGACTCTCAGCGAACTCGTTCCACCGGTCGAGAAGTTCTTTCAGGAGGTGAACGATCCGGCCAAGAACGACGCGGACGCACAGATCGAGCAGGCTACTCTGTCGGGACTGTGGGATTTGGGTGCCTTCGGTCTCCAGGTGCCGACGGACTTGGGCGGACTGGGACTATCCAATACGCAGTACGCGCGGCTAGTGGAGGTGGTCGGCGCTCACGACCTGGGCGTGGGCATCACGTTGGGTGCTCACCAGTCCATCGGTTTCAAGGGCATATTGCTTTTCGGCACGCCGGAACAGAAGCAGCATTACTTACCGCGCGTTACGGGCGGAGAATATGCGGCTTTCTGCTTGACAGAGCCGTCGTCGGGCTCCGACGCCGGCTCCATTAAAACGAGAGCAGAACTGTCTCCTGACGGGAAGCACTTCATTCTAAATGGATCCAAAATTTGGATCAGCAATGGTGGAATCGCTGAGATCATGACCGTGTTTGCTCAAACACCCGTCGAAAAAGACGGAAAAAAAGTTGATAAAGTGACGGCTTTCATTGTGGAACGATCGTTTGGCGGCGTATCGTCGGGTCCTCCCGAGAACAAGATGGGTATCAAGTGTTCAAATACCACGGAAGTGTACTACGAGGACGTGCGGGTCCCGGTAGAGAACGTGCTGGGCGGAGTCGGGAACGGGTTCAAGGTGGCCATGAATATCCTCAACAACGGCCGCTTCGGCATGGCGGCGGCGCTGGGCGGCACGCAGCGCGCCGCCCTGCGGCAGGCCGCGGAACATGCCGCCACTAGGGTTCAGTTCGGTAAACGGCTCTGTGAGTTCGGTTCCGTCCAAGAGAAGCTGGCGCGTATGGCGATGCTGCAGTACGTGACGGAGTCGCTCGCCTACATGGTGAGCGGCAACATGGACGCCGGCCACCAGGACTACCACCTTGAGGCGGCGGTGTCCAAGGTATTCGCGTCGGACTCCGCTTGGAAGGTTGTGGACGAGGCGATCCAGATCCTCGGAGGTATGGGCTTCATGAAGGCCACGGGGCTGGAGCGCGTGCTGAGGGACCTGCGAATCTTCCGCATCTTCGAGGGAACCAACGATATCTTGCGGCTCTTCGTCGCTCTCACGGGAATTCAGTTTGCGGGCTCGCACCTCCAGGAGGTGATGCGTGCCTTCAAAAACCCGACAGCGCATCTCGGACTCATTTTTAGCGAAGCCGGCAAGCGCGCCGGACGGGCCGTGGGGTTCGCGCGGGGCGCGGATCTCGACCCGCTGGTGGCGCCGGCGCTCCGCCCCGCGGCGAGGGAGCTGGCGCGCCGGGTGTTGGAGTACGGGGCGTGTGTCGAGGCCGCGCTCCGTAAATACGGCCGCGGCGTGGTCGACGAGCAGCTCGTACTGAACAGTCTGGCGGCCGGCGCCATCGACGCCTACACGGCGGCCGCCGTGTTATCTCGTGCTTCGAGAGCGCATCGCCTCGGGCTCCCGGCCGCCGACCACGAGCTGGCCATGGCCGAGACCTGGACAGAGGAAGCCACGGACCGCATGGGAGCTTTGGCGGGCGCTCTGGCCCCGCGGGCCTTGAGACACGGCGCGCGGCTGACAGCTCTGGGGCGCGATGTGGCCGCCGCAGCCGGACAACCTTCGCGCTCCCCTCTCGGCCTGTGA

Protein sequence:

>DPOGS210602-PA
MKGSKMVTCASRCFGGSKRLLPMRSRCMATAAAAEAPRGARQSTSFTLNLFRGQFEPTQVFPFPEPLSEDQRQTLSELVPPVEKFFQEVNDPAKNDADAQIEQATLSGLWDLGAFGLQVPTDLGGLGLSNTQYARLVEVVGAHDLGVGITLGAHQSIGFKGILLFGTPEQKQHYLPRVTGGEYAAFCLTEPSSGSDAGSIKTRAELSPDGKHFILNGSKIWISNGGIAEIMTVFAQTPVEKDGKKVDKVTAFIVERSFGGVSSGPPENKMGIKCSNTTEVYYEDVRVPVENVLGGVGNGFKVAMNILNNGRFGMAAALGGTQRAALRQAAEHAATRVQFGKRLCEFGSVQEKLARMAMLQYVTESLAYMVSGNMDAGHQDYHLEAAVSKVFASDSAWKVVDEAIQILGGMGFMKATGLERVLRDLRIFRIFEGTNDILRLFVALTGIQFAGSHLQEVMRAFKNPTAHLGLIFSEAGKRAGRAVGFARGADLDPLVAPALRPAARELARRVLEYGACVEAALRKYGRGVVDEQLVLNSLAAGAIDAYTAAAVLSRASRAHRLGLPAADHELAMAETWTEEATDRMGALAGALAPRALRHGARLTALGRDVAAAAGQPSRSPLGL-