Monarch geneset OGS2.0

DPOGS210521
TranscriptDPOGS210521-TA1608 bp
ProteinDPOGS210521-PA535 aa
Genomic positionDPSCF300186 + 241643-243623
RNAseq coverage53x (Rank: top 70%)
Annotation
HeliconiusHMEL0163390.083.90% 
BombyxBGIBMGA012624-TA0.078.03% 
Drosophila% 
EBI UniRef50UniRef50_Q9XVZ20.059.74%Protein Y7A5A.1 n=8 Tax=Bilateria RepID=Q9XVZ2_CAEEL
NCBI RefSeqXP_966520.10.067.81%PREDICTED: similar to Y7A5A.1 [Tribolium castaneum]
NCBI nr blastpgi|910941330.067.81%PREDICTED: similar to Y7A5A.1 [Tribolium castaneum]
NCBI nr blastxgi|910941330.067.81%PREDICTED: similar to Y7A5A.1 [Tribolium castaneum]
Group
Gene OntologyGO:00166141.1e-24oxidoreductase activity, acting on CH-OH group of donors
GO:00506601.1e-24flavin adenine dinucleotide binding
GO:00038241.1e-24catalytic activity
GO:00551141.1e-24oxidation-reduction process
GO:00087622.9e-14UDP-N-acetylmuramate dehydrogenase activity
GO:00164912.9e-14oxidoreductase activity
KEGG pathwaygga:4246611e-109 
 K09828 (DHCR24)maps-> Steroid biosynthesis
InterPro domain[105-222] IPR0161661.1e-24FAD-binding, type 2
[107-189] IPR0060942.9e-14FAD linked oxidase, N-terminal
[108-220] IPR0161682.4e-13FAD-linked oxidase, FAD-binding, subdomain 2
Orthology groupMCL22301 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210521-TA
ATGCTACCGAGCAGTATGAAGAGCTACATCATCAGATGGCTCGAGGATCACAGAGCGCTGGTGGTGTGCGCGTTCTGCCTCCCCGCCAGTTTCCTGTTCACGTTACTGTTGAGACTGAGAGCCTTCGCCCGCAGGCTGACCAGCGACCCTCAGAGACACGACAGCGCCGTCCGGCGTATACAGTCTCAGGTCTTAGAATGGAACAAACTCCCGTCGAAAAACAAACGGCTGCTGTGTACGTCCCGGCCGAACTGGCTGTCCCTCTCGATAACCTTCTTCCAGAAACACCTGCACCATCAAGTCCCCATCCCTCTGTATGATATCCTGGAACTAGACGAGGAGGCCGGGACTGTGAGGGTGGAGCCCATGGTCACCATCGGAGACATTACCCGATATCTCATACCCAAAGGATATTCATTGGCCGTCACTATTGAACTGGACGACGCGACCTTGGGTGGTCTTGCCCTCGGGACCGGGATGTCCACACACTCCCATAAGGCCGGCCTCTATCACGAGACTATAACCAGCTACGAGGTGGTGTTAGGAGACGGCTCGCTCGTCACCGCCACGGCCACCAACGAGTATTCTGACCTCTACAAAGCCCTGCCCTGGTCGCACGGAAGTCTTGGCTTCCTTGTGGCGCTGACGCTGAAGATCGTCAAAGTGAAGCCCTACATCAGAATCAAGTACACGCCCGTACGAGGACAGAACAATTACTGCGACTTGATAAGAAAGTTATCCGGAACCCATGAAGCCGAACCCACAAGGCACCCGGATTATATAGAGGGGACGATATTCAGCAAGGACGAAGCGGTCGTCATGACGGGGGAGTACGCCGACTATGATGGGAGACTCGCAGTCAATCACTGCTCCAGGTGGTATAAGCCGTGGTTTTACAAACATGTCGAGTCTTTCCTCGAAGAAGGCGAAAAAGAAGAATTGATCCCTTTGAGAGACTACCTGCTGCGCCACAACAGACCTATCTTCTGGGTCGTGGAAGATATGATTCCGTTCGGCAACAACGCGCTATTCAGGCTCTTCTTCGGGTGGCTCTTACCGCCGAAACCGGCCTTCCTCAAATTCACGACGACACCAGGCGTTAGAGCTTATACGTTTACGAGACAGGTGTTCCAAGACATCGTCCTACCCATTCAAGAGCTGGAAAAGCAAATCGAGCTCGCCATCCAGCTGTTCGAGAAATTTCCTCTGCTGGTGTACCCTTGCAGAATAATAGACCACGGGCCGCTGTCGGGGCAACTGCGGAGACCGCACGCTAAATATCTAGTACCCGGGACTAACTACGCCATGTACAACGACCTGGGAGTGTACGGCGTTCCCGGGAAAGTGAAGCACAAGAAACCTTACAACGCCGTGGCCGCCATGAGAGAGATGGAGCGGTTCACGCGCGACGTCGGAGGATACTCCTTCCTGTATGCAGACATATTCATGGACAGAGAAGAGTTCGGCCAGATGTTCGATCTGAGCCTGTACGACGCGGTGCGCACCAAATACATGGCCCAGGGAGCCTTCCCGCACCTCTACGATAAAGTCAAGCCCGAAATCGACGTGTTCTCTATCGGCGAACGAAATGTTATACGCGGTCAATAA

Protein sequence:

>DPOGS210521-PA
MLPSSMKSYIIRWLEDHRALVVCAFCLPASFLFTLLLRLRAFARRLTSDPQRHDSAVRRIQSQVLEWNKLPSKNKRLLCTSRPNWLSLSITFFQKHLHHQVPIPLYDILELDEEAGTVRVEPMVTIGDITRYLIPKGYSLAVTIELDDATLGGLALGTGMSTHSHKAGLYHETITSYEVVLGDGSLVTATATNEYSDLYKALPWSHGSLGFLVALTLKIVKVKPYIRIKYTPVRGQNNYCDLIRKLSGTHEAEPTRHPDYIEGTIFSKDEAVVMTGEYADYDGRLAVNHCSRWYKPWFYKHVESFLEEGEKEELIPLRDYLLRHNRPIFWVVEDMIPFGNNALFRLFFGWLLPPKPAFLKFTTTPGVRAYTFTRQVFQDIVLPIQELEKQIELAIQLFEKFPLLVYPCRIIDHGPLSGQLRRPHAKYLVPGTNYAMYNDLGVYGVPGKVKHKKPYNAVAAMREMERFTRDVGGYSFLYADIFMDREEFGQMFDLSLYDAVRTKYMAQGAFPHLYDKVKPEIDVFSIGERNVIRGQ-