Monarch geneset OGS2.0

DPOGS202444
TranscriptDPOGS202444-TA1530 bp
ProteinDPOGS202444-PA509 aa
Genomic positionDPSCF300174 - 303986-308915
RNAseq coverage170x (Rank: top 51%)
Annotation
HeliconiusHMEL0164040.071.63% 
BombyxBGIBMGA009966-TA0.073.05% 
DrosophilaCG3835-PB3e-15152.62% 
EBI UniRef50UniRef50_A7SBJ27e-15656.67%Predicted protein n=14 Tax=Eukaryota RepID=A7SBJ2_NEMVE
NCBI RefSeqXP_002732875.11e-17158.02%PREDICTED: CG3835-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2912257772e-17058.02%PREDICTED: CG3835-like [Saccoglossus kowalevskii]
NCBI nr blastxgi|2912257771e-16658.02%PREDICTED: CG3835-like [Saccoglossus kowalevskii]
Group
Gene OntologyGO:00166142.2e-68oxidoreductase activity, acting on CH-OH group of donors
GO:00506602.2e-68flavin adenine dinucleotide binding
GO:00038242.2e-68catalytic activity
GO:00551142.2e-68oxidation-reduction process
GO:00164912.6e-35oxidoreductase activity
GO:00087622.2e-34UDP-N-acetylmuramate dehydrogenase activity
KEGG pathwaymcc:7020283e-153 
 K00100 (E1.1.1.-)maps-> Linoleic acid metabolism
    Bisphenol A degradation
    Fructose and mannose metabolism
    Butanoate metabolism
    Tetrachloroethene degradation
InterPro domain[47-261] IPR0161662.2e-68FAD-binding, type 2
[261-501] IPR0041132.2e-53FAD-linked oxidase, C-terminal
[264-504] IPR0161645.1e-48FAD-linked oxidase-like, C-terminal
[140-258] IPR0161682.6e-35FAD-linked oxidase, FAD-binding, subdomain 2
[86-223] IPR0060942.2e-34FAD linked oxidase, N-terminal
[37-139] IPR0161672.4e-29FAD-binding, type 2, subdomain 1
[462-502] IPR0161711.2e-09Vanillyl-alcohol oxidase, C-terminal subdomain 2
Orthology groupMCL11540 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202444-TA
ATGTTTAAACCGTTCACCCTATTTGTGTTTAATCTAAACAAGACAAATGTTTTTATACAAGCCAAATACGCTACGTCAGTGCTACCACAGTTTTCAGCAAATAAATATGCCGTTGAAAGAAAAAAATTCGGCACAGTTAGCCCATCAGATATCGACTATTTCAAAACTATATTAAGTAAGGAAAGAATTTTGACTAATGAAGAGGACGTGCTACCATATAACATAGATTGGATAAAGAACTGCAGAGGTCAATCGCAAGTTGTGCTAAAGCCAAGAACAACCGAGGAAGTATCGAAAATATTACACTATTGTAACAATAAGAGGCTGGCGGTATGTCCTCAAGGTGGCAACACAGGCCTGGTTGGAGGTTCAGTGCCAGTTTTCGACGAAATTATCCTAAACTTATCATTAATGAACAAAATTATCAGCCTGGATGAAATATCAGGTGCCTTAGTTTGTGAGGCGGGATGTATTTTGGAAAACCTGGACAACTACGTCAGGGAACATGATCTGATTATGCCCCTTGATCTCGGAGCCAAAGGATCCTGTCACATAGGAGGTAACATCAGTACAAACGCTGGAGGATTGAGATTACTGAGATACGGAAATTTACACGGATCTGTGCTTGGTATAGAGGCTGTAAAGGCAGATGGGACAGTCATTGATTGTCTTCGAACACTGAAGAAAGACAACACTGGTTATCCTTTGAAACACATATTTATTGGCTGTGAAGGCACCCTCGGTGTTATCACCAAAGTCAGCATACATTGCCCCGCGCAACCCAGATCCGTATCGCTCGGATATTTTGGCGTAGAAAAGTTTGAAAATGTTCTCAAGTTATATAAAAGCGCGAAATCGTCACTCGGCGAAATCCTCTCAGCGTTCGAGATGGCGGACAACGAATCGATTACTTCCACCGTAAACAATCTAAAACTATCGAATCCAATAGGGCACTATCCTATGTACGTTCTGGTTGAATCACATGGTAGTGATGAAAAACATGACAGCGAGAAACTAAATCGATTCCTCAGCGAAGGCATGGAGACTGGATTAATATTAGACGGCACGGTCACCTCCGAACCTAATAAAATGAAGGCGATCTGGAACATTCGGGAGAGCATCGCGGGTTCTGGTTTGATTGACGGCTATGTATTCAAATACGACGTGTCCCTCCCGCTGAGAAACTACTACGATCTCGTGGAAGACCTTCGAAAGAAGCTAGGAGACCGCGTCACCAGAGTTTACGGATACGGACATGTTGGTGACGGCAATATTCATATAAATGTAACAGTGCCGCAATATTCAAAGGAAATTAATTCTGAATTGGAGCCGTATATTTTCGAACAAGTATCCAAACTGAAGGGTTCTATCAGCGCCGAGCACGGCGTTGGTTTTAGGAAACCACAGTTCATACATTACAGTCAAGGAGACACTTCCTTGCAGCTAATGAGGGATTTAAAAAGAACCATGGATCCCAACGGCATACTGAACCCATACAAAGTACTGCCGGATGTTGCCAGTCATGAATAA

Protein sequence:

>DPOGS202444-PA
MFKPFTLFVFNLNKTNVFIQAKYATSVLPQFSANKYAVERKKFGTVSPSDIDYFKTILSKERILTNEEDVLPYNIDWIKNCRGQSQVVLKPRTTEEVSKILHYCNNKRLAVCPQGGNTGLVGGSVPVFDEIILNLSLMNKIISLDEISGALVCEAGCILENLDNYVREHDLIMPLDLGAKGSCHIGGNISTNAGGLRLLRYGNLHGSVLGIEAVKADGTVIDCLRTLKKDNTGYPLKHIFIGCEGTLGVITKVSIHCPAQPRSVSLGYFGVEKFENVLKLYKSAKSSLGEILSAFEMADNESITSTVNNLKLSNPIGHYPMYVLVESHGSDEKHDSEKLNRFLSEGMETGLILDGTVTSEPNKMKAIWNIRESIAGSGLIDGYVFKYDVSLPLRNYYDLVEDLRKKLGDRVTRVYGYGHVGDGNIHINVTVPQYSKEINSELEPYIFEQVSKLKGSISAEHGVGFRKPQFIHYSQGDTSLQLMRDLKRTMDPNGILNPYKVLPDVASHE-