Monarch geneset OGS2.0

DPOGS210809
TranscriptDPOGS210809-TA2049 bp
ProteinDPOGS210809-PA682 aa
Genomic positionDPSCF300027 - 716163-727958
RNAseq coverage111x (Rank: top 59%)
Annotation
HeliconiusHMEL0091173e-14771.71% 
BombyxBGIBMGA007125-TA0.069.24% 
DrosophilaCG10253-PA0.052.03% 
EBI UniRef50UniRef50_Q7QA930.055.50%AGAP004358-PA n=9 Tax=Culicidae RepID=Q7QA93_ANOGA
NCBI RefSeqXP_313642.20.055.50%AGAP004358-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2700122180.055.78%hypothetical protein TcasGA2_TC006332 [Tribolium castaneum]
NCBI nr blastxgi|2700122180.055.78%hypothetical protein TcasGA2_TC006332 [Tribolium castaneum]
Group
Gene OntologyGO:00506601.6e-49flavin adenine dinucleotide binding
GO:00038241.6e-49catalytic activity
GO:00166145.7e-44oxidoreductase activity, acting on CH-OH group of donors
GO:00551145.7e-44oxidation-reduction process
GO:00087626e-38UDP-N-acetylmuramate dehydrogenase activity
GO:00164916e-38oxidoreductase activity
KEGG pathwayaga:AgaP_AGAP0043580.0 
 K00803 (E2.5.1.26, AGPS)maps-> Peroxisome
    Ether lipid metabolism
InterPro domain[396-664] IPR0041131.6e-49FAD-linked oxidase, C-terminal
[147-348] IPR0161665.7e-44FAD-binding, type 2
[399-680] IPR0161644.6e-38FAD-linked oxidase-like, C-terminal
[178-318] IPR0060946e-38FAD linked oxidase, N-terminal
[238-350] IPR0161684.8e-20FAD-linked oxidase, FAD-binding, subdomain 2
[132-236] IPR0161674.4e-16FAD-binding, type 2, subdomain 1
Orthology groupMCL11035 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210809-TA
ATGTCACCCGGCTCGGTCGCTAGTGCAAAAAACATGTCAGAAAATACAGAAGAACAGAATAAAGTTAAAGATAATTTAGTGTTTAATGATAAAATAAATACTTTCAGTGAACAACGTGAAATTAAAAGTGTAGAAAATAAGGATATTGATACTGTTATCAAAGTTAAAAGTGCTATCCCTAGAAGACGACAGGATCTACTGAAATGGTACGGATGGGGCTATCAGGACTCGACCTTTAAAATTTCTGATGGTGTAGCAATTTTCACTGGAAACAGATACTTGCTGGGTGGAAAAAAATTACCGCACGCTCTACAATGGATAAAGGAGTATTTCAACGTGGATATAACCAAAGTAATGGTGCCGCCACCACAACCCACAAGCTTTCAAGAGAGCAAGCTGCCGGTGAACATAAAGGAGGAGTTGGAGAAGATAGCGCCTGTGAGCACGGACGGTATGGATCGGCTGATCAGAGCCCACGGACAGACCCTGAAAGACGTCTACTGTCTGAGGAAGAATAACTTCCAGAAGATACCTGATGCCGTGATATGGCCGGAGACTCACCAACAGGTTGAAGATATTGTAAAGTGTGCGTCCAAACACCAGTTCGTGATAATTCCATTCGGCGGCGGGACCTCGGTGTCCGGCTCAGTGACGTGTCCGGCGAACGAGGAGAGGCCCATACTACTATTGGACACCACCGCCATGAATGCCATCCTGTGGCTGGACAAGGAACAGCTCCTGGCCAGGGTCCAGGCGGGTATATTAGGCCAGGACTTGGAGCGCGAGCTGCGTTCCCGGGGGTTCACGGTGGGCCACGAACCGGACTCGTTCGAGTTCTCCACCCTCGGCGGCTGGGTCGCCACACGAGCCAGCGGCATGAAAAAAAACACGTACGGGAATATAGAGGATCTGGTCGTGCAGACCAAGATCGTGACTCCTAAAGGCGTCATCGAGAGGAATTGTCGTGTCCCTCGTATATCGTGCGGCCCGGACTGGGAACACGTGGTGCTGGGGTCAGAGGGTTGTCTTGGAGTCGTCACTGAGATCGTGACTCCTAAAGGCGTCATCGAGAGGAATTGTCGTGTCCCGCGTATATCGTGCGGCCCGGACTGGGAACACGTGGTGCTGGGGTCAGAGGGTTGTCTTGGAGTCGTCACTGAGGTGACATTGAAGATCCGTCCGCTACCGCCGTGTGTCCGCTACGGCTCGCTCGTCTTCCCCGACTGGGAGGCTGGCTTCCACTTCGAGAGGGAGGTCGCCCGCCAGCGAGCACAGCCTTCCAGTATACGACTCATGGACAATGAACAGTTCCGAATGGGCCACGCGCTGAAAGTGGAACAGTCCTGGGGCGGAGTCGTCCTGGACGGGTTGAAGAAGTTCTACATAACACGGATCAAAGGCTTCGACCCGCTGAAGATGTGTGTCGTCACTCTCCTGATGGAGGGCAGCTCGGAGCATGTCGCGCGCAGCGAGAAGAGACTGAACGCTATAGCGGCGGAGTACGGGGGCGTGCCGGGAGGCGCCCGGAACGGGGAGATAGGATACACGCTCACCTTCGTCATCGCCTACATAAGGGACCTGGCCCTCGACTACGACATAGTGGCAGAGTCGTTCGAGACCTCGGTGTCGTGGGAGCGGACCCTGGCCCTGTGCAGGAACACCAAGGAGCGAGTGCGACGAGAGTGTCGCGACAGGAACATTAAAGAGCATATCATATCGTGCAGACTCACACAGACTTACGACGCCGGCTGTTGTATTTATTTCTACTTCGCTTTCAAGACCGACCTGAGCGCGGATTCGGTTCGCGTCTACGAGGACATCGAGGAGGCGGCTCGGGACGAGATCATTGCTAACGGAGGTTCGATATCTCATCATCACGGTGTTGGGAAGCTGCGCAAGAAGTGGTACACGCAGACGGTGAGCGAACCCGGGAGGCGGCTACTACTCGCCACCAAACAGGCTCTAGACCCCGACAACATATTCGCTTTGGGAAATATGGCCTACGACCAATACAAAGCCGACGGGTTGGGCGTCAGGAGCAAGCTATAA

Protein sequence:

>DPOGS210809-PA
MSPGSVASAKNMSENTEEQNKVKDNLVFNDKINTFSEQREIKSVENKDIDTVIKVKSAIPRRRQDLLKWYGWGYQDSTFKISDGVAIFTGNRYLLGGKKLPHALQWIKEYFNVDITKVMVPPPQPTSFQESKLPVNIKEELEKIAPVSTDGMDRLIRAHGQTLKDVYCLRKNNFQKIPDAVIWPETHQQVEDIVKCASKHQFVIIPFGGGTSVSGSVTCPANEERPILLLDTTAMNAILWLDKEQLLARVQAGILGQDLERELRSRGFTVGHEPDSFEFSTLGGWVATRASGMKKNTYGNIEDLVVQTKIVTPKGVIERNCRVPRISCGPDWEHVVLGSEGCLGVVTEIVTPKGVIERNCRVPRISCGPDWEHVVLGSEGCLGVVTEVTLKIRPLPPCVRYGSLVFPDWEAGFHFEREVARQRAQPSSIRLMDNEQFRMGHALKVEQSWGGVVLDGLKKFYITRIKGFDPLKMCVVTLLMEGSSEHVARSEKRLNAIAAEYGGVPGGARNGEIGYTLTFVIAYIRDLALDYDIVAESFETSVSWERTLALCRNTKERVRRECRDRNIKEHIISCRLTQTYDAGCCIYFYFAFKTDLSADSVRVYEDIEEAARDEIIANGGSISHHHGVGKLRKKWYTQTVSEPGRRLLLATKQALDPDNIFALGNMAYDQYKADGLGVRSKL-