Monarch geneset OGS2.0

DPOGS205582
TranscriptDPOGS205582-TA1611 bp
ProteinDPOGS205582-PA536 aa
Genomic positionDPSCF300237 - 194560-199693
RNAseq coverage354x (Rank: top 33%)
Annotation
HeliconiusHMEL0023946e-12645.94% 
BombyxBGIBMGA009675-TA9e-16051.19% 
DrosophilaCG6178-PA4e-9637.55% 
EBI UniRef50UniRef50_G8GE172e-13145.94%Luciferin 4-monooxygenase n=2 Tax=Obtectomera RepID=G8GE17_BOMMO
NCBI RefSeqXP_001845435.11e-10540.95%luciferin 4-monooxygenase [Culex quinquefasciatus]
NCBI nr blastpgi|3580315788e-13145.94%luciferin 4-monooxygenase [Bombyx mori]
NCBI nr blastxgi|3580315782e-12945.94%luciferin 4-monooxygenase [Bombyx mori]
Group
Gene OntologyGO:00081521.5e-91metabolic process
GO:00038241.5e-91catalytic activity
KEGG pathwaytad:TRIADDRAFT_562025e-80 
 K01904 (E6.2.1.12)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[49-461] IPR0008731.5e-91AMP-dependent synthetase/ligase
Orthology groupMCL30683 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205582-TA
ATGTTAAAAAATCCGAAGTATATATACGGACCTTGCGACAGATTTGTCCCAGCGTCTCTGAATTATGGGCAATATTTCTTAACAAAACTCAAAGAGAACAGCGGTAAGATAGCCTTGATAAATGGTTTAACGGATGAAAAGTTAACATACGATGATATTGCTCAAGAGGCCATTAATGTGTCCTTCTCTCTCACACGTATGGGTGTTAGAAAGGGAGACGTTATCGCCGTCAGTTCAGAGAACAGAAGAGAGTATTGGAGTACTGTTGTTGGCATCATATGTTCTGGCGCTGTTGTCACAACTATAAACATCGGATACACAAACGACGAACTCAAACACGTCGTCGGTATATCGAAGCCAAAATACTTATTTTGCTCGCCGCTGGCTTACAAAATGCATTCAAAGACCTATAGATCGCTGGGTTTCCTAAAACACATCATATTATATGGCGACGAGGAGTCGCCAGGGGCGATTCCATTCAAATATTTAGCCGTTCCGAGTAACCGACGGACTAGCCACTATCACCTGGAACTGAATGTGAATTTTGAGGATTTCGAGCCAATAGATGTCGAAGGTTATGACACGCTGTTCATATTATACTCGTCCGGAACCACTGGCCTGCCCAAGGGAGTTATGATAACACACCAGAACATCTTGACGTTATCCTGCTCTAACGTAATCTTACCACCTCTCTTGGGACTCACCATAACCCCATGGTATCACACCATGGGTCTCATTGGCACCCTGAACAGTTTCTCTCGTGGTAATACCACGGTCTTTCTGCCGAAATTTAACGTGGAACAATACTTGAGGACCGTTGAAAAATACAAGATAGAACAGTTAGTTTTGGTCCCCGCGGCTCTAGTGGCTCTGGTGAAATCGTCTCTGGATGTAGACACTTCTTCAGTGCATTTAATCTACTGCGGCTCCGCCCCGCTCTACGAAGACACCGCCAAAGCTGTCACTAAGAGGTTTCCGAACGTGACCGCACTCCTCCAGGGTTATGGGATGACGGAGACCACTCTCGCCATCACCATGAACTACAACCCTGACAAGTACGGCAGCGTGGGGACCGTGACCTCGCACACAGTTGTCAAGGTGGTCGATCCAGATACGAAGGAGGTCCTCGGTCCGAATAAACCTGGTGAGATATGCTTGAAGAGTGCGACCATGATGAAAGGATACGTCGGCAGGCCGAGGAGTGAGGGGTACGATGATGAAGGGTTCTTCAGAACCGGAGACATAGGATACTACGACGAGGACGGCTACTTCTATATAGTTGACAGGTTGAAGGAACTCATCAAATACAAGAGCTATCAGGTCCCTCCCGCTGAGATAGAGACGACTCTCCTAAAACACCCCTCAGTGCTAGACGCGGGTGTGGTGGGTGTGCCGCACCCCGTCTCTGGTGAGGTGCCTGTCGCCTTCGTGGTGAAAAGTGGACCCGTCACTGAGGCGGAGCTGGTGAAATTTGTGGCTGACAGGCTCTCAAACCCGAAGCACATCCGCGGCGGAGTCATATTCATAGACGAGATACCGAGGAACCAGACGAGCAAGATACTGAGGAAGGAGCTGAGGAAGATGGCGAAAACAAGGAAAAGTAAACTCTAA

Protein sequence:

>DPOGS205582-PA
MLKNPKYIYGPCDRFVPASLNYGQYFLTKLKENSGKIALINGLTDEKLTYDDIAQEAINVSFSLTRMGVRKGDVIAVSSENRREYWSTVVGIICSGAVVTTINIGYTNDELKHVVGISKPKYLFCSPLAYKMHSKTYRSLGFLKHIILYGDEESPGAIPFKYLAVPSNRRTSHYHLELNVNFEDFEPIDVEGYDTLFILYSSGTTGLPKGVMITHQNILTLSCSNVILPPLLGLTITPWYHTMGLIGTLNSFSRGNTTVFLPKFNVEQYLRTVEKYKIEQLVLVPAALVALVKSSLDVDTSSVHLIYCGSAPLYEDTAKAVTKRFPNVTALLQGYGMTETTLAITMNYNPDKYGSVGTVTSHTVVKVVDPDTKEVLGPNKPGEICLKSATMMKGYVGRPRSEGYDDEGFFRTGDIGYYDEDGYFYIVDRLKELIKYKSYQVPPAEIETTLLKHPSVLDAGVVGVPHPVSGEVPVAFVVKSGPVTEAELVKFVADRLSNPKHIRGGVIFIDEIPRNQTSKILRKELRKMAKTRKSKL-