Monarch geneset OGS2.0

DPOGS203788
TranscriptDPOGS203788-TA1629 bp
ProteinDPOGS203788-PA542 aa
Genomic positionDPSCF300010 + 1376480-1380838
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0023940.058.68% 
BombyxBGIBMGA003693-TA6e-16454.42% 
DrosophilaCG6178-PA2e-9439.84% 
EBI UniRef50UniRef50_G8GE173e-16553.51%Luciferin 4-monooxygenase n=2 Tax=Obtectomera RepID=G8GE17_BOMMO
NCBI RefSeqXP_974050.12e-10640.57%PREDICTED: similar to CG6178 CG6178-PA [Tribolium castaneum]
NCBI nr blastpgi|3580315781e-16453.51%luciferin 4-monooxygenase [Bombyx mori]
NCBI nr blastxgi|3580315783e-16153.51%luciferin 4-monooxygenase [Bombyx mori]
Group
Gene OntologyGO:00081522.1e-85metabolic process
GO:00038242.1e-85catalytic activity
KEGG pathwayath:AT1G205107e-84 
 K10526 (OPCL1)maps-> alpha-Linolenic acid metabolism
InterPro domain[53-459] IPR0008732.1e-85AMP-dependent synthetase/ligase
Orthology groupMCL15406 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203788-TA
ATGTACTCACAGCGAATTAAAAATGGTATTATATACGGAGATGACAATATTCCGCCTACGCCGAATTTGAATTATGGAGCGGTTGCTTTGGAGAAAATTTTATCACACGATCCGAATGGGGTTGCATTGATTGATGGTGCTAAAGGAGAACAAATAACATTCGGTGAGATGGCACGGAAGATAGTCAATATTGCTTCATCTCTAACAAAGCTGGGTGTGAAGGTCGGTGATGTAGTAGCTATATGTAGTGAGAACAGAATCGAATACCTCATAGCTACTATAGCTGTTTTCTGTTGTGGCGGTGTTGTTACTTTTTACAACCCAGCCTACACTAAAGATGATCTTATCCACGGCCTTAACATTTCTAGACCGAAATATGTTTTCCTTTCGGGAGAAATATATGACACACATTTCGCTACTATGAGGCACGCGAGCATCATCTCTAGATTCATATTATTCGATAAAATAAGATCACTGCACAGTCACGTGCTCTTCAAAGATTTAGAGAATAGTAAAATTGATATAAATAACTATCAACCAGTTAAATTTCAAGGTCAACCCAGAACTGCTATGATACTCTATTCATCAGGAACAACGGGTATGGCCAAAGGAGTTAAACTAACACATTTAAATTTGATTGCAAGCTCCTACCAACTACGACCAATAACAAAAAACACAATAAAATTTATGGTTGCACCATGGTCTAGCACAATGGGAATCTTGTGCAGTCTCCGTGAGATCTTATATGGAAGAACACTTGCGTTTTTGGCAAAGTACGAAGAGGATTTATTCCTCCAAACTATACAAAAGTATAAGGTCGGAGTTCTTATTATTGCACCACCCCTCATCGTAATGTTAACTAAATCGGAATTAGCTAATAAATACGATATAAGTTCAGTTGAGTTTATATACTCAGGAGGTGCACCAATCGACAAAGAGTCTATAGAAAAAGTTAAGCAAAGGTATTCAAATATTAAACACGTCCTGCAAGGCTACGGGATGACAGAAGCGACAGGTGCTATAACGGACGACTTAGAAATCGCACCAAAGGAAGGCAGCGTCGGAAGGGCTGCACTGGGAATAATAATTAAGATATCTGATCCTTTTACCAATAAGACACTTGGACCTGGCGAACCAGGCGAAGTCCGTATTAAAGGTTTAACTTTATTTGAAGGTTACGTCAGAAAAGATATGAAAAATGAATTTGACGAGGAAGGTTTTTACAAAACAGGTGATATAGCGTATTACGACGAAGATGGCTACTTCTTTATTGTGGATAGAATAAAAGAACTTATCAAATACAAGGCATGGCAAGTCGCACCCTCAGAACTTGAAGGTCTGATACTGAAGCACCCGGCCGTTAAAGATGTTGGTGTTACTGGCGTTCCCGACGAACTTGCCGGGGAACTACCTACGGCCTTTGTGGTGAAGCAACCAAACTCCACAGTCACGGAACAAGATATTATCAAACACGTAGCAAATAAGGTCGCTCCATGGAAGAGACTGCGAGGAGGTGTAATATTTCTAAATGAGATACCAAAAACTCCGAGCGGCAAAATTCTGAGACGAAAACTACTATCGCTGCTGCCGAAGCGAAGCCCACTAAAGCTACCTGCCAGCAAATTGTGA

Protein sequence:

>DPOGS203788-PA
MYSQRIKNGIIYGDDNIPPTPNLNYGAVALEKILSHDPNGVALIDGAKGEQITFGEMARKIVNIASSLTKLGVKVGDVVAICSENRIEYLIATIAVFCCGGVVTFYNPAYTKDDLIHGLNISRPKYVFLSGEIYDTHFATMRHASIISRFILFDKIRSLHSHVLFKDLENSKIDINNYQPVKFQGQPRTAMILYSSGTTGMAKGVKLTHLNLIASSYQLRPITKNTIKFMVAPWSSTMGILCSLREILYGRTLAFLAKYEEDLFLQTIQKYKVGVLIIAPPLIVMLTKSELANKYDISSVEFIYSGGAPIDKESIEKVKQRYSNIKHVLQGYGMTEATGAITDDLEIAPKEGSVGRAALGIIIKISDPFTNKTLGPGEPGEVRIKGLTLFEGYVRKDMKNEFDEEGFYKTGDIAYYDEDGYFFIVDRIKELIKYKAWQVAPSELEGLILKHPAVKDVGVTGVPDELAGELPTAFVVKQPNSTVTEQDIIKHVANKVAPWKRLRGGVIFLNEIPKTPSGKILRRKLLSLLPKRSPLKLPASKL-