Monarch geneset OGS2.0

DPOGS202321
TranscriptDPOGS202321-TA1704 bp
ProteinDPOGS202321-PA567 aa
Genomic positionDPSCF300032 + 493508-498803
RNAseq coverage268x (Rank: top 40%)
Annotation
HeliconiusHMEL0056042e-15057.53% 
BombyxBGIBMGA003693-TA3e-12543.33% 
DrosophilaCG6178-PA4e-12644.97% 
EBI UniRef50UniRef50_E0VSL51e-12542.78%Luciferase, putative n=1 Tax=Pediculus humanus corporis RepID=E0VSL5_PEDHC
NCBI RefSeqXP_001604903.13e-12845.01%PREDICTED: similar to CG6178-PA [Nasonia vitripennis]
NCBI nr blastpgi|1565512016e-12745.01%PREDICTED: luciferin 4-monooxygenase-like [Nasonia vitripennis]
NCBI nr blastxgi|1565512013e-12444.35%PREDICTED: luciferin 4-monooxygenase-like [Nasonia vitripennis]
Group
Gene OntologyGO:00081527.8e-91metabolic process
GO:00038247.8e-91catalytic activity
KEGG pathwaytad:TRIADDRAFT_562029e-110 
 K01904 (E6.2.1.12)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[51-444] IPR0008737.8e-91AMP-dependent synthetase/ligase
Orthology groupMCL14712 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202321-TA
ATGGCTGTGTCGATTCATAACAATGTGGTCTCAGGTCCTGAGGAAAGACCTATACCTGCCCATTTATCCTATGGTCAATTTTTGTTTGATAAATTAAAGGCGGGGGGAAATAAAATCGCGCAAATAAGTGCGGAGACTGGAGAATCTGTTACTTATCAAAATATTCTCCAGAATAGTGTTAATCTGGCAGTGGCTCTACAGGAGTTGGGCTTACAGAAGGGTGATGTAGTTTCGCTCAGTTGTGAAAATCGTTTTGAATTCACCGTTGCCTCTTTAGCCGTAATCTTTGCTGGAGGAGTTTTATCAACTCTAAATGTTACTTATTCGCCAGGTGAAATTTCCCATGTATTCCAAATCACAAAGCCCAAGTTTATATTCACGTCGCCGATCACTGCACAGAACATGTATGACTGCAGCAAGGATCTGACATTTGTGAAGAATTTGATTTTGTTCGGTGAATATGACATTGTACCCGCCGTGTTCTACAATGATTTAGTCAAGAAACACTGTGATATAGATGATTTCGCATTGGTCGATGTGAATGGAGCAGAGGATACTGTGGCCGTAATGTGTTCATCGGGAACGACTGGCTTACCAAAGGGTGTTATGTTAACCCATGTAAATTTCCTCACACTATCCGCTCATATGAAATATTATTTGGAGACGTCTCAACAGAAAAGGAAACATAATGTAATAACAGCCCTGTCTTTGATCCCTTGGTTTCATGCTTATGGATTCATTACAACATTAGCCGTGATGTGCCTACACGTAGAGGTTGTGTTTCTTGTTAGATTCGAAGAGGAACAATTTCTTGAAACGATACAAAAATATAAGATAAACATGACGACAATAGTGCCACCGCTCGCTGTTTTCCTTGCCAAACATCCGTTGGTCTCCAAGTATGACCTGAGCTCATTGAACGAAATGTGGTGCGGAGCCGCTCCCCTGTCCAAGGAAATACAGACGCTTGTCACTAAACGAACTGGTATTGATTTCATCAAGCAAGGTTACGGCCTGACAGAAGTCACAATGGCATGTTGTGTGGATTTAGTCGGCAGAAGCAAAGCAGGCTCCTGCGGTACACCTGCGCCTGGCATGAAGATCAAGGTGATAGATACTGAGAGTGGTAAGAAATTAGGTCCCAATGAAGAGGGTGAGCTGTGCATTAAGTCGCCTCTCCGCATGAAGGGATATTTGGGTGATAAAGCATCCGGTGATGCCATGATTGATGAGGAAGGTTATGTTAAGACGGGAGATATTGGGTACTATGACAAGGAAGGATACTTCTACATTGTTGATAGACTCAAAGAACTCATCAAATATAAAGGTTTCCAGAGCAACAAGGAAGGATACTTCTACATTGTTGATAGACTCAAAGAACTCATCAAATATAAAGGTTTCCAGGTTGCTCCAGCTGAGTTGGAATCTTTACTGCTGCAGCACAGTGCAGTGGCGGATTGCGGTGTTGTTGGCAGACCTGATGAATTGGCGGGTGAACTACCGGTAGCGTTTGTAGTCAAACAGCCGGAAGCCAATATACAGGAACAGGAAATTATTGACTACGTAGCCAAGAAGGTGTCGCCAGCCAAACGTCTACGAGGTGGCGTTATATTTGTTGACGAAATACCAAAGAATCAATCAGGTAAAATTCTGAGAAGGGAGCTAAGGAAAATGTTGTCCGCCAACATTAAAAGCAAGCTATAA

Protein sequence:

>DPOGS202321-PA
MAVSIHNNVVSGPEERPIPAHLSYGQFLFDKLKAGGNKIAQISAETGESVTYQNILQNSVNLAVALQELGLQKGDVVSLSCENRFEFTVASLAVIFAGGVLSTLNVTYSPGEISHVFQITKPKFIFTSPITAQNMYDCSKDLTFVKNLILFGEYDIVPAVFYNDLVKKHCDIDDFALVDVNGAEDTVAVMCSSGTTGLPKGVMLTHVNFLTLSAHMKYYLETSQQKRKHNVITALSLIPWFHAYGFITTLAVMCLHVEVVFLVRFEEEQFLETIQKYKINMTTIVPPLAVFLAKHPLVSKYDLSSLNEMWCGAAPLSKEIQTLVTKRTGIDFIKQGYGLTEVTMACCVDLVGRSKAGSCGTPAPGMKIKVIDTESGKKLGPNEEGELCIKSPLRMKGYLGDKASGDAMIDEEGYVKTGDIGYYDKEGYFYIVDRLKELIKYKGFQSNKEGYFYIVDRLKELIKYKGFQVAPAELESLLLQHSAVADCGVVGRPDELAGELPVAFVVKQPEANIQEQEIIDYVAKKVSPAKRLRGGVIFVDEIPKNQSGKILRRELRKMLSANIKSKL-