Monarch geneset OGS2.0

DPOGS209948
TranscriptDPOGS209948-TA918 bp
ProteinDPOGS209948-PA305 aa
Genomic positionDPSCF300148 - 335564-336784
RNAseq coverage3215x (Rank: top 4%)
Annotation
HeliconiusHMEL0129219e-11057.00% 
BombyxBGIBMGA011337-TA4e-11360.59% 
Drosophilaregucalcin-PD7e-6441.61% 
EBI UniRef50UniRef50_E0XEM83e-7045.45%Luciferin regenerating enzyme n=2 Tax=Polyphaga RepID=E0XEM8_9COLE
NCBI RefSeqXP_967668.15e-6944.84%PREDICTED: similar to luciferin-regenerating enzyme [Tribolium castaneum]
NCBI nr blastpgi|3323768973e-7145.82%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3010684951e-7046.00%luciferin regenerating enzyme [Lampyris turkestanicus]
Group
KEGG pathwaypto:PTO09075e-32 
 K01053 (E3.1.1.17)maps-> Pentose phosphate pathway
    Ascorbate and aldarate metabolism
    Caprolactam degradation
InterPro domain[14-272] IPR0136585.2e-70SMP-30/Gluconolaconase/LRE-like region
[5-296] IPR0110421.5e-65Six-bladed beta-propeller, TolB-like
[15-32] IPR0055112.8e-46Senescence marker protein-30 (SMP-30)
Orthology groupMCL10244 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209948-TA
ATGCCTCTTAAGATAGAGAAGATAACTGATCCGCTGACGCTTGGCGAGGGTCCTCATTGGGATGACAGGCAGCAGGCATTGTTGTTCGTGGACATCCTCGCGTGTACTATACATAAATACGTGTTGTCCTCTAAGAAACACACGAAAACTAAACTCGACGGTCGACCGGGCTTCATAGTACCGGTGGAGGGCGAAACGGACCAATATATAATAGGTTTGGAGTTGTCCTTTGTCATAATACAATGGGACGGCGAGGAAGGCAGCCCGGCGAAGGTTCTCCGTACATTAGCCGACGTCGACCAGGACGTGTCTCCCAAACCCAGGATTAACGATGGGAAGGCCGACCCTCGAGGGCGGATTTTTGCAGGCTCCATAGGCTATGAGAATCCACCTGGTAAGTTTTCGCCAAAACAGTGTTCACTGTACCGTCTGGATAAGAGTGAGGTGAAGAAAGTGTGCGGAGACATAACGGTGTCAAACGGACTGGCCTGGGACCTGGAGAGAAAGGCCTTCTATTATATCGACAGCATGGACTTAAAGATTAGGAGATACGATTACGACGTTGACTCTGGGGATGTTTCGAACATGAAGTATATATTCGACCTGCAGGCCAACGGTGTAGAGGGCTTCCCTGACGGAGCCACGATAGACTCGGACGGTAACCTGTGGGTCGCCGTGTTCTCAGGCTCCTGCGTCCTCCAGGTGGATCCGGTCCGAGGGACATTGATACAGAAGCTGGCGATCCCCGCCAGTCAAGTGACCTCTGTTACCTTCGGAGGTCCCGACTTCGATGTCATGTTCGTAACGTCGGCCAGTGTCGACTACACGGGCCCCCAGGAACCTCCCGGAGGGTGCACCTTCATGATCACAGGCGTCGGAGCCAAGGGTCTTCCTAATGTTTGCTACAAGCTCCAGTAA

Protein sequence:

>DPOGS209948-PA
MPLKIEKITDPLTLGEGPHWDDRQQALLFVDILACTIHKYVLSSKKHTKTKLDGRPGFIVPVEGETDQYIIGLELSFVIIQWDGEEGSPAKVLRTLADVDQDVSPKPRINDGKADPRGRIFAGSIGYENPPGKFSPKQCSLYRLDKSEVKKVCGDITVSNGLAWDLERKAFYYIDSMDLKIRRYDYDVDSGDVSNMKYIFDLQANGVEGFPDGATIDSDGNLWVAVFSGSCVLQVDPVRGTLIQKLAIPASQVTSVTFGGPDFDVMFVTSASVDYTGPQEPPGGCTFMITGVGAKGLPNVCYKLQ-