Monarch geneset OGS2.0

DPOGS209986
TranscriptDPOGS209986-TA1119 bp
ProteinDPOGS209986-PA372 aa
Genomic positionDPSCF300148 + 330496-333318
RNAseq coverage1081x (Rank: top 12%)
Annotation
HeliconiusHMEL0129215e-8149.03% 
BombyxBGIBMGA011271-TA4e-13573.27% 
Drosophilaregucalcin-PD2e-6744.44% 
EBI UniRef50UniRef50_E0XEM81e-7346.58%Luciferin regenerating enzyme n=2 Tax=Polyphaga RepID=E0XEM8_9COLE
NCBI RefSeqXP_967668.13e-6945.95%PREDICTED: similar to luciferin-regenerating enzyme [Tribolium castaneum]
NCBI nr blastpgi|3323768972e-7650.49%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323768971e-7550.49%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00055098.3e-08calcium ion binding
GO:00302348.3e-08enzyme regulator activity
KEGG pathwaycbu:CBU_17894e-36 
 K01053 (E3.1.1.17)maps-> Pentose phosphate pathway
    Ascorbate and aldarate metabolism
    Caprolactam degradation
InterPro domain[68-364] IPR0110421.7e-73Six-bladed beta-propeller, TolB-like
[83-340] IPR0136588e-71SMP-30/Gluconolaconase/LRE-like region
[84-101] IPR0055112.2e-42Senescence marker protein-30 (SMP-30)
[202-215] IPR0083678.3e-08Regucalcin
Orthology groupMCL10244 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209986-TA
ATGCTCGTCCTATGGCTCCTCCTAACTCCAGGGACCTCAGCTCTAGCTCGTATCATTGATGTCTTCATTCACACCACCATGTGTGAGGGTTGTCGTAGCGTGTCGATATTATGTGTTATGATATTCGTTGAGGGTGAAGTTTTGTTCTTCTCTAACTCCATCAATCTGTGCCTTCAATACATCCCAGAAACAACTCATTTCCTTCAGATGGCCCCCCAAGTGCAGGCGGTGACGGAGCCCGTGTGGCTGGGCGAGGGTCCTCACTGGTCCCACGACCACCAGGCCCTCTTCTTCGTCAGCATCTTCGACAAGACCATTCACAAATACAACCCTTCCAACGGAAAACATACTAGAGCCAAACTAGGTGACATGCCCGGCTTTATAATACCGGTGGAAGGGAAGCTGGACCGCTTCGTGGTCGGTCTGAAGAGGAGGGTGGTGGAGGTTCAGTGGGACGGGGAGGGCGGGGACGCCACCGTCATCAGGGAGCTGGCCGAGCTGGACAAACACAGTCCCGATAACAGGATCAACGACGCCAAGGCGGACCCCAGGGGGAGACTGTTTGTTGGTACCATGGGTCACGAGTACGAGCCGGGTAAGTTCCACCTGAAGCAGGGCTCCCTGTACCGCCTGGAGCGGGACGGGAGCGTGTCCCGCGTGGCCCAGGACATCGATATCTCGAACGGTCTGTGCTGGGACCCGAAGAGAAGCGCCTTCTACTACGCTGACTCCTTCGAGTACGCCATCAGAAGATACGACTACGACATCGACACGGGCAACATCTCGAACCCCACGATAATATTCAAGTATTCGGATCACGGGCTGGACGGCATCGTGGACGGCATGTCTATAGACACGGACGGCAACCTGTGGGTCGCCAACTTCGACGGGTCACAGGTGTTGAAGATCGACCCCGTGAAGGGCGCTCTCCTCCAGCGAGTCCCCATCCCGGCGCTGCAGACCACCTCCGTGACGTGGGGAGGTCCTGCGCTGGACGTGCTGTACGTGACCTCCGCCTCCATGAACAGGGGTCAGGAACAGAAGCCACCTTGCGGCTCAACATTCAAGGTGACCGGTCTCGGGGCCCGCGGACATCCCAACAACAATGTTAAACTTTAA

Protein sequence:

>DPOGS209986-PA
MLVLWLLLTPGTSALARIIDVFIHTTMCEGCRSVSILCVMIFVEGEVLFFSNSINLCLQYIPETTHFLQMAPQVQAVTEPVWLGEGPHWSHDHQALFFVSIFDKTIHKYNPSNGKHTRAKLGDMPGFIIPVEGKLDRFVVGLKRRVVEVQWDGEGGDATVIRELAELDKHSPDNRINDAKADPRGRLFVGTMGHEYEPGKFHLKQGSLYRLERDGSVSRVAQDIDISNGLCWDPKRSAFYYADSFEYAIRRYDYDIDTGNISNPTIIFKYSDHGLDGIVDGMSIDTDGNLWVANFDGSQVLKIDPVKGALLQRVPIPALQTTSVTWGGPALDVLYVTSASMNRGQEQKPPCGSTFKVTGLGARGHPNNNVKL-