Monarch geneset OGS2.0

DPOGS204686
TranscriptDPOGS204686-TA1005 bp
ProteinDPOGS204686-PA334 aa
Genomic positionDPSCF300170 + 55513-57625
RNAseq coverage153x (Rank: top 53%)
Annotation
HeliconiusHMEL0176001e-10868.22% 
BombyxBGIBMGA010133-TA8e-10468.20% 
Drosophilasmp-30-PA2e-3532.99% 
EBI UniRef50UniRef50_D6X0J59e-5841.11%Putative uncharacterized protein n=6 Tax=Tribolium castaneum RepID=D6X0J5_TRICA
NCBI RefSeqXP_971420.21e-6037.73%PREDICTED: similar to luciferin-regenerating enzyme [Tribolium castaneum]
NCBI nr blastpgi|1892406532e-5937.73%PREDICTED: similar to luciferin-regenerating enzyme [Tribolium castaneum]
NCBI nr blastxgi|1892406537e-6039.40%PREDICTED: similar to luciferin-regenerating enzyme [Tribolium castaneum]
Group
Gene OntologyGO:00055091.1e-07calcium ion binding
GO:00302341.1e-07enzyme regulator activity
KEGG pathwayhar:HEAR02162e-33 
 K01053 (E3.1.1.17)maps-> Pentose phosphate pathway
    Ascorbate and aldarate metabolism
    Caprolactam degradation
InterPro domain[35-293] IPR0136584.9e-66SMP-30/Gluconolaconase/LRE-like region
[29-320] IPR0110425.4e-66Six-bladed beta-propeller, TolB-like
[36-53] IPR0055114.3e-37Senescence marker protein-30 (SMP-30)
[153-166] IPR0083671.1e-07Regucalcin
Orthology groupMCL19388 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204686-TA
ATGGCATCGATAAAGTGTTTTTTAATCGTGGCACTGCTCTCATTCTGCGTTGAAAGCAAAGTCTCCACACCTCTTATAAAAAATATACATCAAGGAGGAGTTCTTTCTGAAAGTCCACTTTGGGTGAACGAAGAAGGTGCGCTTTACTGGGCCGACATTCTATCGCAGAAAATTTTCAGACTGGAAATAGATACTGGCAATGTCACAAGCAAATACATCGGATATGGCCCCGTAAGCTTAATTATAAGGGTTAAAGATTATCCAAAATTACTTTTGATATCAGTCAGAAGTGAACTTTACTTTCTGAATTGGGATAATTTTGAAGGAGACAAATCTTTGAGACTCCTCACAGCTGTGGACCTCGGCCATCCTGATAATAGATGCAATGATGGCAAGGTCGACGCGAAAGGAAGATTATGGTTTGGTACTATTGGAAAGGAATCAGGTAGCTGGCACGAAAAAGACGCGGCCTCTGTATATATGCTAACGGAAAACAACTTCAAAAATCCGGAAATTAAAATCCGTCCAGTCTCTATTTCAAATGGAATTGCTTGGAGTTCGGATAACAAGTACATGATGTACATAGATTCAAGTGCTCGGGCGATTTCCGTGTATGATTTTGATTTGGAAACGGGAAAAATTGAAAATGGCAGAACATTATTCAGCTTTCCAGCAAATAATCTGACAGGTTCTCCAGATGGAATGACAATTGACAGGGATGATAATCTGTGGGTAGCTTGCTTTAATGACGGCAAGGTGATAAAAGTCGATCCAAGAGCCGGTAAGCTACTCGAGCAGCATAGACTACCTGCAACCAAAATCACTTCTGTGATGTGGGGAGGTTATGATTACTCCACTTTGTACGTTACCAGCGCCAGTAAGGATTTGACAGGCAGCGAATTAGCTCAACAACCGGAAGCTGGTTCCGTTTTCGCTATAACCGGCACAGGTTCTTCCGGTTACCCCATGAACGAGTTTATTTTTAAAGACGCTGACAAATATTGA

Protein sequence:

>DPOGS204686-PA
MASIKCFLIVALLSFCVESKVSTPLIKNIHQGGVLSESPLWVNEEGALYWADILSQKIFRLEIDTGNVTSKYIGYGPVSLIIRVKDYPKLLLISVRSELYFLNWDNFEGDKSLRLLTAVDLGHPDNRCNDGKVDAKGRLWFGTIGKESGSWHEKDAASVYMLTENNFKNPEIKIRPVSISNGIAWSSDNKYMMYIDSSARAISVYDFDLETGKIENGRTLFSFPANNLTGSPDGMTIDRDDNLWVACFNDGKVIKVDPRAGKLLEQHRLPATKITSVMWGGYDYSTLYVTSASKDLTGSELAQQPEAGSVFAITGTGSSGYPMNEFIFKDADKY-