Monarch geneset OGS2.0

DPOGS209513
TranscriptDPOGS209513-TA972 bp
ProteinDPOGS209513-PA323 aa
Genomic positionDPSCF300127 + 199337-203560
RNAseq coverage266x (Rank: top 40%)
Annotation
HeliconiusHMEL0041641e-6869.88% 
BombyxBGIBMGA001967-TA1e-6565.66% 
DrosophilaCG8726-PB5e-3544.97% 
EBI UniRef50UniRef50_B6RQP93e-4150.62%I-type lysozyme n=2 Tax=Cucujiformia RepID=B6RQP9_SITZE
NCBI RefSeqXP_972128.22e-4352.20%PREDICTED: similar to PX domain containing serine/threonine kinase [Tribolium castaneum]
NCBI nr blastpgi|1892393813e-4252.20%PREDICTED: similar to PX domain containing serine/threonine kinase [Tribolium castaneum]
NCBI nr blastxgi|1674442182e-4358.73%i-type lysozyme [Sitophilus zeamais]
Group
Gene OntologyGO:00037966.4e-37lysozyme activity
GO:00167724.2e-16transferase activity, transferring phosphorus-containing groups
GO:00055247.7e-09ATP binding
GO:00046727.7e-09protein kinase activity
GO:00064687.7e-09protein phosphorylation
KEGG pathway 
InterPro domain[11-132] IPR0085976.4e-37Destabilase
[113-323] IPR0110094.2e-16Protein kinase-like domain
[153-253] IPR0174427.7e-09Serine/threonine-protein kinase-like domain
Orthology groupMCL18553 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209513-TA
ATGATATCACCAGCGCAAGTGGTGTTCTTGTCGATCCTGCTCATTGCGTTGCTCTGGACCGGATCTTCAGGCGTATTTATACCAAATCTAACAGACGGTTGCTATAGATGTTTGTGTTACATTTCGACTCTTTGCGACGTATCCAACGATTGTTCCGGAGGCTACTGCGGTCCTTACAACATCTCAAGAGTGTACTGGGTAGACGCCGGCAACGTCACCTTACCTGATGACGAGCCTGAGAGAAATAATGCTTGGAAGGACTGTGCTCGGAACTATCAGTGCGCGAGAAAAATTATAGAGGGCTATCTACAAAGGTTTGGGAAGGACTGCAACGGTGACGGTGTGACAGATTGCTATGACTACATGATGATTAATGGCAACGGAGGTTACGGATGTACTTCGCCTCTTAACAGATCAGAGAATGGAAGGAGATGGCTCAGGAGATACGAGGAATGTCGGAGTTCTCATCTACATCCGTACATAGCAGACATATTAGCGATGAACACGTTAGAATCAGGCGCCTACGTTGTGAGACGCATATATAAGAACGGTAGCTTCAGGGATCTGTTATACGGAACGGAGTATAATAAGAGCCATCTAGCTAAGTACGGCAATCCCAAGACAAGGAAGCCGTTCACGAACGGTCAGATATCACACTACGGGTACCAAATATTGCAGGCGTTGAAGTTTTTGCACAGCAAGGGCTTACCGCACGGTCACTTACACCCCGGAAACATAACCGTGGACAACCAGACCGCTCTGCTATTGGATATAGAGAACCTGCTGATGGGCGTACCGAGTCTGTGCCGGCCGTATGTGTTGGATGTGAGGAGAGCGAATACGATGGAGTCAGTGGACGTGTATTGTTTCGGGAGGACTTTATACGAGATGGCCTTCGCGTCCCCTCTACAACAACACTACTGTGACGTTTACCCTGACAATATTCCGCAAGACTTGGGTAGCTTCATTTGA

Protein sequence:

>DPOGS209513-PA
MISPAQVVFLSILLIALLWTGSSGVFIPNLTDGCYRCLCYISTLCDVSNDCSGGYCGPYNISRVYWVDAGNVTLPDDEPERNNAWKDCARNYQCARKIIEGYLQRFGKDCNGDGVTDCYDYMMINGNGGYGCTSPLNRSENGRRWLRRYEECRSSHLHPYIADILAMNTLESGAYVVRRIYKNGSFRDLLYGTEYNKSHLAKYGNPKTRKPFTNGQISHYGYQILQALKFLHSKGLPHGHLHPGNITVDNQTALLLDIENLLMGVPSLCRPYVLDVRRANTMESVDVYCFGRTLYEMAFASPLQQHYCDVYPDNIPQDLGSFI-