Monarch geneset OGS2.0

DPOGS207148
TranscriptDPOGS207148-TA1608 bp
ProteinDPOGS207148-PA535 aa
Genomic positionDPSCF300001 + 4105659-4120979
RNAseq coverage372x (Rank: top 32%)
Annotation
HeliconiusHMEL0105482e-9957.89% 
BombyxBGIBMGA000584-TA4e-10352.60% 
DrosophilaPGRP-LF-PA1e-2826.78% 
EBI UniRef50UniRef50_UPI000203B05E4e-2740.48%UPI000203B05E related cluster n=1 Tax=unknown RepID=UPI000203B05E
NCBI RefSeqXP_001601870.13e-2838.15%PREDICTED: similar to peptidoglycan recognition protein [Nasonia vitripennis]
NCBI nr blastpgi|3544984721e-3127.18%PREDICTED: peptidoglycan recognition protein 4 [Cricetulus griseus]
NCBI nr blastxgi|1565531259e-2738.15%PREDICTED: peptidoglycan-recognition protein SA-like [Nasonia vitripennis]
Group
Gene OntologyGO:00087451.3e-41N-acetylmuramoyl-L-alanine amidase activity
GO:00092531.3e-41peptidoglycan catabolic process
GO:00082701.9e-26zinc ion binding
KEGG pathway 
InterPro domain[341-508] IPR0025021.3e-41N-acetylmuramoyl-L-alanine amidase domain
[90-278] IPR0155106.5e-36Peptidoglycan recognition protein
[130-268] IPR0066191.9e-26Peptidoglycan recognition protein family domain, metazoa/bacteria
Orthology groupMCL21785 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207148-TA
ATGTGGAGAAGCAGGGAGGATACGGGACAGACGGTTGCACAGAGTAGGGTCCCCGCGGAGTTGTCCGTGATAGACGAAAGTGCAATCGCCTCGGCCATACCCTCACCCGCGGTCGCCAACCTCAATATAAGCAAGTCTTCCAAGGTCCACATCGGTCCCAAGTTCGTCTCCGTCACTCAAAATGTACAAAATGCTGAAACTCTTAAAGGCCGTTTCTTGGGCCTCGAATTGGTATCAACCAAACAGGCGCGGAGGTTGCGTTGCAGCGTCGCAGTATTTGTATGTTGGGCGCTTGTAGTAGCCTCCGCGCTTGTCATCTATCTCGTCTATGTGGCGTTGCCAAACCAACAATTTCGTCTTGATATAGGTCTAAATGAAACGTGGTACTTACGTCGGGGTGACTGGCAAGCGATGAATCCTTATAACGTACGTTTCTTGCATCTGCCTGTTCCTAAAGTCATCATTGGTCATTCAGCGGCCAATTATTGTAACCAAAGGTACAGATGCATAGAACAGATGATCATCATCCAACAAGACCATTTAAGACGAGAGCTATCTGACATCGGTCCAAATTTCCTCGTCGGTGGCAACGGCTTCATTTTTGAGGGTAGAGGCGCTAACGTCCACGGGGCTATGGTAGGCTCTCTTAACTCTAGAGCTATATCGATCATGTTTATGGGAAATTACATCCATGACCAGCCAGACTCGAAGCAATTTGAACATTTAAACGTTCTTCTGGACGTTTTGGTTAGAGAGGGCGTTCTACGACAAGATTATACGTTGGTCGGCCATTGTCAGGTTAACTTCGATACGATCAGTCCTGGTCCCCATATAATGACCCAGCTAGAGCTGCCCTTGCCGAAGTACCTATGGCAGGTGATAAAGAATAGTTCGCGCACGGAGCGACTATCCTGTGCTGCAGCGCTTATCGTGTTGATTGTCTGCGTTGCCCTCATCGCCTACTTTTCAGTTATGACGAGCAAAACTAAGGAAGAGGACAGAGCTCCACACGAATGGAGAATCACTCGTGAAATGTGGCTCGCACGGCCGTATAACTACACGTATTATACATATGATTTTGAACCGTTGTTGTTGGTCGTGATACAAAACACAGTCGGCCCACAATGTCATCGCTTCCAAGCCTGTGCAGCCGAACTCCGAAATTTGCAAGGCTGGTTCATCAATGACATGGGCTATGACATCCCTTACAATTTTGCGGTTGGTAACGATGGGCGTGTGTATGAAAATCGTGGCTGGTCAGTTGAAGGCGCACATACACGTGGTTATAATCGATGCTCTATGGGCATCGGGTTCCTTGGTGACTACAGAGGAGAGATGGAAAATCACGCAGTTGTAACTCCCGAACAAGAAAACCGAACTCAATTAATACTGGCAGAGGGTGTGAAGCTCGGTTACTTGCGGCGAGATTTCCTAGTAGTAGGAGCCAAAGATATTTCTGACTCGGCCAGTCCTGGCTCCAACCTCTACAATGCAATCCGTCGGTGGCCCAACTACGACCATCAAAACAGGTTCAAAGGACTTTCATGCGAACAGATTCACGAAAAGTACAAGGACACACCTTTATACGAAGTCCCCAAAGATATATAG

Protein sequence:

>DPOGS207148-PA
MWRSREDTGQTVAQSRVPAELSVIDESAIASAIPSPAVANLNISKSSKVHIGPKFVSVTQNVQNAETLKGRFLGLELVSTKQARRLRCSVAVFVCWALVVASALVIYLVYVALPNQQFRLDIGLNETWYLRRGDWQAMNPYNVRFLHLPVPKVIIGHSAANYCNQRYRCIEQMIIIQQDHLRRELSDIGPNFLVGGNGFIFEGRGANVHGAMVGSLNSRAISIMFMGNYIHDQPDSKQFEHLNVLLDVLVREGVLRQDYTLVGHCQVNFDTISPGPHIMTQLELPLPKYLWQVIKNSSRTERLSCAAALIVLIVCVALIAYFSVMTSKTKEEDRAPHEWRITREMWLARPYNYTYYTYDFEPLLLVVIQNTVGPQCHRFQACAAELRNLQGWFINDMGYDIPYNFAVGNDGRVYENRGWSVEGAHTRGYNRCSMGIGFLGDYRGEMENHAVVTPEQENRTQLILAEGVKLGYLRRDFLVVGAKDISDSASPGSNLYNAIRRWPNYDHQNRFKGLSCEQIHEKYKDTPLYEVPKDI-