Monarch geneset OGS2.0

DPOGS200905
TranscriptDPOGS200905-TA1293 bp
ProteinDPOGS200905-PA430 aa
Genomic positionDPSCF300066 + 119838-123209
RNAseq coverage194x (Rank: top 48%)
Annotation
HeliconiusHMEL0134013e-5872.93% 
BombyxBGIBMGA000673-TA7e-5769.34% 
DrosophilaPGRP-LF-PA3e-4645.51% 
EBI UniRef50UniRef50_E2C4B71e-5457.93%Peptidoglycan-recognition protein-LF n=3 Tax=Formicidae RepID=E2C4B7_HARSA
NCBI RefSeqXP_972267.23e-6137.94%PREDICTED: similar to AGAP005203-PC [Tribolium castaneum]
NCBI nr blastpgi|2700048323e-6037.94%hypothetical protein TcasGA2_TC002790 [Tribolium castaneum]
NCBI nr blastxgi|1892354612e-5838.25%PREDICTED: similar to AGAP005203-PC [Tribolium castaneum]
Group
Gene OntologyGO:00087451.9e-75N-acetylmuramoyl-L-alanine amidase activity
GO:00092531.9e-75peptidoglycan catabolic process
GO:00082702.2e-69zinc ion binding
KEGG pathway 
InterPro domain[252-416] IPR0155101.9e-75Peptidoglycan recognition protein
[254-397] IPR0066192.2e-69Peptidoglycan recognition protein family domain, metazoa/bacteria
[250-421] IPR0025024.9e-69N-acetylmuramoyl-L-alanine amidase domain
Orthology groupMCL11802 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200905-TA
ATGGCTACAAAGGCAAGTAAATCCAATACAAATGGGATGACAATAATAGGAAATATCGATGATATGCAAATTATAAAAGAAATCAGTGATGAAGATAGTATTAATAGACCGAAAAAATCATGTTCTTTGAACACTCCTTTAAATCTTCCCAGTAATTTAGTTCCGAACACTGGAGCACCTGTTTATGAAAGCGTCTCGGTTACAAATTCGGAAAATGTTCAGTTTGGTAACAATACATATTTTAATGGTCCAGTAATCATAAAACAAATAGTCCAACCTAAATCTGGCGTTGATAATGTTTCCTATACTAAAACAGAAGAAACAGAAAACGAGGATTCCCAGAAGCCTCCTTGTGCCACATTGGATGAAACTTATATTAAAAAGAAAGAGATATTATTATGGCACAAGATAACATTCTCCGCAGTATGTATTACAGTTGTTGGCTGTGTTGTGGCTCTCGTGCTTGTTTTACAAAACAAAAGTGATGATCAAATTCATACAAATGCACAGTCAGATTATGACAAAGAAGACAAACATACTCGTTTGGTGGTGGGAGTTTGTGATGAAAATTTTCAACTTGTTATAATATCGGTACTCCTGTTATCATTAACATTTTTGTTAGTTTCTTACTCATTACTAACACCGAGTGTTAAGAAAATAAATAATGGTGTGGACATAAACCATCATTTGAACTGTGTCAATAAAAATTGCATGTTCTGCAAGAAGACAGCAAATTCATTGCTGATAGCACCAAATCATTTGCGGATAGTGTCCAGATCGGACTGGTTGTCACAGACTGTTGAAGGCGAGGTGAACATGCTTCGTCAACCGGTACCATGGGTCATTATATCTCACACTGCCACCGAAAGCTGCTATACACAGAGTGAATGTGTTCTACGTGTAAGATTGATACAAATGTATCATGTTGAAGCTCGAAAGTGGTCAGATATTGGGTACAACTTTCTGGTGGGAGGAGATGGTTCGGTGTACCATGGTCGTGGCTGGAACATAGAGGGGGCTCATACAAAAGGTTACAACAAATATAGTATCGGAATAGCTTTTATTGGTACATTCAACACCACTCCGCCTCCAAAACAGCAAGTTGAAGCCTGCGAGAAACTTCTCAAGCTGGGAGTTGAATTGGGTAAACTCGCCAAAGACTACAAACTGTTTGCCCACAGGCAATTCATGTCAACTCTCAGTCCAGGTGATACAGTTTATGATATCATCAAGGAATGGCCACATTTTGTCAACAATTTGACAGACACCCAGTCATTGATACCCAATTATTAA

Protein sequence:

>DPOGS200905-PA
MATKASKSNTNGMTIIGNIDDMQIIKEISDEDSINRPKKSCSLNTPLNLPSNLVPNTGAPVYESVSVTNSENVQFGNNTYFNGPVIIKQIVQPKSGVDNVSYTKTEETENEDSQKPPCATLDETYIKKKEILLWHKITFSAVCITVVGCVVALVLVLQNKSDDQIHTNAQSDYDKEDKHTRLVVGVCDENFQLVIISVLLLSLTFLLVSYSLLTPSVKKINNGVDINHHLNCVNKNCMFCKKTANSLLIAPNHLRIVSRSDWLSQTVEGEVNMLRQPVPWVIISHTATESCYTQSECVLRVRLIQMYHVEARKWSDIGYNFLVGGDGSVYHGRGWNIEGAHTKGYNKYSIGIAFIGTFNTTPPPKQQVEACEKLLKLGVELGKLAKDYKLFAHRQFMSTLSPGDTVYDIIKEWPHFVNNLTDTQSLIPNY-