Monarch geneset OGS2.0

DPOGS201823
TranscriptDPOGS201823-TA1149 bp
ProteinDPOGS201823-PA382 aa
Genomic positionDPSCF300145 + 395227-410043
RNAseq coverage352x (Rank: top 33%)
Annotation
HeliconiusHMEL0083326e-17286.48% 
BombyxBGIBMGA013119-TA4e-16387.97% 
DrosophilaCG17370-PE4e-14868.85% 
EBI UniRef50UniRef50_Q9CUS93e-12660.89%Signal peptide peptidase-like 3 n=45 Tax=Euteleostomi RepID=PSL4_MOUSE
NCBI RefSeqXP_001861816.14e-15372.15%signal peptide peptidase [Culex quinquefasciatus]
NCBI nr blastpgi|1700515567e-15272.15%signal peptide peptidase [Culex quinquefasciatus]
NCBI nr blastxgi|481164462e-14870.87%PREDICTED: signal peptide peptidase-like 3-like isoform 1 [Apis mellifera]
Group
Gene OntologyGO:00160211.6e-182integral to membrane
GO:00041901.6e-182aspartic-type endopeptidase activity
KEGG pathway 
InterPro domain[2-376] IPR0073691.6e-182Peptidase A22B, signal peptide peptidase
[66-358] IPR0066392.8e-81Peptidase A22, presenilin signal peptide
Orthology groupMCL14166 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201823-TA
ATGGCACAAGTCGGTGCTAACATGGAATACCAAGAATATAAATGGGCTTACTCAATTATGGATTCATCTAAGGTGTCAGCTTGTTTGATCTCAATGTTGCTGATAGTTTACGGCAGTTTCCGGTCTCTTAATATGGAGAGGGAAGCGCGCGAACGAGCCGAGAGGGAGACTTCATCTACATCCTCGGCTAATAATAATGTCCAGACCCTGAGTACTATGCAGGCTCTGTGCTTCCCTCTCGGTTCGTCTGTGGCGTTGCTGATAATGTTCTTCTTCTTCGATTCCATGCAGACCCTCGTCGCTATATGCACAGCCATTATAGCATGCGCGGCGCTGGCGTTTCTATTGACGCCGCTCTGCCAGTACGTAGCGGGCGGCGTGGTGGGGGCGGGGGCTGCGCGCTGCGGACGGTACTCCGCCCCCGAACTGGCCGCAGCCCTACTGGCAGCCGCTATAGTAGCAGTGTGGGTACTCACAGGCCACTGGCTGCTCATGGACGCTATGGGTATGGGCCTGTGTGTGACCTTCATCGCTCTCATCCGTCTGCCGTCTCTGAAGGTGTCCACGTTGCTCCTCACCGGTCTCCTCTTGTACGACGTGTTCTGGGTGTTTTTCTCCTCCTACATATTCACCACCAACGTCATGGTGAAGGTTGCCACTAGACCGGCTGAGAATCCCATGAACGTGGTGGCCAGGCGTCTTCAGCTCGGCGGTGCTATGAGAGACGCTCCAAAACTCTCTCTGCCCGCCAAACTAGTCTTCCCTTCAATGCATCACCAGGGACACTTCTCTATGCTCGGTCTCGGTGACATCGTGATGCCGGGATTGTTGCTGTGCTTCGTCTTACGCTACGATGCTTACAAGAAGGCGACGCTCGTGTGTCAGATGGGACAAGTCCCCGGTCCCAGGTCAATGGGCTCTCGTCTGACGTACTTCCATTGCTCGTTGCTGGGTTATTTCCTCGGCCTGTTGACTGCTACCGTATCCGCGGAGGTTTTCAAGGCAGCTCAGCCGGCGCTACTCTACCTGGTGCCCTTCACGCTGCTCCCGCTCCTCACAATGGCATATGTTAAGGGAGATCTGAGGAGAATGTGGAGCGAGCCTTTCATACCACCGTCCGGGAAGAGCGCTGCCGACTTCGACGTTTGA

Protein sequence:

>DPOGS201823-PA
MAQVGANMEYQEYKWAYSIMDSSKVSACLISMLLIVYGSFRSLNMEREARERAERETSSTSSANNNVQTLSTMQALCFPLGSSVALLIMFFFFDSMQTLVAICTAIIACAALAFLLTPLCQYVAGGVVGAGAARCGRYSAPELAAALLAAAIVAVWVLTGHWLLMDAMGMGLCVTFIALIRLPSLKVSTLLLTGLLLYDVFWVFFSSYIFTTNVMVKVATRPAENPMNVVARRLQLGGAMRDAPKLSLPAKLVFPSMHHQGHFSMLGLGDIVMPGLLLCFVLRYDAYKKATLVCQMGQVPGPRSMGSRLTYFHCSLLGYFLGLLTATVSAEVFKAAQPALLYLVPFTLLPLLTMAYVKGDLRRMWSEPFIPPSGKSAADFDV-