Monarch geneset OGS2.0

DPOGS202915
TranscriptDPOGS202915-TA1107 bp
ProteinDPOGS202915-PA368 aa
Genomic positionDPSCF300126 + 393699-395563
RNAseq coverage1407x (Rank: top 9%)
Annotation
HeliconiusHMEL0145890.079.33% 
BombyxBGIBMGA004196-TA1e-17885.05% 
DrosophilaSpp-PA4e-14267.59% 
EBI UniRef50UniRef50_Q16NF31e-14267.02%Signal peptide peptidase n=11 Tax=Bilateria RepID=Q16NF3_AEDAE
NCBI RefSeqNP_001040306.16e-17584.24%presenilin-like signal peptide peptidase [Bombyx mori]
NCBI nr blastpgi|1140515661e-17384.24%presenilin-like signal peptide peptidase [Bombyx mori]
NCBI nr blastxgi|1140515661e-17684.51%presenilin-like signal peptide peptidase [Bombyx mori]
Group
Gene OntologyGO:00160211.1e-167integral to membrane
GO:00041901.1e-167aspartic-type endopeptidase activity
KEGG pathway 
InterPro domain[13-357] IPR0073691.1e-167Peptidase A22B, signal peptide peptidase
[69-340] IPR0066392.1e-104Peptidase A22, presenilin signal peptide
Orthology groupMCL12971 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202915-TA
ATGGCAGAATCTGTAACAGAAATTCCTATAAATATAGAAGAAAGTGTCAGGGAAGCAGTACAAAATGTTACCGAAAAGCCTCCATCAAGTATTGAAGGAGTTGCTATTGCCTACTTAAGTCTAGTAATAATGGCTATATTGCCAATATTTTTTGGATCTTTTCGGTCTGTAAAATATTTAACGGATAAAAAGAATTCAGGCGAGAAGGCTGAAACTATGTCAAAAAAGGATGCGTTAATCTTTCCACTGATAGCATCATGTGCTTTGTTTGCTCTGTATATATTCTTCCAGTTTTTTTCAAAGGAATACATTAATCTACTCCTCACTGGCTATTTCTTCTTCCTCGGTGTTCTTGCACTGAGCCATTTATTGAGCCCTATTATATCACTTATTGTGCCAGCATCAGTCCCAAACACACCTTACCACATCCTCTTCACCCGTGGTGAACAGGAGGGACACTCGGACATTGTTAACTACAAATTCACTTCATATGATGTGATCTGTCTTGTCATTTCTTTGATTCTTGGAGCCTGGTATCTATTTAAGAAGCACTGGATCGCCAACAACTTATTTGGCATTGCATTTGCTGTGAATGCTGTTGAAATGTTGCATCTGAACAATGTGGTGACAGGATGCATCTTACTGTGCGGACTCTTCCTTTATGACATATTCTGGGTGTTTGGCACCAATGTCATGGTTACTGTCGCTAAGTCATTTGAGTCTCCTATCAAATTGGTATTCCCCCAAGATTTATTAGTCAATGGATTTAATGCTAGCAACTTTGCTATGTTGGGTCTCGGTGATATTGTTGTTCCTGGCATCTTCATAGCTCTGCTGCTAAGATTTGACAAAAGTTTGAAGCGTGGCTCAGAGTTGTACTTCAGAGCTACCTTCTCCGCTTACATCCTGGGATTGTTGGCTACCATACTGGTGATGCATGTGTTCAAGCACGCCCAGCCTGCCTTATTGTACTTAGTACCCGCCTGCCTCGGCACGCCATTGACTCTTGCGTTATTGAGAGGAGATATCAACGCTTTGTTCAATTATGAAGATCAACCAGCGGTGGTGGAAGCGCCGAGTGACAGTAAAGCGAAGAAATCGGAATAA

Protein sequence:

>DPOGS202915-PA
MAESVTEIPINIEESVREAVQNVTEKPPSSIEGVAIAYLSLVIMAILPIFFGSFRSVKYLTDKKNSGEKAETMSKKDALIFPLIASCALFALYIFFQFFSKEYINLLLTGYFFFLGVLALSHLLSPIISLIVPASVPNTPYHILFTRGEQEGHSDIVNYKFTSYDVICLVISLILGAWYLFKKHWIANNLFGIAFAVNAVEMLHLNNVVTGCILLCGLFLYDIFWVFGTNVMVTVAKSFESPIKLVFPQDLLVNGFNASNFAMLGLGDIVVPGIFIALLLRFDKSLKRGSELYFRATFSAYILGLLATILVMHVFKHAQPALLYLVPACLGTPLTLALLRGDINALFNYEDQPAVVEAPSDSKAKKSE-