Monarch geneset OGS2.0

DPOGS202258
TranscriptDPOGS202258-TA1371 bp
ProteinDPOGS202258-PA456 aa
Genomic positionDPSCF300032 - 552368-556712
RNAseq coverage795x (Rank: top 16%)
Annotation
HeliconiusHMEL0050932e-10471.86% 
BombyxBGIBMGA004909-TA2e-18076.87% 
DrosophilaPsn-PA2e-14053.23% 
EBI UniRef50UniRef50_E2BSL44e-14255.58%Presenilin-like protein n=6 Tax=Formicidae RepID=E2BSL4_HARSA
NCBI RefSeqXP_967139.27e-15563.69%PREDICTED: similar to Presenilin CG18803-PB isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|1892396921e-15363.69%PREDICTED: similar to Presenilin CG18803-PB isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|1892396925e-15763.91%PREDICTED: similar to Presenilin CG18803-PB isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00160211.3e-219integral to membrane
GO:00041901.3e-219aspartic-type endopeptidase activity
KEGG pathwaytca:6626272e-154 
 K06060 (PSENN, PSN)maps-> Notch signaling pathway
InterPro domain[1-456] IPR0011081.3e-219Peptidase A22A, presenilin
[136-442] IPR0066392.8e-124Peptidase A22, presenilin signal peptide
Orthology groupMCL11637 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202258-TA
ATGAGTGACACCGGTAGTGATATAGAAGCGACTGAGCATACTGCACTCATGGACGGACATATAGCTGAGGCCCGCCCAGACAGGGAAGTTGCTGAACGCATCGCGAGAAAACGTAAAACAAAGTCGGATACAACCCGTAACTATGGAACAGTGGAACAGCCAGATAGTAGGCAGACAGGCGCAGGAAGCTCCCGTCGGAATCCCTCGGAGGAAACGGCAGACCAGAGCGAAGAAATGGAGTTGAAATATGGTGCGCGACATGTCATTAAACTGTTTGTGCCAGTAACCTTGTGCATGATGGTAGTCGTCGCCACTATATCATCCATTACTTTTTACTCCGTGAAAGATGTCTACTTGGCATATACTCCGTTCCATGAGGAGAGTCCTCACGCTGCGACGAAAGTGTGGAATGCTCTCGCAAATTCAATGATCCTGCTGTGTGTGATAGCTTTTATGACGGTGTTGCTTATAGTACTGTACAAGAAACGCTGCTACAAGGTCATCCATGGCTGGTTGATACTGTCTTCGCTGATGCTACTTTTCTTGTTCTCATATCTGTATATAGAGGAAGCGCTGCGAGCTTATAACATCCCACTGGATTATATAACGATATTTTTTGTGATGTGGAACTTTGGTGTGATGGGAATGATCACCATTCACTGGAAGGGTCCGCTGCGAATGCAGCAGGCGTACCTCATATTTATAGCGGCATTGATGGCACTGGTTTTTATTAAATATCTGCCGGAATGGACAACATGGGCTGTCCTAGCTGTTATATCAATTTGGGATTTGATAGCCGTCCTCACACCAAAAGGACCTCTGAGGATACTCGTTGAGACGGCTCAGGAGAGAAATGAACCGATATTCCCAGCACTTATTTATTCATCGACGGTGATGTACATGCTGGCAGCGGTCGAAGGATCTTCGAACTCGGAGGGTAACGTTAACTCCGGGGGAGATGGTGACGGTGAGGGAGGTTTTGATAACGCTTGGAGGGAGCGCGCCGCAGCCGGAGCCCCGAGACACTTACGAGTGGAGGGTACAGCATCAGCGAGATACGTCACCAGGGTTGAGGAGCCGTCGCAGGATGCTGATGACGAGAAGGGCGTGAAATTAGGTCTGGGCGATTTCATCTTTTACAGCGTTCTGGTTGGTAAGGCCAGCTCGTACGGCGACTGGAACACCACACTCGCTTGTTTCATGGCCATACTTATTGGTCTGTGTCTCACACTGTTGTTGCTGGCTATATTCAAGAAGGCGTTGCCCGCTCTGCCAATATCAATAACTTTCGGACTCATTTTTTACTTCGCGACCCGATCCATAGCAAAACCGTTCGCGGACGCGCTCGCCGCTGACCAAGTTTTCATTTAG

Protein sequence:

>DPOGS202258-PA
MSDTGSDIEATEHTALMDGHIAEARPDREVAERIARKRKTKSDTTRNYGTVEQPDSRQTGAGSSRRNPSEETADQSEEMELKYGARHVIKLFVPVTLCMMVVVATISSITFYSVKDVYLAYTPFHEESPHAATKVWNALANSMILLCVIAFMTVLLIVLYKKRCYKVIHGWLILSSLMLLFLFSYLYIEEALRAYNIPLDYITIFFVMWNFGVMGMITIHWKGPLRMQQAYLIFIAALMALVFIKYLPEWTTWAVLAVISIWDLIAVLTPKGPLRILVETAQERNEPIFPALIYSSTVMYMLAAVEGSSNSEGNVNSGGDGDGEGGFDNAWRERAAAGAPRHLRVEGTASARYVTRVEEPSQDADDEKGVKLGLGDFIFYSVLVGKASSYGDWNTTLACFMAILIGLCLTLLLLAIFKKALPALPISITFGLIFYFATRSIAKPFADALAADQVFI-