Monarch geneset OGS2.0

DPOGS214034
TranscriptDPOGS214034-TA2346 bp
ProteinDPOGS214034-PA781 aa
Genomic positionDPSCF300238 + 95903-101863
RNAseq coverage962x (Rank: top 13%)
Annotation
HeliconiusHMEL0035750.092.02% 
BombyxBGIBMGA008317-TA0.089.91% 
DrosophilaAppl-PB4e-9841.94% 
EBI UniRef50UniRef50_Q4ZHV60.085.35%Amyloid protein n=4 Tax=Neoptera RepID=Q4ZHV6_MANSE
NCBI RefSeqXP_002055698.16e-18043.21%GJ19504 [Drosophila virilis]
NCBI nr blastpgi|3338307410.085.35%amyloid precursor protein [Manduca sexta]
NCBI nr blastxgi|3338307410.088.28%amyloid precursor protein [Manduca sexta]
Group
Gene OntologyGO:00160219.6e-99integral to membrane
GO:00054889.6e-99binding
KEGG pathwayame:5517332e-65 
 K04520 (APP)maps-> Alzheimer's disease
InterPro domain[6-176] IPR0081549.6e-99Amyloidogenic glycoprotein, extracellular
[10-111] IPR0158493.4e-33Amyloidogenic glycoprotein, heparin-binding
[112-177] IPR0111788.4e-30Amyloidogenic glycoprotein, copper-binding
[725-778] IPR0195439.3e-20Beta-amyloid precursor protein C-terminal
[165-183] IPR0081559.9e-10Amyloidogenic glycoprotein
Orthology groupMCL12937 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214034-TA
ATGTTTGTTTTATCGCCACAGGCGACGAGCGGTGCTGAGCCGCAAGTTGCTGTGCTATGCGAGGCCGGGTCCACCTACCACCCTCAGTACATGTCAGCTGCTGGAAGATGGACCCCCGACCTCACCACGAAGCCCCACAACTGCTTGAAGGATAAAATGGAAATACTTGACTATTGCAAGAAGGTCTACCCGAGCCACGATATCACCAACATCGTTGAAGCGTCCCATTACGTGAAGGTCAGCAATTGGTGTAAGCTGGGTACCAACAATGCCGCTAAATGCAAGGTCACTAGATGGGTCAAGCCATTCCGCTGTCTCGAAGGTCCATTCCAATCGGATGCGCTGCTCGTCCCCGAGAGCTGCCTGTTCGACCACATCCACAACCAGAGCCGATGCTGGCAGTTCTCTCGCTGGAACGCTACAGCCGGCCGCGCCTGCGCCCAGAGAGGACTCCGCCTCAGGACCTTCGCGATGCTCCTACCCTGCGGAATCAGCCTGTTCTCTGGAGTCGAATTCGTATGCTGCCCTAAACATTTTAAGGAAAACGTGAAAATGCACAAACCAATGGATGTGGGCGTGCCAGTTAGCCCTGGTGGTGAAGAGATGCTGGCTGCCTCCGCTGCTATGGACGAGCGAGATGATGATCTCCTCGACGATGAAGACACCCTCACTGACGACGATGACGACACCCTCAACCTCAGCGACGACGATGACGACGATGACGCTGATGACGATATGGATGAAGATGAAGACGCTGATCTTACCCGCGATGATGACGCCGAAGACGACGATTACACGGACGGTGATGACTCCGGCTGGCCGCGACCTGACTCCTCCGCCGCGCCGTCTACCACCACGCCTACCACCACCACCACGACCACCACCACTACGCCGGCTTCCACCGCTACCTCCGACCCCTACTTCTCTCACTTCGACCCTCGCACCGAACACCAGAGCTACAAGGACGCCCAGCAACGACTCGAGGAGACTCACCGCGAGAAGATTACTAAGGTGATGCGTGACTGGTCTGAACTAGAGGACCGATATCAACAGATGATGACCTCTGACCCCGCTGCCGCCCAAAGCTTCCGCCAACGCATGACCGCCAGGTTCCAAGCCAACATTCAGGCCCTCGAAGAGGATGGTGTATCTGAACGTCGCCGTCTGGCCGCCCTTCATCAGCAACGAGTGTTGGCTCACCTGGCACAAAGACGACGCACCGCCCTCTCGTGTTACACACGGTCCCTGAAAGACGCTCCTCCCAATGCCCACCGCGTCCAGAAGTGTCTTCAGCGTTTGGTCCGCGCCCTCGCCGCCGAGAGATCAGGAGCGCTGGCCGCCTGGCGCCGTGCTGCTTCAGCCGGCCGTGAAGCCGCTGCCGCTGAACGAGCGAGTGCTGTGGACAGACTGCAGGACGCCGACCGCGCTCTCCAACGGGCGTTAACGTCCCTCCGCCGCCGCCCGCACTTGTACGCTAGCATCGGAACTGCTATCGAGGACTACGTTCAGTCCATGCAGTCCAAGGACGACATGGCCGTCTCCCTGATGTCGATGACTCCTGAGGCTGAGGAACTCCTTCTGGACCGTATCGAAGCCGAGGTCCAAAGAGAGCAGGCTGCTCGGGAACAACTGAACGCCAAGAGAGACCAGAGGACCAGGCAGAGGCAGGACATTGAAAACGAACGCGTCAGGACTTCCAACGGTGTGAAAGAGAGCGAAGAGGCTGATGATGAAGCCAGTGAAGTCAGCGACACCGAGGACACCACCACCGTTTCTACGACGCAGAGCTCCCCCTCCCCCTCTCCCTCACCCTCACCGCCGGCCACGCCCACCGTCCGTGACATCACCACCAGATCAGCCTTCACCCAGGAAACCACAACCACGGAGATCGTGACCAGTACGATGACGGACGCCCCCGAGATAGACACGGAGACCAGCGAAGTGGTGGAGACGACCAGCAGGAGACCAACCAGCGAATCCGAGGGTTCGGGACTCCGCGCGGCGCTGGAGCAGGCCGAGGAGCGTGCGCCCCCGCCGCCCGCACACGCCCTCAAACATGAACTACAACACTCACAGCCCGGCTATACAATCCGCGGTCCGTCACCGTCCCCGGGCTCGGGTGCTCTGTACCCGGCCCTGTGTGTGGGTGGAGCGGCGCTGGCGGCCGCAGCAGCTGTCGCGCTCGCTGTGGCACGACGACGTGACCGCGCTCCATCCGCTCAAGGATTCGTGCAGGTGGAGCAAACCGGCGCCGTGGCTCCGACTCCTGAAGAGCGACACGTCGCCAACATGCAGATCAATGGTTACGAAAACCCAACCTACAAATACTTCGAGGTCAAGGAGTAA

Protein sequence:

>DPOGS214034-PA
MFVLSPQATSGAEPQVAVLCEAGSTYHPQYMSAAGRWTPDLTTKPHNCLKDKMEILDYCKKVYPSHDITNIVEASHYVKVSNWCKLGTNNAAKCKVTRWVKPFRCLEGPFQSDALLVPESCLFDHIHNQSRCWQFSRWNATAGRACAQRGLRLRTFAMLLPCGISLFSGVEFVCCPKHFKENVKMHKPMDVGVPVSPGGEEMLAASAAMDERDDDLLDDEDTLTDDDDDTLNLSDDDDDDDADDDMDEDEDADLTRDDDAEDDDYTDGDDSGWPRPDSSAAPSTTTPTTTTTTTTTTPASTATSDPYFSHFDPRTEHQSYKDAQQRLEETHREKITKVMRDWSELEDRYQQMMTSDPAAAQSFRQRMTARFQANIQALEEDGVSERRRLAALHQQRVLAHLAQRRRTALSCYTRSLKDAPPNAHRVQKCLQRLVRALAAERSGALAAWRRAASAGREAAAAERASAVDRLQDADRALQRALTSLRRRPHLYASIGTAIEDYVQSMQSKDDMAVSLMSMTPEAEELLLDRIEAEVQREQAAREQLNAKRDQRTRQRQDIENERVRTSNGVKESEEADDEASEVSDTEDTTTVSTTQSSPSPSPSPSPPATPTVRDITTRSAFTQETTTTEIVTSTMTDAPEIDTETSEVVETTSRRPTSESEGSGLRAALEQAEERAPPPPAHALKHELQHSQPGYTIRGPSPSPGSGALYPALCVGGAALAAAAAVALAVARRRDRAPSAQGFVQVEQTGAVAPTPEERHVANMQINGYENPTYKYFEVKE-