Monarch geneset OGS2.0

DPOGS201589
TranscriptDPOGS201589-TA2178 bp
ProteinDPOGS201589-PA725 aa
Genomic positionDPSCF300152 - 41452-55861
RNAseq coverage179x (Rank: top 49%)
Annotation
HeliconiusHMEL0132538e-15846.00% 
BombyxBGIBMGA012217-TA7e-6749.59% 
DrosophilamodSP-PA2e-6630.02% 
EBI UniRef50UniRef50_Q69BL01e-14541.75%Pattern recognition serine proteinase n=2 Tax=Obtectomera RepID=Q69BL0_MANSE
NCBI RefSeqXP_001655952.13e-8933.59%hypothetical protein AaeL_AAEL002767 [Aedes aegypti]
NCBI nr blastpgi|396550535e-14541.75%pattern recognition serine proteinase precursor [Manduca sexta]
NCBI nr blastxgi|396550537e-15341.65%pattern recognition serine proteinase precursor [Manduca sexta]
Group
Gene OntologyGO:00038242.8e-55catalytic activity
GO:00042522.1e-28serine-type endopeptidase activity
GO:00065082.1e-28proteolysis
GO:00055159.8e-10protein binding
KEGG pathwayspu:5820525e-29 
 K04550 (LRP1, CD91)maps-> Malaria
    Alzheimer's disease
InterPro domain[453-725] IPR0090032.8e-55Peptidase cysteine/serine, trypsin-like
[465-709] IPR0012542.1e-28Peptidase S1/S6, chymotrypsin/Hap
[238-287] IPR0021729.8e-10Low-density lipoprotein (LDL) receptor class A repeat
[295-359] IPR0160602.1e-08Complement control module
Orthology groupMCL11532 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201589-TA
ATGTTTTTTGTTGTGAACTTCATTGATTCGGATTTGGATCGTCTCAAACGTGGATCAGATTGCGATGACTCCTTCAACTTCCGGTGCAGCGATGGAACCTGTATCAGCGCTGACAAGAAGTGTGATGGTGTGGGAGACTGCCCCGACGGTTCAGACGAGAGGTTCCACATGTGCAGGAACGTCAGGTGCCCATATTACCACTTTCGATGCACGTACGGAGCTTGCGTCGATGGTACAGCTTCCTGCAATGGTGTCAAGGAGTGTATGGACAACTCGGATGAACTGCAACCGGCCTGCCAGAAGAAAAGCAACATCTTTGGTGAAAAATTTATTTGCAAGAACGGTGAAATGATAGATGTTTATCAGATCTGTGATGGAACCACTGAGTGCAGTGATAATAGTGATGAGATCTTGGATAAGTCGATTTGGATAAAAAAATCGTTGTGTTGTAGCGGGAATGGCTGCGAGTGCCCATATTACCACTTTCGATGCACGTACGGAGCTTGCGTCGATGGTACAGCTTCCTGCAATGGTGTCAAGGAGTGTATGGACAACTCGGATGAACTGCAACCGGCCTGCCAGAAGAAAAGCAACATCTTTGGTGAAAAATTTATTTGCAAGAACGGTGAAATGATAGATGTTTATCAGATCTGTGATGGAACCACTGAGTGCAGTGATAATAGTGATGAGATCTTGGAGACCTGCGCGAGCACTGTTTGCCCATCACATCTGTTCCAATGTGCGTATGGCGCATGCGTTGATGCTGGGGCCGAGTGCAATAATTTGCAGGAATGCGCTGATAATTCTGATGAATGGGATCTTCTGTGCAATAAGACGTCTACCACAACAACAACCACCACGGAGATCACGGAGACAAGTCGGTCGTCCTGCGTCTTGCCAGATCATCCAAAGTTCGGAATATACAGCCTAGCTGACGGTAGCAAATATGTTCCAAGGAGTGTTCAAGAAAATTTGGTGGTTCTGAGCCTCACATGCTACCCAGGATATAAGAGTGTAGGGGAAATAGCCACTTACTGTCATGAAGGGTCATGGTCTACCGATTTACCTTATTGCGCCCATCAATACAAGAATCTACCGCTGGTTTCGTCCAGCTATCTGAAGTTTTCAATCGAGATCCTTAATTTAGAAAGGACGTGTAAACTGGATAAATCACCAAGTATTGAGTACAGATGTATTACTAATGACGAAGGGACCAGGGCGTGCGAGGATTACGAAGTGGAGGGTACCATTGTCCAGCCTCAGTGCAGGGAGCCCAACTACTACAGTCTCAGCGACCTGTACTACATGGTCTGCCGTGATGGACAATGGAATTACCAACCGAAGTGTGAAGCTGAATGTGGTACATTGACCCCACGCGCGACCCCTTTGGTGTTGGGCGGGCGGACTGCGGACGTTGGCGAAGTCCCCTGGCACGCGGGGATCTACAGCAAGCTGACGGAGCCTCCAATACAGATATGCGGGGCTTCCTTGGTCAGCGACACCGTACTGGTCTCTGCCGCTCATTGCTTTTGGTTCAACGAAAATACTGAGCCAGCAGAAAACTATGCAGTGGCGGTTGGCAAGCTGCATAGAGACTGGGATCATCATCTAGACATGGATTATCAGCAGACTTCTGATGTGCAATCCATCTACGTCTCCCATTACTATCGAGGATCGTCCATGAACTACCAGCACGACCTGGCCATCGTAATCGTCACCCAGCCCTTCTCCTACCGTCCGTACATAAGACCTATATGTGTGCATTTCCCTCATGATGCGACAGAAATGGCGATCAAAAACGACGACCTCGGGAAGGTAGCTGGTTGGGGTCTCACGACGGTCCACACTGGCTCCGAGTCCCCCACGCTAAAGGTCCTGGACGTGCCTTTTGTTGATTTTGACACCTGCCTCCAGAACACACCAGAATACTACCGGGAATTCTTCAGCAGCGATAAGATCTGTGGTGGCTACGCTAATGGTACAAGTCTCTGTAAGGGTGACAGCGGTGGTGGGTACGCCTTCCCCTTCAAGCTCAACGGCCGCACCAGGTACTACCTCCGCGGTGTCGTGTCCACAAGCCCACCGCTGCCTCTAGGATTGTCATGCAACATATACACGTACACGAGCTTCACGGATATCATGCAACACAAAAGAATCATCATGACGCATATGCATTGA

Protein sequence:

>DPOGS201589-PA
MFFVVNFIDSDLDRLKRGSDCDDSFNFRCSDGTCISADKKCDGVGDCPDGSDERFHMCRNVRCPYYHFRCTYGACVDGTASCNGVKECMDNSDELQPACQKKSNIFGEKFICKNGEMIDVYQICDGTTECSDNSDEILDKSIWIKKSLCCSGNGCECPYYHFRCTYGACVDGTASCNGVKECMDNSDELQPACQKKSNIFGEKFICKNGEMIDVYQICDGTTECSDNSDEILETCASTVCPSHLFQCAYGACVDAGAECNNLQECADNSDEWDLLCNKTSTTTTTTTEITETSRSSCVLPDHPKFGIYSLADGSKYVPRSVQENLVVLSLTCYPGYKSVGEIATYCHEGSWSTDLPYCAHQYKNLPLVSSSYLKFSIEILNLERTCKLDKSPSIEYRCITNDEGTRACEDYEVEGTIVQPQCREPNYYSLSDLYYMVCRDGQWNYQPKCEAECGTLTPRATPLVLGGRTADVGEVPWHAGIYSKLTEPPIQICGASLVSDTVLVSAAHCFWFNENTEPAENYAVAVGKLHRDWDHHLDMDYQQTSDVQSIYVSHYYRGSSMNYQHDLAIVIVTQPFSYRPYIRPICVHFPHDATEMAIKNDDLGKVAGWGLTTVHTGSESPTLKVLDVPFVDFDTCLQNTPEYYREFFSSDKICGGYANGTSLCKGDSGGGYAFPFKLNGRTRYYLRGVVSTSPPLPLGLSCNIYTYTSFTDIMQHKRIIMTHMH-