Monarch geneset OGS2.0

DPOGS208498
TranscriptDPOGS208498-TA1341 bp
ProteinDPOGS208498-PA446 aa
Genomic positionDPSCF300064 - 882252-886103
RNAseq coverage359x (Rank: top 33%)
Annotation
HeliconiusHMEL0087581e-15156.92% 
BombyxBGIBMGA004955-TA6e-9756.04% 
DrosophilaSpn28D-PA3e-3828.57% 
EBI UniRef50UniRef50_C0J8G27e-14055.96%Serpin-13 n=1 Tax=Bombyx mori RepID=C0J8G2_BOMMO
NCBI RefSeqNP_001139705.11e-14055.96%serine protease inhibitor 13 [Bombyx mori]
NCBI nr blastpgi|2263428863e-13955.96%serine protease inhibitor 13 precursor [Bombyx mori]
NCBI nr blastxgi|2263428865e-13456.06%serine protease inhibitor 13 precursor [Bombyx mori]
Group
Gene OntologyGO:00048671.3e-83serine-type endopeptidase inhibitor activity
KEGG pathwayssc:3969452e-38 
 K03982 (SERPINE1, PAI1)maps-> Chagas disease
    p53 signaling pathway
    Complement and coagulation cascades
InterPro domain[1-444] IPR0237967.7e-89Serpin domain
[38-446] IPR0002151.3e-83Protease inhibitor I4, serpin
Orthology groupMCL11639 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208498-TA
ATGTCCCTCAAACTTTCGAGCTTCGTCTGTTTTTTTGGGCTTTTTATTATAGGAATAGTAAAAAGCGAAGATGAAAGCAGGATATCATTCAACTCTGATGAATTGAAAAAGAATGCTCTGGCCGACGCTGTTAATGATTTTGGTTACAAAATAATGACCAAGATGATAAATGACAATTATGATAAAAACATCGCCTTGTCCCCAACAGGTATTGCCGGTTTGCTAGCCATGAGTCTACTGGGCAGCGTGGGGAGGTCATATGATGAGCTGGCCGAGGCACTCGGATTCTCTCAAGATGTAAGCATCAATCGTCAGAATCACGAGATGTTTGGGGAACTGTTGAACGATTTAAATAACAATGAAACAGCTAGTAAAACTATATTCTCGGACGCTCTTTTCATCGAAGGCAAATCAAATCTCAGAGAGGCATATCGGAGTTACCTGGCCAGGGTTTACCGTGGGGACGCCATTGGTGTCGACTTTGCTGACAAAAATACAGTTAAGGCATTAATCAATGAATGGGTTAGTAATATGACTAAAGGAAAAATTCCGGATTTCCTTAAAGAATCACTACCAGCCGACACAAGGGCGGTGTTACTTAGCGCCTTATATTTTATCGGCCAGTGGGAAACCCCGTTCGTGCCGGAGTACACGTTAAAAATGAATTTTACGACACCCAAGAATGACGTTGAAGTTGACATGATGTTGAACTTAGGAAACTTCAAACACGTCTACTCCATTGAAGATGGTGTTCACATGGTAGCCCTTCCTTATAACGATAGTATCACAACCATGTATGTTCTAAAACCTAGACGACCTGATAAACAGAGTATAAATGATTTATTAAACAGCCTTAATTACACAAGAATAAATAAAATGATCGACGAAATGTGCAACCGAAAGGCCATAATAAGATTTCCCAAAATGGACCTCAAGGTTCACGCAAATTTGGAAGGACCTTTAAAACAATTAGGCATTCAGTCCATTTTCATTCCGAACCAAGCGAATTTCGCTCTTATGATAGACGGTGCAAAGGCTGTAAACAAAACGGAAGAGGAATTATTGACGAGAATTAATGATGGCGACCGGCTGTCAGGAATTAAAGATTTAAAAAGCGTAATTGATGCACTCCCGAATCCCGGCGTTTATGTCGACTCCATATTACACGACGTCCGAATAACTATAGATGAATATGGGACGGAAGCGGTCGCTGCGACAAGTGGTATTTTGGCAAGGACGGCTGAGACGTTTTATGCAAATACGCCATTCTACATGTTCATAAGAAATGAGAAAACTAAACTAGTCACCTTCAGTGCTGTTGTCTATGACCCTACTACATGA

Protein sequence:

>DPOGS208498-PA
MSLKLSSFVCFFGLFIIGIVKSEDESRISFNSDELKKNALADAVNDFGYKIMTKMINDNYDKNIALSPTGIAGLLAMSLLGSVGRSYDELAEALGFSQDVSINRQNHEMFGELLNDLNNNETASKTIFSDALFIEGKSNLREAYRSYLARVYRGDAIGVDFADKNTVKALINEWVSNMTKGKIPDFLKESLPADTRAVLLSALYFIGQWETPFVPEYTLKMNFTTPKNDVEVDMMLNLGNFKHVYSIEDGVHMVALPYNDSITTMYVLKPRRPDKQSINDLLNSLNYTRINKMIDEMCNRKAIIRFPKMDLKVHANLEGPLKQLGIQSIFIPNQANFALMIDGAKAVNKTEEELLTRINDGDRLSGIKDLKSVIDALPNPGVYVDSILHDVRITIDEYGTEAVAATSGILARTAETFYANTPFYMFIRNEKTKLVTFSAVVYDPTT-