Monarch geneset OGS2.0

DPOGS213363
TranscriptDPOGS213363-TA1185 bp
ProteinDPOGS213363-PA394 aa
Genomic positionDPSCF300109 - 99744-100928
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0075958e-3426.15% 
BombyxBGIBMGA013849-TA1e-3428.02% 
DrosophilaSpn77Ba-PB2e-1923.12% 
EBI UniRef50UniRef50_Q6Q2D38e-3126.92%Serpin-5A n=3 Tax=Obtectomera RepID=Q6Q2D3_MANSE
NCBI RefSeqNP_001037205.13e-3928.39%serine protease inhibitor 5 [Bombyx mori]
NCBI nr blastpgi|1129845486e-3828.39%serine protease inhibitor 5 precursor [Bombyx mori]
NCBI nr blastxgi|1129845483e-3728.27%serine protease inhibitor 5 precursor [Bombyx mori]
Group
Gene OntologyGO:00048676.3e-31serine-type endopeptidase inhibitor activity
KEGG pathwayxla:7794331e-20 
 K03982 (SERPINE1, PAI1)maps-> Chagas disease
    p53 signaling pathway
    Complement and coagulation cascades
InterPro domain[6-392] IPR0237962.6e-54Serpin domain
[63-391] IPR0002156.3e-31Protease inhibitor I4, serpin
Orthology groupMCL23359 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213363-TA
ATGAGCTCGCTGCGTCGCAACTTGCTTAAGTTACTGGTTCTCTGCCGATTGTTAGACATATTTTCTCCTGCTTTATGTAAAACCAAAAATGTAATTCCCTACACCGATTTAAGATCATTCGGCGAAAAGGTAAGAAATCTGAGCTTCCACTTGTTTAACTATACGGAGCAGCGAAATGATTGTGGGCTGCTAGCACCTTACACTTTATGGAATCTGGTTTCGCTTGTTGCTTTCATGACGTCGGAAGACAGTTGGGACCAACTGCATAAAACGATGGGTGTGTCAAGACGGAAAGGAAAATATTTCAACTCAATTTACAACCACATAAACGATTTAATGATCACAATAAAACCGGGAGCATCTTTTAAAATAAACAATACTGTTTTCTATGACTCTAGACTGCAGTTAACGAGTAACTTTGAAGACAGCATCATGGCTTCTGGAGTGTCACTAAAAAAGCTCAATTTCCATGATAGCGTTTTGGCGTCTGATTCAGCAAATAACTACATACAATCTAATTACTTTCTTATGCCGAAGCGAATAATTTTTCATTCAAGTGATTTTAAGGACACTTCGATGATCGTGAGCGGTGTTGTGGAATTCGAGGCCGCTTGGGCCAAACCTTTTGATATCTCCCTTAGCGATCGATTCGTGATGCGTCAAAGTGGTGAATTTTTTTACACTGATGTAGACTGGTTGCGTGCTTCGGTATTAGAATTATCTTATGCGAGCGATTTTGATTTTAGCATGCTAGTAATTCGACCCCGTTATGGTGTCGCCCTTAAGGATGTGATAACAAACCTGGCCCTTAAAAAATTGGACGACATATTTCAAAAATTATATACAGCAGGCTCGAAGGAAACTGTGATCGAATTGCCGAAATTTTCCTTAACATCATTGCAAGTGTTAAATGAGCCGTTCATGTCTATGGGAATAATAAATGTTTTTTTACCAGACGAGGCGGACTTTTCTGGTATATCATCAGACAACTTGTACATACAGAGCTTTGAACAAAGGGTGACGGTGACGGTTTCAGAAACAGGGACCACAGCCACTGCGTACACACCAGCAAATGTTGCCAAATCTACTAGCCCCAATGAAATAGGATCTGCTTCGCCATTTATATTTCTAATCGTGCATGGTCCATCATTAAGCATAGTTTTTTGCGGAAAATACGGCGAATAA

Protein sequence:

>DPOGS213363-PA
MSSLRRNLLKLLVLCRLLDIFSPALCKTKNVIPYTDLRSFGEKVRNLSFHLFNYTEQRNDCGLLAPYTLWNLVSLVAFMTSEDSWDQLHKTMGVSRRKGKYFNSIYNHINDLMITIKPGASFKINNTVFYDSRLQLTSNFEDSIMASGVSLKKLNFHDSVLASDSANNYIQSNYFLMPKRIIFHSSDFKDTSMIVSGVVEFEAAWAKPFDISLSDRFVMRQSGEFFYTDVDWLRASVLELSYASDFDFSMLVIRPRYGVALKDVITNLALKKLDDIFQKLYTAGSKETVIELPKFSLTSLQVLNEPFMSMGIINVFLPDEADFSGISSDNLYIQSFEQRVTVTVSETGTTATAYTPANVAKSTSPNEIGSASPFIFLIVHGPSLSIVFCGKYGE-