Monarch geneset OGS2.0

DPOGS206419
TranscriptDPOGS206419-TA1032 bp
ProteinDPOGS206419-PA343 aa
Genomic positionDPSCF300181 + 32624-33655
RNAseq coverage1822x (Rank: top 7%)
Annotation
HeliconiusHMEL0075944e-13165.89% 
BombyxBGIBMGA013849-TA3e-12462.10% 
DrosophilaSpn77Ba-PB1e-4028.53% 
EBI UniRef50UniRef50_Q6Q2D39e-11661.22%Serpin-5A n=3 Tax=Obtectomera RepID=Q6Q2D3_MANSE
NCBI RefSeqNP_001037205.17e-12161.52%serine protease inhibitor 5 [Bombyx mori]
NCBI nr blastpgi|1129845481e-11961.52%serine protease inhibitor 5 precursor [Bombyx mori]
NCBI nr blastxgi|455942324e-11761.22%serpin-5A [Manduca sexta]
Group
Gene OntologyGO:00048672.7e-70serine-type endopeptidase inhibitor activity
KEGG pathwaymcc:7088543e-35 
 K03911 (SERPINC1, AT3)maps-> Complement and coagulation cascades
InterPro domain[1-340] IPR0237963e-78Serpin domain
[6-344] IPR0002152.7e-70Protease inhibitor I4, serpin
Orthology groupMCL26136 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206419-TA
ATGACTGGCATCACTTTAGGCGCTACGGGTGAGAGCAAATCCCAAATAGAAAAAGCGTTTTTCTTACCAAAGAACGAAGATACGCTCGTTAGAGGATATAAAAACTTAACGAAATCAGTTTTGGAGCCTCAAACCTATGGCGTCACTTTGGACATCAAAAACTTCGTCTTCTTGGACAAAGGCTTTGAAATAAACCGGAATTTTCAAAACACTTTGAGCACAGACTTTGGAGCAATGATCCAAACTCTTAATTTCAAAGATCCAAATGCAGCGGCGAATGTAGCCAACAGGCTTATAGGAAAATACGGCGCAACTGTCTCTAACGTGCTTCAATCACAAGACTTCTTAAAGTCCAGGATGATTCTGACGAACGTTATTTCGTTCAAGGGCCTCTGGTCTTCGCCGTTCAACCAAACAGAAACGAACCTTGAGCCGTTCTATGACGAGAACAAAAGGGAAATTGGCAAAGTCAATATGATGTATCAGAGATATCAATTTCCGTTTTCTAACATGAAAGCCATGGGTGCCATGGTGTTGGAACTGCCTTACGGGGTTGATCAACGTTACTGTATGCTTGTGATCCTCCCTTACCCTCGAAATACTGTGAGTTCAGTTTACAACACCTTTGAAAGGGTAACGTTCAAGGATATATTTGCCCAACTGAAGAGCGACGAAGAGGAATTCGGTTTAGTAGACATCGATGTGAAGTTGCCGAGATTCAAAATAAGTACGAACGTAGTCCTAAACAAACCGTTGAACAGTATGGGAGTGTACGATATATTTGAACCGGGTCGTGCAAGCTTCGATAAGGTCACGACAGAGGAGATTTACATCTCAGCGATAGTTCACAAAGCTGATATTGAAGTCACAGAGGCCGGTACGGTAGCGTCGGCAGCAACATCTGCCTATTTCGCTGACCGTATAGCAACACCAAACTTCTCAGCCAACAAACCGTTCCTGTATTTCGTTATGGAGAAGCCAACGGCCACAGTGATCTTCAGTGGTATTTACTCTAAGCCAAGCGTATTCTGA

Protein sequence:

>DPOGS206419-PA
MTGITLGATGESKSQIEKAFFLPKNEDTLVRGYKNLTKSVLEPQTYGVTLDIKNFVFLDKGFEINRNFQNTLSTDFGAMIQTLNFKDPNAAANVANRLIGKYGATVSNVLQSQDFLKSRMILTNVISFKGLWSSPFNQTETNLEPFYDENKREIGKVNMMYQRYQFPFSNMKAMGAMVLELPYGVDQRYCMLVILPYPRNTVSSVYNTFERVTFKDIFAQLKSDEEEFGLVDIDVKLPRFKISTNVVLNKPLNSMGVYDIFEPGRASFDKVTTEEIYISAIVHKADIEVTEAGTVASAATSAYFADRIATPNFSANKPFLYFVMEKPTATVIFSGIYSKPSVF-