Monarch geneset OGS2.0

DPOGS213372
TranscriptDPOGS213372-TA1080 bp
ProteinDPOGS213372-PA359 aa
Genomic positionDPSCF300109 + 110064-111143
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0075949e-4533.24% 
BombyxBGIBMGA013849-TA1e-4032.76% 
DrosophilaSpn77Ba-PB2e-2025.27% 
EBI UniRef50UniRef50_Q6Q2D32e-4331.98%Serpin-5A n=3 Tax=Obtectomera RepID=Q6Q2D3_MANSE
NCBI RefSeqNP_001037205.13e-4733.42%serine protease inhibitor 5 [Bombyx mori]
NCBI nr blastpgi|1129845486e-4633.42%serine protease inhibitor 5 precursor [Bombyx mori]
NCBI nr blastxgi|1129845481e-4533.16%serine protease inhibitor 5 precursor [Bombyx mori]
Group
Gene OntologyGO:00048672.5e-32serine-type endopeptidase inhibitor activity
KEGG pathwaycpb:Cphamn1_17257e-20 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[1-357] IPR0237963.8e-49Serpin domain
[2-356] IPR0002152.5e-32Protease inhibitor I4, serpin
Orthology groupMCL23359 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213372-TA
ATGAGTTTAGAAATGTTTTACTTTACGCAATTAGAAAACGAAGGTGATCTGGTGATAGCACCCTTTAGTTTATGGAATCTGATCTCCCTCGTCGCATTTAAATCATCCTCAGACACCTGGAGACAGCTTCATTCCATCATAGGAATATCAAAACGGAACGGACAATATTTTAATTCAATTTTCAAATTTACAACAGATGTTATGACGGATGTCTCCAAGTTTCCTGGAGTGTCATTCAAGAATGTACAGGTTGTACTCTGTGACGCAAATCTAGAATTGACAAAATCATTTCAAAGGAGTGTAATTGATTCTGGAATGTCGTTGAAGATGCTAGATTTTGAGGATGGCGTTTCCGCGGCTGAAGAAGCGAATCATTATTATCAAAACCTATGCAATAATACTTTTTCACCGGAATTATTGATTTTCGATAGCACCACTTTCAATGAATCATCTATGATAGTGAGCGGTATGGTCGAATTTGAAGGCAACTGGTCTTTACCCTTCAAAAAATCTGATACAATTCTCAAAAACGGCACAGACATTTATATAATGCAACAAAAAGCGAACATTCAGCAAGTTAATATAGACATTTTAGGGGCTTCCGTCCTAGAACTGAATTACGGAAAGAGTGAGGACTTTAATATGGTAGTACTAACACCCCACGAAGGCGTGTCCATTAAGGACGTCTTAATAAATTTTGATAAAATTTCATTTACTGATCTACTCAGTAAATTGCATAGCGAACCACTGAAAGAAGTAGACGTGAAATTACCAAAATTTTCGGTGATTTCCACAATATCATTAAATGGTCCCTTAACGTCTATGGAAGCGAGGGATATATTCTTGCCCAGTCGAGCTGACTTTTCGGGTATAACAGAAGAAGAAATGTATATATCCTCGATAGAACACAGAGCCTTAATAACAGTGACGGAGACAGGAACTCGGGCAACAGCATTCACTCCGGCAAACTCATCTAAAGACAGGAGAACGACTATCACAGATTCAGTTTCACCTTTCCTTTTCCTTATAGTGTACCGCCCTTCATATAGTATTCTTTTTTGTGGCAAATACGGAGCGTGA

Protein sequence:

>DPOGS213372-PA
MSLEMFYFTQLENEGDLVIAPFSLWNLISLVAFKSSSDTWRQLHSIIGISKRNGQYFNSIFKFTTDVMTDVSKFPGVSFKNVQVVLCDANLELTKSFQRSVIDSGMSLKMLDFEDGVSAAEEANHYYQNLCNNTFSPELLIFDSTTFNESSMIVSGMVEFEGNWSLPFKKSDTILKNGTDIYIMQQKANIQQVNIDILGASVLELNYGKSEDFNMVVLTPHEGVSIKDVLINFDKISFTDLLSKLHSEPLKEVDVKLPKFSVISTISLNGPLTSMEARDIFLPSRADFSGITEEEMYISSIEHRALITVTETGTRATAFTPANSSKDRRTTITDSVSPFLFLIVYRPSYSILFCGKYGA-