Monarch geneset OGS2.0

DPOGS215953
TranscriptDPOGS215953-TA1266 bp
ProteinDPOGS215953-PA421 aa
Genomic positionDPSCF300078 - 1048504-1057387
RNAseq coverage7x (Rank: top 87%)
Annotation
HeliconiusHMEL0164637e-5450.79% 
BombyxBGIBMGA001066-TA5e-2742.24% 
DrosophilaSpn2-PA3e-2527.88% 
EBI UniRef50UniRef50_B6DZ413e-7640.80%Serpin-33 n=1 Tax=Bombyx mori RepID=B6DZ41_BOMMO
NCBI RefSeqNP_001129363.15e-7740.80%serine protease inhibitor 33 [Bombyx mori]
NCBI nr blastpgi|2095714601e-7540.80%serine protease inhibitor 33 precursor [Bombyx mori]
NCBI nr blastxgi|2095714609e-7639.81%serine protease inhibitor 33 precursor [Bombyx mori]
Group
Gene OntologyGO:00048671.2e-42serine-type endopeptidase inhibitor activity
KEGG pathwaytca:6621392e-25 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[55-419] IPR0237963.6e-60Serpin domain
[60-401] IPR0002151.2e-42Protease inhibitor I4, serpin
Orthology groupMCL26839 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215953-TA
ATGTCCAGAGCAAGTGGCGATCAAGAGTGCTGTATACGAGATACAGCCTTAAAACTTTTTGCATCTTTTTTCCAAGATAGTGGTCGCAAATTTGTTGTGTTAATCTTCATTTTCAATACATTTCATACAGCAAAATCCAGCAAATTAGCAAACAGCTTAGCAATGGAACTGCTTCTCAATACTAAGATAAATACAGTTGTCTCGCCACTCTTGGTTCGTTTTCCACTATGCAAGTTGACCTCAGAAGCTGAAGGTACTACAAAAAGTGATTTACTCTCAATATTAGGAATAAAATCTAAAGAGGTTCCAAACTGCTACACCGAAATAAAAGACGTGTTAGAGGGAATGTCACATATGGATTTCTTGAGCTTAAACAAAATCATAGTGAATTATACTGACGACGTGAAACTCCAATTCATATCGAACGGCGCCAACTACGGAATCAAAGTTGATAAAATCGGCTTTAACTATCCAAGTTCTGCAATATCCTTTATAAACAGATGGGTTGATAAAGCCACTTTAAAAAGAGTAACAGATGTTTTGGAGCCGAATGATATCAACAACAAGTCATCCATGCTGATAATAAACGCGGCTTATTTGAAGGTAACCTGGGAGATCCCATTCGATATAACATTGACAAAGAATGCAAAGTTCTATCAAATCGATGGAAAGGTGTCCACTGTATCAATGATGACTAAAATGGATACCTGTTTGTACTTCCAAGACGAAAATATGAATATTCAGTTTGTAACCATGAAATTGGCCAGTTTTGGCATAACAATGTCGATCGCATTACCTGAAACACGAAAAGGTCTTTCTGAACTCCTGCACAAATTATTACAAGAGCCAAACTATTTTGATCAAATACAAAATAATATGAAGTTTGAAACTATTAAAATAAATTTGCCGAGGTTTAAAATAAAGAACTGTTTCGAATTGGATAAATACCTGAAAAAGATTGGCGCAAGTCAACTTTTTAATGAAACATACTCTGGCCTGGATAGAATATTGAAGAAAAACAGCACATCCAAAAATATACATCTTAGTAAAGTGAAACAAAAAATCTTTATTGATATTGATGAAATGGGGATATTCAGGAAACTCCCTGGAGAGATGTACGATGAAACGCATGGACTAGCTTTTGGGTCTACCGAAGTTGTTGGAGATCATCCTTTTTACTTCACCGTCAATTTACAAACTAGTCCTTTGGAAAATGCTCGCAGATATCATTTATTTCAAGGAGTTTATTATGGACCTGAAAATTAA

Protein sequence:

>DPOGS215953-PA
MSRASGDQECCIRDTALKLFASFFQDSGRKFVVLIFIFNTFHTAKSSKLANSLAMELLLNTKINTVVSPLLVRFPLCKLTSEAEGTTKSDLLSILGIKSKEVPNCYTEIKDVLEGMSHMDFLSLNKIIVNYTDDVKLQFISNGANYGIKVDKIGFNYPSSAISFINRWVDKATLKRVTDVLEPNDINNKSSMLIINAAYLKVTWEIPFDITLTKNAKFYQIDGKVSTVSMMTKMDTCLYFQDENMNIQFVTMKLASFGITMSIALPETRKGLSELLHKLLQEPNYFDQIQNNMKFETIKINLPRFKIKNCFELDKYLKKIGASQLFNETYSGLDRILKKNSTSKNIHLSKVKQKIFIDIDEMGIFRKLPGEMYDETHGLAFGSTEVVGDHPFYFTVNLQTSPLENARRYHLFQGVYYGPEN-