Monarch geneset OGS2.0

DPOGS207577
TranscriptDPOGS207577-TA4455 bp
ProteinDPOGS207577-PA1484 aa
Genomic positionDPSCF300072 + 243015-265162
RNAseq coverage133x (Rank: top 56%)
Annotation
HeliconiusHMEL0065282e-12535.82% 
BombyxBGIBMGA004728-TA1e-5436.02% 
DrosophilaSpn4-PB3e-4834.13% 
EBI UniRef50UniRef50_G9F9J31e-6238.75%Seminal fluid protein CSSFP041 n=1 Tax=Chilo suppressalis RepID=G9F9J3_9NEOP
NCBI RefSeqNP_001139719.19e-6338.42%serine protease inhibitor 28 [Bombyx mori]
NCBI nr blastpgi|3640236335e-6238.75%seminal fluid protein CSSFP041 [Chilo suppressalis]
NCBI nr blastxgi|2263429142e-6138.42%serine protease inhibitor 28 [Bombyx mori]
Group
Gene OntologyGO:00048672.9e-96serine-type endopeptidase inhibitor activity
KEGG pathwaydgr:Dgri_GH198622e-55 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[1107-1483] IPR0237963.3e-101Serpin domain
[1112-1483] IPR0002152.9e-96Protease inhibitor I4, serpin
[492-555] IPR0029197.4e-16Protease inhibitor I8, cysteine-rich trypsin inhibitor-like
Orthology groupMCL10132 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207577-TA
ATGGTTAAGTGGGTGTATTTTGTGTTTTTAATATCCTTGGCTGGAATCATATCATGTGAAGCAATATCACCTACAGCTGTTACTCTACCTTCCGAAGATATCGACGGAGAAGATGACGATGGTGATTGTGGTGATGCATATGAATGCGGCTTAAACGAAGAATACTCGTCTTGCGCAAATGGTGGTTGTAGAAGATTTACTTGTGCTATTAACGGTATCCTTTGTGTGGATTTAATAGAAGGTGGCTGTAAAAAGGGTTGTATCTGTAAAGCATCTTATACACGAGCAGATAACGGGACCTGCATTCCGGAAGATGAATGTCCTCCTACATGTGGAGATAATGAAGTATATGATCTTTGCCCCTCACCTTGTCCACCAAAAAGATGTGACGTAGATGAACAACTTATAGATTGTGCACCAAACCCATTACCTGGAGATGCAAATTGTGAACCAGGATGTCGCTGTGCTGATAACTATTATAGAAACGAAAACGGCTTATGCGTGGCAAAAGAGGAATGTCCTCCTTTAATTGAATGTGGACCCAATGAAATAGCATTAAGTTGCGTAAATGGAGGTTGTTGGAAAGAAAGTTGTACTTTTCCACAAACGATTTGTGGAAAGCCTGCGCCTGGTAGCTGCAAGGAAGGATGTAGATGTAAAGAAAATTATTTGCGTTCTAATAACGGCACTTGTATTCCGGAAAGTGACTGTCCCCAGACAGAGCTTTGCGGTCCTAATGAGGAGTATGGATGTGATACTTCTTGCAACAAACAATGTATTACTATTGACGGAAAATGTATTCCTCCTAAGGAATGCAAACCTTGCAAGTTCCAATGCCGTTGTAAGGATGGTTATACCAGAAATGAAAATGGTGAATGCATTTCAACAACTTGTGGGATGAATGAAGTGTATGACGAATGTCCGCCACTTTGTCCACCTCAACGATGTGATGTAGATGACAGACTAATAAGATGCGCAGAAAACCCAAAACCGGGTGATCCTAAATGTAAGCCCGGTTGCAGATGCGCAGACAATTTCTATAGAAATGATAAAGGAATTTGCGTAACTAGAGAGGAATGTCAACTACCAAAAAAATGTGGTTTAAACGAGGAATACTCTTCTTGCGCAAATGGTGGGTGTAGAAGATTTACATGCGCAATGAACGCTATTCTTTGTGTTGACTTAATAGAAGGAGGCTGTGAGGAGGGATGTATTTGTAAGGAATCCTACACCCGTGCCGATAATGGAACCTGTATTCCCGAAAATCAATGTCCTACTGTTTGTGGAGATAATGAAGTATATGATGTCTGCCCCTCACCTTGTCCACCAAAAAGATGTGACGTAGACGAACGACTTATAGATTGTGCTCCAAACCCATTACCTGGAGATCCAAATTGCGAACCAGGATGTCGCTGTGCTGATAACTATTATAGAAACGAAAACGGTTTATGTGTGACGAAAGAGGAATGTCCTCCAATAATTGACTGTGGACCGAATGAAATTGCATTACGTTGTGCCAATGGTGGTTGTTGGAAACGAAGCTGTTCTCGACCACAGACGATATGTGTAAAGCTACGTGAAGGAGCTTGTAAAGAAGGGTGTAGATGTAAAGAAAATTATTTGCGTTCTGATAATGGCACTTGCATTCCAGAAAGTGAATGTCCCCAAAATTGCGGACTAAATGAAATTTACGATGAATGTCCCCCCACTTGTCCACCTCAAAGATGTGATATAGACAACCGAGTAATAAAGTGTAAACGAACACCAATGCCTGGAGATCCTGATTGTAAGCCAGGGTGCCGTTGCAAAGATGACTTTTATAGAAATGATTTAGGAATTTGTGTCACTAGGGATGAATGTCCACCACTAATAAAGTGTGGAGACAATGAGTACGCATCAAAATGTGCTAATGGTGGTTGCAAGAAAATAAGTTGCGCTGAACCAATAACCCGTTGTGTAAAATTGCCGCCCGGCGCCTGTGAAGAAGGTTGTCTTTGTAAAGAAAATTATCTAAGAGCTGACAATGGAACATGCATACCTGAAGAGGAGTGTCCAAATATTTGTGGCATCAATGAGGTTTATGACTACTGTCCTCCTATTTGTCCACCACAGCGATGTGACGTGGATATAGCGACGATATTGTGCAAACCTAATCCACTACCTGGGGATCCGGAATGTAAACCAGGTTGTAGATGTGCTGATAATTATGCTAAAAACAAAGACGAGTATTCTGACTGTGCCAATGGTGGCTGCAGGCGACTTTCATGCAATTCGACTGGTATCGAATGTGTCGATTTAAAAGATGCTGAATGTAAACCAGGATGTATATGCAAAACGAACTATACACGTGCAGACAACGGAACTTGTATTGCTGAAAGCCAATGTCCTGCAACATGTGGAGTGAACGAAGTGTATGACGAATGCCCTCCACTTTGTCCACCACAAAGATGTGATGTAGATGATCGAATAATAAGATGCGCAGCAAACCCCAAGCCGGGTGATCCTAAATGTAAGGCAGGGTGCAGATGTGCAGAAAACTACTATAGAAATGATAACGGTGTTTGCGTAACTAGAGAGGAATGCTCTAAATGCTTTGGAGCCAATGAAGTATATGCATGCAAAGACTCCAACCCTCAAACTTGTGAATCTATTGATCAAGAAATCAAACCAGACAAGACATCAAAAAATTGTAAGCGAGAATGTAGATGTGAAGAAGGGTTTTACAGAAATAAGATTGTGAAATGCAATGGACCTAATGAATTCTTCTCATGTGGCTCAGCTTGTGTCAATGAATGTCGTACTATAAAAACACAAAATCAAACTCATTGCCCTATTGTAAATATTGTTTGCAATAAGCAGTGTTATTGTGAGGATGGATATGCTTATGATGAAAATAAGATATGTATACCTATTTCGCAGTGTCCCCCTCAACCATCTTGCGGTGTAAATGAAGAACCTTCTGACTGTGCTAACGGTGGATGTAGGAGATGGGAATGTTCGGATAATGGTGTTATTTGTAAAGACTTACTGGAAGGTTCATGTGAACAAGGCTGCGTTTGCAAAGACTCTTACACTAGAGACGAAGATGGTAATTGTATTCCTGAAGACCAGTGCCCTGAGCTTTGCGGTCCTAATGAGGAGTATGGATGTGATACTACGTGCAACAAACAATGTATTGTTATGGACGGAAAATGTATTCCTCCTAATAAATCCAAAGTTTGCAAGCTCCAATGCCGTTGTAAGGATGGTTATAGCAGAAACGAAAATGGTCTGTGCGTTTCAACCTGTGATGGTCAAAGTGAAAAAATTATCGATGGCTTGACAGAATTCGAAGAGGGTAATCTTCGTTTTACTGAAAGTTTTCTATCGACGGCTGCCAAAAATAATCCCAACGCGAGCGTTATAGGATCACCTTTTAGTATATTGTTTTTGTTAGCCCAATTGGCTTTATATGCGAGTGGAAATTCCAAAACAGAACTTTTGAAGTTGCTTAATCTAAGCAATGATTGTGAGATTCGATCATTTGTTCCGAAATATCTCCAACTAATTTCGGTGACAAACAACGCTAGTTTCGACTTAGCTCAAAAAATTTATGGTAGTGTCAAATATCCGTTCAGTGAAAACTTTAAAAAGGATACTAGAGAGGTATTCAACGCCCAAGCACAAAATGTTGACTTTTCAAATCCAAAAGAAGCAGCTGACATCATCAACAAATGGGTAGCTGAACGTACTCGGAATCTAATTCCAAATCTCATATCGCCTGATGCGCTAAGTTCTAACACGCGTTTAGTTCTTGCAAACGCTATATATTTCAAGGGAGATTGGAGATATCAATTTAAAGCAAGAAATACCAGATTGCTGCCATTCTATACTGGAAAAACAAAAGATGATTCCGTGCAAGTAAAAATGATGAATCAGATTGGTAATTTCAAATATACAGAATTAAAAAGCCCCGATGTACAGATTCTTCAACTACCGTACAAAGCCGCTGATATTTCATTTGTCATATGCTTGCCAAGATCTAGAACAGGAATCAATGATTTGATTAAAAATCTAAAAGTATCGTCTCTCATAAAGCGTTCATTCACTGAATTACAATTTACTAGGGTAGATGTATCCATGCCAACATTAGATATATCGACAACTACCGACTTGAAACCTCTTTTATTTGATGCGGGTGTTAAAGCAATCTTTGACCCTTCAACTGCTGGAATTAGTGGAATATTGGAAAAGCCAGAAGATATTTTCGTGACATCTGGTATTCAAAAAGCTAAAATACTTCTTAATGAAACTGGAACTGAAGCAGCCGCCGCTACAGCAATCACCGTAGGAATAACGTCAGTAGCCGAACCGGTGGATTCTCCTATTGTATTCAGAGCGGACCATTCATTTATTTATTATATTTTATTCAAAAAACTGCCTATTTTTATCGGCATATTTGCAGGCCCGCAGTAG

Protein sequence:

>DPOGS207577-PA
MVKWVYFVFLISLAGIISCEAISPTAVTLPSEDIDGEDDDGDCGDAYECGLNEEYSSCANGGCRRFTCAINGILCVDLIEGGCKKGCICKASYTRADNGTCIPEDECPPTCGDNEVYDLCPSPCPPKRCDVDEQLIDCAPNPLPGDANCEPGCRCADNYYRNENGLCVAKEECPPLIECGPNEIALSCVNGGCWKESCTFPQTICGKPAPGSCKEGCRCKENYLRSNNGTCIPESDCPQTELCGPNEEYGCDTSCNKQCITIDGKCIPPKECKPCKFQCRCKDGYTRNENGECISTTCGMNEVYDECPPLCPPQRCDVDDRLIRCAENPKPGDPKCKPGCRCADNFYRNDKGICVTREECQLPKKCGLNEEYSSCANGGCRRFTCAMNAILCVDLIEGGCEEGCICKESYTRADNGTCIPENQCPTVCGDNEVYDVCPSPCPPKRCDVDERLIDCAPNPLPGDPNCEPGCRCADNYYRNENGLCVTKEECPPIIDCGPNEIALRCANGGCWKRSCSRPQTICVKLREGACKEGCRCKENYLRSDNGTCIPESECPQNCGLNEIYDECPPTCPPQRCDIDNRVIKCKRTPMPGDPDCKPGCRCKDDFYRNDLGICVTRDECPPLIKCGDNEYASKCANGGCKKISCAEPITRCVKLPPGACEEGCLCKENYLRADNGTCIPEEECPNICGINEVYDYCPPICPPQRCDVDIATILCKPNPLPGDPECKPGCRCADNYAKNKDEYSDCANGGCRRLSCNSTGIECVDLKDAECKPGCICKTNYTRADNGTCIAESQCPATCGVNEVYDECPPLCPPQRCDVDDRIIRCAANPKPGDPKCKAGCRCAENYYRNDNGVCVTREECSKCFGANEVYACKDSNPQTCESIDQEIKPDKTSKNCKRECRCEEGFYRNKIVKCNGPNEFFSCGSACVNECRTIKTQNQTHCPIVNIVCNKQCYCEDGYAYDENKICIPISQCPPQPSCGVNEEPSDCANGGCRRWECSDNGVICKDLLEGSCEQGCVCKDSYTRDEDGNCIPEDQCPELCGPNEEYGCDTTCNKQCIVMDGKCIPPNKSKVCKLQCRCKDGYSRNENGLCVSTCDGQSEKIIDGLTEFEEGNLRFTESFLSTAAKNNPNASVIGSPFSILFLLAQLALYASGNSKTELLKLLNLSNDCEIRSFVPKYLQLISVTNNASFDLAQKIYGSVKYPFSENFKKDTREVFNAQAQNVDFSNPKEAADIINKWVAERTRNLIPNLISPDALSSNTRLVLANAIYFKGDWRYQFKARNTRLLPFYTGKTKDDSVQVKMMNQIGNFKYTELKSPDVQILQLPYKAADISFVICLPRSRTGINDLIKNLKVSSLIKRSFTELQFTRVDVSMPTLDISTTTDLKPLLFDAGVKAIFDPSTAGISGILEKPEDIFVTSGIQKAKILLNETGTEAAAATAITVGITSVAEPVDSPIVFRADHSFIYYILFKKLPIFIGIFAGPQ-