Monarch geneset OGS2.0

DPOGS215240
TranscriptDPOGS215240-TA3183 bp
ProteinDPOGS215240-PA1060 aa
Genomic positionDPSCF300047 - 703212-729241
RNAseq coverage279x (Rank: top 39%)
Annotation
HeliconiusHMEL0154810.068.96% 
BombyxBGIBMGA001513-TA0.077.49% 
DrosophilaFur1-PB0.069.81% 
EBI UniRef50UniRef50_Q75WU00.080.17%Furin-like convetase n=2 Tax=Obtectomera RepID=Q75WU0_BOMMO
NCBI RefSeqNP_001036904.10.080.17%furin-like convetase [Bombyx mori]
NCBI nr blastpgi|1129827450.080.17%furin-like convetase precursor [Bombyx mori]
NCBI nr blastxgi|1129827450.080.17%furin-like convetase precursor [Bombyx mori]
Group
Gene OntologyGO:00042522.5e-108serine-type endopeptidase activity
GO:00065082.5e-108proteolysis
KEGG pathway 
InterPro domain[5-732] IPR0155001.5e-272Peptidase S8, subtilisin-related
[102-430] IPR0002092.5e-108Peptidase S8/S53, subtilisin/kexin/sedolisin
[426-568] IPR0089793.8e-46Galactose-binding domain-like
[473-559] IPR0028844.7e-31Proprotein convertase, P
[844-945] IPR0090301.7e-11Growth factor, receptor
[11-50] IPR0090201e-09Proteinase inhibitor, propeptide
Orthology groupMCL10133 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215240-TA
ATGCCCTACGACATCGAAATATCCGAGATCTTCGATAATCACTATCATTTTCATCATAATTCCGTGGTGAAGAGATCTATATCGCCGGCCCACGAACATCACAATCTATTAGAAATGGACGACAGAGTACTTTGGGCGAAACAGCAGCGTGTGCTTTCAAGAAAGAAGAGGGACTTCATACCTTACCCCACTAGCACAGCTGAAACGAATTCATCGGAAGTGCCATCAATGAAAAGGGAAGCTTCTAAAAGAATGTCGCACTCGAGAGTAAAAGGACGGCCGCGTTCTACTGACTCCAAATTCCTATTAAATGATCCCAAGTGGCCTCATATGTGGTATTTGAATCGTGGTGGTGGTTTGGACATGAACGTCATTCCAGCGTGGCGGGAAGGTATCACCGGTCGGGGAGTCGTAGTCACCATCCTGGACGACGGCCTCGAGACTGATCATCCAGACCTCGTCTCCAATTATGATCCAATGGCTTCATATGATGTGAACTCCCAAGACTCGGATCCTCAACCACGGTACGACATGATAGACTCCAATCGACATGGGACCAGATGTGCTGGCGAGGTCGCTGCTACAGCTAACAACTCACTCTGCGCTGTTGGTGTCGCTTATGAAGCCAGCGTAGGAGGGGTAAGAATGTTAGATGGAGACGTGACAGATGCGGTCGAGGCTCGTTCTTTGAGTCTTAACCCTCAACATGTAGATATATACAGCGCTTCCTGGGGACCTGATGATGATGGAAAAACGGTTGACGGACCTGGAGAATTGGCCACCCGAGCGTTCATTGAAGGTGTTACAAAGGGTCGAAACGGTAAAGGATCGATATTTGTGTGGGCTTCGGGTAATGGCGGCCGGGAGCACGATAATTGCAATTGTGATGGCTACACAAATTCCATATGGACCCTCTCAATATCATCCGCTACAGAGCGAGGGGACGTGCCCTGGTATTCTGAAATGTGTAGTTCCACCCTGGCAGCTACGTACAGCTCCGGGGCTACTGATGAGAAACAGGTGGTGACAACGGATCTGCACCACTCGTGCACAACAGGTCACACTGGTACTTCAGCCTCTGCTCCCCTCGCAGCCGGAATATGCGCCCTCGCTCTAGAAGCCAACAAGGATCTCACATGGAGAGACATGCAGCATATTGTGGTTCGGACTGCCAGACCTGAGCGTTTGAGTCTTGGTGGCGATTGGAAGGTTAACGGCGTTGGTAGGAACGTGTCACATTCTTTCGGATATGGGTTGCTCGATGCAGCTGGCATGGTCCGACTGGCCAGAACATGGAAAACAGTTCCAACTCAAAGACGATGTGAGCTGGCTGCTCCCAGACCACAGAGAGCGGTGCCACCAAGATCATCTATAACACTTCATCTAAATGTTGGTGCATGTCCGGGAGTAAACTATTTGGAACACGTTCAAGCTCGAGTATCGCTGTCCGCTGCACGTAGAGGAGACCTTCGAATTGCATTGACTTCACCTGCCGGAACTAGAGTGACGTTGCTGGCTCCAAGACCCCGTGATTCCTCTCGAGCCGGTTTCAACGCCTGGCCATTCATGTCTGTACATATGTGGGGAGAGTCACCACTCGGAGTCTGGCAACTGGAGGTTTCAAATGAAGGAAGATACATGGGTCGGGCGACTTTACAAGACTGGTCGTTAACATTATATGGAACAAGCACACCGGCTGCTAAAAACGATCCTATACCATTTAGAAATCCAATAATCCGGAATAGGAGCAATGCGACCCGACCTGTTGTAGTTCAAACAGGGCGTAAAAACAACAGAGGAAAGTCAGTGCCTATAGTTCAGACTTATACAGTGGGACTTGTGAAAAGAAAAAAAAATAAGAATTCAAAGAACCCAAACAGAAATTCGCAGAAAAAACAGGGACGACCGGCAACGCGCTCACCCATAGAGAGTACGACCTCACGCCTGACAACATCTACGACGCCGGATTTAAGCCTTAATTTGAGAGGGGATTTGACAGATCCTTTGTCTGACGTCAGAGGGCGTAAGATGTCAAATCTGTTCGAGCAATATCCGAAAATACAAAGAATATACCCGGCGCCCCTGCAAGCCAACGCTGGACCTTTGACTCAGTGGGAGCTAATATTCTATGGTACTGAAACACCTGCACAAGAAAGCGATGTGTCCACTGAAAGTAACGTAGTGGGAACTAATCCCGGGACCGTCTGGAGCGCTATACCATCCGATGTGAACCAGAATGTCATAGATGATGACTTGGCTTTAGTCTGGCACGACTCTCATGCGATCCGTGAGGAAGGACAAGCGGTTGAAGGTGGGTTCAAAGCAGGAGCGGAAGCGGGTACAGTCTCCAGCGGTTGCGCCACACTCACACATCAACCACCACATCGTTGTTTAGTGGCGAGATTCTACTTACCAACCCCGCTCCTATCAAATCCTGTTTTCATAAAAACATCTGCAAACTTCGTGCCTCGTGTATGGATCTTCTGCATCATACGATCTACTTTCAAGAAACGCAACTGCATGGACGCTGAATGCGCGAAGGGTTTGCACTTGTACGACGGTCGTTGCTATCAACGTTGTCCGGCCGGGACGTATGCCAGCGAAATTTTAATGGAGCGCAGCTCACGAAGACGTAACCTTACTTATTTGGAAACAGGCGATTCTTCTGTCGTTATGAAACGTCAGGGTGATGTTTTTCGACCCACGGCATTAGAGGCCTTTGATATGGAACCAGTTGTAAATGTTTCAAAAGCTCCCTTAATCTGCCTACCTTGTCATTACACCTGCGCCACTTGTACGGGTCCTAATAATAATCAATGCTCTTCTTGTTTAGAAGACGCTAAACTCTTTAACTTAACTGATGTAGAACCAAAATTTTACTGCTATCCGAAAACTGTTCTGCCTCACATAACCAACGCTGAATGGCATTATAAAATGAACGTTTTGTTGACGATAGCCCTTGTAACAGTCAGTTTAATTAGTATCTATATAGTTCTGGCTTTTGTTCTAAAACGGATGGGAATATGCTTTGGAAATAATTACGACTCAAACATCAAAATTGCATATAACAAATTAGCGGTTGACGATAAACTTCAAAGCGCTATTGAGATTGAAGAAGAAATTCACAAGGCTCTTAATAAATATTCGAGTGAGAGCGAATCCGATGACGATATGAATTTATAG

Protein sequence:

>DPOGS215240-PA
MPYDIEISEIFDNHYHFHHNSVVKRSISPAHEHHNLLEMDDRVLWAKQQRVLSRKKRDFIPYPTSTAETNSSEVPSMKREASKRMSHSRVKGRPRSTDSKFLLNDPKWPHMWYLNRGGGLDMNVIPAWREGITGRGVVVTILDDGLETDHPDLVSNYDPMASYDVNSQDSDPQPRYDMIDSNRHGTRCAGEVAATANNSLCAVGVAYEASVGGVRMLDGDVTDAVEARSLSLNPQHVDIYSASWGPDDDGKTVDGPGELATRAFIEGVTKGRNGKGSIFVWASGNGGREHDNCNCDGYTNSIWTLSISSATERGDVPWYSEMCSSTLAATYSSGATDEKQVVTTDLHHSCTTGHTGTSASAPLAAGICALALEANKDLTWRDMQHIVVRTARPERLSLGGDWKVNGVGRNVSHSFGYGLLDAAGMVRLARTWKTVPTQRRCELAAPRPQRAVPPRSSITLHLNVGACPGVNYLEHVQARVSLSAARRGDLRIALTSPAGTRVTLLAPRPRDSSRAGFNAWPFMSVHMWGESPLGVWQLEVSNEGRYMGRATLQDWSLTLYGTSTPAAKNDPIPFRNPIIRNRSNATRPVVVQTGRKNNRGKSVPIVQTYTVGLVKRKKNKNSKNPNRNSQKKQGRPATRSPIESTTSRLTTSTTPDLSLNLRGDLTDPLSDVRGRKMSNLFEQYPKIQRIYPAPLQANAGPLTQWELIFYGTETPAQESDVSTESNVVGTNPGTVWSAIPSDVNQNVIDDDLALVWHDSHAIREEGQAVEGGFKAGAEAGTVSSGCATLTHQPPHRCLVARFYLPTPLLSNPVFIKTSANFVPRVWIFCIIRSTFKKRNCMDAECAKGLHLYDGRCYQRCPAGTYASEILMERSSRRRNLTYLETGDSSVVMKRQGDVFRPTALEAFDMEPVVNVSKAPLICLPCHYTCATCTGPNNNQCSSCLEDAKLFNLTDVEPKFYCYPKTVLPHITNAEWHYKMNVLLTIALVTVSLISIYIVLAFVLKRMGICFGNNYDSNIKIAYNKLAVDDKLQSAIEIEEEIHKALNKYSSESESDDDMNL-