Monarch geneset OGS2.0

DPOGS215393
TranscriptDPOGS215393-TA3903 bp
ProteinDPOGS215393-PA1300 aa
Genomic positionDPSCF300088 - 127076-162195
RNAseq coverage1176x (Rank: top 11%)
Annotation
HeliconiusHMEL0079470.074.17% 
BombyxBGIBMGA002365-TA0.076.49% 
DrosophilaFur2-PC0.061.90% 
EBI UniRef50UniRef50_Q264890.080.47%Endoprotease FURIN n=4 Tax=Coelomata RepID=Q26489_SPOFR
NCBI RefSeqXP_001355065.20.065.61%GA15057 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|11678600.080.47%Endoprotease FURIN [Spodoptera frugiperda]
NCBI nr blastxgi|11678600.080.47%Endoprotease FURIN [Spodoptera frugiperda]
Group
Gene OntologyGO:00042525.6e-112serine-type endopeptidase activity
GO:00065085.6e-112proteolysis
GO:00160201.4e-08membrane
GO:00071691.4e-08transmembrane receptor protein tyrosine kinase signaling pathway
GO:00055241.4e-08ATP binding
GO:00064681.4e-08protein phosphorylation
GO:00047141.4e-08transmembrane receptor protein tyrosine kinase activity
KEGG pathway 
InterPro domain[23-1180] IPR0155000Peptidase S8, subtilisin-related
[132-471] IPR0002095.6e-112Peptidase S8/S53, subtilisin/kexin/sedolisin
[467-617] IPR0089796.1e-44Galactose-binding domain-like
[693-856] IPR0090306e-37Growth factor, receptor
[520-608] IPR0028844.1e-26Proprotein convertase, P
[25-97] IPR0090201.7e-25Proteinase inhibitor, propeptide
[743-790] IPR0062125.8e-15Furin-like repeat
[699-806] IPR0062111.4e-08Furin-like cysteine-rich domain
Orthology groupMCL10133 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215393-TA
ATGGTGCTGTCATGGCGTGCGCTGGCGCTGCTAGCCGCGCTTCAAATGTGCTCATGCTTACCGAAAGCGCTATATCACAATCACTTCGCTATCCATGTACCAGCTGGCGAAAAACACGCGGATGACATCGCGACAAGACATGGTTTCATCAACCATGGCCAGATTGGAGCCCTCAAGGGATACTACTTGCTGTCACACCACGGGGTTCACAAACGGTCTACAGAACCAAGTCACGAGCATCATCACAAGTTAAATAATGAACCCGAGGTTAAATGGTTCGAGCAACAACGCGAGAGGCGTAGAATGAAACGTGATTACAGCCCTTATGAGAGCACATTATGGTCGCAGTTGTCTCGAAGGCTGCCCTCCCATAGAACCCGTCACCGCGCCATTACACCCTCGCCTTTCTTCTCCGATCCGTTGTTTAAAGAGCAATGGTATTTGAATGGCGGTGCGAAAGATGGACTTGACATGAATGTAATGCCGGCGTGGCAAAGAGGTTACACTGGAAAGGGTGTAGTTGTGTCAATCCTTGATGACGGTATACAAACCAACCATCCCGACCTCGCGCAGAACTATGATCCTCTCGCTTCCACTGACATAAATGGGAACGACGATGATCCAATGCCTCAAGACAACGGCGATAACAAACATGGAACACGTTGTGCTGGGGAGGTTGCCGCTGTAGCATATAATCAGTACTGTGGCGTCGGTATAGCATACAATGCTAGTATAGGAGGAGTCCGTATGTTGGACGGTGTAGTGAATGACGCGGTGGAAGCCAGAGCTCTTGGTCTTAACCCCGATCACATTGACATATACAGTGCCTCGTGGGGTCCTGAAGATGATGGGAAGACGGTAGACGGGCCGGGCCCGCTTGCTAGAAGAGCTTTTATTTATGGAGTTACAAGTGGTAGGCGCGGTAAAGGAAGTATATTCGTGTGGGCTTCGGGAAACGGTGGTCGCCATACAGACTCCTGTAATTGTGATGGATATACAAATAGTATATTTACTTTATCAATATCGAGTGCGACACAAGGGGGATTTAAACCTTGGTATCTAGAAGAATGTTCATCGACTCTAGCCTCCACATACAGCTCGGGTACTCCGGGTCATGATAAGAGTGTTGCTACTGTTGATATGGACGGCAGATTAAGATCAGATCATATTTGTACAGTGGAACATACAGGAACGTCCGCATCTGCACCTTTAGCAGCCGGTATTTGTGCCCTTGCGCTGGAAGCTAATCCAAATTTGACCTGGAGAGATATGCAGTATTTAGTAGTGTTAACATCACGTCCACAACCCCTCGAAAAAGAAACTGGGTGGATTGTGAACGGTGTGAAGAGAAAAGTTAGTCACAAGTTTGGCTATGGTTTAATGGATGCATCGGAAATGGTGAATTTGGCGGAACAATGGGTATCAGTACCACCGCAACATATATGTAAATCGCAGGAAATTAATGAGGACAAAGCTATTGAATCCTCATTTGGTTATACACTAAAAGTACATATGGATGTTAATGGTTGCAGTGGAACAGTTAATGAAGTGAGATATCTAGAACATGTCCAGTGCAAAATATCGTTGAGGTTTTTCCCTAGAGGTAATCTCCGCATACTTCTTACTTCACCGATGGGAACAACGTCCTCTTTATTATTTGAAAGACCTAGAGATGTTATCAGTTCCAACTTTGATGATTGGCCCTTCTTAAGTGTTCATTTCTGGGGTGAGAGAGCCGAAGGTAGATGGACTTTGCAGATCGTCAATGCTGGTAACAGGCATGTTAACCAACCAGGCATTCTTAAAAAATGGCAGTTGATATTTTATGGCACATCAACAGACCCTATACGGCTAAGGTCGAAAAGACCTGCACAAGCAGCGCCAGCCTTTGCTTTTCCAACTGCCGCTGATGGTTACGAAGCTGCCGGGGATTCTTTTTACAATACTGACGCGTTTACAAATTACCAGAACTTTCCTTCATTATTCGCCGCTGGGTCAAACCCCGAAAAGGCGATAGCACGTCTCGACGGACACAATGTCCCTTCACCGCATGGGGAAAATGTCCTCGCTGATAGTAATGATAAGCGCGTCATGCACGATTGTGATCCCGAATGCGATTCTCAAGGTTGCTATGAAAAAGGACCCACACAATGTATAGCCTGTAAGCATTACAGACTAGATGATGCCTGTGTATCTCGATGCCCTCCGAGAAGTTTTGCCAATCAAGGTGGTGTTTGTTGGCCCTGTCATGAAACATGTGAAACATGCGTGGGCCCAGGACAAGATTCATGTTTGACATGTTCGCCAGCACATTTATTAGTGGCCGATTTGGGTTTGTGTATACAACAATGTCCTGATGGATATTGGGAAAATAGCGAAGCGTCAGCTTGTCGGCCGTGTGCTGCACACTGTTCCACCTGCTCAGAGAGAGCTGATGCATGTACGTCATGTGAACATCATTTAGTACTATACAACGGAACTTGTGCCACATCCTGCCCACCTTCAACGTATGAAACGGAAGACTATAGCTGTGCTAAATGTCATGAAAGTTGCAACACTTGTCACGGACCTGGAGAGCAACATTGTGTCACATGTCCTGCTTCTAGTTATGTGCTTGATGGCCGTTGTCTGAGCACGTGTCCAAGTGGTTATTACGCAGATAAGAAAAGGAAAGAATGCATGAAATGTCCCATTGGTTGTGCAACTTGTTTGGCTTCTTTGTGCCAATCTTGTAACTCAAATTGGGAATTGAACAGAAAAGGGAAATGTGTGGCTGCTGGAAGTGACAGGTGTAATGCTGGTGAATTTTCGGATGGTAGCCAATGTCAGCTGTGTCACAATGACTGCGATTCGTGTTACGGTGAAACTGAGGGCAACTGTCTAACGTGTCCATCGCCCAACCTTTTACAAAATCACAAATGTGTACCAGAATGTAGTCGTGGGTACTACTCTGAAGCCGGTCGCTGCACTCGTTGTATCCACGGTTGCAGCGAGTGCGCATCGAGACTAAACTGCACTTTCTGCACTGGGTCTCTCAGACTTCAGTCTGGTACTTGTAGAACAGCCTGTGCAGAAGGTTACTACGCTGATCGTGGTACATGTTCCAAGTGCTACTTATCGTGTGCTACTTGCATTGGTCCACGTCGTGATCAGTGCGCCTCGTGTCCCCGTGGCTGGAGGCTGGCAGCTGGTGAATGTCACCCTGAATGTCCACAGGGTTTCTATAAGACCGCCGACGGTTGCCGCCACTGTCACCACTACTGCCGCGAGTGTGACGGCTCCGGGCCGTTACACTGCACGTCGTGTCCTCAACGCTTCATGTTAGACGGCGGGCTGTGTATGGAGTGTTTGAGCTCTCAATACTATGAAAGCAGCAGTGGATTATGTCGATCGTGTCACGAATCGTGTAGGATTTGCTCTGGACCCGGACAGTACAGCTGTACGGCGTGTTCGAGACCATTGCGGTTGGATAGGTTGAACAACCAATGTGTTCAGTGTTGTTCGGAGCGAGCTAACAACGCTACCTCAGACTGTTGTCACTGTGATTCTGACACAGGTGAGTGTATTAACTCGTCGGGCGCTGTTCGTCGTATCGCGGAGTGGGGCGCGCTACACACCGACGAGAACCACCCAGAACTGGCGACCACTGTGATCGTGTTGTGTGCGGCGGCCGGGCTCGTGTTGGTAGCTGTGGCAGTCGTGTTGCATAAGCGGTCACAGAAGCCGCAGGCACGATCTAAAGGACTAACTTACGCGGCCTTATCCTCCGAGGACGCGGATGTGCTGGTGGTCGGGCGTAATCGGTTGGTCGAGCACGTGCTCGAAGACGAGCACGCGCGGCCCGAGCACGTGCTCGGCTCCGACGATCTAGAGCACGCGCCGCTAATGAAACATTCCACATAG

Protein sequence:

>DPOGS215393-PA
MVLSWRALALLAALQMCSCLPKALYHNHFAIHVPAGEKHADDIATRHGFINHGQIGALKGYYLLSHHGVHKRSTEPSHEHHHKLNNEPEVKWFEQQRERRRMKRDYSPYESTLWSQLSRRLPSHRTRHRAITPSPFFSDPLFKEQWYLNGGAKDGLDMNVMPAWQRGYTGKGVVVSILDDGIQTNHPDLAQNYDPLASTDINGNDDDPMPQDNGDNKHGTRCAGEVAAVAYNQYCGVGIAYNASIGGVRMLDGVVNDAVEARALGLNPDHIDIYSASWGPEDDGKTVDGPGPLARRAFIYGVTSGRRGKGSIFVWASGNGGRHTDSCNCDGYTNSIFTLSISSATQGGFKPWYLEECSSTLASTYSSGTPGHDKSVATVDMDGRLRSDHICTVEHTGTSASAPLAAGICALALEANPNLTWRDMQYLVVLTSRPQPLEKETGWIVNGVKRKVSHKFGYGLMDASEMVNLAEQWVSVPPQHICKSQEINEDKAIESSFGYTLKVHMDVNGCSGTVNEVRYLEHVQCKISLRFFPRGNLRILLTSPMGTTSSLLFERPRDVISSNFDDWPFLSVHFWGERAEGRWTLQIVNAGNRHVNQPGILKKWQLIFYGTSTDPIRLRSKRPAQAAPAFAFPTAADGYEAAGDSFYNTDAFTNYQNFPSLFAAGSNPEKAIARLDGHNVPSPHGENVLADSNDKRVMHDCDPECDSQGCYEKGPTQCIACKHYRLDDACVSRCPPRSFANQGGVCWPCHETCETCVGPGQDSCLTCSPAHLLVADLGLCIQQCPDGYWENSEASACRPCAAHCSTCSERADACTSCEHHLVLYNGTCATSCPPSTYETEDYSCAKCHESCNTCHGPGEQHCVTCPASSYVLDGRCLSTCPSGYYADKKRKECMKCPIGCATCLASLCQSCNSNWELNRKGKCVAAGSDRCNAGEFSDGSQCQLCHNDCDSCYGETEGNCLTCPSPNLLQNHKCVPECSRGYYSEAGRCTRCIHGCSECASRLNCTFCTGSLRLQSGTCRTACAEGYYADRGTCSKCYLSCATCIGPRRDQCASCPRGWRLAAGECHPECPQGFYKTADGCRHCHHYCRECDGSGPLHCTSCPQRFMLDGGLCMECLSSQYYESSSGLCRSCHESCRICSGPGQYSCTACSRPLRLDRLNNQCVQCCSERANNATSDCCHCDSDTGECINSSGAVRRIAEWGALHTDENHPELATTVIVLCAAAGLVLVAVAVVLHKRSQKPQARSKGLTYAALSSEDADVLVVGRNRLVEHVLEDEHARPEHVLGSDDLEHAPLMKHST-