Monarch geneset OGS2.0

DPOGS209563
TranscriptDPOGS209563-TA3810 bp
ProteinDPOGS209563-PA1269 aa
Genomic positionDPSCF300092 + 431497-458421
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0158990.089.36% 
BombyxBGIBMGA002365-TA0.062.78% 
DrosophilaFur2-PC0.056.91% 
EBI UniRef50UniRef50_Q264890.065.35%Endoprotease FURIN n=4 Tax=Coelomata RepID=Q26489_SPOFR
NCBI RefSeqXP_002055429.10.057.24%GJ18794 [Drosophila virilis]
NCBI nr blastpgi|11678600.065.35%Endoprotease FURIN [Spodoptera frugiperda]
NCBI nr blastxgi|11678600.064.66%Endoprotease FURIN [Spodoptera frugiperda]
Group
Gene OntologyGO:00042521.3e-106serine-type endopeptidase activity
GO:00065081.3e-106proteolysis
GO:00160203.7e-08membrane
GO:00071693.7e-08transmembrane receptor protein tyrosine kinase signaling pathway
GO:00055243.7e-08ATP binding
GO:00064683.7e-08protein phosphorylation
GO:00047143.7e-08transmembrane receptor protein tyrosine kinase activity
KEGG pathway 
InterPro domain[20-1143] IPR0155000Peptidase S8, subtilisin-related
[108-446] IPR0002091.3e-106Peptidase S8/S53, subtilisin/kexin/sedolisin
[442-592] IPR0089798.8e-45Galactose-binding domain-like
[663-782] IPR0090301.4e-30Growth factor, receptor
[495-583] IPR0028848.1e-27Proprotein convertase, P
[20-90] IPR0090204.6e-19Proteinase inhibitor, propeptide
[706-753] IPR0062122.2e-13Furin-like repeat
[659-769] IPR0062113.7e-08Furin-like cysteine-rich domain
Orthology groupMCL10133 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209563-TA
ATGCCACCTATTCACCTTGTTGTGGTATTGACAGTGTTGGACTTGTGTGTGCCACTGTACCACGATCAGTTCGCGCTGCTCGTGCCAGATGGCCGTGCAGATGAACTGGCTCACAGACATGGGTTCATAAACCATGGCCAGATCGGCAGCCTGAACCATTATTACTTATTTTCACATCCTCACATAAGCAAGAGATCAGTGAACGCAAGCAAAGAATATGAAATTCGTCTCAAGAATGACCCACAGGTGAGATGGGTGATGCAGCAGCGCGAGCTGAAGAGAAGCAAGCGGGATCTGCATCCACGTCACGTCATGAGACAGTCCATGCCGGAGTTCCCAGATCCATTGTTCAAAGAACAGTGGTATCTTAATGGTGGTGCGGCTGAAGGTTTAGATATGAACGTCGGGGTCGCATGGAGAAAAGGCTACACAGGAAAGGGGGTCGTAATAACAATACTCGATGACGGAATCCAGCCTAACCATCCAGACCTGTTGCAGAATTATGATCCCGCGGCATCGACTGACATAAACGGAAATGACACCGATCCAACGCCGCAGGATAATGGCGACAATAAACATGGTACCCGGTGTGCGGGGGAAGTCGCCGCAGTGGCGTATAATAAATATTGTGGCGTAGGAATAGCATACAATGCTAGCATTGGTGGCGTGAGAATGCTAGACGGTCTAGTTAATGACGCAGTGGAAGCTAAAGCACTCGGCTTCAACACTCACCACATAGACATTTATAGCGCTTCCTGGGGACCGGAGGACGACGGAAAAACGGTCGACGGACCTGGCCCCTTAGCTAGAAGAGCTTTTATCAACGGCGTAACAAACGGCAGAGGGGGAAAAGGTTCTATTTTTATATGGGCATCAGGAAATGGAGGGAGGCATACTGATTCCTGTAATTGTGATGGCTACGCAAATAGTATATTCACAATTTCAATTTCTAGTGCTACACAGGGAGGTTACAAGCCGTGGTATTTAGAAGAATGTTCCTCTACTTTAGCATCCACATATAGTTCAGGGACACCAGGTAGGGACAAAAGTGTTGCGACCGTTGATATGGACGTCCAATTGCGGCCTGATCATATTTGTACCGTAGATCATACGGGTACCTCGGCTTCTGCACCCTTAGCAGCGGGAATTTGTGCACTAGCATTGGAAGCAAATTCATTATTAACATGGCGAGATATGCAACATCTAATCGTTATGACCTCCAGGTCACAACCTTTAGATAAAGAAGAAGGATGGATCGTAAATGGTGTTAAAAGAAAAGTAAGTCATAAATTTGGTTACGGTCTTATGGATGCCGGACAAATGGTATCTTTAGCTGAACAATGGATAAATGTTCCACCGCAACACATATGTAAGTCACAAGAAATAAACGAGGACAGGGCAATTGAAACTTCTTTCGGATATACTATATCCGTTCATATGGACGTAAATGGTTGTAGTGGTACAATGAATGAAGTTAGGTTTTTGGAACACGTTCAATGTAAGATTTCGTTAAGCTTTTTTCCAAGAGGTAATTTACGGATATTGCTGACGTCGCCAATGGGCACTACGTCAACTTTATTATTTGAAAGAACTCACGATGCTGCTAGTTCTAATTTTGACGATTGGCCTTTCTTAAGTGTTCATTTTTGGGGTGAAAACGCCGAAGGACGATGGACACTTCAAATAATAAACGCCGGCAATAACCATGTTACTCAACCGGGAGTATTAAAAAAGTGGCAGCTTATTTTCTACGGTACGGCGGCAAATCCGATGCGTTTACGAAATAAAAGTTACTTCAATTCTGATAATATTCGACAAGATGAGAAGACCTATCATATTAACGACGTTTATGATGCTAATGAGTATTCGCAATTTCTCAATGAAATAGAACTTGGGATTTCAGATAGACGTAATTATCCTAAGAATATTCCTTCAGCTCAAAGAAAAAACGTTTTGGCAGATGCTAACGATAAGCAAGTCCAAAGACTGTGCGATCCCGAATGTGATTCACAAGGTTGTTATGGTAAGGGTCCCACCCAATGTGTTGCTTGCAAACATTACCGCCTCGATAACTCCTGTGTGTCCAGGTGTCCACCGAGAAGCTTTGTTAACCAAGGAGGTGTTTGTTGGCCTTGCCATGAGTCTTGCGAAACTTGTGCTGGAGCTGGACAGGATTCTTGTCTTACTTGTGCACCAGCACATTTACTTGTTGTCGATTTAGCTGTATGTCTGCAACAGTGTCCAGATGGTTATTATGAAGATCCTGACGCAAACGCTTGTTTTCCGTGTGCAGAACACTGTGACACCTGTTCGGATAAAGCTGATTTGTGTTCTTCATGTGCTCATAATTACGAATTGTATAATGGGTCTTGTTTAGCCACTTGCCCTCCTGGAACATACAAAAAAGAGGATTTTGGTTGTATGCGGTGTCATGAAACGTGTGAGTCCTGTAGCGGCCCGAATGAATCTGAATGTGTTACTTGTAAAATTGGAGAGTACGCGCTAGAAGGTCGCTGTGTATCTAACTGTCTCATCGGGAATTATGCAGATGTTCAAAAAAAAGAATGCATATCGTGTCCCATTGGATGTTCAATTTGCACATATGCAGTTTGTTCCGCTTGTCAAGAGAAATGGGTTCTAACGAAAAAAGGAACATGTCAGCCTGAAGGAAACGACAAGTGTGATACTAATGAGTATTACGAAGGAGGGCGTTGCAAAAATTGCCATTCCACTTGTGAGAAATGTAGTGGTCCTAATGAATGGGACTGTTTATCTTGTTCAAGTCCTCTGTTATTGCAGGGATCAAGGTGCGTTGCGGAATGTGGACAAGGCTTTTACCAGACAGCTGGGAGATGTTCGTTATGCCCGCACACATGCAAAACATGCGTGTCGAGGTTAAATTGTACAACTTGTGCTAATAGTCTTAGATTACAATCCGGTACTTGTCGTTCTACATGCGCAGCTGGTTACTATCCTGATGAAGGAACATGTTCCAAGTGCTACTTATCTTGTGAGACCTGTACTGGCCCGAGAAGAGATCAATGCGCATCATGTCCTCCAGATTGGAGGCTAGCAGCGGGAGAATGTAGACCGGAATGTCCTCAAAACTTCTTTACATGGGGAGACAGTTGTCGTAGATGTCATCATTATTGCCAGGATTGCCATGGAGCTGGTCCCCAAAGGTGTACATCCTGTCCTCAGCATTTTTCCTTAGAAAATGGTTTATGTGTTGAGTGCCTTAGTTCCCAATATTATGAAATTAGGACGAGGACTTGCCGTCCATGCCATGACTCTTGCAGGTCATGTTCTGGACCTGGGCCTACTAGTTGTGTGACGTGTGCACATCCTCTTCGTTTAGATAGGGTGAATCACAAATGTTTGCCATGCTGTACAGAGAATTTAGTTTCGTTTTATTTAAGCACTAATCAATCAACAGATTGCTGCCACTGTGATAAAGATATGGGCGGTTGTCTGAACGGTTCATCGGCGGGTAAGAGACGCATTGCGGAGAACATTGGGGCGCATATGACGCCATCATTTTTTGTCGACGACGCAAAAGAGCAGAATATCTTAGACCGTGATTTATTGCTCCTTTTGAGTGCTGGAGTGGCGGTCCTTGTAATATCCATTGCAATGATTGTACTGAGGTTCAAGTCTAAAAAGTGCAAACATCTTTCACCGTTTCCAAGGACAGGATATTCACAATTGACTTCCATAGACGAGGATTTCACAGCAGTGAGCTTATCGCATACTACATTGAAAGTCATTCAAAGTGACATAAACACAAACCATCTGGAGGAACCAACATAA

Protein sequence:

>DPOGS209563-PA
MPPIHLVVVLTVLDLCVPLYHDQFALLVPDGRADELAHRHGFINHGQIGSLNHYYLFSHPHISKRSVNASKEYEIRLKNDPQVRWVMQQRELKRSKRDLHPRHVMRQSMPEFPDPLFKEQWYLNGGAAEGLDMNVGVAWRKGYTGKGVVITILDDGIQPNHPDLLQNYDPAASTDINGNDTDPTPQDNGDNKHGTRCAGEVAAVAYNKYCGVGIAYNASIGGVRMLDGLVNDAVEAKALGFNTHHIDIYSASWGPEDDGKTVDGPGPLARRAFINGVTNGRGGKGSIFIWASGNGGRHTDSCNCDGYANSIFTISISSATQGGYKPWYLEECSSTLASTYSSGTPGRDKSVATVDMDVQLRPDHICTVDHTGTSASAPLAAGICALALEANSLLTWRDMQHLIVMTSRSQPLDKEEGWIVNGVKRKVSHKFGYGLMDAGQMVSLAEQWINVPPQHICKSQEINEDRAIETSFGYTISVHMDVNGCSGTMNEVRFLEHVQCKISLSFFPRGNLRILLTSPMGTTSTLLFERTHDAASSNFDDWPFLSVHFWGENAEGRWTLQIINAGNNHVTQPGVLKKWQLIFYGTAANPMRLRNKSYFNSDNIRQDEKTYHINDVYDANEYSQFLNEIELGISDRRNYPKNIPSAQRKNVLADANDKQVQRLCDPECDSQGCYGKGPTQCVACKHYRLDNSCVSRCPPRSFVNQGGVCWPCHESCETCAGAGQDSCLTCAPAHLLVVDLAVCLQQCPDGYYEDPDANACFPCAEHCDTCSDKADLCSSCAHNYELYNGSCLATCPPGTYKKEDFGCMRCHETCESCSGPNESECVTCKIGEYALEGRCVSNCLIGNYADVQKKECISCPIGCSICTYAVCSACQEKWVLTKKGTCQPEGNDKCDTNEYYEGGRCKNCHSTCEKCSGPNEWDCLSCSSPLLLQGSRCVAECGQGFYQTAGRCSLCPHTCKTCVSRLNCTTCANSLRLQSGTCRSTCAAGYYPDEGTCSKCYLSCETCTGPRRDQCASCPPDWRLAAGECRPECPQNFFTWGDSCRRCHHYCQDCHGAGPQRCTSCPQHFSLENGLCVECLSSQYYEIRTRTCRPCHDSCRSCSGPGPTSCVTCAHPLRLDRVNHKCLPCCTENLVSFYLSTNQSTDCCHCDKDMGGCLNGSSAGKRRIAENIGAHMTPSFFVDDAKEQNILDRDLLLLLSAGVAVLVISIAMIVLRFKSKKCKHLSPFPRTGYSQLTSIDEDFTAVSLSHTTLKVIQSDINTNHLEEPT-