Monarch geneset OGS2.0

DPOGS214515
TranscriptDPOGS214515-TA1191 bp
ProteinDPOGS214515-PA396 aa
Genomic positionDPSCF300287 - 401129-402980
RNAseq coverage95x (Rank: top 62%)
Annotation
HeliconiusHMEL0178374e-17675.25% 
Bombyx% 
DrosophilaS2P-PA5e-4832.37% 
EBI UniRef50UniRef50_Q7PZA91e-6338.81%AGAP011819-PA n=4 Tax=Culicidae RepID=Q7PZA9_ANOGA
NCBI RefSeqXP_001870882.13e-6838.95%protease m50 membrane-bound transcription factor site 2 protease [Culex quinquefasciatus]
NCBI nr blastpgi|1700491066e-6738.95%protease m50 membrane-bound transcription factor site 2 protease [Culex quinquefasciatus]
NCBI nr blastxgi|1583008909e-6838.27%AGAP011819-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160205.3e-10membrane
GO:00065085.3e-10proteolysis
GO:00042225.3e-10metalloendopeptidase activity
KEGG pathwaydpo:Dpse_GA214572e-47 
 K07765 (MBTPS2)maps-> Protein processing in endoplasmic reticulum
InterPro domain[115-134] IPR0011935.3e-10Peptidase M50, mammalian sterol-regulatory element binding protein
[131-214] IPR0089157.6e-07Peptidase M50
Orthology groupMCL12667 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214515-TA
ATGTCGTTTACTATTCTCTGCACTTTTGTTTTAACGTTTTATGTTGTTATCTGGTTTTTCGATTCATTCTTTAAGAGTTGTATGCACTACCCCTACTATGCATTTTTGGATGGAACAGGTCTGAAAGTTGGTATTTTCAATTTTTCATGGACAACTACAGCTTGTAACAGGTTTATTTACAGATGGAGTAAAAATTTAAACAAAATTTTGAAGAAATGGTTTGCATTTGGGTACATTTTTACAATTGGAATATTTCTTCCATTTGCTCTATGGACACTGCTTTCATTTATTTTTGAGCACTTTTATGAGACCATACAAATAAACAGTGTTCCTGAAGTAAAGGCTGTATTGCCTGGTGTAAATATACCTGCATCGGACTTCTGGGTATACTTTTTAGCTATTGGATTCTGTTCAATGTTCCATGAAATTGGCCATGCTGCTGCAGCAGCTCAGGAAGACGTCCAATTAATAGCAATCAGTGTATATGTATTTACGATTATACCTGTAGCTTTTGTGCAACTAAATACAGAACATTTGAATAGCTTGACCATAGCTAAGAAACTGAAAATATACTGTGCCGGAGTGTGGCATAATATTGCCCTAGCATTTTTAGCCTTGCTTTTATTTTTCTCCGCCCCTGTGCTGTTTAGTTTGGTATATCAAACAGATGTTGGAGTTAGAGTTACCGGATTCAGTCACGATTCTCCCTTACAAGGTGCAAGAGGCCTAGAAGATAATGATGTCATATTATCCATTAATGACTGCACGGTTAAAAATTCTAATGATTGGTCATATTGCTTACGAGTGGCCCATGATCGCTTCGGAATTTGCACAAGTGCTGAATATATTGCACAGAATGATGAAATTATGATGGAGACAATTAAAGAGAATGATGTTGTGGAATGTTGCAGGAAAGATGATTTGTACGGATTCTGCTTTGAATATATGGAGCCCAAAACCATTGTGGACTCGGCTTTACCCGGTCAATACTCCTGTCTCAAACCAAGAGACATGATCAAAGATGGACACCATATTGCAAGAAGTATTATTCAATGTTTGGCTGTGTATTTAAACAAGAATGGTGATTTTATAACGTTCTTCACTGTGTTCACTGTTGTTGTCGGCACAGGAATCACGGTGCCCATACTTATTTACCTCTTTTACCGAGCCATTTATATTGACGGCTATTGA

Protein sequence:

>DPOGS214515-PA
MSFTILCTFVLTFYVVIWFFDSFFKSCMHYPYYAFLDGTGLKVGIFNFSWTTTACNRFIYRWSKNLNKILKKWFAFGYIFTIGIFLPFALWTLLSFIFEHFYETIQINSVPEVKAVLPGVNIPASDFWVYFLAIGFCSMFHEIGHAAAAAQEDVQLIAISVYVFTIIPVAFVQLNTEHLNSLTIAKKLKIYCAGVWHNIALAFLALLLFFSAPVLFSLVYQTDVGVRVTGFSHDSPLQGARGLEDNDVILSINDCTVKNSNDWSYCLRVAHDRFGICTSAEYIAQNDEIMMETIKENDVVECCRKDDLYGFCFEYMEPKTIVDSALPGQYSCLKPRDMIKDGHHIARSIIQCLAVYLNKNGDFITFFTVFTVVVGTGITVPILIYLFYRAIYIDGY-