Monarch geneset OGS2.0

DPOGS210504
TranscriptDPOGS210504-TA2652 bp
ProteinDPOGS210504-PA883 aa
Genomic positionDPSCF300186 - 17942-23083
RNAseq coverage853x (Rank: top 15%)
Annotation
HeliconiusHMEL0068963e-2434.50% 
BombyxBGIBMGA012590-TA3e-10459.32% 
DrosophilaCG9416-PA5e-10130.76% 
EBI UniRef50UniRef50_F4W9985e-16437.66%Endoplasmic reticulum metallopeptidase 1 n=6 Tax=Endopterygota RepID=F4W998_ACREC
NCBI RefSeqXP_001606695.17e-16138.91%PREDICTED: similar to FXNA [Nasonia vitripennis]
NCBI nr blastpgi|3320292972e-16337.66%Endoplasmic reticulum metallopeptidase 1 [Acromyrmex echinatior]
NCBI nr blastxgi|3504077447e-15537.56%PREDICTED: endoplasmic reticulum metallopeptidase 1-like [Bombus impatiens]
Group
Gene OntologyGO:00082337.9e-35peptidase activity
GO:00065087.9e-35proteolysis
KEGG pathwayani:AN4200.29e-24 
 K03360 (GRR1)maps-> Ubiquitin mediated proteolysis
    Cell cycle - yeast
InterPro domain[170-339] IPR0074847.9e-35Peptidase M28
Orthology groupMCL10086 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210504-TA
ATGAATAAGGATACGGAGCCTCGCGGGGCGCCGGCCGGCGAGGAGGAGCTGCTACTGAAGGAGAACGTACTGGAGAGAGGCGTAGTGCCCGTTTGGGTCGTGTGCATGGCGGCGTGCCTCACCGCCGGGACTCTGTTGGCCGCGGGGGCCGTCGACCGTCGTCTGCCGGAGCCTTTGCCGCGAGATGCGCCGGCGCAGCTGTTCAGCGCCGAGAGGGCCTACGACCACCTCATTAACCTGACGTCCATCGGCCCTCGGGTGGCGGGCAGCTATGAGAACGAGGTATTGGCCGTCCGGGAGCTGGTGTCGGCAGCCCGCTCTGTGGCCGCCGCCGCCAGCCCACACAACCTCGTTGACTACGACGTGTTCACCGCCAGCGGTGCCTTCTCGCTCACCTTCCTCGACGGCATGACTAACATCTATCGAGACGTTCAGAGCGTCGTGATTCGGATCAGAGGCGCGGGGGAGGCGAGCGGCCCGGGGAGGGGGTCTGCGCGAGCACCCGCCGCTCTACTCATCAACTGCCACTTCGACACCGTGCCTGACAGTCCTGGGGCCAGCGACGACGGCGCGGGCTGTGCAGTGGCGCTGGAGACGGCCCGGGCCCTCGCCGCCGCGCCGAGGCCGCTGAGACATCGCGTGCTGGTGTTGTTGAATGGTGCGGAGGAGAACATCCTGCAAGCGAGCCACGCCTTCGTCACCAGCCACGCCTGGGCGCGAGGAGCGCGGGCCTTCATCAACATCGAGGCGTGTGGCGCCGGGGGCCGCGAGGTGCTGTTTCAGGCCGGCCCGCACGACCCCTGGATAGTGGAGGTGTACGCGGGGGCGGTGCCGCACCCCTTCGCCTCTTCGCTGGCGCAGGAGCTGTTCGAGAGCGGCCTCATCCCCGCAGACACCGACTTCCGTATATTCCGAGATTTCGGGAACATGTCCGGCGTGGATCTCGCGTGGAGCAGCAACGGGTACGTGTACCACACGCGGCTGGACACGGCCGACCGTGTGCCGCTTCCCGCCCTCCAGCGCACTGGAGACAACGTGCTCGCCCTCGCTCACGGGTTGCTGAGCAGCGAGCGACTGGAGCAAGAGACGGAGCGTGAACGCCAGCCCGTGTTCTTCGACGTGGTGGGTGTGGTGGTGGTGGCGGCCCGCGCCACGCTCGCCGCATCCGTCGCCGTGGCCGTGTTGCTGCTCACCGTGCTGGCATTAGTGCTATCGGCCAGGGACGCCGCCAGGGAACTGTACATGCCGGCGCGCTTGTGGCTCAAGTTGGTGTTCTTGATGGCGTGGCGGGCCGTTCTGTGTACGGCGGCCGGGGTCGCCGCGTCGGCGGGGGTGGCTCTGGTGCTGCACGTGCTCGGCGCGAGGATGTCTTTTTACGCACAGCCGGCGCTACTTGTGCCGCTGTATGCTTTGCCCGCGCTGGCGGGGTCGTGGGCGGATGCTCGTCTGGCCGGTGGATCGCGCCGCGGGCCGGCGGGGTTGCTCCGTGGCTGGGTGGCGTGGCGCGCCTGGAGAGACGCGCTGGCGCTGCTGACCGCGTCGTCGCTGGCGCTGCTGGGAGCGCTCGGCCTGCGCTCCGCCTTCTTGCCGGCTCTCTGGACTCTCCCCGCGCTCTCGTCGCTACCGTTCCGTCTGGCAGCGGGCACGTCTCCACCGCCGCAGACCGCCGCCGCACTCCACGCCGCGGGCTCGACCCTGCCGGCCCTGCAGACGGCGTATCTGGCGCTCAACTCCATCAACATGTTCGTGCCCATCATGGGTCGCGCCGGCACGGCCTTCTTGCCGGCAGACGTGATGATGTCCGTGGTGGTGTCGTCCCTGACCCTGCTCACCTTCAGTTGGATGCTGCCACTCGTGGTCGCCGCGAAGAGACTCAACGTTCTGCTGTACGCGGTGCTGGCCGCCAGCTGTGTGGGCGCCTTGTACTCGCTCAGTCCCTTGGGCGCGCCGTACAGTGAGACGCGCCCCCAGCGTCTGATGGTGTTCCACACTCGTCGCTCGTACACTCCTCTGGGCGCCGGCGACCCCGTCTCCCTGGAGGACTTCTTCTGGATGCCGGAGTTGGACGTCAACACGCCACACTCAATGGACAAATACATCGAGGGCGTGTCCGCGGCGCGCGTCACGGCGGCCGAGGAGTGTTCGCGCTGGGCCTACTGCGGCGCACCTTACTTTCTGCCCGTGCTGTCGCGCGTATCGCGAGGATACTCCATGCCGGCGCCGGAGCCCCCCCTGCCGCGGCTGCGGGTGGCTTCGCGCCTGCTGGTCGCGGACGGTGACCCCGGCAGCCGCACGTTGCAGCTGGACCTGTCGGGCACACAACACGCCGTGCTGGTGTTGGCACCGGCGGAGGGGGTGAGGGTCACGCGCTGCTCCGAACTGAACGGGCCGCCGCGGGAGGGACCGGCGTGGGGCGCGCGACGCACCTACTTCGTGACCCTGCACCATGCGCGCGACCCGCACACCTGGCGCCTGGAGTGCGTCCTCGAGGGCCGGCCGGTGGCGGAGGGCTGGGTGCAGGTGTCTGCGGCGGGTCACGCCATGTTCGGTCCGCGGCGCCTGTCGGACTCCCACTCCCGGCTCCTGCAGGCCGCGCCGCCGCACGTGGCGGTCACCGGCTGGGGAGTCGACCTCCACATCCTGGACCTGTAG

Protein sequence:

>DPOGS210504-PA
MNKDTEPRGAPAGEEELLLKENVLERGVVPVWVVCMAACLTAGTLLAAGAVDRRLPEPLPRDAPAQLFSAERAYDHLINLTSIGPRVAGSYENEVLAVRELVSAARSVAAAASPHNLVDYDVFTASGAFSLTFLDGMTNIYRDVQSVVIRIRGAGEASGPGRGSARAPAALLINCHFDTVPDSPGASDDGAGCAVALETARALAAAPRPLRHRVLVLLNGAEENILQASHAFVTSHAWARGARAFINIEACGAGGREVLFQAGPHDPWIVEVYAGAVPHPFASSLAQELFESGLIPADTDFRIFRDFGNMSGVDLAWSSNGYVYHTRLDTADRVPLPALQRTGDNVLALAHGLLSSERLEQETERERQPVFFDVVGVVVVAARATLAASVAVAVLLLTVLALVLSARDAARELYMPARLWLKLVFLMAWRAVLCTAAGVAASAGVALVLHVLGARMSFYAQPALLVPLYALPALAGSWADARLAGGSRRGPAGLLRGWVAWRAWRDALALLTASSLALLGALGLRSAFLPALWTLPALSSLPFRLAAGTSPPPQTAAALHAAGSTLPALQTAYLALNSINMFVPIMGRAGTAFLPADVMMSVVVSSLTLLTFSWMLPLVVAAKRLNVLLYAVLAASCVGALYSLSPLGAPYSETRPQRLMVFHTRRSYTPLGAGDPVSLEDFFWMPELDVNTPHSMDKYIEGVSAARVTAAEECSRWAYCGAPYFLPVLSRVSRGYSMPAPEPPLPRLRVASRLLVADGDPGSRTLQLDLSGTQHAVLVLAPAEGVRVTRCSELNGPPREGPAWGARRTYFVTLHHARDPHTWRLECVLEGRPVAEGWVQVSAAGHAMFGPRRLSDSHSRLLQAAPPHVAVTGWGVDLHILDL-