Monarch geneset OGS2.0

DPOGS210444
TranscriptDPOGS210444-TA642 bp
ProteinDPOGS210444-PA213 aa
Genomic positionDPSCF300062 - 163927-165103
RNAseq coverage115x (Rank: top 59%)
Annotation
HeliconiusHMEL0222603e-7569.68% 
BombyxBGIBMGA001960-TA2e-8067.30% 
DrosophilaCG30157-PA2e-4246.81% 
EBI UniRef50UniRef50_F4WAD17e-6255.92%Ufm1-specific protease 1 n=7 Tax=Formicidae RepID=F4WAD1_ACREC
NCBI RefSeqXP_001122104.17e-6554.93%PREDICTED: similar to F38A5.1a [Apis mellifera]
NCBI nr blastpgi|3504197893e-6456.94%PREDICTED: ufm1-specific protease 1-like [Bombus impatiens]
NCBI nr blastxgi|1107597372e-6554.93%PREDICTED: ufm1-specific protease 1-like [Apis mellifera]
Group
KEGG pathway 
InterPro domain[19-204] IPR0124627.5e-47Peptidase C78, ubiquitin fold modifier-specific peptidase 1/ 2
Orthology groupMCL16614 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210444-TA
ATGACATCAACACTTTTGAGTGATATCCATTGCCATCTTCAACACGATTTACAAAATTGTTTTTTAGTGAATGGAAAATATGACTATTATCATTATTTATGCGATGGGTTCGATGATCGGGGCTGGGGATGTGGCTACAGAACATTACAAACAATTTGTTCCTGGATGAATTATAATTTTGATAAACCATCAAAGGTTCCCAGTATCAGAGAAATTCAGAACATTCTAGTTGAGTTAGAAGACAAGCCTAAATCCTTTTTAAATTCTAGGCAATGGATTGGAAGTTTTGAAGTTTGTCTTGTAATAGATAAGTTGTATGGTGTTCCAAGTAAAATAGTACATGTAAAAAAAGAAGATAATTTAGAAATTATAGTGGAAATCTTAAAGAGTCATTTTGAAAAGTTCGGCAGCCCTATTATGATGGGAGGTGATGTGGATTGTTCATCGAAAGGTATAATGGGGGTTCTTGTGGATGGAAATAATTCAAAATTACTGGTTGTGGATCCCCATTACGTTGGTAAACAAAGTTCTAGAACTCTTCTTCAGAATAATGGTTGGGTCAAATGGCAGTCGTTAAATGACTTCTTAAGTTCCTCATTCTATAATTTATGCCTTCCTCAAGCAAAAGCAAAGAATAAATAA

Protein sequence:

>DPOGS210444-PA
MTSTLLSDIHCHLQHDLQNCFLVNGKYDYYHYLCDGFDDRGWGCGYRTLQTICSWMNYNFDKPSKVPSIREIQNILVELEDKPKSFLNSRQWIGSFEVCLVIDKLYGVPSKIVHVKKEDNLEIIVEILKSHFEKFGSPIMMGGDVDCSSKGIMGVLVDGNNSKLLVVDPHYVGKQSSRTLLQNNGWVKWQSLNDFLSSSFYNLCLPQAKAKNK-