Monarch geneset OGS2.0

DPOGS210878
TranscriptDPOGS210878-TA864 bp
ProteinDPOGS210878-PA287 aa
Genomic positionDPSCF300027 + 1388817-1393047
RNAseq coverage679x (Rank: top 19%)
Annotation
HeliconiusHMEL0201471e-7664.04% 
BombyxBGIBMGA007003-TA1e-12389.91% 
DrosophilaCHIP-PA2e-12269.69% 
EBI UniRef50UniRef50_Q9XYW62e-12069.69%CHIP n=22 Tax=Bilateria RepID=Q9XYW6_DROME
NCBI RefSeqXP_002426063.12e-13175.61%STIP1 homology and u box-containing protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420106263e-13075.61%STIP1 homology and u box-containing protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420106268e-12875.61%STIP1 homology and u box-containing protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00001514.3e-29ubiquitin ligase complex
GO:00165674.3e-29protein ubiquitination
GO:00048424.3e-29ubiquitin-protein ligase activity
GO:00054885.8e-27binding
KEGG pathwayphu:Phum_PHUM2345405e-131 
 K09561 (STUB1, CHIP)maps-> Ubiquitin mediated proteolysis
    Protein processing in endoplasmic reticulum
InterPro domain[210-283] IPR0036134.3e-29U box domain
[15-114] IPR0119905.8e-27Tetratricopeptide-like helical
[212-272] IPR0130838.4e-21Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL12487 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210878-TA
ATGAGTAAGCATATGTATTCCACCGCTAACTTGACCGATAAAGAATTGAAAGAACAAGGAAACAGGTTGTTTAGTTTGAGAAGATTCGAAGATGCTATGAACTGTTACACGAAGGCTATCATAAAAAATCCATCTGTAGCCACATACTTTACAAATAGAGCGTTATGTCACCTCAAGATGAAGAGATGGGAAGCTACTTGTCAGGATTGCCGAAGAGCCCTGGATATAGACAACAACCAGGTGAAAGGTCATTTCTTTCTTGGCCAGGCCTTGGTTGAGTTGGACTGCTATGATGAAGCTATCAAACACTTACATAGAGCCAATGATCTAGCAAGAGATCAAAAGCTGAATTTCGGCGATGACATAGCAGCTCAGATAAGAATTGCAAGGAAGAAGAGATGGAATGTACAGGAGGAGAAGAGAATATCACAGGAGATTGAACTGCAGACTTATTTAAATAGGCTTATAAATGAGGACATGCAACGTAGAGTAGAATCAATTAAAATAGAAAACATTAATGAAGAAGACACCAATAGCAAAATAGCTAAAGTGGAAGAAGAATGCAACAACTATAGCAGTGAACTTAACAATCTATTTTCAAAAATGGATGAACGGAGGAGGAAACGCGATGTACCAGACTATCTATGCGGCAAAATAAGCTTCGAGATTCTGAACGAGCCTGTCATCACGCCTAGCGGGATAACCTATGAGAAGAAAGACATTGAGGAGCATCTCGAGCGCGTCGGCCATTTCGATCCTGTGACACGTGTGAAGTTAACAGCGGATCAACTCATACCTAACTTCACTATGAAGGAGGTTGTGGACGCCTTCCTCCAGGACAACGAGTGGGCGCTCGATTATTAA

Protein sequence:

>DPOGS210878-PA
MSKHMYSTANLTDKELKEQGNRLFSLRRFEDAMNCYTKAIIKNPSVATYFTNRALCHLKMKRWEATCQDCRRALDIDNNQVKGHFFLGQALVELDCYDEAIKHLHRANDLARDQKLNFGDDIAAQIRIARKKRWNVQEEKRISQEIELQTYLNRLINEDMQRRVESIKIENINEEDTNSKIAKVEEECNNYSSELNNLFSKMDERRRKRDVPDYLCGKISFEILNEPVITPSGITYEKKDIEEHLERVGHFDPVTRVKLTADQLIPNFTMKEVVDAFLQDNEWALDY-