Monarch geneset OGS2.0

DPOGS202205
TranscriptDPOGS202205-TA1335 bp
ProteinDPOGS202205-PA444 aa
Genomic positionDPSCF300149 - 233889-236555
RNAseq coverage2259x (Rank: top 5%)
Annotation
HeliconiusHMEL0092070.097.52% 
BombyxBGIBMGA013499-TA0.097.52% 
Drosophilaalien-PB0.084.15% 
EBI UniRef50UniRef50_F4X7N40.088.51%COP9 signalosome complex subunit 2 n=9 Tax=Eukaryota RepID=F4X7N4_ACREC
NCBI RefSeqXP_001607475.10.090.32%PREDICTED: similar to cop9 signalosome complex subunit [Nasonia vitripennis]
NCBI nr blastpgi|3454855210.090.32%PREDICTED: COP9 signalosome complex subunit 2-like isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454855210.090.32%PREDICTED: COP9 signalosome complex subunit 2-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00055156.3e-24protein binding
GO:00054883.5e-07binding
KEGG pathway 
InterPro domain[170-344] IPR0131434.5e-63PCI/PINT associated module
[311-412] IPR0007176.3e-24Proteasome component (PCI) domain
[341-415] IPR0119912.5e-17Winged helix-turn-helix transcription repressor DNA-binding
[40-265] IPR0119903.5e-07Tetratricopeptide-like helical
Orthology groupMCL14196 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202205-TA
ATGTCCGACCATGACGACGACTATATGTGTGAAGAGGAGGAAGACTATGGCTTGGAATATTCGGAGGACAGTAATTCAGAACCCGATGTGGACCTTGAAAACCAGTATTACAATAGCAAAGCCCTAAAAGAAGACATGCCATTAGCAGCTCTCTTGAGTTTTCAAAAAGTTCTGGAGTTGGAGGGGGGAGACAAGGGCGAGTGGGGATTCAAGGCTCTCAAACAAATGATTAAGATAAACTTTAAACTGAGCAACTTCACGGAAATGATGGCAAGATACAAACAGCTTCTCACATACATCAAGAGTGCTGTAACAAGAAATCACTCGGAGAAATCAATCAACTCTATCCTTGATTATATTTCTACTTCTAGAAATATGGAGCTCCTGCAGGATTTTTATGAGACAACATTAGAAGCATTAAAAGATGCCAAAAATGACCGTTTATGGTTCAAAACAAACACCAAACTAGGCAAGCTGTATTATGACCGTGGAGATTTTAATAAATTAGCCAAGATATTGAAGCAATTGCATCAGAGTTGCCAGACCGATGAGGGCGAAGATGATCTCAAGAAAGGCACCCAACTATTAGAAATATATGCACTCGAAATCCAAATGTATACAGCACAAAAAAATAATAAGAAATTAAAGGCACTATATGAACAATCGTTGCATATTAAGTCAGCAATACCACATCCCCTCATAATGGGTGTTATAAGAGAGTGTGGAGGTAAGATGCATCTCAGGGAGGGTGAGTTTGAAAAGGCCCATACAGATTTCTTTGAAGCTTTCAAGAACTATGACGAATCCGGCAGCCCACGACGAACAACCTGCCTTAAATATTTGGTACTAGCCAACATGCTCATGAAGTCCGGCATCAATCCATTTGATTCTCAGGAAGCGAAACCCTACAAGAACGATCCAGAAATATTAGCAATGACCAACCTTGTGATGGCGTACCAGAACAACGATATAAATGACTTCGAGTCCATCCTGAAACACAATAGAAACAACATAATGGATGATCCCTTCATCAGGGAACACATCGAAGATCTCCTAAGAAATATTAGAACTCAAGTCCTCATCAAACTGATCGGTCCGTACACTAGGATCCACATACCGTTCATATCTAAGGAGCTCAATATTGATGAGAAAGAAGTTGAGAATTTATTAGTCACCTGTATTTTGGATAACACCATCAGCGGTCGCATCGACCAAGTGAACAGCGTGTTGGAGTTAGCCCGTGGTGCGAGGGACGCAGCTCGGTACACCGCCCTGGACAAGTGGACCGCGCAGTTGGCAGCCCTGCACCTCGCCCTGGCCAACAAGATGGCCTGA

Protein sequence:

>DPOGS202205-PA
MSDHDDDYMCEEEEDYGLEYSEDSNSEPDVDLENQYYNSKALKEDMPLAALLSFQKVLELEGGDKGEWGFKALKQMIKINFKLSNFTEMMARYKQLLTYIKSAVTRNHSEKSINSILDYISTSRNMELLQDFYETTLEALKDAKNDRLWFKTNTKLGKLYYDRGDFNKLAKILKQLHQSCQTDEGEDDLKKGTQLLEIYALEIQMYTAQKNNKKLKALYEQSLHIKSAIPHPLIMGVIRECGGKMHLREGEFEKAHTDFFEAFKNYDESGSPRRTTCLKYLVLANMLMKSGINPFDSQEAKPYKNDPEILAMTNLVMAYQNNDINDFESILKHNRNNIMDDPFIREHIEDLLRNIRTQVLIKLIGPYTRIHIPFISKELNIDEKEVENLLVTCILDNTISGRIDQVNSVLELARGARDAARYTALDKWTAQLAALHLALANKMA-