Monarch geneset OGS2.0

DPOGS209711
TranscriptDPOGS209711-TA1230 bp
ProteinDPOGS209711-PA409 aa
Genomic positionDPSCF300105 - 386238-389676
RNAseq coverage680x (Rank: top 19%)
Annotation
HeliconiusHMEL0113410.089.97% 
BombyxBGIBMGA008922-TA6e-11793.30% 
DrosophilaCSN4-PB2e-17673.75% 
EBI UniRef50UniRef50_Q9BT783e-17774.01%COP9 signalosome complex subunit 4 n=47 Tax=Coelomata RepID=CSN4_HUMAN
NCBI RefSeqXP_001607868.10.081.48%PREDICTED: similar to cop9 complex subunit [Nasonia vitripennis]
NCBI nr blastpgi|3071723360.080.98%COP9 signalosome complex subunit 4 [Camponotus floridanus]
NCBI nr blastxgi|3071723360.080.98%COP9 signalosome complex subunit 4 [Camponotus floridanus]
Group
Gene OntologyGO:00055159.4e-17protein binding
KEGG pathway 
InterPro domain[295-368] IPR0119911.2e-23Winged helix-turn-helix transcription repressor DNA-binding
[270-365] IPR0007179.4e-17Proteasome component (PCI) domain
Orthology groupMCL13846 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209711-TA
ATGCCTCTTAATTTAGCAGGCGTTCGACAATACCTAAGTGATTTGAGAAATTCGGGAGGACTACACAAAGATCAGGCTGAAAAATACCGCAATGTATTACTTGAAATTTTAAAAAATCCTGAAGGAGAACTAGCAGAGTGCTTAAAGGCATTTATAGAAGCGATTGTAAATGAGAATGTAAGCCTGGTGATATCCAGGCAACTGCTTACCGATGTGAGCACTCATCTTGCCTTGCTACCAGATAATGTGTCACAGGAGGTCTCACACTTTGCTTTGGATGTCATTCAACCAAGAGTTATATCTTTTGAGGAACAGGTGGCCAGTATTAGACAACATCTAGCAGATATATATGAAAGAAATCAAAATTGGAAAGAAGCAGCCAATGTTCTTGTTGGTATTCCATTGGAGACTGGACAGAAACAATATTCAGTGGATTATAAGTTGGAGACATACTTAAAGATAGCTCGGCTATATCTTGAAGTGGATGACCCAGTACAGGCAGAGGCTTTTGTAAACAGAGCTTCATTGCTACAAGCTGAAACAACCAATGAGCAGTTGCAGATATATTATAAAGTCTGTTACGCAAGAGTGTTAGATTATAGGAGAAAATTCATTGAAGCAGCTCAGAGGTATAATGAACTGTCTTACCGTAACATAATACATGAGGATGAAAGGATGACATGTCTTAGGAATGCACTTATATGCACGGTGCTAGCATCGGCTGGTCAGCAAAGATCGCGAATGCTGGCAACATTGTTCAAAGACGAACGCTGCCAACAATTGCCCGCATATTCCATATTAGAAAAAATGTACCTGGATCGCATCATTCGACGCTCAGAACTTCATGAGTTCGAAGCTCTAATGCAGACTCACCAAAAAGCGACGATGTCAGACGGGTCCACTATTTTGGACCGAGCTGTGTTCGAACACAACTTACTGTCAGCCAGCAAACTGTACAACAACATAACGTTCGAGGAGCTGGGAGCTCTACTAGAGACACCTCCAGCACGAGCTGAGAGAATCGCCTCACACATGATCAGTGAAGGAAGAATGAATGGATACATTGACCAGATCAGCGCTGTTGTGCACTTTGAAACTCGTGAAATTCTTCCGCAATGGGACAAACAAATCCAAAGTCTCTGCTACCAAGTTAACGGGCTGATTGAACAAATCGCTGCCGCAGAACCAGAGTGGATGGCCAAGCTCATGGAAGAAGAGATGATTCAATGA

Protein sequence:

>DPOGS209711-PA
MPLNLAGVRQYLSDLRNSGGLHKDQAEKYRNVLLEILKNPEGELAECLKAFIEAIVNENVSLVISRQLLTDVSTHLALLPDNVSQEVSHFALDVIQPRVISFEEQVASIRQHLADIYERNQNWKEAANVLVGIPLETGQKQYSVDYKLETYLKIARLYLEVDDPVQAEAFVNRASLLQAETTNEQLQIYYKVCYARVLDYRRKFIEAAQRYNELSYRNIIHEDERMTCLRNALICTVLASAGQQRSRMLATLFKDERCQQLPAYSILEKMYLDRIIRRSELHEFEALMQTHQKATMSDGSTILDRAVFEHNLLSASKLYNNITFEELGALLETPPARAERIASHMISEGRMNGYIDQISAVVHFETREILPQWDKQIQSLCYQVNGLIEQIAAAEPEWMAKLMEEEMIQ-