Monarch geneset OGS2.0

DPOGS215697
TranscriptDPOGS215697-TA3279 bp
ProteinDPOGS215697-PA1092 aa
Genomic positionDPSCF300041 - 321757-331179
RNAseq coverage556x (Rank: top 23%)
Annotation
HeliconiusHMEL0096530.096.26% 
BombyxBGIBMGA005758-TA0.081.07% 
DrosophilaCul-4-PA0.063.99% 
EBI UniRef50UniRef50_E2AII30.067.96%Cullin-4B n=2 Tax=Coelomata RepID=E2AII3_CAMFO
NCBI RefSeqXP_392800.30.071.09%PREDICTED: similar to cullin 4B [Apis mellifera]
NCBI nr blastpgi|3071993830.070.80%Cullin-4B [Harpegnathos saltator]
NCBI nr blastxgi|3071993830.070.80%Cullin-4B [Harpegnathos saltator]
Group
Gene OntologyGO:00065111.6e-123ubiquitin-dependent protein catabolic process
GO:00316251.6e-123ubiquitin protein ligase binding
GO:00314611.6e-123cullin-RING ubiquitin ligase complex
KEGG pathwayame:4092790.0 
 K10609 (CUL4)maps-> Ubiquitin mediated proteolysis
    Nucleotide excision repair
InterPro domain[31-480] IPR0013731.6e-123Cullin, N-terminal
[27-373] IPR0161591.6e-99Cullin repeat-like-containing domain
[778-1004] IPR0161583.3e-74Cullin homology
[994-1092] IPR0119914.9e-37Winged helix-turn-helix transcription repressor DNA-binding
[1021-1086] IPR0195593.7e-35Cullin protein, neddylation domain
Orthology groupMCL12049 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215697-TA
ATGAATAAACCTGGCGCAACAACAAAGAAACTAGTTATTAAAAACTTTAAAAGTAAACCGAACCTTCCGGAAAATTATCAAGAAACAACATGGAGCAAATTACGAGAGGCTGTTATAGCTATACAAACGTCGAAGGCAATCGCCTATTCCTTAGAAGAATTATATCAAGCAGTTGAAAATATGTGTAGCCATAAGATGGCGTCTCAATTGTATGTTAATTTGACAAACTTAGTGGAGGCCCACGTGAAATCAAACATTGAGCAGTTCCTGTCGGAGAGCATGGATCGCCAAGTGTTTCTCAAACGTATGGACGACTGTTGGCGGGCTCACTGTCGACAAATGATCATGATCAGGAGCATCTTCCTGTACCTGGACCGTACTTATGTTCTCCAAAACCCTAGCATACATTCTATATGGGACATGGGTCTAGATCTGTTCCGGCATCATATAGCTATGAACACTCTGATACAGACTCGCACTGTTGATGGACTGTTGACATTGATAGAACGGGAAAGAGGGGGAGATGCCGTGGACATCTCCCTGCTGAAGAGTTTATTGAGGATGCTGTCCGACCTTCAGATATACCAGGATGCCTTTGAACACAAATTCCTGCAGGCCACAGAGCGTCTGTACTGCGCGGAGGGCCAGCGTCTGATGCGAGAGTTAGCAGTGCCGCAGTATCTGGCACACGTGGAGAAGAGACTCAGGGAGGAGAACGAGCGGCTCCTGCACTATCTGGACCCCTGTACCAAATGGCAGCTCATCCATACGGTGGAGCGTCAGTTGTTGAGCGAGCATGTCAGCGGTGTACTCAGCAAGGGACTCGAGTCGCTTATGGACGGGCCGCGCCTCAGAGACCTCGCCACCCTATACTCACTGTTCAGCCGAGTCAAGGACGGACTCACTGAGCTGTGTAATCACTTTAATGCGTACATTAAGAAAAAAGGTCGTACCATAGTCATCGAGCCGGAGCGTGACAAGACGATGGTAGCGGAACTGTTGGAATTCAAAGAGCAGCTGGACAATGTTGTGAGCACGTGCTTCCAGAGGAACGACCGGTTCCTGTACTCCATGAGAGAGGCCTTCGAGCACTTCATCAACCAGAGACAGAATAAACCGGCTGAGCTCATTGCCAAATTCGTCGATCTCAAACTGAGAGCCGGCAACAAAGAGGCGACGGAGGAAGAATTAGAAAGACTGCTGGACAAAATAATGGTTCTGTTCCGTTTTATACACGGGAAGGATGTGTTCGAGGCATTCTACAAGAAGGATCTAGCAAAGAGGTTGTTGGTGGGCAAGTCGGCCTCCGTGGACGCGGAGAAGTCCATGTTAAGCAAACTGAAGCAGGAGTGTGGAGGGGGCTTCACCTGCAAGTTAGAAGGCATGTTCAAAGACATGGAACTGTCAAAGGATATTAATATTACATACAAGCAGATGGCGTCTCAATTGTATGTTAATTTGACAAACTTAGTGGAGGCCCATGTGAAATCAAACATTGAGCAGTTCCTGTCGGAGAGCATGGATCGCCAAGTGTTTCTCAAACGTATGGACGACTGTTGGCGGGCTCACTGTCGACAAATGATCATGATCAGGAGCATCTTCCTGTATCTGGACCGGACTTATGTTCTCCAAAACCCTAGCATACATTCTATATGGGACATGGGTCTAGATCTGTTCCGGCATCATATAGCTATGAACACTCTGATACAGACTCGCACTGTTGATGGACTGTTGACATTGATAGAACGGGAAAGAGGGGGAGATGCCGTGGACATCTCCCTGCTGAAGAGTTTATTGAGGATGCTGTCCGACCTTCAGATATACCAGGATGCCTTTGAACACAAATTCCTGCAGGCCACAGAGCGTCTGTACTGCGCGGAGGGCCAGCGTCTGATGCGAGAGTTAGCAGTGCCGCAGTATCTGGCACACGTGGAGAAGAGACTCAGGGAGGAGAACGAGCGGCTCCTGCACTATCTGGACCCCTGTACCAAATGGCAGCTCATCCATACGGTGGAGCGTCAGTTGTTGAGCGAGCATGTCAGCGGTGTACTCAGCAAGGGACTCGAGTCGCTTATGGACGGGCCGCGCCTCAGAGACCTCGCCACCCTATACTCACTGTTCAGCCGAGTCAAGGACGGACTCACTGAGCTGTGTAACCACTTTAATGCGTACATTAAGAAAAAAGGTCGAACCATAGTCATCGAGCCGGAGCGTGACAAGACGATGGTAGCGGAACTGTTGGAATTCAAAGAGCAGCTGGACAATGTTGTGAGCACGTGCTTCCAGAGGAACGACCGGTTCCTGTACTCCATGAGAGAGGCCTTCGAGCACTTCATCAACCAGAGACAGAATAAACCGGCTGAGCTCATTGCCAAATTCGTCGATCTCAAACTGAGAGCCGGCAACAAAGAGGCGACGGAGGAAGAATTAGAAAGACTGCTGGACAAAATAATGGTTCTATTCCGTTTTATACACGGGAAGGATGTGTTCGAGGCATTCTACAAGAAGGATCTAGCAAAGAGGTTGTTGCATCTATCAGCGACCAGCGAGGGCGGGGGGCTCGAGCTGTCCGTGTACATCCTGACCATGGGTTTCTGGCCGACGTACGCGGCCGTGGACGTGCGGCTGCCGGGAGAACTCACCCGCCACCAGGAACACTTCGCCAAATTCTACCTCGCCAAGCACTCCGGCAGGAAGCTACAGTGGCAGGCGACGCTGGGACACTGTGTACTGAGAGCGCACTTCACACAGGGTAACAAAGAACTTCAGGTCTCGTTGTTCCAAGCGCTGGTTCTGCTACTCTTCAATGATGGAGACAATCTCTCCTTTGAAGACATTAAGACTGCCACTAACATCGAGGAGGGGGAGCTGCGCCGCACTCTCCAGTCGCTGGCTTGTGGTAAGGCGCGCGTGCTGATGAAGACCCCTCGGGGGAGGGACGTGCAGGACCGGGATCACTTCGCCTTCAACGGGGACTTCACCAACAAGCTGTTCCGCATCAAGATCAACCAGATACAGATGAAGGAGACTAGCGAGGAACAGAAGGCCACCGAGGAGCGAGTGTTCCAAGATCGTCAGTATCAGATAGACGCGGCCATTGTGCGCGTCATGAAGATGAGGAAGGCTCTCTCACACAACCTCCTCATATCCGAACTATACAACCAGCTCAAATTTCCCGTCAAGCCGGGGGACCTCAAGAAGCGGATAGAGTCCCTCATCGACCGCGACTACATGGAGCGAGACAAGGACAACCCCAACCAGTACAACTACGTCGCGTAA

Protein sequence:

>DPOGS215697-PA
MNKPGATTKKLVIKNFKSKPNLPENYQETTWSKLREAVIAIQTSKAIAYSLEELYQAVENMCSHKMASQLYVNLTNLVEAHVKSNIEQFLSESMDRQVFLKRMDDCWRAHCRQMIMIRSIFLYLDRTYVLQNPSIHSIWDMGLDLFRHHIAMNTLIQTRTVDGLLTLIERERGGDAVDISLLKSLLRMLSDLQIYQDAFEHKFLQATERLYCAEGQRLMRELAVPQYLAHVEKRLREENERLLHYLDPCTKWQLIHTVERQLLSEHVSGVLSKGLESLMDGPRLRDLATLYSLFSRVKDGLTELCNHFNAYIKKKGRTIVIEPERDKTMVAELLEFKEQLDNVVSTCFQRNDRFLYSMREAFEHFINQRQNKPAELIAKFVDLKLRAGNKEATEEELERLLDKIMVLFRFIHGKDVFEAFYKKDLAKRLLVGKSASVDAEKSMLSKLKQECGGGFTCKLEGMFKDMELSKDINITYKQMASQLYVNLTNLVEAHVKSNIEQFLSESMDRQVFLKRMDDCWRAHCRQMIMIRSIFLYLDRTYVLQNPSIHSIWDMGLDLFRHHIAMNTLIQTRTVDGLLTLIERERGGDAVDISLLKSLLRMLSDLQIYQDAFEHKFLQATERLYCAEGQRLMRELAVPQYLAHVEKRLREENERLLHYLDPCTKWQLIHTVERQLLSEHVSGVLSKGLESLMDGPRLRDLATLYSLFSRVKDGLTELCNHFNAYIKKKGRTIVIEPERDKTMVAELLEFKEQLDNVVSTCFQRNDRFLYSMREAFEHFINQRQNKPAELIAKFVDLKLRAGNKEATEEELERLLDKIMVLFRFIHGKDVFEAFYKKDLAKRLLHLSATSEGGGLELSVYILTMGFWPTYAAVDVRLPGELTRHQEHFAKFYLAKHSGRKLQWQATLGHCVLRAHFTQGNKELQVSLFQALVLLLFNDGDNLSFEDIKTATNIEEGELRRTLQSLACGKARVLMKTPRGRDVQDRDHFAFNGDFTNKLFRIKINQIQMKETSEEQKATEERVFQDRQYQIDAAIVRVMKMRKALSHNLLISELYNQLKFPVKPGDLKKRIESLIDRDYMERDKDNPNQYNYVA-