Monarch geneset OGS2.0

DPOGS215829
TranscriptDPOGS215829-TA3255 bp
ProteinDPOGS215829-PA1084 aa
Genomic positionDPSCF300073 + 287949-301930
RNAseq coverage327x (Rank: top 35%)
Annotation
HeliconiusHMEL0116390.092.68% 
BombyxBGIBMGA013548-TA0.092.68% 
DrosophilaCul-3-PF0.071.81% 
EBI UniRef50UniRef50_Q136180.074.43%Cullin-3 n=123 Tax=Eukaryota RepID=CUL3_HUMAN
NCBI RefSeqXP_625079.20.082.33%PREDICTED: similar to cullin 3 [Apis mellifera]
NCBI nr blastpgi|3071881050.080.89%Cullin-3 [Camponotus floridanus]
NCBI nr blastxgi|3071881050.080.76%Cullin-3 [Camponotus floridanus]
Group
Gene OntologyGO:00065119.1e-196ubiquitin-dependent protein catabolic process
GO:00316259.1e-196ubiquitin protein ligase binding
GO:00314619.1e-196cullin-RING ubiquitin ligase complex
KEGG pathwayame:5527000.0 
 K03869 (CUL3)maps-> Ubiquitin mediated proteolysis
InterPro domain[353-983] IPR0013739.1e-196Cullin, N-terminal
[349-704] IPR0161596.8e-105Cullin repeat-like-containing domain
[696-993] IPR0161581.3e-91Cullin homology
[1011-1078] IPR0195591.4e-37Cullin protein, neddylation domain
[983-1084] IPR0119913e-37Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL11898 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215829-TA
ATGATGAAGAGTACTTTACCTAAAGATAAACCTGGCAAAATGAGGATCCGAGCTTTCCCCATGACTATGGATGAGAAATATGTGGAGCGCATATGGAGTTTACTTAAGAATGCAATACAAGAAATACAAAAGAAAAATAACTCGGGGCTTTCATTTGAGGAGCTTTATAGAAATGCATACACCATGGTTCTTCACAAACATGGAGAAAGACTTTATACAGGATTAAAAGAAGTTGTTACACATCACCTTGAGACAAAGGTTCGTGAAGATGTCCTTCAGGCGCTCCACAATGGGTTTCTCCAGACATTAAACAATGCTTGGACTGATCACCAAACCAGTATGGTCATGATACGCGACATACTCATGTACATGGATAGAGTTTACGTACAGCAGAATGATGTAGATAATGTTTATAACTTGGGACTCATTATATTCAGGGACCAGGTTGCCCGCTACGGTTGCATCCGTGACCACCTGCGCCAGACCCTTCTAGAGCTGGTTGCTCGTGAGCGTCGCGGTGAGGTCGTGGACCGTCTCGCCATAAGGAACGCCTGCCAAATGCTCATGGTGGTCGGCATCAACTCGCGCACTGTCTACGAAGAGGACTTTGAGAAGCCCTTCCTGCATCAGTCCTCTGAATTTTATAGGATGGAATCTCAAAAGTTCCTAGCTGAGAACAGCGCGGCGGTGTACATAGCCCGTGTGGAGGCCCGCATCAGTGAGGAGGCGGAGCGTGCGCGGCACTACCTGGACGAGAGCACCGAGCCGCGGATAGTGGCCGTCCTCGAACACGAGCTCATAGAGAGACATATAAAGACCATTGTTGAGATGGAGAACTCTGGTGTTGTTCATATGTTGATGCACACTCGCACAGTGGAGTTGGCCTGTATGTACAAGTTATTGTCCCGTGTGGACGAAGGCCTGCGCACTGTAGCGGACGCCGTGTCAGCACACCTCAGGGAGCAAGGCCGCGCCCTCGTCACTGACACGCATAGCAACACTAATGCCATAGCATACGTGCAGATGACTATGGATGAGAAATATGTGGAGCGCATATGGAGTTTACTTAAGAATGCAATACAAGAAATACAAAAGAAAAATAACTCGGGGCTTTCATTTGAGGAGCTTTATAGAAATGCATACACCATGGTTCTTCACAAACATGGAGAAAGACTTTATACAGGATTAAAAGAAGTTGTTACACATCACCTTGAGACAAAGGTTCGTGAAGATGTCCTTCAGGCGCTCCACAATGGGTTTCTCCAGACATTAAACAATGCTTGGACTGATCACCAAACCAGTATGGTCATGATACGCGACATACTCATGTACATGGATAGAGTTTACGTACAGCAGAATGATGTAGATAATGTTTATAACTTGGGACTCATTATATTCAGGGACCAGGTTGCCCGCTACGGTTGCATCCGTGACCACCTGCGCCAGACCCTTCTAGAGCTGGTTGCTCGTGAGCGTCGCGGTGAGGTCGTGGACCGTCTCGCCATAAGGAACGCCTGCCAAATGCTCATGGTGGTCGGCATCAACTCGCGCACCGTCTACGAAGAGGACTTTGAGAAGCCCTTCCTGCATCAGTCCTCTGAATTTTATAGGATGGAATCTCAAAAGTTCCTAGCTGAGAACAGCGCGGCGGTGTACATAGCCCGTGTGGAGGCCCGCATCAGTGAGGAGGCGGAGCGTGCGCGGCACTACCTGGACGAGAGCACCGAGCCGCGGATAGTGGCCGTCCTCGAACACGAGCTCATAGAGAGACATATAAAGACCATTGTTGAGATGGAGAACTCTGGTGTTGTTCATATGTTGATGCACACTCGCACAGTGGAGTTGGCCTGTATGTACAAGTTATTGTCCCGTGTGGACGAAGGCCTGCGCACTGTAGCGGACGCCGTGTCAGCACACCTCAGGGAGCAAGGCCGCGCCCTCGTCACTGACACGCATAGCAACACTAATGCCATAGCATACGTGCAGAACCTACTCGACCTTAAAGATAGATTCGACCACTTCCTCCACAACTCGTTTAACAATGATAAGATATTTAAACACATGATTGCTTCAGACTTTGAGTACTTCCTCAACCTGAACAACAAATCCCCGGAGTTCCTATCATTATTCATTGATGGTAAACTCAAGAAAGGCGAAAAAGGCATGAGTGAACAAGAAATAGAGGCAGTCCTGGACAAAACGATGGTGCTTTTCCGTTTCCTTCAAGAGAAAGACGTGTTTGAGCGTTACTACAAACAGCATCTGGCCAAACGTTTGTTGCTCAATAAATCTGTCTCAGATGACAGCGAGAAAAACATGATCTCTAAACTGAAGACTGAGTGCGGATGTCAATTCACATCAAAGCTGGAAGGAATGTTCAAAGACATGACAGTCTCCAATACTATTATGGAAGAGTTCAAAGAGCATGTGCTTCAATCAGGGAACAACTTGCACGGCGTGGATCTGTCCGTGCGTGTGTTAACGACTGGTTTCTGGCCGACGCAGAGCGCGACGCCCAAGTGTAACATACCCACGGCGCCGAGGAGTGCCTTCGATGTGTTCAGATCGTTCTATCTCGCAAAGCACTCCGGTCGCCAGTTGTCCCTACAGCCTCAGCTGGGTAGTGCGGACCTCCACGCGACGTTCCGCGCGCCCTCTACCGGCAGTCCGCCCCGCTCCCCGCCCTCCGCCCCCGCCGCCCCCGCCGCCGTCCGCAGGCGCATCATACAAGTGTCCACCTTCCAGATGTGCGTGCTGCTACTGTTCAATAAACGCGAACGACTCACCTACGAGGAAATCCTGAACGAGACTGACATCCCTGAAAAAGATTTGGTAAGAGCATTACAGTCACTAGCGATGGGGAAACCGACACAGCGCGTTCTGATCAAACATCCCAAGACCAAGGAGATCGAACCGTCGCACCAGTTCTACGTAAACGACGCCTTCACCTCCAAGTTACATAGAGTTAAGATTCAGACAGTAGCAGCGAAAGGTGAATCAGAGCCGGAGCGTCGCGAGACACGCAACAAGGTGGACGAGGACCGGAAACACGAAATAGAGGCAGCCATTGTGAGGATCATGAAGGCCAGGAAGAAAATGGCGCACACGTTACTTGTGGCAGAGGTCACGGAACAGCTCCGCGTGCGGTTCCTCCCGTCTCCCGTCGTGATCAAGAAACGTATTGAGGGTCTCATAGAACGGGAATACCTCGCGCGCACTCCCGACGACCGCAAGGTCTACACATACGTCGCATAG

Protein sequence:

>DPOGS215829-PA
MMKSTLPKDKPGKMRIRAFPMTMDEKYVERIWSLLKNAIQEIQKKNNSGLSFEELYRNAYTMVLHKHGERLYTGLKEVVTHHLETKVREDVLQALHNGFLQTLNNAWTDHQTSMVMIRDILMYMDRVYVQQNDVDNVYNLGLIIFRDQVARYGCIRDHLRQTLLELVARERRGEVVDRLAIRNACQMLMVVGINSRTVYEEDFEKPFLHQSSEFYRMESQKFLAENSAAVYIARVEARISEEAERARHYLDESTEPRIVAVLEHELIERHIKTIVEMENSGVVHMLMHTRTVELACMYKLLSRVDEGLRTVADAVSAHLREQGRALVTDTHSNTNAIAYVQMTMDEKYVERIWSLLKNAIQEIQKKNNSGLSFEELYRNAYTMVLHKHGERLYTGLKEVVTHHLETKVREDVLQALHNGFLQTLNNAWTDHQTSMVMIRDILMYMDRVYVQQNDVDNVYNLGLIIFRDQVARYGCIRDHLRQTLLELVARERRGEVVDRLAIRNACQMLMVVGINSRTVYEEDFEKPFLHQSSEFYRMESQKFLAENSAAVYIARVEARISEEAERARHYLDESTEPRIVAVLEHELIERHIKTIVEMENSGVVHMLMHTRTVELACMYKLLSRVDEGLRTVADAVSAHLREQGRALVTDTHSNTNAIAYVQNLLDLKDRFDHFLHNSFNNDKIFKHMIASDFEYFLNLNNKSPEFLSLFIDGKLKKGEKGMSEQEIEAVLDKTMVLFRFLQEKDVFERYYKQHLAKRLLLNKSVSDDSEKNMISKLKTECGCQFTSKLEGMFKDMTVSNTIMEEFKEHVLQSGNNLHGVDLSVRVLTTGFWPTQSATPKCNIPTAPRSAFDVFRSFYLAKHSGRQLSLQPQLGSADLHATFRAPSTGSPPRSPPSAPAAPAAVRRRIIQVSTFQMCVLLLFNKRERLTYEEILNETDIPEKDLVRALQSLAMGKPTQRVLIKHPKTKEIEPSHQFYVNDAFTSKLHRVKIQTVAAKGESEPERRETRNKVDEDRKHEIEAAIVRIMKARKKMAHTLLVAEVTEQLRVRFLPSPVVIKKRIEGLIEREYLARTPDDRKVYTYVA-