Monarch geneset OGS2.0

DPOGS205820
TranscriptDPOGS205820-TA1251 bp
ProteinDPOGS205820-PA416 aa
Genomic positionDPSCF300081 - 552190-554644
RNAseq coverage1240x (Rank: top 10%)
Annotation
HeliconiusHMEL0099732e-7764.53% 
BombyxBGIBMGA009308-TA1e-13763.01% 
DrosophilaCG3925-PA1e-4531.36% 
EBI UniRef50UniRef50_E2C9262e-6033.15%Protein cereblon n=9 Tax=Formicidae RepID=E2C926_HARSA
NCBI RefSeqXP_395264.22e-6333.67%PREDICTED: similar to cereblon [Apis mellifera]
NCBI nr blastpgi|3287769974e-6233.67%PREDICTED: protein cereblon-like [Apis mellifera]
NCBI nr blastxgi|3071922722e-5832.62%Protein cereblon [Harpegnathos saltator]
Group
Gene OntologyGO:00065089e-08proteolysis
GO:00041769e-08ATP-dependent peptidase activity
KEGG pathway 
InterPro domain[55-279] IPR0159473.9e-11PUA-like domain
[59-278] IPR0031119e-08Peptidase S16, lon N-terminal
Orthology groupMCL11228 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205820-TA
ATGTTAATAGTGGCTTCCGACGGGGAAGAAGAGAGAAATGGTGAAGGTAATGGCGAATCTGAAGAGGAGCAATTTGATATTTCACTTGCAGCCTCACATTCGTACATGGGAAACAGCCTGGTAGCAGTGAGCGGTCGCTCAGTGTTGGAGGCGGGTTGGGTGGGCCGAGTTCCGGTGGCAGCACACCATGGCACAGTATTCCCTGGCGAAACGGTGCCCATGCTCCTCACCCATCCCCACGACGCCGCCATCATCACGCGTGCCATCAAGCATAACAAGCTATTTGGCCTGCTCTGTCCTGATGAAACTGGCAGCCTGGTGTCCGGGTACGGCGTCTTGTGCGAGGTGTTCGAGGCGGGAGTGGGTGCCGACGACGAGCCCCCGCGCACGCTCTCTTTTAAGGCACGCGCCACTCATCGCTTCCGCTGCCTCCACATGCCCAAGCATTCGGTGCCCATACACATGTACTCCAGGATGCGTGCCGTTGACGTGCGCGTGCTACCCGAAGTCCGACTCGGGGAGCCCCTGAGACACGCGCGACTCGCCAGCCTCGACACCCTAAGGCGGTCAACGTCGGACGGTCGCCTACGCTGTATGGATGCAGCCGTGACGCCGTGGCCGCTGTTCGTGTACGACATTTTCGACTACCGCCGCATGAGACGGATCATCGAGGACTACTTCAGGACCATGTCGCTAGAGAACCTGCCGGAGGAGGCGGTGTCTCTATCGTTCTGGACGGCGTCCAACTTGGCGCTGTCGGCGCGCGACCGCCTGGCCCTGTTCGTGGTGGACGACGCTCTACTCCGCCTGCACATGGAGGTGCGACTCATCACCAGGAAGAGCGTGTTGTGCTGCGCATCGTGCGCGACGGTGGTGGCTCGGCGGGAGGACATCTTCGCCATGTCGAGCGAGGGCGTGCATGCCAACTACAGCAACCCGGGCGGCTACATGCACGACATAGTGACGGTGTCCCGCGCCTCCAACACGGCCCCCGGCGGCGCGCCTTCCTCCGAGTTCTCGTGGTTCCCGGGCTACTCGTGGACGGTGGCGCTGTGCTCCTCGTGCACCTCGCACGTGGGCTGGAGGTTCGACGCGCGGCGCCGGACCCTCCGGCCGCAGCACTTCTACGGCCTGTGTCGCAACTTCGTGCGGCCGCGGTGCGACGGAGACTCCTCGCCCGCCTCGTCCCCCACACCCTCGGCCTCCCCGCCCGCCGCCGAGCCGGAGGACGAGTCGCTCCGAGACTCGTGA

Protein sequence:

>DPOGS205820-PA
MLIVASDGEEERNGEGNGESEEEQFDISLAASHSYMGNSLVAVSGRSVLEAGWVGRVPVAAHHGTVFPGETVPMLLTHPHDAAIITRAIKHNKLFGLLCPDETGSLVSGYGVLCEVFEAGVGADDEPPRTLSFKARATHRFRCLHMPKHSVPIHMYSRMRAVDVRVLPEVRLGEPLRHARLASLDTLRRSTSDGRLRCMDAAVTPWPLFVYDIFDYRRMRRIIEDYFRTMSLENLPEEAVSLSFWTASNLALSARDRLALFVVDDALLRLHMEVRLITRKSVLCCASCATVVARREDIFAMSSEGVHANYSNPGGYMHDIVTVSRASNTAPGGAPSSEFSWFPGYSWTVALCSSCTSHVGWRFDARRRTLRPQHFYGLCRNFVRPRCDGDSSPASSPTPSASPPAAEPEDESLRDS-