Monarch geneset OGS2.0

DPOGS208908
TranscriptDPOGS208908-TA2391 bp
ProteinDPOGS208908-PA796 aa
Genomic positionDPSCF300009 - 558389-565057
RNAseq coverage913x (Rank: top 14%)
Annotation
HeliconiusHMEL0146710.086.73% 
BombyxBGIBMGA002481-TA0.078.07% 
DrosophilaCG6905-PB0.066.88% 
EBI UniRef50UniRef50_Q994590.063.83%Cell division cycle 5-like protein n=68 Tax=Coelomata RepID=CDC5L_HUMAN
NCBI RefSeqXP_001660095.10.069.88%cell division control protein [Aedes aegypti]
NCBI nr blastpgi|3123814480.068.63%hypothetical protein AND_06250 [Anopheles darlingi]
NCBI nr blastxgi|910917820.069.69%PREDICTED: similar to cell division control protein [Tribolium castaneum]
Group
Gene OntologyGO:00055151.9e-22protein binding
GO:00036771.8e-15DNA binding
GO:00063551.8e-15regulation of transcription, DNA-dependent
KEGG pathwayaag:AaeL_AAEL0094690.0 
 K12860 (CDC5L, CDC5, CEF1)maps-> Spliceosome
InterPro domain[1-703] IPR0154959.3e-267Myb transcription factor
[325-706] IPR0217865.7e-99Protein of unknown function DUF3351
[33-118] IPR0090571.9e-22Homeodomain-like
[6-60] IPR0122871.8e-15Homeodomain-related
[7-56] IPR0010057.7e-15SANT domain, DNA binding
[9-54] IPR0147783e-12Myb, DNA-binding
Orthology groupMCL12163 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208908-TA
ATGCCGCGTATTATGATAAAAGGTGGCGTTTGGCGCAACACCGAGGATGAAATTCTTAAAGCGGCGGTTATGAAATATGGGAAAAATCAATGGTCACGAATTGCTTCTTTGCTCCATCGTAAATCTGCTAAACAGTGTAAAGCTCGTTGGTATGAATGGCTGGATCCCAGCATCAAGAAGACTGAGTGGTCTCGAGAGGAAGATGAGAAGCTCTTGCACCTGGCCAAGTTGATGCCAACACAATGGAGGACCATTGCTCCCATTATTGGTCGCACTGCAGCACAGTGTCTTGAACGATATGAATATCTACTTGATCAAGCTCAAAAAAAGGAAGAAGGTGAAGATGTTGGTGATGATCCGAGGAAGCTCAAGCCGGGTGAAATTGACCCTAACCCAGAAACAAAACCAGCCAGACCTGACCCCAAGGATATGGATGAAGATGAATTGGAAATGCTGTCAGAAGCTAGGGCTCGTCTCGCCAACACTCAGGGTAAAAAGGCAAAGAGAAAGGCTCGTGAGAAACAATTAGAAGAGGCTCGGCGACTTGCAGCCCTACAAAAGAGGAGGGAGTTGAGTGCTGCTGGCATATCTGTGCCTATAAGGCGTAAAAAGAAACGTGGTGTGAATTACAATTCCGAAATTCCATTTGAAAAGAAACCAGCTGCCGGTTTTTATGATACCTCCACGGAGGTCGTTGATCCTATGGCACCGGATTTCTCTAGACTGAGACAGCAACACCTTGATGGGGAGCTACTGTCTGAGAAAGAAGAGAGGGATCGCCGCAAAGACAAACAGAAGCTGAAACAGCGTAAAGAAAACGATGTGCCACAAGCTATGCTGCAGGGTGATCAGCCAGCGAGGAAACGCAGCAAACTAGTGCTGCCTGAACCACAAGTTACTGATCAGGAGTTACAGCAGGTAGTGAAGCTGGGTCGTGCGTCAGAGGAAGCTCGCGGGAGCGCGGTGGAGGGCGGTGCTACGGACGCCCTCCTGGCGACATATGCTCTAACACCTGCACCCGCCACCGCGCTTAGAACACCTGCACCTGCACAGGACAGAATTTTAATGGAGGCACAGAATGTTATGGCCCTGACACATGTAGACACACCCCTGAAGGGTGGACTGAACACTCCTCTGCATGAATCAGATTTCTCCGGAGCACTGCCACAGACACAGGTCGTCGCTACACCCAATGCGGTGCTCTCCACACCATTCAGGTCATCGCGGACGGACGTATCAACGCCAAACAGCTTTGCAACACCTGGCCCTGGAGGGCAAGCGACGATAATGACTCCAGGACTACGTGACAAACTTAGCATAAACCCAGAAGACAGGTTAATAGGAGACACGCCGCAACAAAACAACCAAATACAGAAACAACTGAAGGCATCTGTTCGTAACGCTCTGCAATCCCTTCCCACTCCACGTAACGACTACGAGATCGTAGTCCCCGAGGCGCGGGACGACAACGACACGGAGCGGGGGGACGACCTCGTGGACGATCAGGCTGACGTCGATGACAGGATCCTCAGGGAACAAGAGGAGAAACGTCTAGCAGCCCTAGCGCTTCGGTCCAGCGCTATAAGACGCGGCTGCGCCCGTCCGGCGGAGGTGGTTGGCGGAGCGGGAAGGACAGGTGGCGCCCTCACCTCACTACAGCGGGCCGAGGAACTGCTCAAGGCGGAAATGTTATCTATGCTGCATTATGACGCATTGCACGACCCTCCTCCCGGTGTGGACAAGAAGCGAGCGGTACAGTTACAGGCGTCACACTTGGCGTATCTGGAACAGCATCCTTACGAGCAGTTCACACGCGAGGAGCTGGACGCGGCGGAACAGGAGTTGAACAAGGAGATGGAAGTAGTGAAGGCGGGTATGGGCCATGGAGATCTGGGACTAGAGGCGTACACCACAGTATGGGAGGAATGTCTCGCACAGGTGCTGTTTCTTCCCGGACAAAACCGATACACTCGCGCTAACTTAGCGAGCAAGAAGGATCGACTAGAGTCGGCCGAGAAGAGATTGGAACAGAACAGGAACCATATGGCGAAAGAAGCTAAGAAATGCTCAAAAATGGAGAAAAAGTTAAGAGTGCTTACAGGTGGTTACCAAAGTCGAACTGCTTCACTAATAAAGCAGTTTCAGGAACTGCAAGATCAAATAGAGCAATCTAATTTAGAATTATCAACATTCAAATTTCTCGCTGAGCAAGAAAAGGCAGCTATACCTAGACGAGTCGAGTCTCTTACTGAAGATGTGAATAGACAAACGGAGAGAGAGAAACAACTCCAGAAGCGCTATGCCGAACTACAAGCGGAATTGGAAGATATTCACAAGGGACGTCTGAATAAGGAAGGACAAACAAAACAAGTGGAACCAGAACTTTCATAA

Protein sequence:

>DPOGS208908-PA
MPRIMIKGGVWRNTEDEILKAAVMKYGKNQWSRIASLLHRKSAKQCKARWYEWLDPSIKKTEWSREEDEKLLHLAKLMPTQWRTIAPIIGRTAAQCLERYEYLLDQAQKKEEGEDVGDDPRKLKPGEIDPNPETKPARPDPKDMDEDELEMLSEARARLANTQGKKAKRKAREKQLEEARRLAALQKRRELSAAGISVPIRRKKKRGVNYNSEIPFEKKPAAGFYDTSTEVVDPMAPDFSRLRQQHLDGELLSEKEERDRRKDKQKLKQRKENDVPQAMLQGDQPARKRSKLVLPEPQVTDQELQQVVKLGRASEEARGSAVEGGATDALLATYALTPAPATALRTPAPAQDRILMEAQNVMALTHVDTPLKGGLNTPLHESDFSGALPQTQVVATPNAVLSTPFRSSRTDVSTPNSFATPGPGGQATIMTPGLRDKLSINPEDRLIGDTPQQNNQIQKQLKASVRNALQSLPTPRNDYEIVVPEARDDNDTERGDDLVDDQADVDDRILREQEEKRLAALALRSSAIRRGCARPAEVVGGAGRTGGALTSLQRAEELLKAEMLSMLHYDALHDPPPGVDKKRAVQLQASHLAYLEQHPYEQFTREELDAAEQELNKEMEVVKAGMGHGDLGLEAYTTVWEECLAQVLFLPGQNRYTRANLASKKDRLESAEKRLEQNRNHMAKEAKKCSKMEKKLRVLTGGYQSRTASLIKQFQELQDQIEQSNLELSTFKFLAEQEKAAIPRRVESLTEDVNRQTEREKQLQKRYAELQAELEDIHKGRLNKEGQTKQVEPELS-