Monarch geneset OGS2.0

DPOGS215464
TranscriptDPOGS215464-TA1296 bp
ProteinDPOGS215464-PA431 aa
Genomic positionDPSCF300098 - 444777-448987
RNAseq coverage1188x (Rank: top 11%)
Annotation
HeliconiusHMEL0083468e-17373.61% 
BombyxBGIBMGA012127-TA2e-2428.57% 
DrosophilaCaf1-PA0.094.08% 
EBI UniRef50UniRef50_Q090280.092.31%Histone-binding protein RBBP4 n=210 Tax=root RepID=RBBP4_HUMAN
NCBI RefSeqXP_624580.10.096.25%PREDICTED: similar to Chromatin assembly factor 1 subunit CG4236-PA [Apis mellifera]
NCBI nr blastpgi|3454835390.096.02%PREDICTED: probable histone-binding protein Caf1 isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454835390.096.02%PREDICTED: probable histone-binding protein Caf1 isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00055158.3e-61protein binding
KEGG pathway 
InterPro domain[116-407] IPR0159438.3e-61WD40/YVTN repeat-like-containing domain
[112-404] IPR0110463.5e-54WD40 repeat-like-containing domain
[19-89] IPR0220521e-30Histone-binding protein RBBP4
[263-303] IPR0016804.3e-10WD40 repeat
[265-303] IPR0197812.3e-09WD40 repeat, subgroup
[194-208] IPR0204721.8e-08G-protein beta WD-40 repeat
Orthology groupMCL11414 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215464-TA
ATGGGTGATAAAGATGGAGAAACCTTTGATGATGCTGTGGAAGAGAGGGTTATCAACGAGGAGTACAAAATATGGAAGAAGAATACACCTTTCTTATATGACCTGGTCATGACACACGCTTTAGAGTGGCCCTCGCTGACCGCTCAGTGGCTTCCAGATGTCACAAGACCTGAAGGCAAGGATTACTCCGTACACAGATTGATTCTGGGCACTCACACATCAGATGAACAAAACCACCTCCTCATTGCAAGTGTACAACTTCCTAATGAGGATGCACAGTTTGATGCAAGCCACTATGATAATGATAAGGGTGAATTTGGTGGTTTTGGATCAGTTTCTGGTAAGATAGATATAGAAATTAAAATTAATCATGAGGGTGAAGTCAATAGGGCTCGCTACATGCCCCAAAATCCTTGCGTCATTGCCACAAAGACACCATCTTCTGATGTCCTCGTATTTGACTACACCAAACATCCATCAAAACCTGAACCTTCCGGAGAATGTCATCCCGACCTTAGATTGCGAGGACACCAAAAGGAAGGTTACGGTTTGTCATGGAACCCTAATCTTAATGGATACCTTTTATCAGCCAGTGACGATCATACAATCTGCTTATGGGATATAAACGCCACTCCTAAGGAAGGGCGTGTGATTGAAGCCAAGTCTGTCTTCACGGGACACACAGCGGTGGTTGAGGATGTGGCGTGGCATCTGCTTCATGAATCCTTGTTTGGATCTGTGGCCGACGATCAGAAGCTCATGATATGGGATACGAGATGTAACAACACGTCCAAGCCATCCCACACCGTGGATGCTCACACCGCTGAAGTGAACTGCCTTAGCTTCAACCCATACTCGGAATTTATTCTCGCCACTGGCAGTGCTGACAAAACTGTGGCGTTGTGGGACTTGCGTAACCTTAAACTGAAGTTGCACTCGTTTGAGTCGCACAAGGACGAGATCTTCCAAGTACAGTGGTCGCCACACAACGAGACCATTCTGGCTAGCAGTGGCACAGACAGGAGGCTGCATGTTTGGGATCTATCGAAGATTGGTGAGGAACAGACGGCTGAGGACGCGGAGGACGGGCCCCCGGAACTGTTGTTCATCCACGGAGGTCACACCGCCAAGATATCCGACTTCTCATGGAACCCCAACGAGCCGTGGGTCATCTGCTCCGTCTCCGAGGACAACATCATGCAGGTGTGGCAAATGGCTGAGAACATCTACAACGATGAGGAACCGGAAACGCCGGCTTCGGAGCTGGAATCGGGAGTCAACGTGAACCACGGTTAG

Protein sequence:

>DPOGS215464-PA
MGDKDGETFDDAVEERVINEEYKIWKKNTPFLYDLVMTHALEWPSLTAQWLPDVTRPEGKDYSVHRLILGTHTSDEQNHLLIASVQLPNEDAQFDASHYDNDKGEFGGFGSVSGKIDIEIKINHEGEVNRARYMPQNPCVIATKTPSSDVLVFDYTKHPSKPEPSGECHPDLRLRGHQKEGYGLSWNPNLNGYLLSASDDHTICLWDINATPKEGRVIEAKSVFTGHTAVVEDVAWHLLHESLFGSVADDQKLMIWDTRCNNTSKPSHTVDAHTAEVNCLSFNPYSEFILATGSADKTVALWDLRNLKLKLHSFESHKDEIFQVQWSPHNETILASSGTDRRLHVWDLSKIGEEQTAEDAEDGPPELLFIHGGHTAKISDFSWNPNEPWVICSVSEDNIMQVWQMAENIYNDEEPETPASELESGVNVNHG-