Monarch geneset OGS2.0

DPOGS213538
TranscriptDPOGS213538-TA1800 bp
ProteinDPOGS213538-PA599 aa
Genomic positionDPSCF300033 - 496536-498764
RNAseq coverage236x (Rank: top 43%)
Annotation
HeliconiusHMEL0054730.070.96% 
BombyxBGIBMGA011818-TA0.065.18% 
Drosophiladup-PA3e-9038.70% 
EBI UniRef50UniRef50_D6WGF93e-9637.17%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WGF9_TRICA
NCBI RefSeqXP_393349.32e-10141.70%PREDICTED: similar to double parked CG8171-PA [Apis mellifera]
NCBI nr blastpgi|3825464570.065.18%DNA replication factor Cdt1 [Bombyx mori]
NCBI nr blastxgi|3825464570.065.29%DNA replication factor Cdt1 [Bombyx mori]
Group
KEGG pathway 
InterPro domain[231-390] IPR0149391.4e-47DNA replication factor CDT1-like
Orthology groupMCL14070 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213538-TA
ATGTCTCAAACGACATTAACAACATTTTTTAACAGTAGAAAAAGACCAGCAACGGAAGACATAGCGAGTACGAAGAATAAAATAGCACACTTGGAACGTTTAGATCCAATCACTAAATCGGGTAGAAAATCTCCATTTCCTAAAAATGATTTACTGCTACATAATGGCAAAGAATTAAACCATGTCACCAAAGTTGAAAATTCCAAGAAACTAGATTCGGAGCAGAAGTTGACCAAAAAACCTGAAAATACATCAGAGCAACAGAAAGCTGTTGCCTTCAGTAAAAAATCAATGTCTCTAATATTGCTAAAAACAGTGAAACCTCCAAAACGGAACACGTCACTGAAGCTCGAAAAGAACTATGTTTGGAGGTTAGCTGGCAGCAGTAGATTATCCGAATTAAGAGCTACAGCGGAACGTCTTAGCAAAGGTATTCAGGAACTAAAAGAAAGTAGTGATAAAAAGAACTTAAAAGAGTTTAAATCTATTAATGTGGATGTACCCCAGAGTCCTAGCAAGAAATCATTAAATCGTCATGAGTTTTTGTCACCTACGAAGCAAGACAGTGTCTCAAGTCAGCAAATACCTTTACTGTCTCCAAGAAAAGTTTTTGTTAGTCCTATCAAAAGTCCAAGCAAAGTACCAGCATACATTAGACATGCCTCTTTGGCTGCTCCATCCAATCTTCAGCTGCCACATCACTATAGATTTCTGGCTGAGCTATTCCGAGGTATGGAAACGGTTGTCGCACTCCTCTATAACAGGAATGAAAAGATCACTTTCAATAAACTGAAGCCTTCTATTCAAGAAATGCTCAAGAGAAGCTTCTGTGAAAAACATTTGGCGCAAATAAAATACTTAGTCCCTGACTTTTATAACTTTGAAGTACAGAAGATAAAGAGTTTTACCTCTACAAATCACAAAGAAACATTTGAACTCATCATATCTCCCAATTTTCCTAATGATATCAAAATCATGAATCCAAGTGTTCTCCTTGAAAGGCGGAGATATTTCTACAATACTTTACTTCAATTGGTGAAAAAGCACCATGCTCAATTTCTCTCAACTCTGGATCCTCCGATTGAAATCCCTGACAATAAGTTAGTGAGATGGCATCCTGAGTTTGAACTTGAAAAGATACCAGACATTGATGGAGCTAAACTGCCCGAATTGCCAAATACCGAAAAATTCTCTTCAGCCCAAGATGTTCTTGCAAAAGCCAGAGAACTTTTTAAATGTAATACTAAAATGGAAAGAGCGCTCGAAAAACTTGCACAAGCTAAAGCAAGAGGTTTAACTGAACAAGAAAAAGCTGTCACCGGTTTAAATGAATCCCCAAAGAAGAATGTATCTACTCAGATAAGCCAACCGTCAACAAGTGGGATTCAAATTTTGAACCCTGCCCTTCGTAATCTACCAGCGGCCTTATTGGAAAAGGTCAAAGCTAAACAAGCAGCTAAAGCATTTGAAGCAATGACTAGATCTTCGGAAACTGAACATAAATACCTGATCTACACTCGCCTTCCAGATTTGGCGAGGACTTTAAGGAATATATTTGTTACAGAGAGGAAAAATGTACTCGCACTCAATATAGTGCTCTCAAAACTTGATAGCAGTTTCAAATCTAATGTCTCTGCTAATGAGTTACAAAAGGACATAAAGCTACTGACCGAGGAAGTCCCAGATTGGATCAAACTTCATGAAATTAGGAACGCCACATATCTGAAACTAGACAAAAATACAGACTTGAAGATAATAACTTCGAAACTGGAAGCGGCTGCGCAAAAATATAAAGATTAA

Protein sequence:

>DPOGS213538-PA
MSQTTLTTFFNSRKRPATEDIASTKNKIAHLERLDPITKSGRKSPFPKNDLLLHNGKELNHVTKVENSKKLDSEQKLTKKPENTSEQQKAVAFSKKSMSLILLKTVKPPKRNTSLKLEKNYVWRLAGSSRLSELRATAERLSKGIQELKESSDKKNLKEFKSINVDVPQSPSKKSLNRHEFLSPTKQDSVSSQQIPLLSPRKVFVSPIKSPSKVPAYIRHASLAAPSNLQLPHHYRFLAELFRGMETVVALLYNRNEKITFNKLKPSIQEMLKRSFCEKHLAQIKYLVPDFYNFEVQKIKSFTSTNHKETFELIISPNFPNDIKIMNPSVLLERRRYFYNTLLQLVKKHHAQFLSTLDPPIEIPDNKLVRWHPEFELEKIPDIDGAKLPELPNTEKFSSAQDVLAKARELFKCNTKMERALEKLAQAKARGLTEQEKAVTGLNESPKKNVSTQISQPSTSGIQILNPALRNLPAALLEKVKAKQAAKAFEAMTRSSETEHKYLIYTRLPDLARTLRNIFVTERKNVLALNIVLSKLDSSFKSNVSANELQKDIKLLTEEVPDWIKLHEIRNATYLKLDKNTDLKIITSKLEAAAQKYKD-