Monarch geneset OGS2.0

DPOGS207360
TranscriptDPOGS207360-TA2619 bp
ProteinDPOGS207360-PA872 aa
Genomic positionDPSCF300188 + 404328-411760
RNAseq coverage343x (Rank: top 34%)
Annotation
HeliconiusHMEL0088630.084.77% 
BombyxBGIBMGA008625-TA0.086.17% 
DrosophilaMcm2-PA0.069.93% 
EBI UniRef50UniRef50_P497360.061.80%DNA replication licensing factor MCM2 n=42 Tax=Amniota RepID=MCM2_HUMAN
NCBI RefSeqXP_395109.20.072.38%PREDICTED: similar to DNA replication licensing factor Mcm2 (Minichromosome maintenance 2 protein) (DmMCM2) [Apis mellifera]
NCBI nr blastpgi|3838659590.073.95%PREDICTED: DNA replication licensing factor Mcm2-like [Megachile rotundata]
NCBI nr blastxgi|3838659590.072.65%PREDICTED: DNA replication licensing factor Mcm2-like [Megachile rotundata]
Group
Gene OntologyGO:00036779.1e-265DNA binding
GO:00055249.1e-265ATP binding
GO:00062609.1e-265DNA replication
GO:00056343.2e-29nucleus
GO:00062703.2e-29DNA-dependent DNA replication initiation
KEGG pathwayame:4116400.0 
 K02540 (MCM2)maps-> Meiosis - yeast
    Cell cycle - yeast
    DNA replication
    Cell cycle
InterPro domain[261-764] IPR0012089.1e-265Mini-chromosome maintenance, DNA-dependent ATPase
[168-413] IPR0160277.2e-67Nucleic acid-binding, OB-fold-like
[274-291] IPR0080453.2e-29Mini-chromosome maintenance complex protein 2
[342-416] IPR0123401.6e-26Nucleic acid-binding, OB-fold
Orthology groupMCL13829 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207360-TA
ATGAGTTCTCCTATTCCTGATACACCGTCTGACAGAGATGGAGCCCGGTCTCGTATGACATCGCCAGCTCGGGAGTACGAAATGTTTGAAGATGAAGGAGCCATCTTGGGTGACAATGCTGATGAAGAAGAAGAAGATGGAGAGGAATTGTTCAATGATAATATGGAAGCTGATTACCGACCGATGCCAGCGTTGGATCGGTATGATGCTGAAGACCTGGACGAGGAGGACTATGACGCCATGTCGGTTGAAGATCGTGTGGCCGCAGAACGGGAGCTACAGAGACGGGACAGAGATGAGGGACGCATACGGAGAGATGATAGGGACTTGTTATATGACAGCAGTGACGCGCCTCGTGCGAAACGACGGAGGGCCGCGGAGAAGGCGGCCGGGATGGAGGAACCCGTGGAGGGGATCGAAAGTATTGAAAACCTCGAAGACACCAAGGGATATTCCACTAAAGAATGGGTTTCCATGTTGGGGCCTCGTACTGAGATCGCTAATAGGTTCAAGAATTTCCTCCGCACCTACACGAACACGAAAGGCCAATACGTGTACAAAGAAAGAATACGTAGGATGTGCGAACACAATCAGGCCTCATTCCACGTGGAGTTCGATGTGTTGGCGAGACGAGAACAGGTGCTAGCCTATTTCCTACCCGAGGCACCGTTTCAGATGCTGCAAATATTCGACGAAGTGGCCAAAGACATAGTACTCCAGATATTCCCGAGCTACGAGCGCGTCACCTCAGAGGTACACGTGCGGATCGCTGATCTTCCTCTCATAGAAGAGTTGAGGACGTTCAGGAAGCTGCACTTGAACCAGCTGGTGCGCACCGTGGGCGTTATAACGGCCACTACCGGGGTGATGCCGCAGCTGTCCGTGGTCAAATACGATTGTAACAGATGTGGCTATATATTGGGACCGTTTGTCCAGTCTCAGAATTCTGAAGTCAAACCGGGGTCTTGTCCTGAGTGTCAGAGCTCGGGACCTTTTATGGTTAATATGGAACAAACCGTGTATAGAAACTATCAGAAGGTTACAATCCAGGAATCACCTGGAAGGATTCCAGCTGGTCGTATACCGAGGAGTAAGGACTGTGTGCTGTTAGCAGATCTCTGTGATAGATGCAAGCCAGGAGATGAGGTAGACCTAACGGGGATCTACACAAATAATTATGATGGATCACTCAATACTGAACAGGGTTTCCCAGTATTCGCCACTGTTATTATAGCGAACTACATAGTAGTTAAGGACTGCAAGCACATTGTTGAATCTCTCACTGATGACGATGTTGCCAGCATCCTAAAACTGTCAAAAGACCCACAAATAGGGGAAAGAATTGTACAGAGTATAGCCCCTTCTATATACGGCTATGATTATATCAAAAGAGGTTTGGCGCTCGCTTTGTTTGGCGGCGAACCTAAAAATCCCGGCGAAAAACATAAACTAAGAGGCGACATTAACGTTTTAATATGCGGAGATCCGGGGACGGCTAAGTCGCAATTCCTCAAATATACGGAGAAGGTAACTTCGCTTGGATCTTTCATTCATAATAAAGTACTAATTAATTTGTTACTTGTACACAACATATCGAAAATTAAACGGACAGATTGGACATTGGAAGCCGGCGCGCTGGTGCTGGCTGACCGCGGAGTCTGTCTCATAGATGAGTTCGATAAGATGAACGATCAAGATCGTACGTCTATACACGAAGCGATGGAACAACAATCCATATCTATATCTAAGGCTGGAATTGTTACATCGTTACATGCTAGATGTTCTATAATAGCAGCGGCCAATCCTATCGGCGGTCGTTACGACGCGTCGCTGACATTCACCGAGAACGTGAATCTCTCTGAGCCGATATTATCTCGTTTCGACGTGTTGTGTGTCGTGAGAGACGAGGCAGATCCTATGCAAGACGCGCATCTGGCTAAGTTCGTAGTGAGCTCTCATATAAGACATCACCCCACGCAACGCGGTACTACCATCGAGGATACTACAGTTGAAAACGATTTCACATTGCCACAGGATCTATTGAAGAAATACATTGTCTACTCGCGGGAGAATATTCATCCTAAGCTCACGAATATGGATCAAGACAAAGTAGCGAAAATGTACAGCCAGCTAAGACAAGAGTCGTTAGCCACAGGCAGTTTACCTATCACTGTGCGACATATTGAATCTGTAATCCGCATGTCCGAGGCGCATGCTAGAATGCATCTAAGAGCAGCTGTAAACGAGCAAGACGTTAACATAGCCATCAGAACTATGTTAGAGAGCTTTGTGGCCACACAAAAGTACAGTGTGATGCGGGCTATGAGACAGACCTTCCAAAAATACCTATCATACAAGAAAGATAACAGCGAACTACTGTATTACATTTTAAGACAACTCACAATGGACCAGCTAGCATATATGCGAGGCTTGCATAATCACTCCCAATCGACTATAGAGATATCGGAGCGAGATTTAACAGAAAGAGCAAGACAAATCAATATCACAGATCTCAAACCTTTCTATGATAGTAGAATATTCAAAATGAATAACTTCAGTTACGATGCTAAACGGAAAGTCATTGTTCACACTCTGCCGGAAGTGCCATCGGTTAATTAA

Protein sequence:

>DPOGS207360-PA
MSSPIPDTPSDRDGARSRMTSPAREYEMFEDEGAILGDNADEEEEDGEELFNDNMEADYRPMPALDRYDAEDLDEEDYDAMSVEDRVAAERELQRRDRDEGRIRRDDRDLLYDSSDAPRAKRRRAAEKAAGMEEPVEGIESIENLEDTKGYSTKEWVSMLGPRTEIANRFKNFLRTYTNTKGQYVYKERIRRMCEHNQASFHVEFDVLARREQVLAYFLPEAPFQMLQIFDEVAKDIVLQIFPSYERVTSEVHVRIADLPLIEELRTFRKLHLNQLVRTVGVITATTGVMPQLSVVKYDCNRCGYILGPFVQSQNSEVKPGSCPECQSSGPFMVNMEQTVYRNYQKVTIQESPGRIPAGRIPRSKDCVLLADLCDRCKPGDEVDLTGIYTNNYDGSLNTEQGFPVFATVIIANYIVVKDCKHIVESLTDDDVASILKLSKDPQIGERIVQSIAPSIYGYDYIKRGLALALFGGEPKNPGEKHKLRGDINVLICGDPGTAKSQFLKYTEKVTSLGSFIHNKVLINLLLVHNISKIKRTDWTLEAGALVLADRGVCLIDEFDKMNDQDRTSIHEAMEQQSISISKAGIVTSLHARCSIIAAANPIGGRYDASLTFTENVNLSEPILSRFDVLCVVRDEADPMQDAHLAKFVVSSHIRHHPTQRGTTIEDTTVENDFTLPQDLLKKYIVYSRENIHPKLTNMDQDKVAKMYSQLRQESLATGSLPITVRHIESVIRMSEAHARMHLRAAVNEQDVNIAIRTMLESFVATQKYSVMRAMRQTFQKYLSYKKDNSELLYYILRQLTMDQLAYMRGLHNHSQSTIEISERDLTERARQINITDLKPFYDSRIFKMNNFSYDAKRKVIVHTLPEVPSVN-