Monarch geneset OGS2.0

DPOGS206444
TranscriptDPOGS206444-TA1797 bp
ProteinDPOGS206444-PA598 aa
Genomic positionDPSCF300070 - 356420-360976
RNAseq coverage605x (Rank: top 21%)
Annotation
HeliconiusHMEL0139990.068.53% 
BombyxBGIBMGA005448-TA0.067.51% 
DrosophilaRpA-70-PA8e-16748.33% 
EBI UniRef50UniRef50_Q5TLD30.067.22%Replication protein A large subunit n=1 Tax=Bombyx mori RepID=Q5TLD3_BOMMO
NCBI RefSeqNP_001036938.10.067.22%replication protein A1 [Bombyx mori]
NCBI nr blastpgi|1129831320.067.22%replication protein A1 [Bombyx mori]
NCBI nr blastxgi|1129831320.067.00%replication protein A1 [Bombyx mori]
Group
Gene OntologyGO:00056341.8e-196nucleus
GO:00036771.8e-196DNA binding
GO:00062601.8e-196DNA replication
GO:00036761.1e-10nucleic acid binding
KEGG pathwaycqu:CpipJ_CPIJ0023810.0 
 K07466 (RFA1, RPA1, rpa)maps-> DNA replication
    Homologous recombination
    Mismatch repair
    Nucleotide excision repair
InterPro domain[4-591] IPR0045911.8e-196Replication factor-a protein 1 Rpa1
[419-593] IPR0160271.2e-54Nucleic acid-binding, OB-fold-like
[454-593] IPR0123401.7e-46Nucleic acid-binding, OB-fold
[441-586] IPR0139555.9e-46Replication factor A, C-terminal
[4-103] IPR0071991.1e-20Replication factor-A protein 1, N-terminal
[177-259] IPR0043651.1e-10Nucleic acid binding, OB-fold, tRNA/helicase-type
Orthology groupMCL12219 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206444-TA
ATGTCATATTCACTTTCGGAAGGTGCTGTGGAGATAATAATGAGTGGAGGCGAGTATGATGGCCCTGTAATACAAGTACTTGGACACAAAAAGATTCAAAGTAGAGGAGAAGATAGAATTCGCCTTATTGTTTCTGATGGAAAACATTCCCATAGCTTAGCCCTGCTCAACTCCCAGCTTAATAATAAAGTATTCAGTGGAGAGCTGTCAAACTATTCTGTAATAAAAGTAGATAAATATTTTATTTCCTCTGTTCAAAAAAAGGAAGAGAAGAGAGTTATGGTGATACTCAATTTGACTATCATCGCTCCAGGAGCAGAAGTTGGAAAGAAACTCGGTGACCCTATACAGTGGTCGGAAGATATGACATCACCGTCATATGCAAGCACCACTAAAGCGGAACCAAAACCAGTTCCTACACCCATGAGCAATTTGTCAAAAACAACTGCTGGAATCAACTTGAATTCAAGTGTACTGAGTTCACAAATGACTCATCCTATTGCAAGTTTAAGCCCATATCAGAATAAGTGGGTTATTAAAGCAAGAGTAATGAACAAAACAGCCATTCGTACTTGGAGCAATGCTAAGGGTGAGGGCAAACTCTTCAGCATGGACCTTTGTGATGAGAGTGGTGAAATCAGAGCCACAGCTTTCAAAAATGAGTGTGATAAATTCTATGATATGATACAGATTGATAAGGTATACTACATCAGCCGCTGTCAATTGAAAACAGCAAACAAACAGTACACAACACTGAAGAACGATTATGAGATGACATTCACAGCTGACACGGTTGTATCAGAATGTATGGAGGAAAGTAACTCAGTGCCATCCATAAAGTACGACTTTATGCCCATCAGTGATATTGCTGACAAGGGTCCCGATACAATTCTGGATGTGATAGGAGTTTGTAAATCCGCATCAGATATTCAGGAGCTTACAGCCAAAAGCACAGGGAAGCTGCTAAAAAAACGAGAGGCCACATTAGTGGATTCGTCTGGAGGGGCGATCACTTTAACACTTTGGGGAGCTGAGGCTGAAAAATTTGATGGCAGCAGCAATCCTGTTGTTGCTGTGAAGGGCGCTCGCCTGGCGGAGTTCAACGGCAGCAAGTCCCTGTCGTGCCTCGCCAGTACTATAGTGAGGGTGCAGCCGGATGTGGAAGAGGCACACCGTCTACGAGGCTGGTATGATAATGGAGGGGATTCCATGGCCATGGTACATATATCAGCCAGAGTCGGTCAAGGCGGTGGGAACGCTGAATGGATGACATTCGCGGAAGCCGAAGAACGGAGACTCGGCACTGGCGATAAGGCCGACTACTTCAGTCTGTTGGGAGTTTTGACCTTCACGTTCGCGGATAACGCTGTGTACAAAGCCTGTCCGCAGGAACAGTGCAACAAGAAACTCGTAGACCAGCAGAACGGGCTCTACAGATGTGAGAAGTGCAATCGAGAGTATCCCAACTATAAATACAGATTGCTGCTAGGAGCTACAGTGTCCGACCCTACGGGCGACCAGCGAGTGACGGCCTTCAACGAGTCGGCGGAGGTGATGCTCGGCCGCAGCGCGGAAGAGGTCGGCCGGCTCTCAGACTACGACAAGGCGGAGTACGGCCAGCTGCTCGACCACGTGAAGTTCAAGACGTTCGTCTTCAAGTTCAGGACCAAGATTGAGACGTACAGCGACGAAGCCAAGCTAAAGACAGTGGTAATGAGCGCCCAGCCGGTCGACTACAGAGACGCTAACGCCAGACTCGTCAAGAGCATCAAGGCTTTGAGCGGTGTCGAAGTTTAA

Protein sequence:

>DPOGS206444-PA
MSYSLSEGAVEIIMSGGEYDGPVIQVLGHKKIQSRGEDRIRLIVSDGKHSHSLALLNSQLNNKVFSGELSNYSVIKVDKYFISSVQKKEEKRVMVILNLTIIAPGAEVGKKLGDPIQWSEDMTSPSYASTTKAEPKPVPTPMSNLSKTTAGINLNSSVLSSQMTHPIASLSPYQNKWVIKARVMNKTAIRTWSNAKGEGKLFSMDLCDESGEIRATAFKNECDKFYDMIQIDKVYYISRCQLKTANKQYTTLKNDYEMTFTADTVVSECMEESNSVPSIKYDFMPISDIADKGPDTILDVIGVCKSASDIQELTAKSTGKLLKKREATLVDSSGGAITLTLWGAEAEKFDGSSNPVVAVKGARLAEFNGSKSLSCLASTIVRVQPDVEEAHRLRGWYDNGGDSMAMVHISARVGQGGGNAEWMTFAEAEERRLGTGDKADYFSLLGVLTFTFADNAVYKACPQEQCNKKLVDQQNGLYRCEKCNREYPNYKYRLLLGATVSDPTGDQRVTAFNESAEVMLGRSAEEVGRLSDYDKAEYGQLLDHVKFKTFVFKFRTKIETYSDEAKLKTVVMSAQPVDYRDANARLVKSIKALSGVEV-