Monarch geneset OGS2.0

DPOGS204200
TranscriptDPOGS204200-TA1761 bp
ProteinDPOGS204200-PA586 aa
Genomic positionDPSCF300034 + 548014-554675
RNAseq coverage74x (Rank: top 65%)
Annotation
HeliconiusHMEL0099003e-14661.89% 
BombyxBGIBMGA005158-TA0.064.07% 
DrosophilaDNApol-alpha73-PC9e-7729.30% 
EBI UniRef50UniRef50_UPI00022B10F63e-8132.63%UPI00022B10F6 related cluster n=2 Tax=unknown RepID=UPI00022B10F6
NCBI RefSeqXP_002731400.16e-8032.84%PREDICTED: MGC80532 protein-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|3485142271e-8233.50%PREDICTED: DNA polymerase alpha subunit B isoform 1 [Oreochromis niloticus]
NCBI nr blastxgi|2089670744e-8432.73%polymerase (DNA directed) alpha 2 [synthetic construct]
Group
Gene OntologyGO:00038874.8e-34DNA-directed DNA polymerase activity
GO:00036774.8e-34DNA binding
GO:00062604.8e-34DNA replication
KEGG pathwayxtr:4486132e-83 
 K02321 (POLA2)maps-> Purine metabolism
    DNA replication
    Pyrimidine metabolism
InterPro domain[1-586] IPR0167221.5e-119DNA polymerase alpha, subunit B
[338-540] IPR0071854.8e-34DNA polymerase alpha/epsilon, subunit B
[18-232] IPR0136271.2e-11DNA polymerase alpha, subunit B N-terminal
Orthology groupMCL12791 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204200-TA
ATGGCTACCGAAGAATTGGTGACCGAGCAATTTAAATTTTTAGGAATAGAAGTATCAAAAGAAGTTTTATCAAAATGTGTGAGTATATGCACAGAATACGATGTAGACGCTGAAACTTTCATCGAACAATGGATGGCGTTTAGTTTGACTACTTTAAACGGTGCTTCGCCGACTTTAGATAACTTAGAATTACTTGAGAAGAGGGAATTTTCCAAACGGTCTGCGAGTCGACTCACCGCTGTCGCTAATGAAGTTACACATTCAAGTACCGGCAGAAGTTTGACTGTTTACGGAGCGCCTGTTGCTAAACAGTCAGAAAATGAACTTCTGTCTAATTACATGACAACTACACCAAAGAGAATTAAAATAGAATGTGAGACGGGGCCGAAACAGAATGAGCTCCATCCTGCTACATATTCACCGAAAGTAGAGCACTCCGTTAAATACACATCGAGAACGAACCAAGGGACAGTGGTGCATTCATTCGGTGAAGATAAGCTGGTGGAAGTGATCACGAATACAACGGCCTTGGATAATTTACTCGAACTAAATATTGTACAAGTCCCCAACGACGACGGCGAACTATACAACAAAGCGAAGTACGGCTTTGAACTGCTGCATGAAAAAGCGAGTGTATTTGATAACCACATCCGATATATATCACAATGTATCATGAAGAAGAACGGCTTTTCCGAGACGGTCTCGGTGAGGCATAAAACACAGACCGAAGTTTTGGTAGCTGGTCGTATTGAATGCGACGCAGATGCTAGACTAAACTCGAAAAGTGTAATCTTACAAGGCACATGGGAGGATTCACTGAGCCAGACCGTCCCTCTAGATTTGGACAGCGTGCAGCAGTATTCTCTGTTCCCGGGCCAGGTGGTGGTGGTGCGTGGTATAAATCCACGCGGCAACAAGTTTGTGGCTCGGGAGTTGTTCTGCGACGCGGCCCGCCCTGTGCCGGATCCAACATCAGATATCACGAACACGCTAAAAGGTACATTGTCAATGGTTGTAGCGGCCGGCCCGTACACCACGTCTAATAACATGTCGTACGAGCCGTTGAAGGACTTCATAGCGTATTTGAACACTCACAAACCACACGTAGTCATAATGACGGGACCATTCGTGGACTGTGAGCACGAGAAAGTCAAAGATAACTCTATGGCTGAAACATATAAATCTTTCTTCGACAAACTTATTGATAGTCTAGCTGATATCGGCAACACAAGTCCTTTTACAAAAATTTACATAGTGTCAAGTCACAAGGACGCTTTTCATGTAAATATCTACCCGACGCCGCCCTATAGCAGTCGAAAGAAATATCCCAACATACAATTTCTACCAGATCCCAGCACATTAAACATCAATGGATATATAGTTGGCATCACCAGTTACGATGTGCTTATGAGCATCAATCAAGAAGAAATATCACATGGTTCAGTCGGCGACAAGTTATCTCGTCTGTCCGGGCACGTGTTGCGGCAACAGTGCTATTATCCAACGGCTGGCTCTCTCGGCTCGCTGGCGGCGGACGGATCGCTGTGGGCGGCGCACGCACAACTACCCGCAACTCCTCACATACTAGTAGTGCCCTCCAACTTCAGATACTTCGTTAAGGAAGTGAACGGCTGTATAGTCATAAACCCTGAGCATCTCAGTAAAGGTGCCGGTGGCGGGACGTTCGCACGACTCGTCGTTCGTCCGCCGACAGAAGATAAAACTAATAGTAATATAGCCGCACAGATAGTACGCATTTAA

Protein sequence:

>DPOGS204200-PA
MATEELVTEQFKFLGIEVSKEVLSKCVSICTEYDVDAETFIEQWMAFSLTTLNGASPTLDNLELLEKREFSKRSASRLTAVANEVTHSSTGRSLTVYGAPVAKQSENELLSNYMTTTPKRIKIECETGPKQNELHPATYSPKVEHSVKYTSRTNQGTVVHSFGEDKLVEVITNTTALDNLLELNIVQVPNDDGELYNKAKYGFELLHEKASVFDNHIRYISQCIMKKNGFSETVSVRHKTQTEVLVAGRIECDADARLNSKSVILQGTWEDSLSQTVPLDLDSVQQYSLFPGQVVVVRGINPRGNKFVARELFCDAARPVPDPTSDITNTLKGTLSMVVAAGPYTTSNNMSYEPLKDFIAYLNTHKPHVVIMTGPFVDCEHEKVKDNSMAETYKSFFDKLIDSLADIGNTSPFTKIYIVSSHKDAFHVNIYPTPPYSSRKKYPNIQFLPDPSTLNINGYIVGITSYDVLMSINQEEISHGSVGDKLSRLSGHVLRQQCYYPTAGSLGSLAADGSLWAAHAQLPATPHILVVPSNFRYFVKEVNGCIVINPEHLSKGAGGGTFARLVVRPPTEDKTNSNIAAQIVRI-