Monarch geneset OGS2.0

DPOGS214594
TranscriptDPOGS214594-TA1710 bp
ProteinDPOGS214594-PA569 aa
Genomic positionDPSCF300050 - 303228-304937
RNAseq coverage74x (Rank: top 66%)
Annotation
HeliconiusHMEL0069700.087.57% 
BombyxBGIBMGA005125-TA0.080.18% 
DrosophilaDNApol-iota-PB2e-11435.69% 
EBI UniRef50UniRef50_Q9VHV13e-11235.69%DNApol-iota, isoform A n=12 Tax=Drosophila RepID=Q9VHV1_DROME
NCBI RefSeqXP_001868410.15e-11741.74%DNA polymerase IV [Culex quinquefasciatus]
NCBI nr blastpgi|1700672561e-11541.74%DNA polymerase IV [Culex quinquefasciatus]
NCBI nr blastxgi|1700672562e-11740.66%DNA polymerase IV [Culex quinquefasciatus]
Group
Gene OntologyGO:00038871.1e-36DNA-directed DNA polymerase activity
GO:00062811.1e-36DNA repair
GO:00036841.1e-36damaged DNA binding
KEGG pathway 
InterPro domain[254-372] IPR0179611.1e-36DNA polymerase, Y-family, little finger domain
[2-164] IPR0011263.9e-35DNA-repair protein, UmuC-like
Orthology groupMCL14989 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214594-TA
ATGGTGCAGAAACCAGAACTGCGCTGTTTTCCGCTAGGAATCCAACAGAAAAACATCGTTGTAACCAGTAATTATGAAGCCAGGAAGTACGGTATACGAAAGTGCATGCTCGTGTCTGATGCTTTAAAAGTATGTCCGAATTTAAAACTTATTAACGGCGAAGACTTACATGATTATAGAGCGGCGTCCAATAAAATATTTACAGTTTTACAAACGTTTAAGTGTCCAGTGGAGAAGTTAGGAATGGATGAAAATTTTATTGATGTCACAAATATAGTACAAGAAAGAATAAAAAATGTTAATTTAAAAAGTATCACAGTATCAGGTCATTTATACACTGAGTCCAACGCAGAGTGTGTGTGTGGTTGCCATGCAAGACTGAAAGTAGCATCACAAATAGCCTCTGAAATGAGGCATAAAATTTACGACGAATTAGGTTTTACTACTTGTGCTGGGATAGCACACAATAAACTGTTGGCTAAGCTTATATGTCCTTTAAATAAGCCTAATGATCAAACAACAATTTACCCCGAGCATGGCGTCAGTTTTATGTCTACCTTGCAAAGTGTCCGCTCAATACCAAGTATAGGATCCAAAACTACTGAAGCCCTCATCTCTCAAAAAATTATCACTGTGAGGGATTTACAAGAAGTTTCTATAGAAGTATTGAAAAAGCATTTTAGTTCTGACATGGCGGTGAGACTTAAGAATTTAAGTGTCGGTGAAGATAATACTCCAATTAAACAAACGGGCAGGCCTCAGAGTATAGGTTTAGAAGATAGCTTTAAGACTGTGAGTGTTAAGAGTGAAGTTGAAGAAAAATTTCAAGCATTGCTTCAAAGGCTGTTGATTCTAGTGAGAGAAGATGGACGCATTCCAGTATCACTAAGAGTGACTCTGAGGAAAAAAGATGTGAAACGATTAAGCAGTCACAGAGAGTCCAGGCAGTGTCAGGTATCTCCTTCCATCTTCACAATTAACAATGGAACACTCACAGTTACAGATTCTGGTAGGCAGAAGCTAATGAGCATAATAATGAGATTATTTAACAAATTAATTGACTTATCGAAACCATTCCATTTAACTTTAGTGGGTTTGGCATTCACAAAGTTCCAAGAGCGTATGACAGGTAGAGGGTCCATTGTCAATTATTTAATGAATGATATATCGGTCCAATCCGTACTCAATATTACAAATGACTGTGATACTTCAGCTTCTTCTATGGATTATTCGGCTGCGTCTCCTAGTAGTAGTACCACCACTGATCTATCTGACGGTGAAGTGGAACCATCACCTAAGAAACCTAAAAAGGGAACCTGGATAGCTAAAAGACGTTGCTTATCAAAGGAGGAAGTTGCATCTCCTAGTAAACTTAAAGTAGGCGAGTTGAGGCTCAATTCTAAAGAACTAGAAAAGGTTTCTGAATTAAGATTAAATTCCAGAGACAGGTCACTAACCCCTAGAGCGAGTCCTGCAAAAGACAATCTCTCTGATACTTCAGACACTACAAAGGATGCAGCTGACAGTAAATGTGACATTTGTCCTAGTTATGTAGATAAAGAAGTCTTTAATGCTCTCCCTGAAGAAATGCAACAAGAACTGAAAGCCATGTGGAAGAATCCCTCCAGTTCAGGGGTCAGAAGTAGCCCCAGAACATTGAACAAAGCTAAACCGAACACTCTTTTAAAATATTTTGTTCCAAACAAATAG

Protein sequence:

>DPOGS214594-PA
MVQKPELRCFPLGIQQKNIVVTSNYEARKYGIRKCMLVSDALKVCPNLKLINGEDLHDYRAASNKIFTVLQTFKCPVEKLGMDENFIDVTNIVQERIKNVNLKSITVSGHLYTESNAECVCGCHARLKVASQIASEMRHKIYDELGFTTCAGIAHNKLLAKLICPLNKPNDQTTIYPEHGVSFMSTLQSVRSIPSIGSKTTEALISQKIITVRDLQEVSIEVLKKHFSSDMAVRLKNLSVGEDNTPIKQTGRPQSIGLEDSFKTVSVKSEVEEKFQALLQRLLILVREDGRIPVSLRVTLRKKDVKRLSSHRESRQCQVSPSIFTINNGTLTVTDSGRQKLMSIIMRLFNKLIDLSKPFHLTLVGLAFTKFQERMTGRGSIVNYLMNDISVQSVLNITNDCDTSASSMDYSAASPSSSTTTDLSDGEVEPSPKKPKKGTWIAKRRCLSKEEVASPSKLKVGELRLNSKELEKVSELRLNSRDRSLTPRASPAKDNLSDTSDTTKDAADSKCDICPSYVDKEVFNALPEEMQQELKAMWKNPSSSGVRSSPRTLNKAKPNTLLKYFVPNK-