Monarch geneset OGS2.0

DPOGS207064
TranscriptDPOGS207064-TA3396 bp
ProteinDPOGS207064-PA1131 aa
Genomic positionDPSCF300001 + 2277434-2283460
RNAseq coverage386x (Rank: top 31%)
Annotation
HeliconiusHMEL0101990.060.85% 
BombyxBGIBMGA013112-TA0.062.96% 
Drosophilal(1)1Bi-PA8e-2621.41% 
EBI UniRef50UniRef50_UPI00022CA8396e-14131.60%UPI00022CA839 related cluster n=1 Tax=unknown RepID=UPI00022CA839
NCBI RefSeqXP_001603225.13e-14434.19%PREDICTED: similar to DNA polymerase v [Nasonia vitripennis]
NCBI nr blastpgi|3287875909e-14531.14%PREDICTED: hypothetical protein LOC725618 [Apis mellifera]
NCBI nr blastxgi|1565371256e-16433.01%PREDICTED: DNA polymerase V-like [Nasonia vitripennis]
Group
Gene OntologyGO:00038871.1e-59DNA-directed DNA polymerase activity
GO:00036771.1e-59DNA binding
GO:00063511.1e-59transcription, DNA-dependent
KEGG pathway 
InterPro domain[46-1110] IPR0070151.1e-59DNA polymerase V
Orthology groupMCL15594 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207064-TA
ATGAAAAACGAGATCAATATGGATAAACCTGAGAAAGCAGTAACTGCTTCATTGCTAGACGCTTTTGATTTATTTAAGTCACCGACAGATGATTTAAAAATCCTTGCTGGCCTTAAGATACATTCGCGTCTACGGGAAAATGAGGGTGGAAAGGAACTACAGTATACATTAAAAAGACTAGTTAGAAGTCTTGGCGCGAACATCGCTGAGTTGCGGATGGGCTATTTTGCTGCACTTGTTACAATCCTGACTAGGTTTCCAGAGATAACAGTAACGCAGCTGCTAGAATTGATCAAAAAAGAACTCCATGCCAGCGGATCATCTAAGAGTGAAGTTGGGGATGTTGCTTTAGGTCACATTCTGGCATGCGGGGCTGTTTTTCGTTCAGGCCTTATGTTAAAATGTACTGAGGAAGAACAAAAGGAAGTATTGCAACTTTTTGAAACAGCTAGCAGTAAGAAGTCATATTTGAGTACTGTGGCAACCCTAGTCTTTATTGATTTTATTAATAATCTCGATGAAGAACAATTTGCTACAATAGTTTGGCCGAACATAAAGCAGAACTACAAAAAAGCTATAAACGAACACAATTTAGACTCTTTATATTTTTTAATGATTGTTAATGAAAAGTTTCCTAAGAAAGTCAAGCTGAGGAAGTTGATTGGTGTGCCAGAACTGTTACACGAAGATCACATTTCAGATATATGTGATAAATTAATGACAGGCGTAGACTTCAATTCATTAAGCCACCCAATATACCAGGAATGTGGTAAACAGATTGCTAACTCACCACATCTTTCAATTTTTTGGAATAAAATTGACAGTCATCTTGTAAAACATAACAGGAACAGAGAATTGGTTTCATTAAATATTTTGAATACTGTTCTCCTGAATTTAAAAGACAATGTTGAAGTCATACCAGATCTGCTGAGTGACAACTTTTTCAAGTTGTTCATGGACTGGTTCAAAGGTTTACAGACAGCCAGCAAAATAAGGAATAAGAGAGACAATGAAGATGATTCCAAAATAATGGTTAAAAAACAAAAAGAAGTTCTCATGTCTCTAGCCAAGGCATTACAATTGCCATCTGTTGATAGCAAAATAAGAGTGAAAACTTTGGACAAGTTATTATTTAGCCCGGGTGAAATTAATTTCACTGAGATAACTGGATCTACAGTTGTTAAATCTATTACTGCTGGTCTAGATGTTGATGGTCTCAAGAAAATGGCAAAGTCATTGAAAAAGGTCTTCCTCAATTCCTCCAAGAAAGTTATTAAAGAGGGTGTGGAAAGGAACTGGTACAATAATGAAAGGGTTAAAGCAGCTGAGTTGATATCATACATGGTCAGTCATGAAGCTGTGAAGGACGACGCTGAATTTAAAATAAAATACATGCAATTGCTTATGTGCTTTGGATTTTTCAAAATCGGCGGTGATGAAAGTGTTGCCGTTAGTAGTAGCCTAGCAGGGTCTATAAAAGCTTGCTTCTACCGATGCTTCACATCCCGCTTTTCAAACGTCGAAGGTTTGGTAACAGTATTGTCATCACTCAGTAGTTTCATTACATCTATGATGACCAAAGAAAAGGTGCGATCAAAGCTCGAGAAACAGTTTGACAAAGAGAACATGGACTGTTGGGAAATGTTAACAAAAGTATGCGGCAAAATAGAAAAGAACCAATCTAAGTCGAAGGTCGAGAATGTCTTTCTTATTTTATTATACCAGCTCGGTTTGTTCCTCTTCTCGGAACCGACACATGTGAAAATTGCCTCCAGCTCTATTATAGAACTCAAAAGTTGCTACGAGCATTATATGAAAGACAGAAAAGCAAAGACAAGTAAGAAAGAAAACTCAATCAAAGATGAACCTGAGTGGATAGAAGTTGTGACTGAGGTGTTGCTGTCGATTTTGTCGATTGAGTCAAGTGTTTTGCGCTCAGTAGTACAATGTGTTTTCAGACTTCTGTGGGAGTATTTGACACCATCCTCTATAGCGCAAATTGTTTCGGTTCTCGACCCAGAGAGCGAAGCTAATCCGTTAGGACAAGAAAGCGATTTGGAAGACGATGAAGGAGAATTTGATGATTCTGATGAAGAAGGAAATGAAAATTGTCAAGAAAATGAAGAAAACGGAGAACACAATGATAGCGAGGAAAGTGAAAGTGAAATGGATGATGACGATGATGATGAGAAGGATTTGAACACGCCAGACCAGTTACGAATGGCCATTCAGAAGGCTCTCGGAAATACTACAGTCGACACTGACGTCGAAAGTATAGACGCTGATATGATAACAGAAGAGGAAGGCAAGAAACTTGATGAGGCTCTTGCTGAAGCATTCAAACAGTTCCATCAAGGCAAAAATAAGAAAACCAAAAAGGAACGCAAGAATAAAAAATCACTTTCAGATTTCAGAATCAAAGTACTAGATTTAATTGACATTTATCTGGAAAAGGATCCGGCTATGGACATATGTTTAAATATGATCGCCCCATTGACTAGATGTCTCGAGTTTTGCATGCAAGATAATCAGTTTAAGGAACTGGAAAATAGAGTACGGAAAACTATTAAGGGTCTATCGAAAATAAAGAAGTTCGCATCCACTGACGACATAACACCTGACATTTTGGCCACTTATTTGAAATCCGTAATAGAAAAGGGAGAACGATCCCACTTCATGTACCAAGCTCTTGGTGACGTTTTAACATATTTTTCAGTTTTTATAATAAACTGTTCACAAAAGATTGAAGCCCAACCGACCCAGACACCTAAAAAAAACAAGATATCTACTTTAAATGACCTTCTGAAAGAGACTGTTGACAATTTTTTCCACAATCGTAGCTGCTTATTACCAATTATTTTCTTCCATAACATTCTTCAATTAGAATGGCCTGGTAAATATAAATTAGCATCGATTGTAGTGAAAAATGTATTTAATCCTAAAGTGAGACAATTTAAACGGAACGAAGGAGTACAACTCCTATCTGGCTTCTATTTATCAATGAAAAGATTTAAGCCTATTTCTGAAAGCTGTTTTGCTGAGTTAGCTAATATAGAGAAAAATTTCAAAGAATCTTTTACAGCCACACTTGAAAGCAATGAGATGGACGTTAAACCTAATTTCATTGATTCCCTCAAGAAATTACTTAATGTTATGAAAAATCTTTATACGCAATGTAATCAGGAATCCCAACTTGATTTTGAATCAATGTTCAATGCATTAACGAACTTCAAAGTAGCTGTTAAGAGTACAAACAATGTTGAAGAATCAAAACAGATCATCGAAAATGGAAATAAGCCGAGTAAAGAGCAAAACAAAAAAAAGAAAAGAAAAGCTCTAACTAATGGGGTTATAGACCCACCGGTTAAGAAATCAAAAAACAAAATAAGTGAATAA

Protein sequence:

>DPOGS207064-PA
MKNEINMDKPEKAVTASLLDAFDLFKSPTDDLKILAGLKIHSRLRENEGGKELQYTLKRLVRSLGANIAELRMGYFAALVTILTRFPEITVTQLLELIKKELHASGSSKSEVGDVALGHILACGAVFRSGLMLKCTEEEQKEVLQLFETASSKKSYLSTVATLVFIDFINNLDEEQFATIVWPNIKQNYKKAINEHNLDSLYFLMIVNEKFPKKVKLRKLIGVPELLHEDHISDICDKLMTGVDFNSLSHPIYQECGKQIANSPHLSIFWNKIDSHLVKHNRNRELVSLNILNTVLLNLKDNVEVIPDLLSDNFFKLFMDWFKGLQTASKIRNKRDNEDDSKIMVKKQKEVLMSLAKALQLPSVDSKIRVKTLDKLLFSPGEINFTEITGSTVVKSITAGLDVDGLKKMAKSLKKVFLNSSKKVIKEGVERNWYNNERVKAAELISYMVSHEAVKDDAEFKIKYMQLLMCFGFFKIGGDESVAVSSSLAGSIKACFYRCFTSRFSNVEGLVTVLSSLSSFITSMMTKEKVRSKLEKQFDKENMDCWEMLTKVCGKIEKNQSKSKVENVFLILLYQLGLFLFSEPTHVKIASSSIIELKSCYEHYMKDRKAKTSKKENSIKDEPEWIEVVTEVLLSILSIESSVLRSVVQCVFRLLWEYLTPSSIAQIVSVLDPESEANPLGQESDLEDDEGEFDDSDEEGNENCQENEENGEHNDSEESESEMDDDDDDEKDLNTPDQLRMAIQKALGNTTVDTDVESIDADMITEEEGKKLDEALAEAFKQFHQGKNKKTKKERKNKKSLSDFRIKVLDLIDIYLEKDPAMDICLNMIAPLTRCLEFCMQDNQFKELENRVRKTIKGLSKIKKFASTDDITPDILATYLKSVIEKGERSHFMYQALGDVLTYFSVFIINCSQKIEAQPTQTPKKNKISTLNDLLKETVDNFFHNRSCLLPIIFFHNILQLEWPGKYKLASIVVKNVFNPKVRQFKRNEGVQLLSGFYLSMKRFKPISESCFAELANIEKNFKESFTATLESNEMDVKPNFIDSLKKLLNVMKNLYTQCNQESQLDFESMFNALTNFKVAVKSTNNVEESKQIIENGNKPSKEQNKKKKRKALTNGVIDPPVKKSKNKISE-