Monarch geneset OGS2.0

DPOGS210759
TranscriptDPOGS210759-TA2754 bp
ProteinDPOGS210759-PA917 aa
Genomic positionDPSCF300013 + 1715703-1718911
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0221830.076.30% 
BombyxBGIBMGA006237-TA0.071.45% 
Drosophilalig3-PA0.043.70% 
EBI UniRef50UniRef50_D6WBL20.054.04%DNA ligase n=1 Tax=Tribolium castaneum RepID=D6WBL2_TRICA
NCBI RefSeqXP_967954.10.054.04%PREDICTED: similar to rCG33581 [Tribolium castaneum]
NCBI nr blastpgi|910763740.054.04%PREDICTED: similar to rCG33581 [Tribolium castaneum]
NCBI nr blastxgi|910763740.053.71%PREDICTED: similar to rCG33581 [Tribolium castaneum]
Group
Gene OntologyGO:00062812e-145DNA repair
GO:00055242e-145ATP binding
GO:00062602e-145DNA replication
GO:00039102e-145DNA ligase (ATP) activity
GO:00063102e-145DNA recombination
GO:00036778.3e-40DNA binding
GO:00082704.8e-23zinc ion binding
GO:00056221.2e-09intracellular
KEGG pathwaytca:6563220.0 
 K10776 (LIG3)maps-> Base excision repair
InterPro domain[199-712] IPR0009772e-145DNA ligase, ATP-dependent
[363-557] IPR0123101.1e-57DNA ligase, ATP-dependent, central
[568-713] IPR0123403.8e-40Nucleic acid-binding, OB-fold
[142-315] IPR0123088.3e-40DNA ligase, ATP-dependent, N-terminal
[567-715] IPR0160271.3e-38Nucleic acid-binding, OB-fold-like
[3-108] IPR0015104.8e-23Zinc finger, PARP-type
[585-696] IPR0123091.4e-18DNA ligase, ATP-dependent, C-terminal
[813-915] IPR0013571.2e-09BRCT
Orthology groupMCL15819 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210759-TA
ATGACAGAATCTACGCCGTTTTACGTTGACCGAGCTAAGGTTGGACGAGCCAACTGTAAAGGGTGCAAGGCACACTGCGAGAGCGGACAGTTACGTATAGCAAAATTGGTCGCTAGTCCATATGGCGAAAACCAACAAATGAAGTCATGGCACCATGTAAACTGTTTGATGAATGTTTTATTAAAACAACGACCTACCACTAAACGGATAGATTCTATCGATGATATTGGAAACTGGGATAATATAAGCAAAGAGGACCAAGAATTTATAATTAAGGAAGTTAATCAGATGGAAAAGACATATGCTGATAAGTATAGTGGAAAATATACAGCTAAAGTCATAAAAAATGAACATTCTCAAACAGAACTTAAGTCGCCTAATATTTCTTTGAATGATGAAATAAAACTGAAGACAGAAGATGACAAATTCTCAATATTCTTTACTTTATGTAAGGAAATATCCAAAGTTGATGCATATACTGAGAAAACTGCTATAGTTAATAATTTTTTTACCAATGGTTGTGATGGTAAAAAGTTTAACGGAGATCTTGTTTTATGGTGTAAGTTCCTTCTACCTCAAGTATCCAAGAGAGTTTATAATTTAAAAAGCAAACAGCTAATTAAATTATTTGCTAGAATATTCAATACAGATCATGATGATATGCTCACACACTTAGAAAATGGTGATATAGCCGATACATTACAAAATTTTTTTCAAAAATCAAACACATACAAACCAGCAACACAAAGTTCTTTAACATTACAAATTGTTGATGACTTCCTCCACGAACTTTCAAAATTGACAAAAGAAGAAGAACAAATATATCACTTTAAAAAAATAATAAAAAACTGTACTTTGGATGACCTTAAAATGCTTATACGCTTGATAAAAGGTGATCTGAGGATAAATGCTGGTCCAAAACATATACTTGAAGGTGTGCACCCTGATGCATATAGCGTTTTCCAAACATCCAGGGATTTGGATATGGTTTTAGAAAGAGTCTTGCCACAAAGCAGTGAAGTCAAACATAAAGATGCAATGCATAGAAATGTTCAGGCCAAACTCAGTATTATGACACCAGTACTGCCAATGTTAGCAGAAGCGTGTAAATCTGTTGAAATGGCTATGAAAAAGTGTCCCAATGGAATGTTCTCTGAAATAAAGTATGACGGAGAAAGAGTACAAGTACATAAAAAAGGAAATGAATTTAAATATTTCTCAAGAGCTCTAAAGCCTGTAATGGCTCATAAAGTCAGTCATTTCAAGGATTACTTGCCTCAGGCGTTTCCTAAAGGGGTTGATTTAATATTAGATGCCGAGGTCTTAATGGTAGACTTAAAAACAGGCAAGCCTTTACCTTTCGGTACTTTGGGCATACATAAACAGTCTGAATTCAAAGATGCTGGAGTGTGCTTATATATTTTCGATTGCCTATATTACAACGGCGAAATTTTAATAGATATGCCAATAAAAAAGAGACGCCAAATATTACATGATAATATGGTTGAAATTAAAAACCATGTGATGTTCTCAGAGCAGGAACTCATATATAAACCATCTGATTTAGCCAATATGATTGCAAAGGTATTGCAGCTGGGCTTGGAAGGTCTAGTTCTTAAAGATTTGGAATCAACATATGATCCTGGAAAGAGACATTGGTTGAAAGTAAAAAAGGATTATTTGTTTGATGGCGCTATGGCAGATACAGCTGATCTGATTGTTTTAGGAGCTTGGTTTGGTACTGGAAAGAAAGGTGGTATGATGTCAGTATTTCTAATGGGATGTCTGGACAAATTTAGAAACAAATGGGTGACCGTTACTAAAGTGCACACCGGTCATGATGATAGTACGTTAGAAAGATTACAAAAGGAATTAAGTCCCTTGATGGTAAAAATATCTCAAGATTCCAATAAAGTGCCTAATTGGTTGGACTGTAAGAAGGGAATGATTCCAGATTTTGTTGCTGCTGATCCTAAGAAACAACCTGTTTGGGAAATTACTGGAACCGAGTTGACAAAAGCTAACTTACATACAGCCGATGGTATATCTGTTAGATTTCCTAGAGTGACACGTATAAGGGATGACAAAAATTGGGAAACTGCAACAAATTTGGAAGAACTAAAACATTTATATAAAGCTTCAAAAGAGAAAACCGATGTCAGTCTTTTAAATAAACTAGCTGCTACCGCGGATGACTATGTACCACCTGAGAAAAAACCAAAACAAAGTCCTAAGACAGCTAAAAACAAAACGAGTCCAGTATCAAATACTTTGGACAAATGTTTTGCTAAGTATGCAAAAAGGAATGAAAAGTCCCCAATTAATCAACATAATGACAAAAAGAATGCAGATTCTGATGATGATTCAGACAGAACAGACATTGAAGACACAAGTCCTATGAAAAAAGATATCAAACCTAAACTACTACCTGAGAATCCTCTACCAGATGTCTTTATAAACAAACGGCTTGGTTTCTATCCAGATTTTATAAGCATACCAGAAAGGGAAAGGTTTCATTTCGAAAGACATTGGGTAGCTTACGGTGGCATTGTAATTAAATCATTAAAAAAAATTGATGTAGATTATGTTATACATAACAATAATAAAATTGGTTTCAGTGAAATGATGAAATTAAAGAGAAAACTACCGCCAGATGTCAGACATGTTACTAAAAGTTGGTTGATAAAGTGTATAAATTGTGTTAAATTGTGTGATACTAAGAATTATACTGTCATAGTAAAACCATTCCAATGA

Protein sequence:

>DPOGS210759-PA
MTESTPFYVDRAKVGRANCKGCKAHCESGQLRIAKLVASPYGENQQMKSWHHVNCLMNVLLKQRPTTKRIDSIDDIGNWDNISKEDQEFIIKEVNQMEKTYADKYSGKYTAKVIKNEHSQTELKSPNISLNDEIKLKTEDDKFSIFFTLCKEISKVDAYTEKTAIVNNFFTNGCDGKKFNGDLVLWCKFLLPQVSKRVYNLKSKQLIKLFARIFNTDHDDMLTHLENGDIADTLQNFFQKSNTYKPATQSSLTLQIVDDFLHELSKLTKEEEQIYHFKKIIKNCTLDDLKMLIRLIKGDLRINAGPKHILEGVHPDAYSVFQTSRDLDMVLERVLPQSSEVKHKDAMHRNVQAKLSIMTPVLPMLAEACKSVEMAMKKCPNGMFSEIKYDGERVQVHKKGNEFKYFSRALKPVMAHKVSHFKDYLPQAFPKGVDLILDAEVLMVDLKTGKPLPFGTLGIHKQSEFKDAGVCLYIFDCLYYNGEILIDMPIKKRRQILHDNMVEIKNHVMFSEQELIYKPSDLANMIAKVLQLGLEGLVLKDLESTYDPGKRHWLKVKKDYLFDGAMADTADLIVLGAWFGTGKKGGMMSVFLMGCLDKFRNKWVTVTKVHTGHDDSTLERLQKELSPLMVKISQDSNKVPNWLDCKKGMIPDFVAADPKKQPVWEITGTELTKANLHTADGISVRFPRVTRIRDDKNWETATNLEELKHLYKASKEKTDVSLLNKLAATADDYVPPEKKPKQSPKTAKNKTSPVSNTLDKCFAKYAKRNEKSPINQHNDKKNADSDDDSDRTDIEDTSPMKKDIKPKLLPENPLPDVFINKRLGFYPDFISIPERERFHFERHWVAYGGIVIKSLKKIDVDYVIHNNNKIGFSEMMKLKRKLPPDVRHVTKSWLIKCINCVKLCDTKNYTVIVKPFQ-