DPOGS210606 | ||
---|---|---|
Transcript | DPOGS210606-TA | 624 bp |
Protein | DPOGS210606-PA | 207 aa |
Genomic position | DPSCF300168 + 9916-10618 | |
RNAseq coverage | 158x (Rank: top 52%) |
Annotation | ||||
---|---|---|---|---|
Heliconius | HMEL005904 | 3e-75 | 69.57% |   |
Bombyx | BGIBMGA014411-TA | 7e-19 | 61.86% |   |
Drosophila | Xpac-PA | 8e-53 | 46.54% |   |
EBI UniRef50 | UniRef50_P28518 | 1e-50 | 46.54% | DNA repair protein complementing XP-A cells homolog n=14 Tax=Diptera RepID=XPA_DROME |
NCBI RefSeq | XP_001848907.1 | 1e-55 | 50.46% | DNA-repair protein complementing XP-A cells [Culex quinquefasciatus] |
NCBI nr blastp | gi|383856492 | 5e-58 | 51.90% | PREDICTED: DNA repair protein complementing XP-A cells homolog [Megachile rotundata] |
NCBI nr blastx | gi|158285440 | 1e-57 | 54.21% | AGAP007566-PA [Anopheles gambiae str. PEST] |
Group | ||||
---|---|---|---|---|
Gene Ontology | GO:0005634 | 9.4e-76 | nucleus | |
GO:0003684 | 9.4e-76 | damaged DNA binding | ||
GO:0006289 | 9.4e-76 | nucleotide-excision repair | ||
GO:0000166 | 8.8e-27 | nucleotide binding | ||
KEGG pathway | aag:AaeL_AAEL011057 | 4e-54 |   | |
  | K10847 (XPA) | maps-> | Nucleotide excision repair | |
InterPro domain | [4-207] IPR000465 | 9.4e-76 | XPA | |
[68-142] IPR009061 | 8.8e-27 | DNA binding domain, putative | ||
[68-120] IPR022656 | 5.4e-20 | XPA C- terminal | ||
[35-67] IPR022652 | 3.1e-12 | Zinc finger, XPA-type, conserved site | ||
Orthology group | MCL11858 |   | Single-copy universal gene |
Genotypes for resequenced monarchs and outgroup Danaus species |
---|
>DPOGS210606-TA
ATGCGTCCGGTGGACTCCGGCGGCGGCTTCCTGCTGGAGGCAGAGGAGGACGTGGCGACCCCCGCGCCCCGCGCCCCGCCTGCGCCCATCGTGCACCGCCCCGACCAGCCGCGCTGCCTTCACTGCGGCTCGCCCTTTCCGCAGTCCTATCTGTTGGACACCTTCGATTACAACGCCTGCGACGCCTGCAGGGACGACGAGGACAAACATGAGCTGATCACCCGGACGGAGGCCAAGAGCGAGTTCCTGCTGAAGGACTGCGACCTGGACGCTCGGCCGCCGCCGCTGAGGTGTGTAAGGCGCCGAAACCCGCACCGCGCGCGCTTCGCCGAGATGAGACTGTACCTGCGCGTGCAGGTGGAGCAGCGCGCCCTCGAGGTGTGGGGCTCCGAGGAGCAGCTACGGCGGGAACGGGAGGAGCGGGACAGGCGCCGAGAGCGAGCCGCCGACACAGCCGCCCGCCGCCGTCTCCGGGCGCTGCGGATGGACGTGCGCTCCAGCCTGTTCGACCGGACGCGAGCGGCACACGAGCACGTGTACGGCCCGGAGACCTACGACCCAGACGAGGACGTGTACCGCCGCCGATGCGAATGTGGACACGTGCAGAGTTACGAGAAGATGTAG
>DPOGS210606-PA
MRPVDSGGGFLLEAEEDVATPAPRAPPAPIVHRPDQPRCLHCGSPFPQSYLLDTFDYNACDACRDDEDKHELITRTEAKSEFLLKDCDLDARPPPLRCVRRRNPHRARFAEMRLYLRVQVEQRALEVWGSEEQLRREREERDRRRERAADTAARRRLRALRMDVRSSLFDRTRAAHEHVYGPETYDPDEDVYRRRCECGHVQSYEKM-