New model in OGS2.0 | DPOGS207474  |
---|---|
Genomic Position | scaffold264:+ 1443-5897 |
See gene structure | |
CDS Length | 2370 |
Paired RNAseq reads   | 1008 |
Single RNAseq reads   | 2490 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000946 (4e-176) |
Best Drosophila hit   | mutagen-sensitive 210, isoform C (9e-124) |
Best Human hit | xeroderma pigmentosum, complementation group C isoform 2 (9e-92) |
Best NR hit (blastp)   | nucleotide excision repair protein [Bombyx mori] (0.0) |
Best NR hit (blastx)   | nucleotide excision repair protein [Bombyx mori] (0.0) |
GeneOntology terms    | GO:0006289 nucleotide-excision repair GO:0003684 damaged DNA binding GO:0005634 nucleus |
InterPro families    | IPR018328 DNA repair protein Rad4, DNA-binding domain 3 IPR018325 DNA repair protein Rad4, transglutaminase-like domain IPR018326 DNA repair protein Rad4, DNA-binding domain 1 IPR018327 DNA repair protein Rad4, DNA-binding domain 2 IPR004583 DNA repair protein Rad4 |
Orthology group | MCL14141 |
Nucleotide sequence:
ATGTTTAATTTCGTTACATTGCCTATAAGGCCACCCGTTTCGGAATTGTGTTCACTCTCA
ACCAAAGTGAAGGGCACAGAAGATGCAAAGAAAAACGACCAGAAAACAAAATCACCAAGA
AAATCAACTAAGAGTAAAAGCAAGAGAGATGTTATTCCACAGTTGGATGGAAATTATGAC
GTAATTGAAAGTGATGATGGAAATATTATGCAAGTTGATGGTGGTGATGATACAACCACA
GCTAGAACGAGAAGACAACGGCTTTCATTAAGAAAAGTGAAACAAGCAAATGATGTTAAG
AAACCAGAGGAGGTTATCAGCCCAACTAAAATAACAAAAAAGAATCTCAGTTTGAAAATA
GAAAATAAGACTGTGGGTGAAACAAAGCGAAAACCAACGACAAGGACCAAAAGAAACTTG
AGATTGCAATCAAAAAATACAAAAACAACAATACATGAAACTAAACAGTCAGAAAGTCTT
TCTAATAAAAAAGATGATATAAAAGCCACTAAAAGTAAACGAAAAATATTAAGTTTAAAT
CTATCAGTTACCAAAAACGACCAAGATAACAAAACCAAAAGTACATCAAAAACGATAGTT
AATAAATGCTCTGCAAACAGAATTACTAGAGCAAACATTACTTCACTCAACGAATCAAAC
GCAATACTTACTAAGACTTTATCAAAACAATCGTCCTTAGATAAAGTTCCTAAGATCATC
CTTACAGACATAAACGATCAAACTGTATCGAGTAAATTCTTTGAGAAGTCGCCCACCAAA
AGAACGTCAAGAAAACGATCACAAACAACGGAACCAAAGAAATCGCCGAATGAAATGTCA
AATGCACGAACGAGAAGTGCACACGCAACAGAAAGCAAGTATTTTGCTCCTGAAACCGAT
AAAAGTCCCGCCAAAAGATGTAGAACAACTAGAAAAATTGATTCTGATGATTCAAAAAGA
GTAAGTCATAGAGATCTCGCGAAAAAGAATGTCCAAGATCTCCAATCTCCAAAGATCTCC
AAACCCAAAAATGATGTCACGAAAGATCTCGTTCACATTATCAAGGGAAGGGTAAAGGAG
GCCAAAACGGATGCAAAAAAACGTATTGTAAAAGGAAAAGAAAAACATGAATCTGATTCT
GATAGCGATCATCTGGCCGTTGAATCTCCAGCTCCCCGTAAATCTGAGAGCGACGAAGAC
TTTAAAGTGGAAAAAGTTACACCTAAACAAAAGAAGCCAGTTAAGAAAATAGACCGATGT
GTTATATCAGCAGATGATGAGATGCCTTTGAATAAAATTAATGTGTGGTGCGAGATATAT
GTTGAAGAATTAGAAGAGTGGGTTCCCGTTGACGTTGTTAGAGGCATAGTTCATTCTGCC
AATGAATTATATAGTCGTTCGACACACCCTGTATCATACATTGTTGGTTGGGACAACAAT
AATTACTTAAAAGATCTGACAAGACGCTACGTGCCATATTGGAACACAGTTACACGTAAA
CTGAGAGTTGATCCTGGATGGTGGGAAGAAGCGATAAAGCCGTGGTTGGGACCAAAAACC
GCCAGGGACAGGGAAGAGGATGAAAGATTGCACAGAATGCAACTAGAAGCGCCATTGCCC
AAAGTTATATCCGAATACAAAAACCACCCTCTATATGTGTTGAAACGTCATCTCCTCAAG
TTCGAAGCCATATATCCGCCTGATGCTGAAACCCTTGGCTTCGTTCGCGGGGAGCCCGTT
TATCCGAGGGATTGTGTTTACATTTGCAAGTCGAGGGATGTTTGGATCAAGGATGCCAAA
GTAGTTAAACTCGGAGAACAGCCATACAAGATAGTTAGAGCTCGTCCCAAATATATAAGA
GCCACAAACACCTTTATAACTGATCGACCTCTGGAAATCTTTGGGCCATGGCAGACACAG
GATTATGAACCTCCGACTGCAGAAAATGGAATTGTTCCACGGAATCCTTATGGAAATGTT
GAATTGTTCAAAAAATGCATGCTACCGAAAGGCACTGTCCATATCAATTTACCAGGTTTA
CAACGAGTTGCTAAGAAATTGAATATTGATTGTGCTCCAGCGTTAACAGGATTTGACTGC
AATGGTGGCTATGTCCACCCTGTATATGAAGGCTTTGTAGTCTGTGAAGAGTTTGAAAAG
GTTCTCACGGAAGCTTGGCTTCAGGATCAAGAAGAGTTGGAACGTAAAGAACAGGAAAAA
GTAGAAACCCGAGTGTACGGAAACTGGAAGCGGCTTATAAGAGGACTTATCATAAAAGAA
CGACTAAAAGCCAAATATGGATTTGCAGAGCCCAGCACATCTCAGGATAAAAAGAAGAAA
GGCCCAAAACTTGTTGTGAAGAAAAAATAA
Protein sequence:
MFNFVTLPIRPPVSELCSLSTKVKGTEDAKKNDQKTKSPRKSTKSKSKRDVIPQLDGNYD
VIESDDGNIMQVDGGDDTTTARTRRQRLSLRKVKQANDVKKPEEVISPTKITKKNLSLKI
ENKTVGETKRKPTTRTKRNLRLQSKNTKTTIHETKQSESLSNKKDDIKATKSKRKILSLN
LSVTKNDQDNKTKSTSKTIVNKCSANRITRANITSLNESNAILTKTLSKQSSLDKVPKII
LTDINDQTVSSKFFEKSPTKRTSRKRSQTTEPKKSPNEMSNARTRSAHATESKYFAPETD
KSPAKRCRTTRKIDSDDSKRVSHRDLAKKNVQDLQSPKISKPKNDVTKDLVHIIKGRVKE
AKTDAKKRIVKGKEKHESDSDSDHLAVESPAPRKSESDEDFKVEKVTPKQKKPVKKIDRC
VISADDEMPLNKINVWCEIYVEELEEWVPVDVVRGIVHSANELYSRSTHPVSYIVGWDNN
NYLKDLTRRYVPYWNTVTRKLRVDPGWWEEAIKPWLGPKTARDREEDERLHRMQLEAPLP
KVISEYKNHPLYVLKRHLLKFEAIYPPDAETLGFVRGEPVYPRDCVYICKSRDVWIKDAK
VVKLGEQPYKIVRARPKYIRATNTFITDRPLEIFGPWQTQDYEPPTAENGIVPRNPYGNV
ELFKKCMLPKGTVHINLPGLQRVAKKLNIDCAPALTGFDCNGGYVHPVYEGFVVCEEFEK
VLTEAWLQDQEELERKEQEKVETRVYGNWKRLIRGLIIKERLKAKYGFAEPSTSQDKKKK
GPKLVVKKK