New model in OGS2.0 | DPOGS210296  |
---|---|
Genomic Position | scaffold1729:+ 1776-7173 |
See gene structure | |
CDS Length | 1275 |
Paired RNAseq reads   | 416 |
Single RNAseq reads   | 1199 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004227 (5e-148) |
Best Drosophila hit   | flap endonuclease 1 (4e-124) |
Best Human hit | flap endonuclease 1 (2e-106) |
Best NR hit (blastp)   | flap endonuclease-1 [Aedes aegypti] (1e-161) |
Best NR hit (blastx)   | flap endonuclease-1 [Aedes aegypti] (3e-137) |
GeneOntology terms    | GO:0005634 nucleus GO:0048256 flap endonuclease activity GO:0006281 DNA repair GO:0004519 endonuclease activity GO:0003677 DNA binding |
InterPro families    | IPR020045 5'-3' exonuclease, C-terminal subdomain IPR006084 DNA repair protein (XPGC)/yeast Rad IPR006085 XPG N-terminal IPR006086 XPG/RAD2 endonuclease IPR008918 Helix-hairpin-helix motif, class 2 IPR019974 XPG conserved site |
Orthology group | MCL13716 |
Nucleotide sequence:
ATGGGTATTTTAGGATTATCAAAGTTGATTGCAGATATTGCTCCAATGGCTGTAAAAGAA
ACAGAGATAAAAATTATTTCGGTTGGTAGGAAAGTTGCCATCGATGCATCTATGAGCTTG
TATCAATTCTTAATTGCTGTAAGAAGTCAAGGCGCTCAGCTGACGTCCGTTGATGGTGAA
ACAACATCACACCTAATGGGTACATTCTACAGAACGATTCGTCTCATAGAAGATGGTATC
AAGCCTGTGTATGTCTTTGATGGTAAACCGCCTGATATGAAGTCACATCAATTGAACAAG
AGGGCCGAGAGACGAGAGGAAGCTGAGAAAGAACTCCAGAAGGCTACCGAGGCTGGTGAT
ACGGCATCTATTGACAAGTTCAACCGTCGGTTGGTGAAGGTGACTCAGCAACACGGTGCC
GAAGCTCGGCAGTTGTTGAAGCTTATGGGGATACCCGTGGTGGAGGCTCCGTGTGAAGCT
GAGGCACAATGCGCTGAATTAACTTCTGAAGGTAACCTCGTAGACGGTTTGACGAATCCC
TTACTTCGAAGAGGTCCGATCCCTGCAGCGGCTCGAGCTAGACTCCCTGTAACACACACC
GAGGTGATCTCGGGCCCCCCAGTTGGAGGGGTCCCAGTCAAAGGTGGTAAGGTGTATGCT
GTAGCCACTGAGGATATGGATGCTTTGACCTTCGGAGCGAACGTGCTGTTGAGGCACCTC
ACCTTCTCCGAGGCGAGGAAGATGCCAGTACAGGAGTTCCACCTGGACCAGGTGCTGAGA
GGATTGGAATTGGAACAGACAGAGTTCATTGACCTCTGCATTCTGTTGGGTTGTGATTAC
TGCGGCTCCATCAAAGGGATCGGACCGAAACGGGCCATCGAACTCATCAAGCAACACCGC
AGTATAGAACAGGTCCTTCACAATATCGACACAAAGAAGTACAGTCCGCCGGAGAATTGG
GAATATGAAAACGCTCGGAGACTGTTCCAGCAACCAGAAGTTACCGAGGCGAAGGATGTC
GAGTTAAAATGGTCGGATCCTGACGAGGAAGGTCTGGTGAAGTTCCTCTGTGGAGACAAA
CAGTTCAACGAGGAGCGCGTCAGGAACGGGGCCAAGAAACTCATGAAGGCGCGCACCGGA
ACCACGCAGGGCAGGCTGGATGGATTCTTCAAGGTGTTATCAACAACACCAAACCCAAAA
AGGAAAGCGGAGGAAGATAAAAAGAGTGCCAACAAGAAAGTTAAAACAGCTGGAAGGGGG
CGGAAACCGAAATAA
Protein sequence:
MGILGLSKLIADIAPMAVKETEIKIISVGRKVAIDASMSLYQFLIAVRSQGAQLTSVDGE
TTSHLMGTFYRTIRLIEDGIKPVYVFDGKPPDMKSHQLNKRAERREEAEKELQKATEAGD
TASIDKFNRRLVKVTQQHGAEARQLLKLMGIPVVEAPCEAEAQCAELTSEGNLVDGLTNP
LLRRGPIPAAARARLPVTHTEVISGPPVGGVPVKGGKVYAVATEDMDALTFGANVLLRHL
TFSEARKMPVQEFHLDQVLRGLELEQTEFIDLCILLGCDYCGSIKGIGPKRAIELIKQHR
SIEQVLHNIDTKKYSPPENWEYENARRLFQQPEVTEAKDVELKWSDPDEEGLVKFLCGDK
QFNEERVRNGAKKLMKARTGTTQGRLDGFFKVLSTTPNPKRKAEEDKKSANKKVKTAGRG
RKPK