New model in OGS2.0 | DPOGS214973  |
---|---|
Genomic Position | scaffold1043:- 34212-38628 |
See gene structure | |
CDS Length | 1746 |
Paired RNAseq reads   | 424 |
Single RNAseq reads   | 1010 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012145 (0.0) |
Best Drosophila hit   | glaikit (3e-127) |
Best Human hit | tyrosyl-DNA phosphodiesterase 1 (1e-75) |
Best NR hit (blastp)   | GG24923 [Drosophila erecta] (9e-137) |
Best NR hit (blastx)   | GE18215 [Drosophila yakuba] (2e-137) |
GeneOntology terms    | GO:0005634 nucleus GO:0006281 DNA repair GO:0008081 phosphoric diester hydrolase activity GO:0045197 establishment or maintenance of epithelial cell apical/basal polarity GO:0007417 central nervous system development |
InterPro families   | IPR010347 Tyrosyl-DNA phosphodiesterase |
Orthology group | MCL12165 |
Nucleotide sequence:
ATGAATCCGGCACACTTCAGAGAGTTCAGTCATCCGCATTTGGAGAGTATCCTGGATAAT
TACAGTGGCTCGGGTGATTACGACATACCCGAGAGGTTCAGCTTACAACGAAACATGATC
AGGGAACAGCTCGACATGATGATACAAAACAAGCTCTATGAAGGACAACGGCGTGGAGAT
CTAGCGAGGAAAGACAGTGAAGATAGTAAGAAAGACGGAGGTGAGCCTAAAACAGACAAG
GCCGGGACGAGTAGGAGTAGTGATGGAAAGAAAAAGGCACAAACGACCGATGAAGAACAG
ATGTCAAGTAGAACACACGAACAGACAGACACGCGATCAAGCAAGACATGTGGTGACAAA
AAAATACTCGACTCCACGTACCGGCCGATGGTGCCGCCGACCAGGGACCCTCGCTCGTTC
CTGGACGTGGTGGTGAGTCCCGGTGGTATGTTGTCCAAACACGCGGCCGCCGCGCCCTAC
CACGTGTTTTACACCACCATCAAGGATAGCAAGAAGACTCACAACCAGAAGTACTCCATC
ACACTGCTGGAGATCCTCGACAGTAGTTTGGGCGAGCTGAAGTGCTCCCTCCAGATAAAC
TTCATGGTGGACGCGGGCTGGCTCCTGGCGCACTATTACTTCGCGGGTTACAGCGCAAAG
AAGCTAACGATCCTGTACGGGGAGGAGAGCGCGGAGCTGAGGAACATCAGTGCCAAGAAG
CCCAACGTGGAGGCGCACCAGGTCAAGATGGCGACGCCCTTCGGCAAACATCACACGAAG
ATGATGTTGCTGTGCTACGAGGACGGCTCCCTGAGGGTGGTGGTGTCCACCGCCAACCTG
TACATGGACGACTGGGAGAACAGGACGCAGGGCCTCTGGCTGAGTCCGTCCTGCCCGCAG
CTGCCGGCGGAGAGTCCGAGTCACTCGGGCGAGAGTCCCACGGGCTTCAAGCGGAGTCTC
CTGGACTACCTGCATCACTACCGCCTGCCGCAGCTGGCGGTCTACGTGCACCGGGTCCAG
CGCTGCGACTTCAGTCACATCAACGTGTTCCTCGTCTGCTCGGTCCCCGGCACTCATTAC
TCCGCGTCGTGGGGTTTCCTGCGTGTGGGTGCTCTGCTGCGTGCTCACTGCGCCGTCCCG
CCCCAGGAGACTCGCTCATGGCCGCTGATCGCTCAGGCCAGTAGCCTCGGCAGCTACGGG
AAGGACCCCGGGTCGTGGCTGACGGGCGACTTCCTGCATCACTTCACCAAGATAAAGGAC
CAGCCGCAGACCCTCACCCCGCCGCCCGACCTCAAACTCATCTACCCGTCGCTGGAGAAC
GTGAAGTCCTCCCACGACGGTCTGCTCGGCGGCGGCTGCCTGCCTTACTCCGCGGCCGTC
CACGTCAAGCAGCCCTGGCTCAAGGACTTCTTATACCAGTGGCGGGCGCTGCACTCGGAG
CGGGACCGCGCGATGCCTCACATCAAGAGCTACACGCGCGTGTCCCCCGACAACTCGCGC
GCCGCCTTCTATCTGCTGACTTCCGGCAACGTGAGCAAGGCGGCCTGGGGCGTCCGCAAC
AAGGACGGCGGACTCCGCCTCATGAGCTACGAGGCCGGAGTCCTATTCCTGCCGCGGTTT
GTGATAAACTCGGACTTCTTCCCCCTGTGCCCCTCCTCCGCCCTCCGCCTGCCGGTGCCG
TACGACCTCCCCCCCCAGAGGTACTCCCCGGACATGTCACCCTGGGTCTCCGACTACTTG
TACTGA
Protein sequence:
MNPAHFREFSHPHLESILDNYSGSGDYDIPERFSLQRNMIREQLDMMIQNKLYEGQRRGD
LARKDSEDSKKDGGEPKTDKAGTSRSSDGKKKAQTTDEEQMSSRTHEQTDTRSSKTCGDK
KILDSTYRPMVPPTRDPRSFLDVVVSPGGMLSKHAAAAPYHVFYTTIKDSKKTHNQKYSI
TLLEILDSSLGELKCSLQINFMVDAGWLLAHYYFAGYSAKKLTILYGEESAELRNISAKK
PNVEAHQVKMATPFGKHHTKMMLLCYEDGSLRVVVSTANLYMDDWENRTQGLWLSPSCPQ
LPAESPSHSGESPTGFKRSLLDYLHHYRLPQLAVYVHRVQRCDFSHINVFLVCSVPGTHY
SASWGFLRVGALLRAHCAVPPQETRSWPLIAQASSLGSYGKDPGSWLTGDFLHHFTKIKD
QPQTLTPPPDLKLIYPSLENVKSSHDGLLGGGCLPYSAAVHVKQPWLKDFLYQWRALHSE
RDRAMPHIKSYTRVSPDNSRAAFYLLTSGNVSKAAWGVRNKDGGLRLMSYEAGVLFLPRF
VINSDFFPLCPSSALRLPVPYDLPPQRYSPDMSPWVSDYLY