DPGLEAN17628 in OGS1.0

New model in OGS2.0DPOGS214973 
Genomic Positionscaffold1043:- 34212-38628
See gene structure
CDS Length1746
Paired RNAseq reads  424
Single RNAseq reads  1010
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012145 (0.0)
Best Drosophila hit  glaikit (3e-127)
Best Human hittyrosyl-DNA phosphodiesterase 1 (1e-75)
Best NR hit (blastp)  GG24923 [Drosophila erecta] (9e-137)
Best NR hit (blastx)  GE18215 [Drosophila yakuba] (2e-137)
GeneOntology terms



  
GO:0005634 nucleus
GO:0006281 DNA repair
GO:0008081 phosphoric diester hydrolase activity
GO:0045197 establishment or maintenance of epithelial cell apical/basal polarity
GO:0007417 central nervous system development
InterPro families  IPR010347 Tyrosyl-DNA phosphodiesterase
Orthology groupMCL12165

Nucleotide sequence:

ATGAATCCGGCACACTTCAGAGAGTTCAGTCATCCGCATTTGGAGAGTATCCTGGATAAT
TACAGTGGCTCGGGTGATTACGACATACCCGAGAGGTTCAGCTTACAACGAAACATGATC
AGGGAACAGCTCGACATGATGATACAAAACAAGCTCTATGAAGGACAACGGCGTGGAGAT
CTAGCGAGGAAAGACAGTGAAGATAGTAAGAAAGACGGAGGTGAGCCTAAAACAGACAAG
GCCGGGACGAGTAGGAGTAGTGATGGAAAGAAAAAGGCACAAACGACCGATGAAGAACAG
ATGTCAAGTAGAACACACGAACAGACAGACACGCGATCAAGCAAGACATGTGGTGACAAA
AAAATACTCGACTCCACGTACCGGCCGATGGTGCCGCCGACCAGGGACCCTCGCTCGTTC
CTGGACGTGGTGGTGAGTCCCGGTGGTATGTTGTCCAAACACGCGGCCGCCGCGCCCTAC
CACGTGTTTTACACCACCATCAAGGATAGCAAGAAGACTCACAACCAGAAGTACTCCATC
ACACTGCTGGAGATCCTCGACAGTAGTTTGGGCGAGCTGAAGTGCTCCCTCCAGATAAAC
TTCATGGTGGACGCGGGCTGGCTCCTGGCGCACTATTACTTCGCGGGTTACAGCGCAAAG
AAGCTAACGATCCTGTACGGGGAGGAGAGCGCGGAGCTGAGGAACATCAGTGCCAAGAAG
CCCAACGTGGAGGCGCACCAGGTCAAGATGGCGACGCCCTTCGGCAAACATCACACGAAG
ATGATGTTGCTGTGCTACGAGGACGGCTCCCTGAGGGTGGTGGTGTCCACCGCCAACCTG
TACATGGACGACTGGGAGAACAGGACGCAGGGCCTCTGGCTGAGTCCGTCCTGCCCGCAG
CTGCCGGCGGAGAGTCCGAGTCACTCGGGCGAGAGTCCCACGGGCTTCAAGCGGAGTCTC
CTGGACTACCTGCATCACTACCGCCTGCCGCAGCTGGCGGTCTACGTGCACCGGGTCCAG
CGCTGCGACTTCAGTCACATCAACGTGTTCCTCGTCTGCTCGGTCCCCGGCACTCATTAC
TCCGCGTCGTGGGGTTTCCTGCGTGTGGGTGCTCTGCTGCGTGCTCACTGCGCCGTCCCG
CCCCAGGAGACTCGCTCATGGCCGCTGATCGCTCAGGCCAGTAGCCTCGGCAGCTACGGG
AAGGACCCCGGGTCGTGGCTGACGGGCGACTTCCTGCATCACTTCACCAAGATAAAGGAC
CAGCCGCAGACCCTCACCCCGCCGCCCGACCTCAAACTCATCTACCCGTCGCTGGAGAAC
GTGAAGTCCTCCCACGACGGTCTGCTCGGCGGCGGCTGCCTGCCTTACTCCGCGGCCGTC
CACGTCAAGCAGCCCTGGCTCAAGGACTTCTTATACCAGTGGCGGGCGCTGCACTCGGAG
CGGGACCGCGCGATGCCTCACATCAAGAGCTACACGCGCGTGTCCCCCGACAACTCGCGC
GCCGCCTTCTATCTGCTGACTTCCGGCAACGTGAGCAAGGCGGCCTGGGGCGTCCGCAAC
AAGGACGGCGGACTCCGCCTCATGAGCTACGAGGCCGGAGTCCTATTCCTGCCGCGGTTT
GTGATAAACTCGGACTTCTTCCCCCTGTGCCCCTCCTCCGCCCTCCGCCTGCCGGTGCCG
TACGACCTCCCCCCCCAGAGGTACTCCCCGGACATGTCACCCTGGGTCTCCGACTACTTG
TACTGA

Protein sequence:

MNPAHFREFSHPHLESILDNYSGSGDYDIPERFSLQRNMIREQLDMMIQNKLYEGQRRGD
LARKDSEDSKKDGGEPKTDKAGTSRSSDGKKKAQTTDEEQMSSRTHEQTDTRSSKTCGDK
KILDSTYRPMVPPTRDPRSFLDVVVSPGGMLSKHAAAAPYHVFYTTIKDSKKTHNQKYSI
TLLEILDSSLGELKCSLQINFMVDAGWLLAHYYFAGYSAKKLTILYGEESAELRNISAKK
PNVEAHQVKMATPFGKHHTKMMLLCYEDGSLRVVVSTANLYMDDWENRTQGLWLSPSCPQ
LPAESPSHSGESPTGFKRSLLDYLHHYRLPQLAVYVHRVQRCDFSHINVFLVCSVPGTHY
SASWGFLRVGALLRAHCAVPPQETRSWPLIAQASSLGSYGKDPGSWLTGDFLHHFTKIKD
QPQTLTPPPDLKLIYPSLENVKSSHDGLLGGGCLPYSAAVHVKQPWLKDFLYQWRALHSE
RDRAMPHIKSYTRVSPDNSRAAFYLLTSGNVSKAAWGVRNKDGGLRLMSYEAGVLFLPRF
VINSDFFPLCPSSALRLPVPYDLPPQRYSPDMSPWVSDYLY