DPGLEAN14718 in OGS1.0

New model in OGS2.0DPOGS213526 
Genomic Positionscaffold772:- 1696-3683
See gene structure
CDS Length1494
Paired RNAseq reads  362
Single RNAseq reads  875
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011808 (0.0)
Best Drosophila hit  lariat debranching enzyme, isoform B (7e-115)
Best Human hitlariat debranching enzyme (6e-108)
Best NR hit (blastp)  lariat debranching enzyme [Culex quinquefasciatus] (9e-139)
Best NR hit (blastx)  lariat debranching enzyme [Culex quinquefasciatus] (2e-138)
GeneOntology terms

  
GO:0000375 RNA splicing, via transesterification reactions
GO:0005634 nucleus
GO:0008419 RNA lariat debranching enzyme activity
InterPro families
  
IPR007708 Lariat debranching enzyme, C-terminal
IPR004843 Metallo-dependent phosphatase
Orthology groupMCL13573

Nucleotide sequence:

ATGAAAATTGCTATAGAGGGATGTGCACATGGTGAATTGGATAAGATATACGAGTGTGTA
GAAACTTTACAGAGAAGAGAAGGAATAAACGTGGATTTGTTAATATGTTGTGGAGATTTT
CAATCAGTCCGCAATAATGATGACCTTAGGGCTATGGCTGTACCAGAGAAATATCAAAAC
ATATGTACATTCTACAAATATTACAGTGGTGAAAAAATCGCTCCAGTATTAACTTTATTT
ATTGGAGGCAATCATGAAGCTTCAAACTATTTACAAGAGCTGCCATATGGTGGCTGGGTT
GCACCAAACATATACTTCTTGGGCAGAGCCGGTGTTGTACAGTTTGGCAATTTACGAATT
GGAGGACTATCAGGAATATTTAAAGGCCATGATTATTTACAAGGTCTCTGGGAATGTCCT
CCTTACACCCCTGGTTCACTGAGATCAGTTTATCATATAAGATCTCTGGATGTGTTTCGG
TTAAGTCAAATGAAAGAAAACATCCACATCATGTTATCACATGATTGGCCGAGGGGTATC
ACTAGTTATGGGGATAAAGAGAATTTACTAAGAAGGAAACCGTTCTTACGAGATGATATT
GAGTCAAACCAACTAGGTAGTCCCCCAGCGGAGAAGTTGTTACACACATTGAAGCCTCAG
TACTGGTTTGCTGCACATTTGCATTGCCAATTTGCTGCCGTTATTAATCATGACAATAAT
CGGGAAACAAAATTTCTTGCTCTAGATAAATGTTTGCCACGAAGAAGGCATTTGCAAATA
TTAGATTTAGCAACAGAGTATGACGGTGACAAGACTTTAAAGTATGATCCTGAATGGTTG
GCAATTTTGAGAAATACCAATCATCTTTTATCCGTCAAGAACGTAGATTGTCATCTACCT
GGCCCCGGAGGTGATGAACGGTATGATTTCACACCAAGTGAAGAAGAGAAAAATGCAATA
TTAAGTCTATTAGATACATTAATAATAACCAATGATTCATTCGTCAAAACTGCACCGGTT
TATAGGCCTGGTGCACCAAAATGTCAACCCACGGAACCTGTGCTAAACCCCCAAACCGCT
TATTTATGTGAAAAGTTAGGTATTGATGATCCCATCCAGGTGATAATCGCTCGTTCAGGC
AGAACTATAAGGCATGTACAAATTGAAAATAATCAGAATGAAGAGAAAGATGACATTATT
GAACAGACACCATTCAAATGTTCAAAGCTTTCTCTCCCGGCTCCAATAACACCCAGTGGG
AATGACGAGGACGCTTCAAGAGAAACATTAGCTTGTACACCAGAAAATAGTTTTTTATCT
ATCAGTAATACATCAGATTGTATAACACCTCCGAGTGCTACAAAAAAGGTTTTCAAGAGA
CGTAATCTAGCTATATACACTCCTGAGGAAGAGCCGGAGAGTGATTCAAGTAGTTCGTTC
ATGAGTACACAGAGTCCAAGATCGAGTAAAATATTCTGTAAAAATGACTTATAA

Protein sequence:

MKIAIEGCAHGELDKIYECVETLQRREGINVDLLICCGDFQSVRNNDDLRAMAVPEKYQN
ICTFYKYYSGEKIAPVLTLFIGGNHEASNYLQELPYGGWVAPNIYFLGRAGVVQFGNLRI
GGLSGIFKGHDYLQGLWECPPYTPGSLRSVYHIRSLDVFRLSQMKENIHIMLSHDWPRGI
TSYGDKENLLRRKPFLRDDIESNQLGSPPAEKLLHTLKPQYWFAAHLHCQFAAVINHDNN
RETKFLALDKCLPRRRHLQILDLATEYDGDKTLKYDPEWLAILRNTNHLLSVKNVDCHLP
GPGGDERYDFTPSEEEKNAILSLLDTLIITNDSFVKTAPVYRPGAPKCQPTEPVLNPQTA
YLCEKLGIDDPIQVIIARSGRTIRHVQIENNQNEEKDDIIEQTPFKCSKLSLPAPITPSG
NDEDASRETLACTPENSFLSISNTSDCITPPSATKKVFKRRNLAIYTPEEEPESDSSSSF
MSTQSPRSSKIFCKNDL