New model in OGS2.0 | DPOGS213526  |
---|---|
Genomic Position | scaffold772:- 1696-3683 |
See gene structure | |
CDS Length | 1494 |
Paired RNAseq reads   | 362 |
Single RNAseq reads   | 875 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011808 (0.0) |
Best Drosophila hit   | lariat debranching enzyme, isoform B (7e-115) |
Best Human hit | lariat debranching enzyme (6e-108) |
Best NR hit (blastp)   | lariat debranching enzyme [Culex quinquefasciatus] (9e-139) |
Best NR hit (blastx)   | lariat debranching enzyme [Culex quinquefasciatus] (2e-138) |
GeneOntology terms    | GO:0000375 RNA splicing, via transesterification reactions GO:0005634 nucleus GO:0008419 RNA lariat debranching enzyme activity |
InterPro families    | IPR007708 Lariat debranching enzyme, C-terminal IPR004843 Metallo-dependent phosphatase |
Orthology group | MCL13573 |
Nucleotide sequence:
ATGAAAATTGCTATAGAGGGATGTGCACATGGTGAATTGGATAAGATATACGAGTGTGTA
GAAACTTTACAGAGAAGAGAAGGAATAAACGTGGATTTGTTAATATGTTGTGGAGATTTT
CAATCAGTCCGCAATAATGATGACCTTAGGGCTATGGCTGTACCAGAGAAATATCAAAAC
ATATGTACATTCTACAAATATTACAGTGGTGAAAAAATCGCTCCAGTATTAACTTTATTT
ATTGGAGGCAATCATGAAGCTTCAAACTATTTACAAGAGCTGCCATATGGTGGCTGGGTT
GCACCAAACATATACTTCTTGGGCAGAGCCGGTGTTGTACAGTTTGGCAATTTACGAATT
GGAGGACTATCAGGAATATTTAAAGGCCATGATTATTTACAAGGTCTCTGGGAATGTCCT
CCTTACACCCCTGGTTCACTGAGATCAGTTTATCATATAAGATCTCTGGATGTGTTTCGG
TTAAGTCAAATGAAAGAAAACATCCACATCATGTTATCACATGATTGGCCGAGGGGTATC
ACTAGTTATGGGGATAAAGAGAATTTACTAAGAAGGAAACCGTTCTTACGAGATGATATT
GAGTCAAACCAACTAGGTAGTCCCCCAGCGGAGAAGTTGTTACACACATTGAAGCCTCAG
TACTGGTTTGCTGCACATTTGCATTGCCAATTTGCTGCCGTTATTAATCATGACAATAAT
CGGGAAACAAAATTTCTTGCTCTAGATAAATGTTTGCCACGAAGAAGGCATTTGCAAATA
TTAGATTTAGCAACAGAGTATGACGGTGACAAGACTTTAAAGTATGATCCTGAATGGTTG
GCAATTTTGAGAAATACCAATCATCTTTTATCCGTCAAGAACGTAGATTGTCATCTACCT
GGCCCCGGAGGTGATGAACGGTATGATTTCACACCAAGTGAAGAAGAGAAAAATGCAATA
TTAAGTCTATTAGATACATTAATAATAACCAATGATTCATTCGTCAAAACTGCACCGGTT
TATAGGCCTGGTGCACCAAAATGTCAACCCACGGAACCTGTGCTAAACCCCCAAACCGCT
TATTTATGTGAAAAGTTAGGTATTGATGATCCCATCCAGGTGATAATCGCTCGTTCAGGC
AGAACTATAAGGCATGTACAAATTGAAAATAATCAGAATGAAGAGAAAGATGACATTATT
GAACAGACACCATTCAAATGTTCAAAGCTTTCTCTCCCGGCTCCAATAACACCCAGTGGG
AATGACGAGGACGCTTCAAGAGAAACATTAGCTTGTACACCAGAAAATAGTTTTTTATCT
ATCAGTAATACATCAGATTGTATAACACCTCCGAGTGCTACAAAAAAGGTTTTCAAGAGA
CGTAATCTAGCTATATACACTCCTGAGGAAGAGCCGGAGAGTGATTCAAGTAGTTCGTTC
ATGAGTACACAGAGTCCAAGATCGAGTAAAATATTCTGTAAAAATGACTTATAA
Protein sequence:
MKIAIEGCAHGELDKIYECVETLQRREGINVDLLICCGDFQSVRNNDDLRAMAVPEKYQN
ICTFYKYYSGEKIAPVLTLFIGGNHEASNYLQELPYGGWVAPNIYFLGRAGVVQFGNLRI
GGLSGIFKGHDYLQGLWECPPYTPGSLRSVYHIRSLDVFRLSQMKENIHIMLSHDWPRGI
TSYGDKENLLRRKPFLRDDIESNQLGSPPAEKLLHTLKPQYWFAAHLHCQFAAVINHDNN
RETKFLALDKCLPRRRHLQILDLATEYDGDKTLKYDPEWLAILRNTNHLLSVKNVDCHLP
GPGGDERYDFTPSEEEKNAILSLLDTLIITNDSFVKTAPVYRPGAPKCQPTEPVLNPQTA
YLCEKLGIDDPIQVIIARSGRTIRHVQIENNQNEEKDDIIEQTPFKCSKLSLPAPITPSG
NDEDASRETLACTPENSFLSISNTSDCITPPSATKKVFKRRNLAIYTPEEEPESDSSSSF
MSTQSPRSSKIFCKNDL