DPGLEAN07140 in OGS1.0

Genomic Positionscaffold2292:+ 7548-20886
See gene structure
CDS Length2331
Paired RNAseq reads  902
Single RNAseq reads  2440
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004963 (8e-161)
Best Drosophila hit  CG31368, isoform C (8e-140)
Best Human hitintron-binding protein aquarius (5e-132)
Best NR hit (blastp)  PREDICTED: similar to aquarius [Nasonia vitripennis] (5e-156)
Best NR hit (blastx)  PREDICTED: similar to aquarius [Nasonia vitripennis] (3e-151)
GeneOntology terms




  
GO:0003676 nucleic acid binding
GO:0008026 ATP-dependent helicase activity
GO:0005524 ATP binding
GO:0071011 precatalytic spliceosome
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0071013 catalytic step 2 spliceosome
InterPro families  ND
Orthology groupMCL14495

Nucleotide sequence:

ATGCTGAAATTTTATGCTCGTTTTGAGATCAGCGACGAAACCGGTGATCCCATGACCGAC
CGCGACATGACCTTACAGCACTATTCCAGAATAACATCACTACAGAAAGCTGCGTTTACA
AAGTTTCCCGATCTAAGATTGTTCTCTCTAGCAAACGTAGCAAGCGTTGATACCAGGGAA
TCTCTTCAGAAACACTTTGGGAATCTCAGTGATAAGGCATTAAGGGCCATTGCCACTTAT
CTGAATTTAGTTCCCACGGAAGGCAAGGAAGATGAAGCGCCTTGGCACAGACTGGACAAA
GATTTCCTCAGGGAACTTTTGATATCAAGACACGAGCGAAGAATTTCTCAGCTGGAAGAA
TTGAATTCAATGCCCCTATATCCCACCGAGAAGGTTGTGTGGGACGAGCACGTGGTGCCG
ACCGAGGTTTACAGCGGGGAGCGTTGCCTCGCCTTACCGAAACTTAATCTCCAATTCCTG
ACACTTCACGATTACCTGTTGAGGAACTTCAACCTTTTCCGCCTAGAGAGTACATATGAG
ATCCGTCAGGACATTGAGGATGCTGTTTATCGCCTGTCACCATGGAAATCTGAAGACGGT
ACTGTGATATTCGGAGGCTGGGCACGTATGGCTCATCCTATCCAAAGCTTCGCTGTGGTT
GAGGTGGCGAAACCGAATATAGGAGAGAAGGCGCCTTCAAGGGTCCGTGCTGACGTCACA
GTGACCCTCAGCGTCAGGAACGAGATCAAGCACGAGTGGGAGAGTCTCAGGAAGCACGAT
GTATGCTTCCTTATAACCGTACGGCCTAGCGAGGGTATAGGGACGAAATACGATTACAAG
AAAAGTATGGTCGACCAGGCTGGTATAGTCTACATCCGAGGTTGTGAGGTCGAGGGGATG
TTGGACGCCGGCGGGAGGGTCATAGAGGACGGGCCAGAACCTCGACCAGAACTAGAGGGA
GATTCCAGAACATTCAGGCTGCTGCTAGACCCTAACCAGTATAGGTTGGACCTTGACGAA
GCCAGCAAAGGAAAAGAGGTAATAATAGTGAGGTCTACAATGTTCGTCCTACCCTCATGC
TCACGCGACGATAAAACGATGACGCCGAGTGGCTTGACGAAACGGTTGTTGATCTTCAAC
CAAGCACTGTGCCAGCATCTGACTTGCAAGATGCTTTATATCCATTACGAAATTTGTCTA
GTTACTAAAAATACACGTGATATAGGAAGGGAGGTGGGGGTTAGTCTTAAAGGTCACTAT
GTATCACCAAGGGGGAGGGAGGGGACGAAATACGATTACAAGAAGAGTATGGTCGACCAG
GCTGGTATAGTCTACATCCGAGGTTGTGAGGTCGAGGGGATGTTGGACGCCGGCGGGAGG
GTCATAGAGGACGGGCCAGAACCTCGACCAGAACTAGAGGGAGATTCCAGAACATTCAGG
CTGCTGCTAGACCCTAACCAGTATAGGTTGGACCTTGACGAAGCCAGCAAAGGAAAAGAG
GATGTGTACGAGACATTCAATATCGTTGTCCGACGGAAGCCTAAAGAGAACAACTTTAAG
GCTGTTCTGGAGACGATACGAGAGCTGATGAACACGGAGTGCGTGGTGCCTGAGTGGCTT
CATGACATAGTGCTGGGCTATGGCGACCCTGGGCAGGCGCACTACACCAGGATGCCCAAC
GAAATCCCTACCCTGGATTTCAACGACACGTTCCTGGATATGGAACATCTACGGAACAGT
TTCCCGGGACACGAGATAAAGGTACAGACGGACGATCCGCGGAAACTCGTCCGACCGTTC
AAATTGACTTTCGAGAACGTTCTACGTAAACAGCGAGGCGAAACGGATATGGATGAAGAG
GAACCCAAGAAGGTTATAGTTGTAGAACCCCACGTGCTGCCCAAGAGAGGGCCGTACCTG
TACAATGAACCTAAAAAGAACAACATACTGTTCACGCCGACCCAGGTGGAAGCGATCCGT
TCAGGAATGCAGCCGGGGCTGACGGTCGTGGTGGGACCTCCCGGCACGGGTAAAACTGAT
GTCGCAGTCCAGATAATATCGAACTTGTACCACAACTTCCCGTCCCAGAGGACGTTAGTT
GTGACGCACAGTAATCAAGCTCTTAACCAGCTGTTCGAGAAGGTTGCTGAGCTGGATGTG
GACGAAAGGCACCTGCTGCGTCTTGGACACGGCGAGGAGGCTTTGCAGACGGACAAGGAC
TTCTCCAGGTATGGACGTGTGAATTACGTGCTGGCAAAGCGTTTGGAACTCCTCGGCCAG
GTGTCGCGTCTTCAGACCACGCTGGGGGCGGGGGGAGAGGCGGGTGGTTGA

Protein sequence:

MLKFYARFEISDETGDPMTDRDMTLQHYSRITSLQKAAFTKFPDLRLFSLANVASVDTRE
SLQKHFGNLSDKALRAIATYLNLVPTEGKEDEAPWHRLDKDFLRELLISRHERRISQLEE
LNSMPLYPTEKVVWDEHVVPTEVYSGERCLALPKLNLQFLTLHDYLLRNFNLFRLESTYE
IRQDIEDAVYRLSPWKSEDGTVIFGGWARMAHPIQSFAVVEVAKPNIGEKAPSRVRADVT
VTLSVRNEIKHEWESLRKHDVCFLITVRPSEGIGTKYDYKKSMVDQAGIVYIRGCEVEGM
LDAGGRVIEDGPEPRPELEGDSRTFRLLLDPNQYRLDLDEASKGKEVIIVRSTMFVLPSC
SRDDKTMTPSGLTKRLLIFNQALCQHLTCKMLYIHYEICLVTKNTRDIGREVGVSLKGHY
VSPRGREGTKYDYKKSMVDQAGIVYIRGCEVEGMLDAGGRVIEDGPEPRPELEGDSRTFR
LLLDPNQYRLDLDEASKGKEDVYETFNIVVRRKPKENNFKAVLETIRELMNTECVVPEWL
HDIVLGYGDPGQAHYTRMPNEIPTLDFNDTFLDMEHLRNSFPGHEIKVQTDDPRKLVRPF
KLTFENVLRKQRGETDMDEEEPKKVIVVEPHVLPKRGPYLYNEPKKNNILFTPTQVEAIR
SGMQPGLTVVVGPPGTGKTDVAVQIISNLYHNFPSQRTLVVTHSNQALNQLFEKVAELDV
DERHLLRLGHGEEALQTDKDFSRYGRVNYVLAKRLELLGQVSRLQTTLGAGGEAGG