DPGLEAN04943 in OGS1.0

New model in OGS2.0DPOGS210730 
Genomic Positionscaffold314:+ 45485-57403
See gene structure
CDS Length2007
Paired RNAseq reads  5651
Single RNAseq reads  16910
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006260 (2e-44)
Best Drosophila hit  CG17838, isoform D (6e-178)
Best Human hitheterogeneous nuclear ribonucleoprotein Q isoform 1 (5e-118)
Best NR hit (blastp)  GA27045 [Drosophila pseudoobscura pseudoobscura] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC000185 [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0030530 heterogeneous nuclear ribonucleoprotein complex
GO:0003729 mRNA binding
GO:0000166 nucleotide binding
GO:0003676 nucleic acid binding
GO:0071011 precatalytic spliceosome
GO:0000398 nuclear mRNA splicing, via spliceosome
InterPro families

  
IPR006535 HnRNP R/Q splicing factor
IPR000504 RNA recognition motif domain
IPR012677 Nucleotide-binding, alpha-beta plait
Orthology groupMCL11681

Nucleotide sequence:

ATGGCGGAAGGTAATGGAGAAATATCCATTGAAGAGGTCCCTGGGAAGGACTCCGATGCT
GGCCGGACACCAGATTACATCAAACTCATAGATTATGGCTTGGACCCAAAGGTGGCGACC
AAGCTCGACGACATTTATAAGACTGGAAAGCTAGCGCACGAGGAGCTGGACGAGCGTGCG
TTAGATGCCTTAAAAGAATTTCCATCCGACGGTGCTTTAAGTGTTCTTGGACAATTTTTA
GATTCAAATCTCGAGCATGTATCCAATAAGAGTGCTTTTTTATGTGGAGTCATGAAGACG
TACAGACAGAAAAGCCGTGCGGGCGTGCAGGGCGCGCCGGCGCTGGCGGCAGCTGTCCAG
GTCAAGGGGCCTGACGAGGAAAAGATCCGCCAGATCCTGCAGAGGACCGGTTACACGCTG
GACGTTACCACCGGCCAGCGCAAGTACGGCGGTCCGCCGCCTGGCTGGGAGGGCGGCACG
CCGGGCGCCGGCTGCGAGGTGTTCTGCGGGAAGATACCCAAGGATATGTACGAAGACGAG
CTGATACCTCTGTTCGAGAGTTGCGGCACCATCTGGGACCTGCGGCTCATGATGGACCCC
ATGTCGGGCGCCAACAGGGGGTACGCCTTCGTCACCTTCACCACCCGCGAGGCCACGCAG
CGAGCCGTGCAAGAGCATGTCATTAATATCCCCATCGACTTCGGCCATAAATATGCGGAG
TGTGACGTCGCGACGGCCCGGCGGCCGGCCCCGGGCGCGGCGCCGCGCGTATCTATGGCC
GGAGTCCCCACTGTAATCACTGATCTTGTGTACATGTGCAACGTAACACCCCTCGATAAT
CACGAAATAAAACCGGGGAAGACTCTGAGAATTAAGATTAGCGTACCGAACCTTCGACTT
TTCGTCGGCAACATTCCCAAGTCTAAAGGCAAAGAGGAGATACTGGAAGAGTTTGGTAAA
TTAACAGCCGGACTCGTTGAAGTCATTATATATAGTTCGCCCGATGACAAGAAGAAAAAT
AGAGGATTTTGTTTTTTGGAATACGAGTCTCACAAAGCGGCGTCACTAGCCAAGCGTCGG
CTGGGCACCGGCAGGATTAAAGTTTGGGGCTGTGATATTATAGTGGACTGGGCGGACCCG
CAGGAGGAACCCGACGAGCAGACCATGAGCAAAGTGAAGGTGTTGTACGTTCGGAACCTG
ACCCAAGAAATCACAGAAGAAGCGCTTAAAGAAGAATTCGAACGTTATGGAAATGTAGAA
CGAGTTAAGAAAATTAAGGATTACGCTTTCGTACACTTCGAAGACCGGGATTGTGCCGTT
AAGGCGATGCAGGAGATAGACGGCAAGGAGCTGGGTGGAGCCCGCCTCGAGGTGTCGCTG
GCCAAGCCACCCTCGGACAAGAAGAAGAAGGAGGAGATACTGAGGGCGAGGGAGAGACGC
ATGACGCAGATGATATACGGACGGGGCGGATTTGATTGGTGCAGCTGCTCGCCGGTGCAC
GGGGCGCTCCGGGGCCGCACGCCGCAGCCGCAGCCGCGCCCGCCGCAGGCCCGCGGGGAC
TACGATTATGATTACGACTATTACGGGTACGGGGATTACCGAGGTGGCTACAATGAGCCA
TTTTACCGGTACGATGAGTTCTATTTTGATTACGCGGGGCCACCGCAACCGTCCGCCGTC
CGCCAGCCTCCCAACAGAGCGCAGCCGGGGGCTGGGTCATGTGGGACGGGCGCGCGCTGG
GGGCGGCGCGGGGCCGCGGCTGGGGCCCGTGGTGGTGCGCGCCGCGCCGCTCGTGGCCGC
CGCACGCCCAGCGGCATGCGTGGCAACCCGCGCGCCAAGCCAAGTTTACCAGGTAAACGT
AAACTCGACGGGGGTCAGCAGATCGCTGGGGGGGAGCGGGAGAGCAAGCGGCGACTGGGC
GCGGCGGCGGCGGCGGCGCGCGGCTGGGGGTCGGCGGGGGTAGGATCGATGGGGTCCATG
GGGTCCGAAGGTGCCGCCGCCAGCTAG

Protein sequence:

MAEGNGEISIEEVPGKDSDAGRTPDYIKLIDYGLDPKVATKLDDIYKTGKLAHEELDERA
LDALKEFPSDGALSVLGQFLDSNLEHVSNKSAFLCGVMKTYRQKSRAGVQGAPALAAAVQ
VKGPDEEKIRQILQRTGYTLDVTTGQRKYGGPPPGWEGGTPGAGCEVFCGKIPKDMYEDE
LIPLFESCGTIWDLRLMMDPMSGANRGYAFVTFTTREATQRAVQEHVINIPIDFGHKYAE
CDVATARRPAPGAAPRVSMAGVPTVITDLVYMCNVTPLDNHEIKPGKTLRIKISVPNLRL
FVGNIPKSKGKEEILEEFGKLTAGLVEVIIYSSPDDKKKNRGFCFLEYESHKAASLAKRR
LGTGRIKVWGCDIIVDWADPQEEPDEQTMSKVKVLYVRNLTQEITEEALKEEFERYGNVE
RVKKIKDYAFVHFEDRDCAVKAMQEIDGKELGGARLEVSLAKPPSDKKKKEEILRARERR
MTQMIYGRGGFDWCSCSPVHGALRGRTPQPQPRPPQARGDYDYDYDYYGYGDYRGGYNEP
FYRYDEFYFDYAGPPQPSAVRQPPNRAQPGAGSCGTGARWGRRGAAAGARGGARRAARGR
RTPSGMRGNPRAKPSLPGKRKLDGGQQIAGGERESKRRLGAAAAAARGWGSAGVGSMGSM
GSEGAAAS