New model in OGS2.0 | DPOGS215581  |
---|---|
Genomic Position | scaffold392:- 21708-29833 |
See gene structure | |
CDS Length | 3921 |
Paired RNAseq reads   | 7702 |
Single RNAseq reads   | 21210 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010991 (2e-48) |
Best Drosophila hit   | CG4266, isoform B (1e-59) |
Best Human hit | RNA-binding protein 16 (1e-54) |
Best NR hit (blastp)   | PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (9e-122) |
Best NR hit (blastx)   | PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] (5e-66) |
GeneOntology terms    | GO:0000398 nuclear mRNA splicing, via spliceosome GO:0003729 mRNA binding GO:0000166 nucleotide binding GO:0003676 nucleic acid binding |
InterPro families    | IPR006903 Domain of unknown function DUF618 IPR000504 RNA recognition motif domain IPR006569 RNA polymerase II, large subunit, CTD IPR008942 ENTH/VHS IPR012677 Nucleotide-binding, alpha-beta plait |
Orthology group | MCL16992 |
Nucleotide sequence:
ATGGCGGAGGTCAAAGCGTTCAACGCGGAGTTATCCGGGTTGTACGAGAATCGGCCTCCG
ATTTCCAAGGCAAAAATGAGTGCTATCACTAGAGGTGCTATCAAAGCCATAAAATTCTAC
AAGCACGTCGTACACAGTGTGGAAAAGTTCATACAGAAGTGTAAACCTGAATACAAAGTT
CCTGGTCTGTACGTGATCGATTCAATAGTAAGACAGTCACGGCACCAGTTTGGCCAGGAC
AAGGACGTGTTCGCGCCAAGGTTCGCCAAAAACATGCAACAGACATTCGCTAACCTGTTC
AGGTGTCCTGATGAAGATAAGCGCAACATAATCAGAGTTCTGAATCTGTGGCAGAAGAAT
AATGTGTTTGGACCTGAAGTGATCCAACCACTGCTAGATCTGGCCGACCCAAGTCATCCT
TTGCATTTGGAAATACAGAACCAGAATAACACAACCAATGGAAGCATAAACATGAGTCAT
AACACATCAGACAGCAAGATCTCCCCCGCTCGGCAGGACTCCCCACAGACATCCTCGCCC
ATGGGAGATGCATTTCAAGATGACTCCTCGCCTGGTCCTCAAGCCAAATTCAACCGCAAG
CTCCTCAATGATTTTGAGTATGAAAGTGAAGATGAACAAGAACCTCCCCCGCAGCCACCA
CATCCGCCACATCCGCCACACGCGCCACATCCGCCACATTCGAGCCACGCCACACACACG
ACACACAACCCCACAGATGCTCTTGGCAGTATACTAACAAATCCAGAAATTATGAGGCAG
CTACAGAGTCTACAAGCCCAGATGCAGCTCATGACGGGCATGCAGATCCCAAATTTGATG
CCGATGATGTCAGACATGCAACTTCAGCAGAACCAAAATTCGAACGCACCATTCTTAAAC
TCTCAGACAGAACAACAGAAACCCGCAGAGCCCAAGGAGGACCTCGCCAACGAGTCCGAC
ATAGAGTTCGTGGAGACCGGACCCCAGGTCATCGAGATACCCGACGCCAACGACTCCCGG
TCGCCCTCGCCGAGGCGACGACATCGCTCCCGGTCGAGGAGTCCACGCCGGAGGAGGAGG
ACGCGCTCCCCGCGGAGAAGGAGGGACAAGGACCGCGATCGAGACCGCGACAAGACGCAC
AAGGAGAGGGAGGCCGAGAAAGAGCGCCAGCGGGAGAGGGAGAAGAGGGGCCTACCACCC
ATCAAGAAAGAGAACCTCAGTGTGTGCAGCACCACGCTGTGGGTCGGCCGGCTGTCCAAA
CAGGCCACGCCCGAGGAGCTGTGGGACCTGTTCGGGGCCGTGGGCGGCGTGGCGGCCGTG
GACGTGGTGGCGCCTCGGGGCTGCGCTTTCGTCGTCATGGAGCGGCGCCGGGACGCCGCG
CGCGCGCTCGCCAAGCTGCACCGACACAAACTGCACTCCAAAGAGATAGACGTCGCCTGG
GCGGCCGGCAAGGGCGTCAAAGGCCGCGAATGGAAGGACTACTGGGAGGCGGAGCTCGGG
GTCGCTTACTTGCCCTGGAGCGCCCTCCACGCGCGCTGGCTGCTGGGCGCGCTGTCGCTG
GACGCGCTCGAGGACGGCGGGGCCGTGGATGAGGACACGCTGCCGCCTTGGCTCCCGCCC
AGGATACTGCCTAAGTCTGTCGGGGAGGCCGTGCCGTTGATGGGCGCGCTGCCCGCTCCG
CTGCCGCTGCCCACCGGGTTGCCGCGTCTGCCGCCACCCGGCCTGGGCGCTCCGCCACCC
GCCGCGGGCTACCCCGGCCTGGGCTCACTGGCGCCGCACCAGCTGCTGAACGAGTCGCCG
GGCTCTTCGGCGCCGGGTCTGCAGCGCGACCCGCTGCTAGCCTTCCCGCCCGCCCTGCCT
CCCCACACCATGCCGCAGCCCGGCTTGGTGGGTGGCTTCCTGGGCGGTCTGATGGGTGTC
GGAGTGGGACACATGAACGTCGGCGGGCTCGTGCTACCCCTTCACCCGGCCCACGCCCAC
GCGGCGCACTCCCACGCGCAGGTCCACACGCATGCACACCCGCACGCCCCGCCACACGTC
CCGCCGCACGCACTCGTCCCTCAGGTGGGTCAACGAGCCGAGGTGGCGGATGACGCCATG
GAGCTGGACAATGACGACCAGACGGACGAGCCCCCAGCCCCCGCGGCCCCTCCGGCCCCA
GCGCCGGCGCCCGCACTCGGTCTTCCGCCTCCCGCCGTGCCCCCGCCGTTGTCTATGGAC
CAGCTTCAGGTCCTGTTGTCGAAGCCGCCGCCGACTTTCAACTCCGCGGAGCCTCCGCCT
GGGTTCAATCCGGAGTCTTTCGAGACGGAGGAGACTCCGGACGAGCGCCGCGAACGGGAC
AAGGAGCGACGGGACAGAGACCGGGACAGGGACCGGGACCGACGGGACCGGCGCGACGAC
CGACCGGACCGACCCGGGGGGCGCAGGGAGAGGGACCGGCCGCGGGACAGGGACAGGGAG
AGAGATGAGCGCCGGGAGAGAGACCGAGGAGGACGGGAGAGAAGAGACCGGGACAGGGAC
AGGGAGAGGGACAGGTTCCCCAGGGAGAACAACAACGAGAAAAGTCAGAAGTCTCCACGG
AGTCAGGCCGGCGAGGCGGGCGGCGCGGAGAAGACGCTGCAGGAGAGGCTGTGGGAGATG
GCCAACGGGAAGACGAGCGACGGAGACGAGCTCGAGCCCCGAGCGGACAGGCCTCCGCTC
ATAGAACGACCGCCTCTCATGGAGCGGCCGCAGACAGCGGACAGCAAGGTTCGTCTCCGC
GGTCCCGGCGGAGGGGGTGGTCCTCGTCCGCCGCCGCGCGCACCGTGGCTGGCTCCGCGC
TTCAATGGTTTGGGTCCGCCTTTCGTACGTCCTCCATTCGAGAGGCCCCCGTTTGAGGGT
CCTCCGATGTTCGAGAGGCCGCCGTTCGGCCGGATGCCGTTCGACGGCGCGCGGCCTCCT
TTCGACGGTCCCCGGCCTCCGGGTCCACGCCTGCCCTTCGATGCGCCGAGGCCTTTCGAC
GGGCCGCGCCCCCCATTCGATGTACCGCGCCCTCCCTTTGACGTTCCGAGGCCACCGTTC
GAAGGTCAGCGACCTCCCTTCGACGGACCGCGACCTCTCTTCGACGGTCCGAGGCCTCCG
TTCGATGGTTTCGAAGGAGATAGATCATTCGACGGACCCAGATTCGATGGGCCCCCCGAG
TTCTTCGACAGAGGCAACAGAAGATTCGATGATAGAGATTTCAACGAGAGAGGCTGGAAC
GGAGATAGAGACTTCGACCGAAGGACAGAATGGGAGGACAGGAGGAGAGAACGCAGAGGG
AGAGATAACGAGGAACGGTTCAGGGAGCGAGGGGGGAGGGGAAGAAACTACGACGAGAGA
GCGAGACCGAGAGACGAGAGGAACACCCGGAGAGACAAGGACAGGAAGTCGAGATGGGGA
GCGGCGGACGAGGCGGGGCAGGGGACAGAAGACGGCAAGGGGAAAGACACCGCGAGTGAG
AGACGAGAGGCAGAGAACGACGACCGGAACGAAACACACGACACGCACACTAGCAGGACC
AGCGGAGAGGAACAGAGGTCAGAAGGTGACGTGGGGCGAGAGGACACCGGGGCAGGGGCA
GAGAAGGAACTGGAGGGCGAGCGGCTGAAGGTCGAGGAAGACGGGGGCAGTGAACACGAA
CAGATTGGACAGGACGGATACCAGCAACAGGACGAGACAGGGGATAAGAAAATAACAGAT
ACGACAGGGGAGGAAGAAAAAATACAAGCAGGGGGGAAACCGGGGAGGTCGGGGAGCGGG
GCGAGGGGTCGGCTTGACATAGGACCAGGAGGAAGATGGGATATTTCCAGTGGAAATGAT
TCGGGTGACGCGGCGCCGGGCCGGGTGACGTCGTGCGACTTGATGCGACGCGGACGAGCG
ACGCCGGGCGACGGCCGCTAG
Protein sequence:
MAEVKAFNAELSGLYENRPPISKAKMSAITRGAIKAIKFYKHVVHSVEKFIQKCKPEYKV
PGLYVIDSIVRQSRHQFGQDKDVFAPRFAKNMQQTFANLFRCPDEDKRNIIRVLNLWQKN
NVFGPEVIQPLLDLADPSHPLHLEIQNQNNTTNGSINMSHNTSDSKISPARQDSPQTSSP
MGDAFQDDSSPGPQAKFNRKLLNDFEYESEDEQEPPPQPPHPPHPPHAPHPPHSSHATHT
THNPTDALGSILTNPEIMRQLQSLQAQMQLMTGMQIPNLMPMMSDMQLQQNQNSNAPFLN
SQTEQQKPAEPKEDLANESDIEFVETGPQVIEIPDANDSRSPSPRRRHRSRSRSPRRRRR
TRSPRRRRDKDRDRDRDKTHKEREAEKERQREREKRGLPPIKKENLSVCSTTLWVGRLSK
QATPEELWDLFGAVGGVAAVDVVAPRGCAFVVMERRRDAARALAKLHRHKLHSKEIDVAW
AAGKGVKGREWKDYWEAELGVAYLPWSALHARWLLGALSLDALEDGGAVDEDTLPPWLPP
RILPKSVGEAVPLMGALPAPLPLPTGLPRLPPPGLGAPPPAAGYPGLGSLAPHQLLNESP
GSSAPGLQRDPLLAFPPALPPHTMPQPGLVGGFLGGLMGVGVGHMNVGGLVLPLHPAHAH
AAHSHAQVHTHAHPHAPPHVPPHALVPQVGQRAEVADDAMELDNDDQTDEPPAPAAPPAP
APAPALGLPPPAVPPPLSMDQLQVLLSKPPPTFNSAEPPPGFNPESFETEETPDERRERD
KERRDRDRDRDRDRRDRRDDRPDRPGGRRERDRPRDRDRERDERRERDRGGRERRDRDRD
RERDRFPRENNNEKSQKSPRSQAGEAGGAEKTLQERLWEMANGKTSDGDELEPRADRPPL
IERPPLMERPQTADSKVRLRGPGGGGGPRPPPRAPWLAPRFNGLGPPFVRPPFERPPFEG
PPMFERPPFGRMPFDGARPPFDGPRPPGPRLPFDAPRPFDGPRPPFDVPRPPFDVPRPPF
EGQRPPFDGPRPLFDGPRPPFDGFEGDRSFDGPRFDGPPEFFDRGNRRFDDRDFNERGWN
GDRDFDRRTEWEDRRRERRGRDNEERFRERGGRGRNYDERARPRDERNTRRDKDRKSRWG
AADEAGQGTEDGKGKDTASERREAENDDRNETHDTHTSRTSGEEQRSEGDVGREDTGAGA
EKELEGERLKVEEDGGSEHEQIGQDGYQQQDETGDKKITDTTGEEEKIQAGGKPGRSGSG
ARGRLDIGPGGRWDISSGNDSGDAAPGRVTSCDLMRRGRATPGDGR