DPGLEAN10301 in OGS1.0

New model in OGS2.0DPOGS212345 
Genomic Positionscaffold101:- 129733-132466
See gene structure
CDS Length1392
Paired RNAseq reads  110
Single RNAseq reads  788
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004663 (7e-137)
Best Drosophila hit  CG7102 (4e-109)
Best Human hitsmall nuclear ribonucleoprotein E (7e-37)
Best NR hit (blastp)  GH11553 [Drosophila grimshawi] (4e-117)
Best NR hit (blastx)  GH11553 [Drosophila grimshawi] (3e-111)
GeneOntology terms  GO:0005515 protein binding
InterPro families



  
IPR001163 Like-Sm ribonucleoprotein (LSM) domain
IPR011705 BTB/Kelch-associated
IPR010920 Like-Sm ribonucleoprotein (LSM)-related domain
IPR006649 Like-Sm ribonucleoprotein (LSM) domain, eukaryotic/archaea-type
IPR006571 TLDc
Orthology groupMCL13958

Nucleotide sequence:

ATGGCATACAAAGGCCCACCTAAGGTACAGAAGGTTATGGTACAACCCATTAACCTTATC
TTCAGATATTTACAAAACAGAAGCCGTGTACAAATATGGCTTTACGAGAATGTGAATTTG
AGAATCGAGGGTCACATTGTTGGTTTCGATGAATACATGAATATTGTTTTGGACGAAGCT
GAAGAAGTTCACATGAAAACCAAAAATCGCAAGCAAATTGGAAGAATTATGATGAAAGGA
GATAACATAACGCTTATACAAAATGTAAACCCGAACGCTACATTTAGTGAAATACATGGA
ACACTTGAGTTTGAGATCTTTGATTCAAAGAAATGTATGAAGCGAGCCCACAAGACAGGA
ATTCATTCCAGTGTAACGGCCAGTAGTGGTAAGAGCGCATCGTCTTTCTTAGAGCGCTGC
ATATCATTCATTGGAGACAATGCTGCGGATTGTGTCAAGACTAATGCGTTCCTCAACCTC
CCAAAAGAAGCACTCATCAAACTCATATCTTCAGACTTTCTGTGTCTGGAGGAGGAGGAG
GTGTGGCGTTGTGCTCTGGCGTGGTCCAAGCAGCGGGCCGGCGTGACGCAGCCCGCCGTG
CATTGGACGGGCGAGGAGCGAGCCCGGGTCTGCCAACACCTGGCCCCGCTCATGCAGCAC
GTGCGACTGCTACTCATCGACAGTACGGTGTTCGCGGAGGAGGTGGAACCCACGGGAGCC
GTGCCCATGGAACTGTCCTTGGAGCGCTACCGCCGCGCCGCACTGCACGCCGCACCGCGA
CACGAGCCCGACAAGAGGACGCAGCCTCGGTCGGCCGTGAACATGTTCGTGGGGTCGGTG
ATCCTGCAGCAGGACCGCGGGGGCCTGCAGTCCCTGGTGAACAGCTGGTGTGGGGCGCCG
GGGGGCCGGCGGGCCTGGCGGCTCGTGTTTCGCGCCTCCAGCCACGGCTACTCTGCCGCC
GCCTTCCACACGCACTGTGACGGAGTGGCGCCCGTGTTACTCTTAGTACAGCTGTCCCGG
GGCGAGGTCATAGGCGGCTACAGTACGGCGGGCTGGTCCCCGGGCGGGGCGGGCGGCTAC
GTGTCTTCGGAGCGCGGCCTGCTGTTCTCCCTGAGCGAGCCGCCGGTCCGCTACCCGCTC
CTCAAGAAACCCTTCGCCCTCTGCTACCATCCAGACTGTGGGCCCATATTCGGCGCGGGT
GCGGACCTGCTGATCTCCAACAACTGTAACATGAACAGTGACAGCTACAGTAACCTCCAC
TCGTACGGGGACGGCTCGCTAGGGTCACTGGGATCCCTGGGGTCTCCGGGGCCCCAGCCC
GCGCCCTCCTCGCTCGCATCTGAGTACAACTTCACCGTCCGTGACTACGAGATCTTCACG
CTCGACCACTAA

Protein sequence:

MAYKGPPKVQKVMVQPINLIFRYLQNRSRVQIWLYENVNLRIEGHIVGFDEYMNIVLDEA
EEVHMKTKNRKQIGRIMMKGDNITLIQNVNPNATFSEIHGTLEFEIFDSKKCMKRAHKTG
IHSSVTASSGKSASSFLERCISFIGDNAADCVKTNAFLNLPKEALIKLISSDFLCLEEEE
VWRCALAWSKQRAGVTQPAVHWTGEERARVCQHLAPLMQHVRLLLIDSTVFAEEVEPTGA
VPMELSLERYRRAALHAAPRHEPDKRTQPRSAVNMFVGSVILQQDRGGLQSLVNSWCGAP
GGRRAWRLVFRASSHGYSAAAFHTHCDGVAPVLLLVQLSRGEVIGGYSTAGWSPGGAGGY
VSSERGLLFSLSEPPVRYPLLKKPFALCYHPDCGPIFGAGADLLISNNCNMNSDSYSNLH
SYGDGSLGSLGSLGSPGPQPAPSSLASEYNFTVRDYEIFTLDH