DPGLEAN06535 in OGS1.0

New model in OGS2.0DPOGS207892 
Genomic Positionscaffold691:+ 95734-101003
See gene structure
CDS Length3333
Paired RNAseq reads  2093
Single RNAseq reads  5031
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008479 (5e-18)
Best Drosophila hit  CG32685 (3e-67)
Best Human hitYLP motif-containing protein 1 (9e-68)
Best NR hit (blastp)  PREDICTED: similar to YLP motif containing 1 [Tribolium castaneum] (8e-99)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC007111 [Tribolium castaneum] (1e-90)
GeneOntology terms


  
GO:0005515 protein binding
GO:0005634 nucleus
GO:0016607 nuclear speck
GO:0045449 regulation of transcription
InterPro families  ND
Orthology groupMCL13939

Nucleotide sequence:

ATGGCTTGGTCTATGCCTACGGGTCAATGGAATTCCGGCATTTCCATGACGCCAGATATT
AATATAGCTAGTATGGGATCATACACACCAGAACAGTGGGCGCTAATGCAACAACAGAAC
TGGCAGCAGTGGGCACAATGGCAACAACAGTATGCTCAATGGCAAAGCCAGTATGGTGAC
AAGTATACTCAGCATATGCAAGCCCTTCATGCTATGAGCGGGATACCACCTCCAGCTCCT
AATGCGGTTCCACCCCCAGCTCCACCTCCACCGGAAAAGCCTCCTCCACCACCACATGAA
AACAACCAACCTCTGTACGGAAATACACCATCGCAAACACAGTCTGTGTCACACACGCCC
CATCTTCCATACTCTAAAGTTGGTTATAATGTGGTTCCTAAAACAGGTAACAATTTTAAT
CAAACTTCCACAATAGATTCCATGTCCGATTTACCGACCTCTCAGGTTGTAAACACGGAT
GCGCTCATGAAGCTAGCTGAGGAGGAGCGTTTGTTTGATATACAATTTCAAAAATGGGAA
GAGGAAATAAAAAAGTGGAAAAATGAGAATGTTAACCACCCCGATAAAAGAGCTTATATG
GAGTATGAGCAAAAGTTTGCCAGCTGTCGTGCACAATTGCTGGAGCGGCGTCAACAGATG
AAGCTGAAAAGAGATAGTCTAATGGGTGTTAAAGCCACACAGACAGCAAACACTACAATT
AACAGCACAGGTAATATATCAACAAGTATTCCTCCACCTACACAAAATATTAACAAAACC
AACTATAATACAAATGTGCAAAACTCTCAAACAAAAAATAATGTTACCCAGAATTACATA
AACCAAAATCAATCTCAGTATGAGCCAATTGGTCATTTACACCAGACAAGTTTTAATAGG
AGTAATAAAAAGCCAGAACATCAAGACAGGTATGAATATTATGGGGATATGAGCAATGAT
TACACTACTACAAGTGACACTTCTAATTTCTTACCCACAAATGATTCTTTTAACGGCATA
CCGGGATTAGACTTGGTACCAGATGGTGATAAATCTGTACAAAAACAATTAGATGTAATT
GATATAACAGAAGACAGACAGAATCAGCAACGGCAACAAAATATTCAAGCACCTGACTAT
TCAAAAATATCTAAAGGGATTAACAACATTCTTGGGGATGAAAAAATTATGAACATCCTT
TCTATGATGAGCAGTCAGAATACTAGGAACGAAAGCAAAGTAGGTTCAGTTGGTTCTCAC
AGGCAAGAGCTGAATACTCAGTCTGGAAGCTTGCAATATAGTGGAAATAATAGCAGCTAC
CACGGAAATAATTACAATAATATGCAACCTCGGAACACGTCCTATCAACAGTCAAGTGAG
AATTATACTAATCAAGCTTCAAGTTCTAGTTATACCGATCCTGATTACCAGAGATATGGG
GGATCCTCTAACGAACAAAATGAAAATGTAAGAAGTAATGATATGGACAAAAACTATGAA
TACAACAGACTTGGGAATCAAAATATACAACAAACTAGGTCTAACATGCCAAGAGTTATG
CAAAATTTACATTCAAAGCAAAGTGATTATCCGCAAGGGGATTTCGTAAGGAGGATGGAT
CTAGATGTAAAACAGATTCAGCCTTTAAAACCTAAATGGGTCGATGAACCTCTGTTCACA
CCCTCAATAATAGTTGAATATGAGCACAAACCATTAAGATTGAAAGCTCGAGATTTTATT
GAGCCCGTGCACATGTTTGAATACAATCATCAATCTAAAGATGGAGAAAGTTCGAATAAG
AAAAACTTCGATAAAGAATTAGATGATTTATTTTCAAGGAAAAGAAGAGCAGACGATGAC
TGGAGTAGTTCAGACAAATTTTATTCCAGAGATTATGATCGGAGAGGTTTAAAGGATGAT
GCGAGAGATAGAAATCGGCTGCGAGATGATCGTGATATGTATGATAGAAGAATTGATGAC
AGAAGACGTGATGACCGTGATAGATTTAGAAGAGAAGAATATGATAGGAGAGATAGAATA
GAACAAGAACGATCCAGAGATATGGGGAGGGGGCGTGATGAAAGAGACAGAGATAGAGAT
ATGGCTAGAGACATGGGTCGAGAGAGAGAGTTGGGAAGAGATAGAGATTACACTAGAGAT
AGAGATAAAGATTTCAATAGAGATAGGGATACAAGAGATAGAAGTAAAGAATATAGAAAA
GATGAAAGAGATATAAGAAATCGTAGTCGAAGTCGTGATAAAGAAAATCGTAAAAGAGGA
CATAGCAGAGAAACGGAATGTTTTGATAATTATGGATTGAAAAAGAATAGAGATATAAAA
GATGAAACGGTTCCAACGAATAAGCCGAAACATGTGGTGATGATAGATGACCTTCTAGAG
CCTCCGGGGCGCACCATGAGACCGGACAAGATGGTTATAATACTCAGAGGTCCACCGGGA
AGTGGTAAATCTTATTTAGCTAAACTGATAAGAGATAAAGAAGCCGAGCACGGGGGCACA
GCAAGAATAATGTCCATAGACGATTATTTCATGCAGGAAGGTGAAATTGAAGAAAAAGAT
CCCATTACGGGAAAAATTGTGAAGAAGCCGTCACTGAAATACGAATACGACGAGAGCTCC
GAGGAATCATATATGACATCGCTAAAGCGGGCGTTCAAGAGGAGTATCACGGATGGCTAC
TTTACATTCTTAATATATGACGCCGTGAACGATCAGTTGAAGTCCTATGCTGATATTTGG
AATTTCTCAAGGCAGAATGGCTTCCAGGTGTACATATGTACGATGGAAATGGATCCCCAA
GCTTGCTTCAAGAGGAACATACACAATAGATCGTTGCAAGACATAGAAGCTATAGTTTCT
AGTTTTTTCCCAACCCCAGCACATCACATACAGTTGGATCCGACGACCTTACTCCAGAGT
GCGTCCATTCGGGAAGTACAAATGGAAGACGCCGATGACGTCACTATGGAGGAGGTGGAA
AACCCTGAGGTCGATAATAGTTTTACGTCGAAATGGGAAAAAATGGAAGACGCCGCCCAA
CTAGCTCGTCTCGACGGCACTAGTCGGCCGCTGCGCTCGTCCCAGCTCTCCATGGAAGAC
TACCTACAGTTAGACGACTGGAAACCGAATACGGCTAAACCGGGAAAGAAAACTGTACGT
TGGGCTGATATTGAAGAGAGAAGACAGCAAGAGAAAATGCGAGCCATCGGTTTTGTCGTA
GGTCAAACTGATTGGAATAGAATGACTGACCCCACTATGGGGTCTAGTGCGCTCACGCAA
ACTAAATATATCGAGCGAGTCAGGCGGCATTGA

Protein sequence:

MAWSMPTGQWNSGISMTPDINIASMGSYTPEQWALMQQQNWQQWAQWQQQYAQWQSQYGD
KYTQHMQALHAMSGIPPPAPNAVPPPAPPPPEKPPPPPHENNQPLYGNTPSQTQSVSHTP
HLPYSKVGYNVVPKTGNNFNQTSTIDSMSDLPTSQVVNTDALMKLAEEERLFDIQFQKWE
EEIKKWKNENVNHPDKRAYMEYEQKFASCRAQLLERRQQMKLKRDSLMGVKATQTANTTI
NSTGNISTSIPPPTQNINKTNYNTNVQNSQTKNNVTQNYINQNQSQYEPIGHLHQTSFNR
SNKKPEHQDRYEYYGDMSNDYTTTSDTSNFLPTNDSFNGIPGLDLVPDGDKSVQKQLDVI
DITEDRQNQQRQQNIQAPDYSKISKGINNILGDEKIMNILSMMSSQNTRNESKVGSVGSH
RQELNTQSGSLQYSGNNSSYHGNNYNNMQPRNTSYQQSSENYTNQASSSSYTDPDYQRYG
GSSNEQNENVRSNDMDKNYEYNRLGNQNIQQTRSNMPRVMQNLHSKQSDYPQGDFVRRMD
LDVKQIQPLKPKWVDEPLFTPSIIVEYEHKPLRLKARDFIEPVHMFEYNHQSKDGESSNK
KNFDKELDDLFSRKRRADDDWSSSDKFYSRDYDRRGLKDDARDRNRLRDDRDMYDRRIDD
RRRDDRDRFRREEYDRRDRIEQERSRDMGRGRDERDRDRDMARDMGRERELGRDRDYTRD
RDKDFNRDRDTRDRSKEYRKDERDIRNRSRSRDKENRKRGHSRETECFDNYGLKKNRDIK
DETVPTNKPKHVVMIDDLLEPPGRTMRPDKMVIILRGPPGSGKSYLAKLIRDKEAEHGGT
ARIMSIDDYFMQEGEIEEKDPITGKIVKKPSLKYEYDESSEESYMTSLKRAFKRSITDGY
FTFLIYDAVNDQLKSYADIWNFSRQNGFQVYICTMEMDPQACFKRNIHNRSLQDIEAIVS
SFFPTPAHHIQLDPTTLLQSASIREVQMEDADDVTMEEVENPEVDNSFTSKWEKMEDAAQ
LARLDGTSRPLRSSQLSMEDYLQLDDWKPNTAKPGKKTVRWADIEERRQQEKMRAIGFVV
GQTDWNRMTDPTMGSSALTQTKYIERVRRH