New model in OGS2.0 | DPOGS207892  |
---|---|
Genomic Position | scaffold691:+ 95734-101003 |
See gene structure | |
CDS Length | 3333 |
Paired RNAseq reads   | 2093 |
Single RNAseq reads   | 5031 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008479 (5e-18) |
Best Drosophila hit   | CG32685 (3e-67) |
Best Human hit | YLP motif-containing protein 1 (9e-68) |
Best NR hit (blastp)   | PREDICTED: similar to YLP motif containing 1 [Tribolium castaneum] (8e-99) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC007111 [Tribolium castaneum] (1e-90) |
GeneOntology terms    | GO:0005515 protein binding GO:0005634 nucleus GO:0016607 nuclear speck GO:0045449 regulation of transcription |
InterPro families   | ND |
Orthology group | MCL13939 |
Nucleotide sequence:
ATGGCTTGGTCTATGCCTACGGGTCAATGGAATTCCGGCATTTCCATGACGCCAGATATT
AATATAGCTAGTATGGGATCATACACACCAGAACAGTGGGCGCTAATGCAACAACAGAAC
TGGCAGCAGTGGGCACAATGGCAACAACAGTATGCTCAATGGCAAAGCCAGTATGGTGAC
AAGTATACTCAGCATATGCAAGCCCTTCATGCTATGAGCGGGATACCACCTCCAGCTCCT
AATGCGGTTCCACCCCCAGCTCCACCTCCACCGGAAAAGCCTCCTCCACCACCACATGAA
AACAACCAACCTCTGTACGGAAATACACCATCGCAAACACAGTCTGTGTCACACACGCCC
CATCTTCCATACTCTAAAGTTGGTTATAATGTGGTTCCTAAAACAGGTAACAATTTTAAT
CAAACTTCCACAATAGATTCCATGTCCGATTTACCGACCTCTCAGGTTGTAAACACGGAT
GCGCTCATGAAGCTAGCTGAGGAGGAGCGTTTGTTTGATATACAATTTCAAAAATGGGAA
GAGGAAATAAAAAAGTGGAAAAATGAGAATGTTAACCACCCCGATAAAAGAGCTTATATG
GAGTATGAGCAAAAGTTTGCCAGCTGTCGTGCACAATTGCTGGAGCGGCGTCAACAGATG
AAGCTGAAAAGAGATAGTCTAATGGGTGTTAAAGCCACACAGACAGCAAACACTACAATT
AACAGCACAGGTAATATATCAACAAGTATTCCTCCACCTACACAAAATATTAACAAAACC
AACTATAATACAAATGTGCAAAACTCTCAAACAAAAAATAATGTTACCCAGAATTACATA
AACCAAAATCAATCTCAGTATGAGCCAATTGGTCATTTACACCAGACAAGTTTTAATAGG
AGTAATAAAAAGCCAGAACATCAAGACAGGTATGAATATTATGGGGATATGAGCAATGAT
TACACTACTACAAGTGACACTTCTAATTTCTTACCCACAAATGATTCTTTTAACGGCATA
CCGGGATTAGACTTGGTACCAGATGGTGATAAATCTGTACAAAAACAATTAGATGTAATT
GATATAACAGAAGACAGACAGAATCAGCAACGGCAACAAAATATTCAAGCACCTGACTAT
TCAAAAATATCTAAAGGGATTAACAACATTCTTGGGGATGAAAAAATTATGAACATCCTT
TCTATGATGAGCAGTCAGAATACTAGGAACGAAAGCAAAGTAGGTTCAGTTGGTTCTCAC
AGGCAAGAGCTGAATACTCAGTCTGGAAGCTTGCAATATAGTGGAAATAATAGCAGCTAC
CACGGAAATAATTACAATAATATGCAACCTCGGAACACGTCCTATCAACAGTCAAGTGAG
AATTATACTAATCAAGCTTCAAGTTCTAGTTATACCGATCCTGATTACCAGAGATATGGG
GGATCCTCTAACGAACAAAATGAAAATGTAAGAAGTAATGATATGGACAAAAACTATGAA
TACAACAGACTTGGGAATCAAAATATACAACAAACTAGGTCTAACATGCCAAGAGTTATG
CAAAATTTACATTCAAAGCAAAGTGATTATCCGCAAGGGGATTTCGTAAGGAGGATGGAT
CTAGATGTAAAACAGATTCAGCCTTTAAAACCTAAATGGGTCGATGAACCTCTGTTCACA
CCCTCAATAATAGTTGAATATGAGCACAAACCATTAAGATTGAAAGCTCGAGATTTTATT
GAGCCCGTGCACATGTTTGAATACAATCATCAATCTAAAGATGGAGAAAGTTCGAATAAG
AAAAACTTCGATAAAGAATTAGATGATTTATTTTCAAGGAAAAGAAGAGCAGACGATGAC
TGGAGTAGTTCAGACAAATTTTATTCCAGAGATTATGATCGGAGAGGTTTAAAGGATGAT
GCGAGAGATAGAAATCGGCTGCGAGATGATCGTGATATGTATGATAGAAGAATTGATGAC
AGAAGACGTGATGACCGTGATAGATTTAGAAGAGAAGAATATGATAGGAGAGATAGAATA
GAACAAGAACGATCCAGAGATATGGGGAGGGGGCGTGATGAAAGAGACAGAGATAGAGAT
ATGGCTAGAGACATGGGTCGAGAGAGAGAGTTGGGAAGAGATAGAGATTACACTAGAGAT
AGAGATAAAGATTTCAATAGAGATAGGGATACAAGAGATAGAAGTAAAGAATATAGAAAA
GATGAAAGAGATATAAGAAATCGTAGTCGAAGTCGTGATAAAGAAAATCGTAAAAGAGGA
CATAGCAGAGAAACGGAATGTTTTGATAATTATGGATTGAAAAAGAATAGAGATATAAAA
GATGAAACGGTTCCAACGAATAAGCCGAAACATGTGGTGATGATAGATGACCTTCTAGAG
CCTCCGGGGCGCACCATGAGACCGGACAAGATGGTTATAATACTCAGAGGTCCACCGGGA
AGTGGTAAATCTTATTTAGCTAAACTGATAAGAGATAAAGAAGCCGAGCACGGGGGCACA
GCAAGAATAATGTCCATAGACGATTATTTCATGCAGGAAGGTGAAATTGAAGAAAAAGAT
CCCATTACGGGAAAAATTGTGAAGAAGCCGTCACTGAAATACGAATACGACGAGAGCTCC
GAGGAATCATATATGACATCGCTAAAGCGGGCGTTCAAGAGGAGTATCACGGATGGCTAC
TTTACATTCTTAATATATGACGCCGTGAACGATCAGTTGAAGTCCTATGCTGATATTTGG
AATTTCTCAAGGCAGAATGGCTTCCAGGTGTACATATGTACGATGGAAATGGATCCCCAA
GCTTGCTTCAAGAGGAACATACACAATAGATCGTTGCAAGACATAGAAGCTATAGTTTCT
AGTTTTTTCCCAACCCCAGCACATCACATACAGTTGGATCCGACGACCTTACTCCAGAGT
GCGTCCATTCGGGAAGTACAAATGGAAGACGCCGATGACGTCACTATGGAGGAGGTGGAA
AACCCTGAGGTCGATAATAGTTTTACGTCGAAATGGGAAAAAATGGAAGACGCCGCCCAA
CTAGCTCGTCTCGACGGCACTAGTCGGCCGCTGCGCTCGTCCCAGCTCTCCATGGAAGAC
TACCTACAGTTAGACGACTGGAAACCGAATACGGCTAAACCGGGAAAGAAAACTGTACGT
TGGGCTGATATTGAAGAGAGAAGACAGCAAGAGAAAATGCGAGCCATCGGTTTTGTCGTA
GGTCAAACTGATTGGAATAGAATGACTGACCCCACTATGGGGTCTAGTGCGCTCACGCAA
ACTAAATATATCGAGCGAGTCAGGCGGCATTGA
Protein sequence:
MAWSMPTGQWNSGISMTPDINIASMGSYTPEQWALMQQQNWQQWAQWQQQYAQWQSQYGD
KYTQHMQALHAMSGIPPPAPNAVPPPAPPPPEKPPPPPHENNQPLYGNTPSQTQSVSHTP
HLPYSKVGYNVVPKTGNNFNQTSTIDSMSDLPTSQVVNTDALMKLAEEERLFDIQFQKWE
EEIKKWKNENVNHPDKRAYMEYEQKFASCRAQLLERRQQMKLKRDSLMGVKATQTANTTI
NSTGNISTSIPPPTQNINKTNYNTNVQNSQTKNNVTQNYINQNQSQYEPIGHLHQTSFNR
SNKKPEHQDRYEYYGDMSNDYTTTSDTSNFLPTNDSFNGIPGLDLVPDGDKSVQKQLDVI
DITEDRQNQQRQQNIQAPDYSKISKGINNILGDEKIMNILSMMSSQNTRNESKVGSVGSH
RQELNTQSGSLQYSGNNSSYHGNNYNNMQPRNTSYQQSSENYTNQASSSSYTDPDYQRYG
GSSNEQNENVRSNDMDKNYEYNRLGNQNIQQTRSNMPRVMQNLHSKQSDYPQGDFVRRMD
LDVKQIQPLKPKWVDEPLFTPSIIVEYEHKPLRLKARDFIEPVHMFEYNHQSKDGESSNK
KNFDKELDDLFSRKRRADDDWSSSDKFYSRDYDRRGLKDDARDRNRLRDDRDMYDRRIDD
RRRDDRDRFRREEYDRRDRIEQERSRDMGRGRDERDRDRDMARDMGRERELGRDRDYTRD
RDKDFNRDRDTRDRSKEYRKDERDIRNRSRSRDKENRKRGHSRETECFDNYGLKKNRDIK
DETVPTNKPKHVVMIDDLLEPPGRTMRPDKMVIILRGPPGSGKSYLAKLIRDKEAEHGGT
ARIMSIDDYFMQEGEIEEKDPITGKIVKKPSLKYEYDESSEESYMTSLKRAFKRSITDGY
FTFLIYDAVNDQLKSYADIWNFSRQNGFQVYICTMEMDPQACFKRNIHNRSLQDIEAIVS
SFFPTPAHHIQLDPTTLLQSASIREVQMEDADDVTMEEVENPEVDNSFTSKWEKMEDAAQ
LARLDGTSRPLRSSQLSMEDYLQLDDWKPNTAKPGKKTVRWADIEERRQQEKMRAIGFVV
GQTDWNRMTDPTMGSSALTQTKYIERVRRH