New model in OGS2.0 | DPOGS209572  |
---|---|
Genomic Position | scaffold154:+ 80511-81907 |
See gene structure | |
CDS Length | 978 |
Paired RNAseq reads   | 270 |
Single RNAseq reads   | 883 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006632 (4e-89) |
Best Drosophila hit   | thoc6 (2e-53) |
Best Human hit | THO complex subunit 6 homolog isoform 1 (5e-44) |
Best NR hit (blastp)   | PREDICTED: similar to thoc6 CG5632-PA [Apis mellifera] (2e-89) |
Best NR hit (blastx)   | PREDICTED: similar to thoc6 CG5632-PA [Apis mellifera] (8e-91) |
GeneOntology terms    | GO:0005575 cellular_component GO:0051028 mRNA transport GO:0008380 RNA splicing GO:0006397 mRNA processing GO:0003723 RNA binding GO:0006810 transport |
InterPro families    | IPR019775 WD40 repeat, conserved site IPR011046 WD40 repeat-like-containing domain IPR015943 WD40/YVTN repeat-like-containing domain IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR001680 WD40 repeat IPR019781 WD40 repeat, subgroup |
Orthology group | MCL12091 |
Nucleotide sequence:
ATGCTTGATAAGATATTTTATAACACGGTTTTATGCCAAACATATTCTCCGTGTGGTAAA
TATTTGGTAGCCGGTAATATTTATGGACAACTAGCAGTGTTTGATCTCGATAACATATTT
AATCCTGTGATAGAACTACTTACACCAGATTATAACAAGCCAAAACACATTCATACTTTA
GAGTCAGAGAATCAAGTATGTAGTTTAGTGAGCACAGAGAATTTTTTAATTGTTGGCTCA
GTAAATGAAATATTGGGATGGAACTGGAAATCAGTAATTCATCCAAAATTAGGTAAACCT
GCATGGACAATAAGAATACAGCCAAAGTCGTTCATTGAGAAATGTGATATTAATTATCTG
TGGTATTGTGAAGAAGAAGGAAAACTATATGTAGGATGTGGTGATAATAATATATATATA
TACAATTTAGAAGATGGTAAGCTTGTGTCTACCTTGGAAGGCCACTCTGATTATATACAC
TGTTTACATGGCAATGGACATCAACTTATTTCTGCTGGTGAAGATGGCAAGGTCCTTCTC
TGGGACACAAGAATGAAAAAAAGTCATAACAAAATCGAACCATACAATAACAGTAAAGTT
GCGAGACCAGATATTGGTAAATGGATGGGAGCCGCTGCTTTGGGAGATGATTGGATTGTA
TGTGGAGGTGGTCCCAGATTGGCTCTTTGGCATCTACGCTCCTTGGATGTTGTGACGGTG
TTTGATATTCCTGATCATGGAATTCATGTGTCCTTCTTTCATGATGACTGCGTGTTTGCC
GGCGGGGCTGCCAAGCACTTGTATCAGCTCAGTTACTCGGGAGATATAAGAGTTGAGCTG
CCAGTGTCATCAACCACAGTATACTCCGCGGTGCTGAGAACAAGCCCACATAAAGTCCTA
ACAATAGCTGGTTCCAGCCCCGAAATTGACTTGTGTACCACATTCAACTATAGAGACCAA
GTCTTGCATTTCAGATGA
Protein sequence:
MLDKIFYNTVLCQTYSPCGKYLVAGNIYGQLAVFDLDNIFNPVIELLTPDYNKPKHIHTL
ESENQVCSLVSTENFLIVGSVNEILGWNWKSVIHPKLGKPAWTIRIQPKSFIEKCDINYL
WYCEEEGKLYVGCGDNNIYIYNLEDGKLVSTLEGHSDYIHCLHGNGHQLISAGEDGKVLL
WDTRMKKSHNKIEPYNNSKVARPDIGKWMGAAALGDDWIVCGGGPRLALWHLRSLDVVTV
FDIPDHGIHVSFFHDDCVFAGGAAKHLYQLSYSGDIRVELPVSSTTVYSAVLRTSPHKVL
TIAGSSPEIDLCTTFNYRDQVLHFR