New model in OGS2.0 | DPOGS207766  |
---|---|
Genomic Position | scaffold360:+ 129335-132228 |
See gene structure | |
CDS Length | 2070 |
Paired RNAseq reads   | 1079 |
Single RNAseq reads   | 2560 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005317 (0.0) |
Best Drosophila hit   | thoc5 (2e-40) |
Best Human hit | THO complex subunit 5 homolog (4e-82) |
Best NR hit (blastp)   | fms interacting protein [Aedes aegypti] (2e-142) |
Best NR hit (blastx)   | fms interacting protein [Aedes aegypti] (2e-115) |
GeneOntology terms    | GO:0003723 RNA binding GO:0005737 cytoplasm GO:0008380 RNA splicing GO:0051028 mRNA transport GO:0006397 mRNA processing GO:0030154 cell differentiation GO:0005634 nucleus GO:0006810 transport |
InterPro families   | IPR019163 THO complex, subunit 5 |
Orthology group | MCL14086 |
Nucleotide sequence:
ATGGGTAAGGACGATACCTCAACGAAAAAACGACGTAAACTGACTACTACTTCATCGAGT
GATAATAACACTAAGCAGACCCCGGTCGATATTTATAAGAAAGTCGTCGAATTCGAAGAA
GCTGAGGCGCAGTTACGTTCAGCCGATAAGGATGCAGCGTTGTTTAAAAAGATATGTCAA
GATGTTCGCCAATTATTTGCCGAAATAGCAGAATTAAAAGAAAAAGGCACTGATGAGGCA
AAAGAAAAAATCAATGTAAAAAGAGTAGAGGCATCATTGCATTTAGTAGCATTGAAAAAG
TTAAACAGATTGGAAAAGGTTCGTACAAGAGCTGGAAGAGAGGCTCTGCACAAAGAAAAG
CAGAGAGTTGATTCAACACATCTCCTCTTGCAAAATCTTCTCTATGAAGCTGATCATCTT
AATAAAGAAGTGACAAAATGCTTACAATTTAAATCAAAAGATGAAGAAATAGAATTAATA
CCACTAGAAGAATTTTACAAGGAAGCACCAAGTGAAATCTCTCGGCCGGAAGTAACAAAA
GCAGATGAGCATCAACTTCAATTGGCAAGGCTTGAGTGGGAATTACGTCAGCGAAGGGAA
CTTGCTGGGGCCTGTAGTGAGTTGGTGGCTTCAAAGGAATGTGTGGCAGCAGCTATAGCT
GCAGCACGGTCGAGGCTGAATGCACTCTCACCGCATTTGAAAGATGTTCTGAAGGCTACA
AAACCACTTCAAGAATGTCTAGCTCTTAGATTGGATGAAAAGAGAGATGAGACGAGAGCA
GCATCACTTCTCCCACCTCCTTTATTTTTGCTCTACGCCAATGCCAGTGCATATTCTGAT
GCTCTTGGTGCTAGCAATGTTGTTGTTGGAATATCTGGAGATGAAGATGAAGCGAAAAGA
TTGGATCAGTTAAGCAATGTTGAAAGTGAACTTGTAGTATCAAACGATTCAGACTCTGAC
CAGGAAAATAACTATGAAGAACCAAGAGATAAGAAAAAGAGACACCACCGAGGTACAAAA
ATATCAAGAGAAGAGAAAGCCGAGGCCAAAAAGAAAGAAGTACTTAAAAGACATCCTCTT
AATGTTAAAGTTACTGTGAAAATACCAGACGGGACTGCATTGAATCTTATATTCTCATAC
ATGGTTCATTTAAAAATTATTGTAGTCAAAAACACTCTGGACCTGTTTAAACCTATAACA
GGAGTTTCAGCTGCCGATGTATTGAATGGAGACTGTATACTTAACGAACTTTACATTGGT
GACAACGGCAATGACTCTCCACATCCAGCCACCACCTATTTACTTAATGCAGCTGGCATT
GTGGAAGATTTTCACTATTTTATTCCTGAAGTTGGTAGACCTTACATATGGGCTCAGAGA
ATGTGCGGATTGGATTTCATGGCAGTGACGGGTGAAGAAAAAAAGTCCAATATTATTCAG
CCGAGTCAAAGTCTCAGTGTTGTCAGTGTTGAAAATTTTATTTTTACTCTGAAGAAAAGA
TTGAAATCGAGAGTGGAACTTATGAAAGAATTGCAAGATTTGGAAAGTGGTAAAATTATA
CCGGAAAAAGGCGTGGGATGTCCCTTGAGACTATCAGGTTCGTTGACCCAGTGGCAGTCA
GTGGGATGGAATGAATATAGCCAATCGACTTCAACATCATTCCTGATATCGGAAGGCCAA
GTGAATCCAGAGAATATGTTGTACCGCGCTATAATCACAAGACAATCAGCTAAGCTCGTA
GCATTGGTTGCCGTGAGTAGTGATTATCCGAAAAAGGCACCGCTGTTTTCATTGACATTA
CATTGGAACGGTACACACACCGCAGGAACAAACGATGACATAAGAGACATCGAGAGAATC
ATCAATACGAACTGGACTAATGATGGCAATAAGTCTACTCTCACCGCACAGATGACGAAG
TTACTCACTTGTCTAGATATTCTCCTGGAGACCACAGGGTCATCAGAATTTCCTCCCGAC
AAAGTAATGTTCCAGTCAGTGAGAGGAAGAAATCGGATGAAACCTTACCGTTTTATAAAA
CAAGGTACAGGCGTGTTTGTACAATATTGA
Protein sequence:
MGKDDTSTKKRRKLTTTSSSDNNTKQTPVDIYKKVVEFEEAEAQLRSADKDAALFKKICQ
DVRQLFAEIAELKEKGTDEAKEKINVKRVEASLHLVALKKLNRLEKVRTRAGREALHKEK
QRVDSTHLLLQNLLYEADHLNKEVTKCLQFKSKDEEIELIPLEEFYKEAPSEISRPEVTK
ADEHQLQLARLEWELRQRRELAGACSELVASKECVAAAIAAARSRLNALSPHLKDVLKAT
KPLQECLALRLDEKRDETRAASLLPPPLFLLYANASAYSDALGASNVVVGISGDEDEAKR
LDQLSNVESELVVSNDSDSDQENNYEEPRDKKKRHHRGTKISREEKAEAKKKEVLKRHPL
NVKVTVKIPDGTALNLIFSYMVHLKIIVVKNTLDLFKPITGVSAADVLNGDCILNELYIG
DNGNDSPHPATTYLLNAAGIVEDFHYFIPEVGRPYIWAQRMCGLDFMAVTGEEKKSNIIQ
PSQSLSVVSVENFIFTLKKRLKSRVELMKELQDLESGKIIPEKGVGCPLRLSGSLTQWQS
VGWNEYSQSTSTSFLISEGQVNPENMLYRAIITRQSAKLVALVAVSSDYPKKAPLFSLTL
HWNGTHTAGTNDDIRDIERIINTNWTNDGNKSTLTAQMTKLLTCLDILLETTGSSEFPPD
KVMFQSVRGRNRMKPYRFIKQGTGVFVQY