DPGLEAN10370 in OGS1.0

New model in OGS2.0DPOGS207766 
Genomic Positionscaffold360:+ 129335-132228
See gene structure
CDS Length2070
Paired RNAseq reads  1079
Single RNAseq reads  2560
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005317 (0.0)
Best Drosophila hit  thoc5 (2e-40)
Best Human hitTHO complex subunit 5 homolog (4e-82)
Best NR hit (blastp)  fms interacting protein [Aedes aegypti] (2e-142)
Best NR hit (blastx)  fms interacting protein [Aedes aegypti] (2e-115)
GeneOntology terms






  
GO:0003723 RNA binding
GO:0005737 cytoplasm
GO:0008380 RNA splicing
GO:0051028 mRNA transport
GO:0006397 mRNA processing
GO:0030154 cell differentiation
GO:0005634 nucleus
GO:0006810 transport
InterPro families  IPR019163 THO complex, subunit 5
Orthology groupMCL14086

Nucleotide sequence:

ATGGGTAAGGACGATACCTCAACGAAAAAACGACGTAAACTGACTACTACTTCATCGAGT
GATAATAACACTAAGCAGACCCCGGTCGATATTTATAAGAAAGTCGTCGAATTCGAAGAA
GCTGAGGCGCAGTTACGTTCAGCCGATAAGGATGCAGCGTTGTTTAAAAAGATATGTCAA
GATGTTCGCCAATTATTTGCCGAAATAGCAGAATTAAAAGAAAAAGGCACTGATGAGGCA
AAAGAAAAAATCAATGTAAAAAGAGTAGAGGCATCATTGCATTTAGTAGCATTGAAAAAG
TTAAACAGATTGGAAAAGGTTCGTACAAGAGCTGGAAGAGAGGCTCTGCACAAAGAAAAG
CAGAGAGTTGATTCAACACATCTCCTCTTGCAAAATCTTCTCTATGAAGCTGATCATCTT
AATAAAGAAGTGACAAAATGCTTACAATTTAAATCAAAAGATGAAGAAATAGAATTAATA
CCACTAGAAGAATTTTACAAGGAAGCACCAAGTGAAATCTCTCGGCCGGAAGTAACAAAA
GCAGATGAGCATCAACTTCAATTGGCAAGGCTTGAGTGGGAATTACGTCAGCGAAGGGAA
CTTGCTGGGGCCTGTAGTGAGTTGGTGGCTTCAAAGGAATGTGTGGCAGCAGCTATAGCT
GCAGCACGGTCGAGGCTGAATGCACTCTCACCGCATTTGAAAGATGTTCTGAAGGCTACA
AAACCACTTCAAGAATGTCTAGCTCTTAGATTGGATGAAAAGAGAGATGAGACGAGAGCA
GCATCACTTCTCCCACCTCCTTTATTTTTGCTCTACGCCAATGCCAGTGCATATTCTGAT
GCTCTTGGTGCTAGCAATGTTGTTGTTGGAATATCTGGAGATGAAGATGAAGCGAAAAGA
TTGGATCAGTTAAGCAATGTTGAAAGTGAACTTGTAGTATCAAACGATTCAGACTCTGAC
CAGGAAAATAACTATGAAGAACCAAGAGATAAGAAAAAGAGACACCACCGAGGTACAAAA
ATATCAAGAGAAGAGAAAGCCGAGGCCAAAAAGAAAGAAGTACTTAAAAGACATCCTCTT
AATGTTAAAGTTACTGTGAAAATACCAGACGGGACTGCATTGAATCTTATATTCTCATAC
ATGGTTCATTTAAAAATTATTGTAGTCAAAAACACTCTGGACCTGTTTAAACCTATAACA
GGAGTTTCAGCTGCCGATGTATTGAATGGAGACTGTATACTTAACGAACTTTACATTGGT
GACAACGGCAATGACTCTCCACATCCAGCCACCACCTATTTACTTAATGCAGCTGGCATT
GTGGAAGATTTTCACTATTTTATTCCTGAAGTTGGTAGACCTTACATATGGGCTCAGAGA
ATGTGCGGATTGGATTTCATGGCAGTGACGGGTGAAGAAAAAAAGTCCAATATTATTCAG
CCGAGTCAAAGTCTCAGTGTTGTCAGTGTTGAAAATTTTATTTTTACTCTGAAGAAAAGA
TTGAAATCGAGAGTGGAACTTATGAAAGAATTGCAAGATTTGGAAAGTGGTAAAATTATA
CCGGAAAAAGGCGTGGGATGTCCCTTGAGACTATCAGGTTCGTTGACCCAGTGGCAGTCA
GTGGGATGGAATGAATATAGCCAATCGACTTCAACATCATTCCTGATATCGGAAGGCCAA
GTGAATCCAGAGAATATGTTGTACCGCGCTATAATCACAAGACAATCAGCTAAGCTCGTA
GCATTGGTTGCCGTGAGTAGTGATTATCCGAAAAAGGCACCGCTGTTTTCATTGACATTA
CATTGGAACGGTACACACACCGCAGGAACAAACGATGACATAAGAGACATCGAGAGAATC
ATCAATACGAACTGGACTAATGATGGCAATAAGTCTACTCTCACCGCACAGATGACGAAG
TTACTCACTTGTCTAGATATTCTCCTGGAGACCACAGGGTCATCAGAATTTCCTCCCGAC
AAAGTAATGTTCCAGTCAGTGAGAGGAAGAAATCGGATGAAACCTTACCGTTTTATAAAA
CAAGGTACAGGCGTGTTTGTACAATATTGA

Protein sequence:

MGKDDTSTKKRRKLTTTSSSDNNTKQTPVDIYKKVVEFEEAEAQLRSADKDAALFKKICQ
DVRQLFAEIAELKEKGTDEAKEKINVKRVEASLHLVALKKLNRLEKVRTRAGREALHKEK
QRVDSTHLLLQNLLYEADHLNKEVTKCLQFKSKDEEIELIPLEEFYKEAPSEISRPEVTK
ADEHQLQLARLEWELRQRRELAGACSELVASKECVAAAIAAARSRLNALSPHLKDVLKAT
KPLQECLALRLDEKRDETRAASLLPPPLFLLYANASAYSDALGASNVVVGISGDEDEAKR
LDQLSNVESELVVSNDSDSDQENNYEEPRDKKKRHHRGTKISREEKAEAKKKEVLKRHPL
NVKVTVKIPDGTALNLIFSYMVHLKIIVVKNTLDLFKPITGVSAADVLNGDCILNELYIG
DNGNDSPHPATTYLLNAAGIVEDFHYFIPEVGRPYIWAQRMCGLDFMAVTGEEKKSNIIQ
PSQSLSVVSVENFIFTLKKRLKSRVELMKELQDLESGKIIPEKGVGCPLRLSGSLTQWQS
VGWNEYSQSTSTSFLISEGQVNPENMLYRAIITRQSAKLVALVAVSSDYPKKAPLFSLTL
HWNGTHTAGTNDDIRDIERIINTNWTNDGNKSTLTAQMTKLLTCLDILLETTGSSEFPPD
KVMFQSVRGRNRMKPYRFIKQGTGVFVQY