DPGLEAN07180 in OGS1.0

New model in OGS2.0DPOGS214796 
Genomic Positionscaffold7337:+ 501-4815
See gene structure
CDS Length1698
Paired RNAseq reads  576
Single RNAseq reads  1828
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012107 (0.0)
Best Drosophila hit  Hpr1 (5e-124)
Best Human hitTHO complex subunit 1 (3e-105)
Best NR hit (blastp)  PREDICTED: similar to Hpr1 CG2031-PA isoform 1 [Apis mellifera] (1e-176)
Best NR hit (blastx)  PREDICTED: similar to Hpr1 CG2031-PA isoform 1 [Apis mellifera] (2e-157)
GeneOntology terms


  
GO:0005654 nucleoplasm
GO:0007165 signal transduction
GO:0005515 protein binding
GO:0006406 mRNA export from nucleus
InterPro families  IPR021861 THO complex, subunit THOC1
Orthology groupMCL14035

Nucleotide sequence:

ATGTCCGAAAAAGTTGGATTTGATGAGTTGCGGCTTAAGTATAAGGATGTATTATCAAAA
GCATTTACTACAAATAATATCGATTTGCTCGATTCCTTTTCAAAAAATGCGAATGAGAGC
GACAGAAAATCAGCCATGGACCAAGCCTTTAGAGATAAACTACTGGATTTGCTATTAGAG
GAGCCAAATACCTTGGAAAGCTATGTAAATTTTTGTATAGACTCATGTCGAAGGCAAATG
GTGACTGCAACTATGCCTGTAGTTCTTTTGGGTGATATTTTCGATGCACTAACACTGAAC
AAGTGTGAAAAAATGTTTATGTATGTTGAAAATGGAGTAAATATATGGAGGGAAGAATTA
TTCTTTGTGGCATGTAAGAACCATTTACTAAGAATGTGTAATGACTTACTGAGAAGATTA
TCAAGATCTCAAAATACGGTATTCTGTGGCAGAATATTGTTATTCTTAGCCAAATTCTTT
CCATTTTCTGAGCGCTCTGGGCTTAATATTGTGTCCGAATTCAACTTGGAAAATATTACG
GAGTTTGGTGGTGATAATACAAGTACCTTAAAGGATGTATTGGATGAAGAAATGGTTGTT
GAAGATGATAAAAATAAATTGGTCATAGATTACAATCTTTACTGTAGATTTTGGAGCTTA
CAGGACTTCTTTAGGAACCCTAACACATGTTATAATAAATTACAATGGAAAACCTTTGTT
GCGCATTCAGGCAGCGTCCTATCAGCTTTCTCCTCATACAAGCTAGAGGCTGTGGAGTTG
CAGAAATCGAAATTAAATATACTTAAATCGGTAAACTCGGATGTTGAAATGCAAGAGAAC
AAAGAGCAACATTACTTTGCAAAGTTTCTAACAAACCAGAAACTTTTGGAGTTACAGTTG
TCTGACTCAAATTTCAGGAGATGTGTCCTGATACAGTATCTGATACTCTTTCAATATTTA
ATGTCGACGGTAAAGTTTAAAATGGAATCTCAGGAATTAAAATCAGATCAGATAGACTGG
GTGAAAGACACCACAGCCCTTGTTTATAAGCTCCTCGGTGAAACTCCGCCCGACGGCAAA
CAGTTTGCTGAATGCGTGAAGAGAATATTGAAGAGGGAAGAACATTGGAACAGTTGGAAG
AACGATGGATGTCCAGAATTTCAAAAGCCAAAACCTCCGGTGCAAGCGGAGAACGAGGAG
GTGAAGCGTTCGAGGAAACGACGTCGGCCTGTTGGAGATATTATTAAAGAATATTCTGGA
ACCGACAAGTTCTTCATGGGGAATAATGACCTGACAAAGTTGTGGAACCTCTGTCCGGAC
AATCTAGCTGCTTGTAGAACAAAGGAAAGAGATTTCATGCCATCACTAGAATCATATATG
TTATCTGGTGATGGTGGTGAGGGTGCTGGTGGGTGGGGGTGGCGAGCTCTGCGTCTGCTA
GCAAGAAGGTCACCACACTTCTTCGTCCACACAAACAATCCCATCGGACGGCTGCCGGAT
TATCTTGATGATATGGTTAAAAAAATCACTCGTGAAGTGGCCGCCAATAATGTGTCCAAT
AACACAAACGGTGACTCCAACGTTAACAACGATAAAGCTAAGCTGGAGCAGACTGAAGAA
GAACTGACAGAAGAGCAAATAGAAACTGATATTATAAAGGAAGAAGACACGACCGACCTT
GAACAGGTGACTGTGTGA

Protein sequence:

MSEKVGFDELRLKYKDVLSKAFTTNNIDLLDSFSKNANESDRKSAMDQAFRDKLLDLLLE
EPNTLESYVNFCIDSCRRQMVTATMPVVLLGDIFDALTLNKCEKMFMYVENGVNIWREEL
FFVACKNHLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIVSEFNLENIT
EFGGDNTSTLKDVLDEEMVVEDDKNKLVIDYNLYCRFWSLQDFFRNPNTCYNKLQWKTFV
AHSGSVLSAFSSYKLEAVELQKSKLNILKSVNSDVEMQENKEQHYFAKFLTNQKLLELQL
SDSNFRRCVLIQYLILFQYLMSTVKFKMESQELKSDQIDWVKDTTALVYKLLGETPPDGK
QFAECVKRILKREEHWNSWKNDGCPEFQKPKPPVQAENEEVKRSRKRRRPVGDIIKEYSG
TDKFFMGNNDLTKLWNLCPDNLAACRTKERDFMPSLESYMLSGDGGEGAGGWGWRALRLL
ARRSPHFFVHTNNPIGRLPDYLDDMVKKITREVAANNVSNNTNGDSNVNNDKAKLEQTEE
ELTEEQIETDIIKEEDTTDLEQVTV