DPGLEAN19964 in OGS1.0

New model in OGS2.0DPOGS208159 
Genomic Positionscaffold3108:- 7439-17401
See gene structure
CDS Length2673
Paired RNAseq reads  958
Single RNAseq reads  2609
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014067 (3e-58)
Best Drosophila hit  CG15160 (2e-62)
Best Human hitregulation of nuclear pre-mRNA domain-containing protein 2 (7e-30)
Best NR hit (blastp)  PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] (3e-94)
Best NR hit (blastx)  PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] (9e-82)
GeneOntology terms  GO:0005515 protein binding
InterPro families

  
IPR006903 Domain of unknown function DUF618
IPR008942 ENTH/VHS
IPR006569 RNA polymerase II, large subunit, CTD
Orthology groupMCL16088

Nucleotide sequence:

ATGGGTGAAACCGAAGAATTCAATACTTTAGCGTTCGAAAAGAAACTCACACAGCTGAAG
GATACACAGGAAAGTATCCAGTCTCTATCTAGCTGGTGTTTGAAACAAAGAACTCACCAT
AAAAAGATAGTTTCCAGTTGGTTGAATGTGTTGAAGAGAGTGAAGGTGGAACAGCGACTT
GTGTTGTTTTATTTAGCTAACGATGTTATTCAATATAGCAAAAGGAAAAATTACGAGTTT
GTTGAAAGCTGGGGTCTTAATTTGCAAAAAGCTACACCACTAGTCAGAGATGAGAAAGTT
AGACCGAAGATATTGAGGATATTCAAAATCTGGGAACAAAGATCAGTTTATGATGATGAG
TTCTTATCAGATCTCACAGGTCTACTGAGTGCGGGGGCTGTTAAAAAGACTGATGACGAT
CCATTGGATTTTCAACCTCAACAGTTGGTCAACAAGATAAGACAGTGTACTGTGCTAGAA
GCGGATACTCAGGTGAAATTGAAATGTATTTGTAACAGTCCATTAGAATTGTCGGATACG
GATGCATTATGTTCAAATTTAAAAGAGAGAAGTAGCAAGGATGACGTAGAAAAGGAACTC
AATGAAGGCATACAATGTGTGGAACGATACACTCAAGCATTACAGAGGGAAATAGTTGCT
AGGGAGGCATTACTAGCATTGTTGAGTTCAGCAAACCAATACTACTCTACGCAGAGGGGA
GAAGTGAAAGTTGTGGCATATGCATACAAAAACTTCGGTTCCCGAGTGCGCGCTCTCAAA
CGCAAGTTAGATGAATTGTTGCCAACACTACCGAGTGCACCGTCGCCGCCGACTAGAGAT
GAAGACGTACCCTCACCTGGACCGGACGAGGATTTGGAACTACCCACCAATGAAAATGAA
GTATCATACAACATCGACCAAACCTTCAATACATCGGTGTCAGCTGATGGGTCCTTGTAT
AACTTGGGACTATCGTCGTTCCTCAACGAAAACTCCATGGCTATATTCAATGAAAGCCAA
GCGGATTTAAATATTGTTAATAGCAGCATACAGCCAGATACGCTTCCGGGATTGGACCTC
CTTAAGGAATCCAACCCTCCACCACCGACATCATTCTACGGGACCATGGAAACTATCACT
AAGAGAAGTAGCAAGGATGACGTAGAAAAGGAACTCAATGAAGGCATACAATGTGTGGAA
CGATACACTCAAGCATTACAGAGGGAAATAGTTGCTAGGGAGGCATTACTAGCATTGTTG
AGTTCAGCAAACCAATACTACTCTACGCAGAGGGGAGAAGTGAAAGTTGTGGCATATGCA
TACAAAAACTTCGGTTCCCGAGTGCGCGCTCTCAAACGCAAGTTAGATGAATTGTTGCCA
ACACTACCGAGTGCACCGTCGCCGCCGACTAGAGATGAAGACGTACCCTCACCTGGACCG
GACGAGGATTTGGAACTACCCACCAATGAAAATGAAGTATCATACAACATCGACCAAACC
TTCAATACATCGGTGTCAGCTGATGGGTCCTTGTATAACTTGGGACTATCGTCGTTCCTC
AACGAAAACTCCATGGCTATATTCAATGAAAGCCAAGCGGATTTAAATATTGTTAATAGC
AGCATACAGCCAGATACGCTTCCGGGATTGGACCTCCTTAAGGAATCCAACCCTCCACCA
CCGACATCATTCTACGGGACCATGGAAACTATCACTATTCCTGATGAACCCGATCAACCT
TATCTACCAGAAGCTGTGGTCACAAACAGTCAATGGGCAAATAATACTTGGAATGTGCCT
CTTCCAGTGGCACGTAACGTGTTCGCGGAGCCCCCAGCCTCACCGCCCGTGCCGATACGA
ACAGACACACAGGTGGATATTTCCGCGTCTGATCATGAGCTTCGCAGCCGCCTGCCACCA
CCGCCACCGCCTCCTGTACTGCCAGGTCTTACACATATCGAGGATGTAGATCACAGGCTA
CTGCCAAGTCTACCTCCGACGCCCGTACCTCCGCCTATACGTCATTCACATCAAGATGTG
GATCACAGGAATCTTATATCACTGACACAACTACCTCCCAGACATGTTAATGTAGATCAA
GACTACCGCCTACCTCCACTGTCTCAGCCCCTGGGTGTGCCTCTCCTGCCACCTCCGCCT
TCAGACATTGTTGAGAGTGTCGATATGGACCTATCAGAGGACGAAGAGCAAGGTATGTAC
CAGACACAGAATCAAAGCGATCACAGACACAACAGCTTTAACAACAATAAAGTACTGGTC
GGTGGTGAGAAGAAGGACAACAGTAATCTCATACAGATAAACGCTAATATAGACATCGAG
GCCCCCCACGGAACCCCCGTGGCCCCCATGACGAACCCCTTCGACAACATGCCCCCCCAA
CTACGATACAACTTCAATACCAACTATCAAAAGAATCCAGATAACGCCGAATATAATGAT
GATATAAGAAATCGTAACCTCGATAGAAGAAATCAATCGCCTGAGTACGAAGATTATAAG
AGTAACGATTTCCAGGCCCCCCGGCCTTATATGAACCGGTTCCCGAGGAATTGGGGGCCT
CGCAATAACTTCAGGGCCCCATATAATCAGTTCAATCAGCGCAACGGCGGCCCTCGACAA
CGATGGGGCGGGCCCAGGCAAAGGTTTTGGTGA

Protein sequence:

MGETEEFNTLAFEKKLTQLKDTQESIQSLSSWCLKQRTHHKKIVSSWLNVLKRVKVEQRL
VLFYLANDVIQYSKRKNYEFVESWGLNLQKATPLVRDEKVRPKILRIFKIWEQRSVYDDE
FLSDLTGLLSAGAVKKTDDDPLDFQPQQLVNKIRQCTVLEADTQVKLKCICNSPLELSDT
DALCSNLKERSSKDDVEKELNEGIQCVERYTQALQREIVAREALLALLSSANQYYSTQRG
EVKVVAYAYKNFGSRVRALKRKLDELLPTLPSAPSPPTRDEDVPSPGPDEDLELPTNENE
VSYNIDQTFNTSVSADGSLYNLGLSSFLNENSMAIFNESQADLNIVNSSIQPDTLPGLDL
LKESNPPPPTSFYGTMETITKRSSKDDVEKELNEGIQCVERYTQALQREIVAREALLALL
SSANQYYSTQRGEVKVVAYAYKNFGSRVRALKRKLDELLPTLPSAPSPPTRDEDVPSPGP
DEDLELPTNENEVSYNIDQTFNTSVSADGSLYNLGLSSFLNENSMAIFNESQADLNIVNS
SIQPDTLPGLDLLKESNPPPPTSFYGTMETITIPDEPDQPYLPEAVVTNSQWANNTWNVP
LPVARNVFAEPPASPPVPIRTDTQVDISASDHELRSRLPPPPPPPVLPGLTHIEDVDHRL
LPSLPPTPVPPPIRHSHQDVDHRNLISLTQLPPRHVNVDQDYRLPPLSQPLGVPLLPPPP
SDIVESVDMDLSEDEEQGMYQTQNQSDHRHNSFNNNKVLVGGEKKDNSNLIQINANIDIE
APHGTPVAPMTNPFDNMPPQLRYNFNTNYQKNPDNAEYNDDIRNRNLDRRNQSPEYEDYK
SNDFQAPRPYMNRFPRNWGPRNNFRAPYNQFNQRNGGPRQRWGGPRQRFW