DPGLEAN22175 in OGS1.0

New model in OGS2.0DPOGS208771 
Genomic Positionscaffold2102:+ 35531-37237
See gene structure
CDS Length1707
Paired RNAseq reads  180
Single RNAseq reads  499
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007632 (4e-168)
Best Drosophila hit  CG34183 (4e-27)
Best Human hitputative RNA polymerase II subunit B1 CTD phosphatase RPAP2 (4e-30)
Best NR hit (blastp)  PREDICTED: RNA polymerase II associated protein 2 [Oryctolagus cuniculus] (3e-53)
Best NR hit (blastx)  PREDICTED: similar to RNA polymerase II-associated protein 2 [Tribolium castaneum] (4e-77)
GeneOntology terms




  
GO:0005634 nucleus
GO:0016787 hydrolase activity
GO:0016020 membrane
GO:0004721 phosphoprotein phosphatase activity
GO:0016021 integral to membrane
GO:0046872 metal ion binding
InterPro families  IPR007308 Protein of unknown function DUF408
Orthology groupMCL16441

Nucleotide sequence:

ATGGAATTTAAAAAAAGTACAAAGCGACCACCTAAAATCGAAGAAATGTCCAAGGAACAA
ATACGAAAAGCTATCATTAAAAAACGCGAGTGTAACGCCAAAGCACAAAACATAGTTGAA
AAATTACTTGAAAAGTGTGTGAATGAGGAATATTTTTTAAAATGCTTACTCGATATTAAC
CAAAGTCATTTCGATGATGTGATTGAGGAACGGTCTATATTACAACTATGTGGTTATCCA
TTATGTCAGAGAACATTGTTAGAGAAGGACATTCCTAAACAAAAATACAGGATATCGTTG
AAAACAAATAAGGTTTACGATATAACAACAAGGAAATGCTTCTGCAGTAACATCTGTTAC
AAATCGGCAATGCATGTTAAAAAACAAATGTTGACGAGTCCTTTATGGTTTAGGGAATAT
GAAGAAATACCGACAGGACAAGAGGTAGACTTGGGTGGACCCGCAAAAATAGAAATAAAT
AAAGATGATTTTATCACCACTTCACAATTCACTAAATCCAGCTTTCAGCATGCATCTGAT
ATTGTAGATTCAAATAAGATTGATGTCCATAAAATTGAAACTAAATTATTAACCAATAGT
ACGGATGTAATAGGAAGCAACATAGTCAATACTAATGATGAAAATATTACACCCAGTAAT
CATTTAGAATCTTCTAAACCGTCATACACACAGGAACCTGATGTGAGGAAGAAACCTAAT
AAGAAAACAACAAACCCATTAAATATTGTTGGTGATATAGTGGAGAAACCAGAGAAACAG
ATTGACCCTATACTCATTAATAGACCTTCAAGTAAAGACAAAGAACCTGAGAAACCAACC
ACCAGCTTAACCAGAACTATACATCAGAAGAAACCGCCTTCAATCACAGCCATAACAATA
AATGTAGAAAAATGTTTAGCGGAATGGTGTACGATTGACACACTGCTGTTTATATATGGG
GAGGAAAACGTAAAGAAAATGTTATCCAACAAAGGACAATTTATAACAGACTACTTAAAC
AATTACTCCAAAAGCATTTTCTACACTTCAAACACATACGACCAGTACCAAGCATTGTGC
CGCAAACTTAACTTGTTAGAATTAGAATCGAGACGACAAGATGCTCAGATATTAAATAAA
GAAACTAGACCATTACCAGATTATTCAATTCTGAAGGAAGAGAGTAAGAAAATACAGTTT
AAAGTCAGAGCCTTTTTTGCGGGAGAAATTGAAATCCCCGAACCAGAGGAGCCCACGGAG
GTTGATGCATCCAATGAACATGATAACTCAACTGTGTTACCACTAGTTGATAAGAATTCA
CAAAATGCACTGCGAAGGAAAATTGTTTGCCAGCATTTAAACAAGGTGTTGCCAGATTTA
TTACGATCACTTGGCTTATTAAATCTAACAATCAGTTCTGACATACGACTGCTTGTAAAT
ACATTCAAATTGAAAGCAGACAATATTATGTTCAAACCTATACAGTGGACATTGATCGCT
TTAGTGTTTATAAAATTGTTATCTATAAGGGACGAACAATTAAAAGGTTTACTGGAGCAT
GAAACGGCATTCAAACACATGCAGCTCTTGTTGCTCAGTTATAATCAAGACGGAGGTTAT
TTAGACAGGCTCATTTCTTGGTTGACCGATGTAAATAGGTTACTCGACGTAAATGATAAT
CAAATGACTATTGAAAAAAATATGTAG

Protein sequence:

MEFKKSTKRPPKIEEMSKEQIRKAIIKKRECNAKAQNIVEKLLEKCVNEEYFLKCLLDIN
QSHFDDVIEERSILQLCGYPLCQRTLLEKDIPKQKYRISLKTNKVYDITTRKCFCSNICY
KSAMHVKKQMLTSPLWFREYEEIPTGQEVDLGGPAKIEINKDDFITTSQFTKSSFQHASD
IVDSNKIDVHKIETKLLTNSTDVIGSNIVNTNDENITPSNHLESSKPSYTQEPDVRKKPN
KKTTNPLNIVGDIVEKPEKQIDPILINRPSSKDKEPEKPTTSLTRTIHQKKPPSITAITI
NVEKCLAEWCTIDTLLFIYGEENVKKMLSNKGQFITDYLNNYSKSIFYTSNTYDQYQALC
RKLNLLELESRRQDAQILNKETRPLPDYSILKEESKKIQFKVRAFFAGEIEIPEPEEPTE
VDASNEHDNSTVLPLVDKNSQNALRRKIVCQHLNKVLPDLLRSLGLLNLTISSDIRLLVN
TFKLKADNIMFKPIQWTLIALVFIKLLSIRDEQLKGLLEHETAFKHMQLLLLSYNQDGGY
LDRLISWLTDVNRLLDVNDNQMTIEKNM