New model in OGS2.0 | DPOGS208771  |
---|---|
Genomic Position | scaffold2102:+ 35531-37237 |
See gene structure | |
CDS Length | 1707 |
Paired RNAseq reads   | 180 |
Single RNAseq reads   | 499 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007632 (4e-168) |
Best Drosophila hit   | CG34183 (4e-27) |
Best Human hit | putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 (4e-30) |
Best NR hit (blastp)   | PREDICTED: RNA polymerase II associated protein 2 [Oryctolagus cuniculus] (3e-53) |
Best NR hit (blastx)   | PREDICTED: similar to RNA polymerase II-associated protein 2 [Tribolium castaneum] (4e-77) |
GeneOntology terms    | GO:0005634 nucleus GO:0016787 hydrolase activity GO:0016020 membrane GO:0004721 phosphoprotein phosphatase activity GO:0016021 integral to membrane GO:0046872 metal ion binding |
InterPro families   | IPR007308 Protein of unknown function DUF408 |
Orthology group | MCL16441 |
Nucleotide sequence:
ATGGAATTTAAAAAAAGTACAAAGCGACCACCTAAAATCGAAGAAATGTCCAAGGAACAA
ATACGAAAAGCTATCATTAAAAAACGCGAGTGTAACGCCAAAGCACAAAACATAGTTGAA
AAATTACTTGAAAAGTGTGTGAATGAGGAATATTTTTTAAAATGCTTACTCGATATTAAC
CAAAGTCATTTCGATGATGTGATTGAGGAACGGTCTATATTACAACTATGTGGTTATCCA
TTATGTCAGAGAACATTGTTAGAGAAGGACATTCCTAAACAAAAATACAGGATATCGTTG
AAAACAAATAAGGTTTACGATATAACAACAAGGAAATGCTTCTGCAGTAACATCTGTTAC
AAATCGGCAATGCATGTTAAAAAACAAATGTTGACGAGTCCTTTATGGTTTAGGGAATAT
GAAGAAATACCGACAGGACAAGAGGTAGACTTGGGTGGACCCGCAAAAATAGAAATAAAT
AAAGATGATTTTATCACCACTTCACAATTCACTAAATCCAGCTTTCAGCATGCATCTGAT
ATTGTAGATTCAAATAAGATTGATGTCCATAAAATTGAAACTAAATTATTAACCAATAGT
ACGGATGTAATAGGAAGCAACATAGTCAATACTAATGATGAAAATATTACACCCAGTAAT
CATTTAGAATCTTCTAAACCGTCATACACACAGGAACCTGATGTGAGGAAGAAACCTAAT
AAGAAAACAACAAACCCATTAAATATTGTTGGTGATATAGTGGAGAAACCAGAGAAACAG
ATTGACCCTATACTCATTAATAGACCTTCAAGTAAAGACAAAGAACCTGAGAAACCAACC
ACCAGCTTAACCAGAACTATACATCAGAAGAAACCGCCTTCAATCACAGCCATAACAATA
AATGTAGAAAAATGTTTAGCGGAATGGTGTACGATTGACACACTGCTGTTTATATATGGG
GAGGAAAACGTAAAGAAAATGTTATCCAACAAAGGACAATTTATAACAGACTACTTAAAC
AATTACTCCAAAAGCATTTTCTACACTTCAAACACATACGACCAGTACCAAGCATTGTGC
CGCAAACTTAACTTGTTAGAATTAGAATCGAGACGACAAGATGCTCAGATATTAAATAAA
GAAACTAGACCATTACCAGATTATTCAATTCTGAAGGAAGAGAGTAAGAAAATACAGTTT
AAAGTCAGAGCCTTTTTTGCGGGAGAAATTGAAATCCCCGAACCAGAGGAGCCCACGGAG
GTTGATGCATCCAATGAACATGATAACTCAACTGTGTTACCACTAGTTGATAAGAATTCA
CAAAATGCACTGCGAAGGAAAATTGTTTGCCAGCATTTAAACAAGGTGTTGCCAGATTTA
TTACGATCACTTGGCTTATTAAATCTAACAATCAGTTCTGACATACGACTGCTTGTAAAT
ACATTCAAATTGAAAGCAGACAATATTATGTTCAAACCTATACAGTGGACATTGATCGCT
TTAGTGTTTATAAAATTGTTATCTATAAGGGACGAACAATTAAAAGGTTTACTGGAGCAT
GAAACGGCATTCAAACACATGCAGCTCTTGTTGCTCAGTTATAATCAAGACGGAGGTTAT
TTAGACAGGCTCATTTCTTGGTTGACCGATGTAAATAGGTTACTCGACGTAAATGATAAT
CAAATGACTATTGAAAAAAATATGTAG
Protein sequence:
MEFKKSTKRPPKIEEMSKEQIRKAIIKKRECNAKAQNIVEKLLEKCVNEEYFLKCLLDIN
QSHFDDVIEERSILQLCGYPLCQRTLLEKDIPKQKYRISLKTNKVYDITTRKCFCSNICY
KSAMHVKKQMLTSPLWFREYEEIPTGQEVDLGGPAKIEINKDDFITTSQFTKSSFQHASD
IVDSNKIDVHKIETKLLTNSTDVIGSNIVNTNDENITPSNHLESSKPSYTQEPDVRKKPN
KKTTNPLNIVGDIVEKPEKQIDPILINRPSSKDKEPEKPTTSLTRTIHQKKPPSITAITI
NVEKCLAEWCTIDTLLFIYGEENVKKMLSNKGQFITDYLNNYSKSIFYTSNTYDQYQALC
RKLNLLELESRRQDAQILNKETRPLPDYSILKEESKKIQFKVRAFFAGEIEIPEPEEPTE
VDASNEHDNSTVLPLVDKNSQNALRRKIVCQHLNKVLPDLLRSLGLLNLTISSDIRLLVN
TFKLKADNIMFKPIQWTLIALVFIKLLSIRDEQLKGLLEHETAFKHMQLLLLSYNQDGGY
LDRLISWLTDVNRLLDVNDNQMTIEKNM