New model in OGS2.0 | DPOGS207627  |
---|---|
Genomic Position | scaffold432:- 26637-31771 |
See gene structure | |
CDS Length | 1686 |
Paired RNAseq reads   | 1593 |
Single RNAseq reads   | 4173 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006011 (1e-110) |
Best Drosophila hit   | mRNA-capping-enzyme (2e-127) |
Best Human hit | mRNA-capping enzyme (2e-123) |
Best NR hit (blastp)   | PREDICTED: similar to mRNA capping enzyme [Tribolium castaneum] (2e-144) |
Best NR hit (blastx)   | PREDICTED: similar to mRNA capping enzyme [Tribolium castaneum] (7e-142) |
GeneOntology terms    | GO:0004484 mRNA guanylyltransferase activity GO:0006370 mRNA capping GO:0006470 protein amino acid dephosphorylation GO:0008138 protein tyrosine/serine/threonine phosphatase activity GO:0004651 polynucleotide 5'-phosphatase activity GO:0005634 nucleus GO:0004725 protein tyrosine phosphatase activity |
InterPro families    | IPR012340 Nucleic acid-binding, OB-fold IPR016130 Protein-tyrosine phosphatase, active site IPR016027 Nucleic acid-binding, OB-fold-like IPR000387 Protein-tyrosine/Dual-specificity phosphatase IPR001339 mRNA capping enzyme IPR013846 mRNA capping enzyme, C-terminal IPR000340 Dual specificity phosphatase, catalytic domain |
Orthology group | MCL16390 |
Nucleotide sequence:
ATGTATCGATTTACACCATCAATGGTTTTTGACTACGTTAAGAAGTATAAAAAAAGATTA
GGTTTATGGATAGATTTAACGAATACAACAAGATTTTACGACAGAACAGAAGTGGAGAAC
AGAGGGTGTATATATAAGAAATTATCATGTCGTGGTCATGGGCAAACACCTTCAGAACAA
CAAACAAAACAGTTTATAGATATTGTGAGTGATTATATTGCACAAAATCCAAACAATTTA
ATTGGTGTCCATTGTACTCATGGATTTAACAGGACTGGTTTTCTTCTCTGCGCCTATATG
ATAATACAGGAGGATTGTAGTGTAGATTTTGCAATTTTTAATTTTGCTCAAGAGAGACCG
CCAGGTATTTACAAGCAAGACTATATTGATGAACTAATAAAAAGATTCAAAGGTGATTGC
GCGTTGGAGGCTCCCACGCTACCCGATTGGTGTGACGAAGAACAAATTGATTATGATGAC
AATGATAGAGATGGCTCTAGTCACTCACAGAGTAATTCTTCAAGAAAGAGGGAGGGGAAA
TACATAAACAAAAAGTTCATGATAGAACACGAAAAAGTAACATTATTGACTGACACGAAG
AAAATCGATGCAATACGTGAAACGGCGGCCTCATATTTGAAGTGGAAAGTGAATGATTTC
CCTGGAGCACAGCCGGTTTCAATGACTAGGAAAAATATAGAAAATTTGCAAAAGTACCCC
TATCAAGTGTCTTGGAAAGCTGATGGTGTTAGATACATGATGCTTATTGTAGACGATGAC
GAAGTTTACATGATAGATAGAGATAATTGTATATTTAAAGTGGACAATTTAAAATTTCCT
CATAACACAAAACCGAGGCATCTGCGGAAAACTTTACTAGACGGAGAAATGGTTATAGAC
AAAGTTGATGGTAGAGAAAAACCGAGATATTTAATTTATGATATAATAAGGTTTGAAGAT
ACGAATGTAGGCAGAGAACACTTTTATCCGGTTAGGCTTCATTGTATAGAAGTGGAAATC
GTTAATCCCAGAAATCGAGCTATAGTGAGCGGTCATATAAGAAAAGAATTGGAACCATTC
AGTGTTATCATAAAACGTTTCTGGGATGTAAGGATGGCACACAGTTTACTGGAGGATAAG
TTTATAAGGACACTGCATCATGAACCTGACGGACTCATTTTTCAACCATCAGAGATGAGA
GAGGAGCCCAGGCACTGGAGGCGAGCGTTTAACGCGCGTTATGGACCCGGCACTTGCAAA
GTGAATCCATGGAATGCTCCCTACTCAGGTGGTCCCTGCGAGTTCATATTAAAATGGAAA
CCAAGTGATCAAAACAGCATTGATTTTAAACTTGTTCTGGAAAAGGAGACTGGACTAGGA
CTTGTTTCCGAAACGAAAGGCAACTTATATGTCGGCGGATCGAACGTCCCCTTTGGATGG
ACAGCATATAATAAGAAAATCAAGCATTTAAACAACAAGATAATTGAATGCAAGCTAGTC
AACCGCTGCTGGGTCTTCATGAGGGAACGAACGGATAAGTCGTTTCCAAACTCCTACACA
ACAGCTAAAGCTGTAATGGAGAGCATCGTTAATCCGGTCACAAAGGAATATCTGTTGGAC
TTCATTAAATACAACTCTTACAGAAAACCAGACATAAATCAATCAAAACGACCACGGCTC
GAATAA
Protein sequence:
MYRFTPSMVFDYVKKYKKRLGLWIDLTNTTRFYDRTEVENRGCIYKKLSCRGHGQTPSEQ
QTKQFIDIVSDYIAQNPNNLIGVHCTHGFNRTGFLLCAYMIIQEDCSVDFAIFNFAQERP
PGIYKQDYIDELIKRFKGDCALEAPTLPDWCDEEQIDYDDNDRDGSSHSQSNSSRKREGK
YINKKFMIEHEKVTLLTDTKKIDAIRETAASYLKWKVNDFPGAQPVSMTRKNIENLQKYP
YQVSWKADGVRYMMLIVDDDEVYMIDRDNCIFKVDNLKFPHNTKPRHLRKTLLDGEMVID
KVDGREKPRYLIYDIIRFEDTNVGREHFYPVRLHCIEVEIVNPRNRAIVSGHIRKELEPF
SVIIKRFWDVRMAHSLLEDKFIRTLHHEPDGLIFQPSEMREEPRHWRRAFNARYGPGTCK
VNPWNAPYSGGPCEFILKWKPSDQNSIDFKLVLEKETGLGLVSETKGNLYVGGSNVPFGW
TAYNKKIKHLNNKIIECKLVNRCWVFMRERTDKSFPNSYTTAKAVMESIVNPVTKEYLLD
FIKYNSYRKPDINQSKRPRLE