New model in OGS2.0 | DPOGS203876  |
---|---|
Genomic Position | scaffold697:- 22078-22926 |
See gene structure | |
CDS Length | 849 |
Paired RNAseq reads   | 936 |
Single RNAseq reads   | 2452 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003833 (4e-120) |
Best Drosophila hit   | nucleolar protein at 60B, isoform D (2e-92) |
Best Human hit | H/ACA ribonucleoprotein complex subunit 4 isoform 1 (2e-78) |
Best NR hit (blastp)   | AGAP004739-PA [Anopheles gambiae str. PEST] (9e-98) |
Best NR hit (blastx)   | AGAP004739-PA [Anopheles gambiae str. PEST] (1e-93) |
GeneOntology terms    | GO:0005730 nucleolus GO:0042254 ribosome biogenesis GO:0006364 rRNA processing GO:0001522 pseudouridine synthesis GO:0005634 nucleus GO:0004730 pseudouridylate synthase activity GO:0007281 germ cell development GO:0003723 RNA binding GO:0009982 pseudouridine synthase activity |
InterPro families    | IPR015947 Pseudouridine synthase/archaeosine transglycosylase-like IPR020103 Pseudouridine synthase, catalytic domain IPR002478 Pseudouridine synthase/archaeosine transglycosylase IPR004521 Uncharacterised conserved domain CHP00451 |
Orthology group | MCL14901 |
Nucleotide sequence:
ATGTGCGTACATTTGGGTCTCATGCTCGGAGTTGGAGGACAAATGATTGAATTACGCAGA
GTCCGCTCTGGTATACAGGGGGAGAAGGAGGGCATGGTTACCATGCACGACATATTGGAC
GCTCAATGGGCGTATGAGAACCATAAGGATGAAACCTATTTGAGAAGAGTCATTAAGCCA
TTAGAAGGTCTGCTGGTAGCTCACAAGAGGATCTTTATCAAGGACAGTGCGGTTAACGCA
GTATGTTACGGAGCCAAAGTACTTTTGCCTGGTATCCTAAGATACGAGGATGGTATTGAA
GTCGACCAAGAAATTGTTATAGTAACAACAAAGGGAGAAGCTGTGGCATTGGCTATAGCC
CTTATGACCACGTCCACTATGGCATCCTGTGATCATGGGGTAGCGGCCAAACTGAAACGT
GTTATCATGGAAAGAGACACATACCCTCGCAAATGGGGCTTAGGTCCGAAAGCATCTCAA
AAGAAAATGCTTATCCAGCAAGGGAAATTAGATAAATTTGGAAAACCCAACGAAAACACA
CCGTCCGAATGGTTGAATAGCTATGTAGACTACAAAGCTAAGAAGGACACAGAGAACGGT
GATGCACAGGAAGATGCAGGTAGAAAGAGAACCGCTAGCACAGCGAACGCTGACAACCCG
AATAACTCGACAGAAATCAAGTCGGAGAAAAAGAAGAAAAAGAAAAAACGTGACACCGAC
GTAGAAATGGATAATGAAGCTGATACAACGGTAGACCCGGATCAGACAATAGAAGGGGAT
GAGTCGGTGCGCAAAGAAAAGAAAAAAAAGAAAAAGAAGGATAAAGATCAAGAGAGACAG
GACGAGTAA
Protein sequence:
MCVHLGLMLGVGGQMIELRRVRSGIQGEKEGMVTMHDILDAQWAYENHKDETYLRRVIKP
LEGLLVAHKRIFIKDSAVNAVCYGAKVLLPGILRYEDGIEVDQEIVIVTTKGEAVALAIA
LMTTSTMASCDHGVAAKLKRVIMERDTYPRKWGLGPKASQKKMLIQQGKLDKFGKPNENT
PSEWLNSYVDYKAKKDTENGDAQEDAGRKRTASTANADNPNNSTEIKSEKKKKKKKRDTD
VEMDNEADTTVDPDQTIEGDESVRKEKKKKKKKDKDQERQDE