New model in OGS2.0 | DPOGS200680  |
---|---|
Genomic Position | scaffold364:- 60220-62326 |
See gene structure | |
CDS Length | 1785 |
Paired RNAseq reads   | 683 |
Single RNAseq reads   | 1498 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005552 (2e-110) |
Best Drosophila hit   | CG12301 (1e-12) |
Best Human hit | U3 small nucleolar RNA-associated protein 14 homolog A isoform 1 (1e-14) |
Best NR hit (blastp)   | PREDICTED: similar to smooth muscle caldesmon, putative [Tribolium castaneum] (8e-106) |
Best NR hit (blastx)   | PREDICTED: similar to smooth muscle caldesmon, putative [Tribolium castaneum] (1e-69) |
GeneOntology terms    | GO:0005730 nucleolus GO:0005634 nucleus GO:0006364 rRNA processing GO:0032040 small-subunit processome |
InterPro families   | IPR006709 Small-subunit processome, Utp14 |
Orthology group | MCL11683 |
Nucleotide sequence:
ATGAGTTATGAAGAAATGGTTGAACACAGACAGCATCTTGCTAAGTTTAGAGCTCAACAG
TCTTATAGGGCTGCCAAGGCTAAGAGACAAAGCAAAATTAAAAGTAAAAAATATCATCGT
ATACTTAAGAAAGAAAAATTAAAACAACAGTTAAAGGAATTTGAGGAATTACAAGCAACT
AATCCAGAAGAGGCATTGAAAAAACTTGAGGAGTTAGAAAAAGCACGAGCTTTAGAAAGA
CATACCTTGAGACATAAAAACACTGGAAAATGGGCTAAGAACAAATTAGTTAGAGCAAAA
TATGATAAAGAGGTGAGGCAGCAATTAGCTGAACAACTGTCCGTAAGCAGAGGTCTTACA
CAGAAAACACAAAATGTTGAAAGTTCAGATGATGAGCCAGATGAAAGTGAAAACATCTGT
GACATTCAAATATCACAGGATCCCATGAATCCATGGATGTTAAAGAAATCCGACAAGAGT
AATATAGATGCCGAATTCAATTTCGGCTATAAAAAATATTTAAAAGACAAAATGTATAAA
TGCAAAGAGCAAAGTGACTCGGAAGAAGATGAAGCTGACCAAAATAGAGACAACGACATG
AGTTCTCTGAAAATGCTGGCAGAGAGCTTGAAAAAATTAAACAATGGTGAAAGTACAAAT
GTGCTTGAAAACAAAACTCAAGACGATGTCACTTTAATGGATGTTACCAGTGAAGAAAAT
AAATCACTAACAGTTCTTAATAAAAATATTACACAGAAACAGAAAAGTAATAATACTGGC
AGAAAAAACAAAAGAAAGATAGTCTCAACCTCAGACTGGCTTGTGGAAGAAATAAATCCC
AAAAATGCAACAACTGAAGAAGATATAAATACTGCATTTGATGATTATGAAGACAAAGTA
GCATTAAAGGTTGCTAAAAAACTTAAGGGTTTGAAACATGAGTTAAAAAATTTAGAATCA
TCATCTATCAAGCCAAATAAAAAGACAAATGAAACTAGTAAAGAAATTGACAACCTTGAA
TATTTAAAAATTAAAAAGCAGAAACAGATGCCTATAATTGATGAACCTCTTATAGAATCA
AATAAAAATATTGATGACATACCCGAGCAGACAAAACACTTATTAGACACATTAAAAGAT
ACAATAACAAGCACAAGCCAAAACGTAAACACAGATATTGATCCAAGTAGGTTTATTGAA
GTTAAACCAAAGTATTTGAATACAGCTGTGACAAATTCCGAGAATAATTTCGACGACTTA
GATGATGAAGAACAAGTGGTACCCAAAGTGGATATTGAAGAAGTTTTTGAAGAAGACGAT
GTGGTGACTAGTTTCAGACAAGAGAAGGAGGACGAAATTAATAAGAATAATCCAGAAGAA
CTAAGTTTAACACTCCCTGGATGGGGAGGCTGGGCCGGTAAAGGTGTGAAAGCACCTAAA
CGAAAGAAAAATAGATTTATTACAAAGAAACCACCAAAAACACTCAGAAGAGATGAAAAC
AAAGGTGATGTAATCATTAACGAGTCTAAAAATCCCAAGCTTGCTATACATAAAGTTTCA
GATTTACCACATCCATTCAACAGTGTGAAAGAATATGAGGAAAGTATAAGAACGCCTCTA
GGTAACACATTTGTGCCTGAAACAGCTCATAAGAAACTTATAAAACCTAATGTTATCACA
AGATCTGGAACAATCATTGAACCGATGGATGAAGAAGAACTGCTTGTGCCAAGAAATCGT
AACTTTAAAAATAAGTCTGTTATTAAGATTCTAGGCAAGCAATAA
Protein sequence:
MSYEEMVEHRQHLAKFRAQQSYRAAKAKRQSKIKSKKYHRILKKEKLKQQLKEFEELQAT
NPEEALKKLEELEKARALERHTLRHKNTGKWAKNKLVRAKYDKEVRQQLAEQLSVSRGLT
QKTQNVESSDDEPDESENICDIQISQDPMNPWMLKKSDKSNIDAEFNFGYKKYLKDKMYK
CKEQSDSEEDEADQNRDNDMSSLKMLAESLKKLNNGESTNVLENKTQDDVTLMDVTSEEN
KSLTVLNKNITQKQKSNNTGRKNKRKIVSTSDWLVEEINPKNATTEEDINTAFDDYEDKV
ALKVAKKLKGLKHELKNLESSSIKPNKKTNETSKEIDNLEYLKIKKQKQMPIIDEPLIES
NKNIDDIPEQTKHLLDTLKDTITSTSQNVNTDIDPSRFIEVKPKYLNTAVTNSENNFDDL
DDEEQVVPKVDIEEVFEEDDVVTSFRQEKEDEINKNNPEELSLTLPGWGGWAGKGVKAPK
RKKNRFITKKPPKTLRRDENKGDVIINESKNPKLAIHKVSDLPHPFNSVKEYEESIRTPL
GNTFVPETAHKKLIKPNVITRSGTIIEPMDEEELLVPRNRNFKNKSVIKILGKQ