New model in OGS2.0 | DPOGS214221  |
---|---|
Genomic Position | scaffold323:- 96592-98959 |
See gene structure | |
CDS Length | 1179 |
Paired RNAseq reads   | 150 |
Single RNAseq reads   | 463 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006197 (3e-37) |
Best Drosophila hit   | CG3294, isoform A (7e-37) |
Best Human hit | U2 small nuclear ribonucleoprotein auxiliary factor 35 kDa subunit-related protein 2 (2e-43) |
Best NR hit (blastp)   | PREDICTED: hypothetical protein [Nasonia vitripennis] (5e-68) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC010403 [Tribolium castaneum] (3e-60) |
GeneOntology terms    | GO:0000166 nucleotide binding GO:0003723 RNA binding GO:0005634 nucleus GO:0005689 U12-type spliceosomal complex GO:0005730 nucleolus GO:0008270 zinc ion binding GO:0046872 metal ion binding |
InterPro families    | IPR012677 Nucleotide-binding, alpha-beta plait IPR009145 U2 auxiliary factor small subunit IPR000504 RNA recognition motif domain IPR000571 Zinc finger, CCCH-type IPR003954 RNA recognition motif domain, eukaryote |
Orthology group | MCL11452 |
Nucleotide sequence:
ATGGGAAAGCATAAAGAATGGAGAAGAATTGTGAAACGGGAAAGAAGAAGGAGGATACGC
AAACAGTTGGCTAAACAAAGAGATTGCTTACCTAATGGAAATGATAATAAAGATTGGATA
AAAGCACAAGAAGAATTAGAAACATATATTTTTGAACAAGTTGAAAATTCTAACAAGGTT
GAAAATGAAAAGTGGTTGGCCGCCGAAGCAGTAGCTTTAAAACACTGGAAGGAACTTCAA
ATCAAAAAAGAAATGTTGCTTAAGAAGCAGCTAGAACTACAAGCTAAGCTTCATCAGGAA
TGGGAGATGGAAAAAGAGAGAAAAGAAAAGGAGGCTCAACGTCTTAAAGAATTAGAAGAA
GAAAATGTCAAGAGACAGGAAGAGTTCATGAAAAACTTAGAAGAATTCCTAAATGGTGAT
TGTAAGGATCCTCCACAGGAGCTGCTTACATTGTATGAAAGTCGGCCAAATTGTGACCCA
TGTCCATTTTATGCTAAAACTGCATGTTGCCGGTTTGGAGATGAATGTTCTAGAAACCAC
AAGTATCCAGGTATCAGCAAGATACTCTTAGCTCCAAATCTTTTTGGACATTTTGGCTTG
GAGAATTCAAATTTTAATGAATATGACACAGATATTATGTTGGAATATGAAGATAGTGAT
ACTTACAAGGATTTCAAAGAATTTTTTTTTGACATATTGCCAGAATTTCAAAAATTTGGC
CAGGTTGTTGAGATTAAAGTTTGCAATAACTTTGAAAAGCATCTCCGAGGCAACACATAT
ATAGAGTACTCCGACGTTCGTAGTGCAGTTTCAGCTTACAGAGCCCTTCATACAAGATGG
TATGGCGGAAAACAACTCTCATTGCAATTTTGTAGACTGTTATCATGGAGTAGTGCAATA
TGTGGTTTACAAGTTACTGGACGATGTCCGAAAGGCAGGGCATGTAATTTTCTTCATGTT
TTTAAAAATCCTATAGATTTACACATAGCCTATGAAAAACGTTATTCAAAAAGGCAACAA
CACACGTCGTCACGTTCATGGAGGTGGTCCGAATCGCCTGAAAGAGAAAGTCCAACTTCA
AGAAGTAAAAGAAAAGACGATGGTCATTCCAAAAGTAGAGAAAGAAGACACTATCAACAT
CGATCACCTAGATCAAGATCACATCGATATAGGGATTAA
Protein sequence:
MGKHKEWRRIVKRERRRRIRKQLAKQRDCLPNGNDNKDWIKAQEELETYIFEQVENSNKV
ENEKWLAAEAVALKHWKELQIKKEMLLKKQLELQAKLHQEWEMEKERKEKEAQRLKELEE
ENVKRQEEFMKNLEEFLNGDCKDPPQELLTLYESRPNCDPCPFYAKTACCRFGDECSRNH
KYPGISKILLAPNLFGHFGLENSNFNEYDTDIMLEYEDSDTYKDFKEFFFDILPEFQKFG
QVVEIKVCNNFEKHLRGNTYIEYSDVRSAVSAYRALHTRWYGGKQLSLQFCRLLSWSSAI
CGLQVTGRCPKGRACNFLHVFKNPIDLHIAYEKRYSKRQQHTSSRSWRWSESPERESPTS
RSKRKDDGHSKSRERRHYQHRSPRSRSHRYRD