New model in OGS2.0 | DPOGS212853  |
---|---|
Genomic Position | scaffold96:+ 72785-99556 |
See gene structure | |
CDS Length | 1641 |
Paired RNAseq reads   | 8781 |
Single RNAseq reads   | 20535 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000798 (1e-06) |
Best Drosophila hit   | pasilla, isoform G (8e-99) |
Best Human hit | RNA-binding protein Nova-1 isoform 2 (9e-64) |
Best NR hit (blastp)   | PREDICTED: similar to pasilla CG16765-PK [Tribolium castaneum] (3e-167) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC008998 [Tribolium castaneum] (1e-104) |
GeneOntology terms    | GO:0003729 mRNA binding GO:0005634 nucleus GO:0000398 nuclear mRNA splicing, via spliceosome |
InterPro families    | IPR004088 K Homology, type 1 IPR004087 K Homology IPR018111 K Homology, type 1, subgroup |
Orthology group | MCL11949 |
Nucleotide sequence:
ATGGCCGCTGACACAGGAATGGATACTTGCCCTAGTCCGGAAATCACGGACTCTAGAAAA
CGACCCCTGGATGGCGATTCAGAAAATGGGGATGTCAAGAGGTCACATTTTAGCTCGGTT
CAAGACTTGGTGACGGCCTTGCCACTGGCTAACGGCCATGGCAATATAACGTCTCATTTC
GCGGTGGAGCCGACGTACCACTTCAAGGTGCTGGTGCCGTCGATGGTGGCCGGCGCCATC
ATCGGCAAGGGTGGTGAGACCATAGCGCAACTGCAGAAGGACACGGGGGCCAGGGTCAAG
ATGTCCAAATCGCATGATTTCTATCCAGGTACTACAGAACGAGCGTGTCTCATAACGGGG
TCGGTGGAAGGCATCATGGTGGTGCTAGACTTCATCATGGAAAAGATCAAAGAGAAACCG
GAGCTGGTGAAACCCTTCCCGGAGGGCGTGGATGCCAAGATGCCGCAGGATAGAGACAAG
CAGGTGAAGATCTTGGTGCCGAACTCCACAGCTGGTATGATAATAGGAAAGGGGGGCAAC
TACATTAAACAAATCAAGGAACAGAGCGGCAGCTACGTACAGATATCTCAGAAGGCGAAG
GAACTGTCTCTCCAGGAGCGCTGCATCACTGTTGTCGGTGAGAAGGAGAACAACAAGAAG
GCCTGCCTGATGATCCTTCAGAAGGTGGTGGACGACCCTCAGTCCGGGTCCTGTCCCAAC
GTGTCGTACGCGGACGTGGCCGGGCCGGTCGCCAACTACAACCCCACCGGCTCGCCGTAC
GCCGTGCCCACCACTGAGAGTCACGCTCTGGTGGGTGGCGGGTCTGTGGGCGGCGTGGGC
GGTGCTGGCGCGCTAGGCGGCGTGGGCGGTGTGGGCGGCGTGCTCGTGAACGGTTCCGGT
CTGGGCTCGCTGTCCCTGTCTCTGTCGCTGGCGCCGCCCGGCACCCCGCCGCCGTCCCCG
CTCACACAGCACACGCTCGACCACATTAAGGCGGCGCTGCGTCAGGCGGGCTACTCGGAG
GCCGGGCTGAGCGAGATCGGCGCGGCGCTGGCTCTGCTGGTGAAGCACGGCGTGCTGGGC
CTGGCGCTGCCGGCCGCCCTGCCCGCGCCGCTGTCCGCCGCCTACTTCCCCCTGCAGCCC
AGCGACTCGCCCGCTGTCTTCGGACCGCTGGCGCAGGTCCAGCTCGGCGGCGCGCGCGGC
GGCTCGTTAGAGCGTTTCGCGGAGGTTGCATTCGAGGCGCTTCGCCCCCCAGCCGTGGCT
CCCATCTCGCTGTCGGGCGGCGTGGGGGGAGTGGGGGGCGTACCCGGCTTCCCCTCCGCC
AGCCTGCTGCCGCTCTCCAAGAGCCCCACGCCCGCCGACGCGGGCGCCAAGGACTCTAAG
AACGTCGAGATCCCGGAGGTCATCGTCGGGGCCATCCTGGGCCCCGGCGGCCGCAGCCTG
GTGGAGATCCAGCAGATGTCGGGTGCCAACATCCAGATTTCCAAAAAGGGCACGTTCGCC
CCGGGCACCCGCAACCGCATCGTGACCATCTCGGGCACCGCCACCGCCATCAGCAATGCG
CATTACCTCATCGAACAGAAGATCCAGGAGGAGGAGCTCAAGCGCACGCGCCACAACGCG
CTCTCCGGCCTCATGCAGTAG
Protein sequence:
MAADTGMDTCPSPEITDSRKRPLDGDSENGDVKRSHFSSVQDLVTALPLANGHGNITSHF
AVEPTYHFKVLVPSMVAGAIIGKGGETIAQLQKDTGARVKMSKSHDFYPGTTERACLITG
SVEGIMVVLDFIMEKIKEKPELVKPFPEGVDAKMPQDRDKQVKILVPNSTAGMIIGKGGN
YIKQIKEQSGSYVQISQKAKELSLQERCITVVGEKENNKKACLMILQKVVDDPQSGSCPN
VSYADVAGPVANYNPTGSPYAVPTTESHALVGGGSVGGVGGAGALGGVGGVGGVLVNGSG
LGSLSLSLSLAPPGTPPPSPLTQHTLDHIKAALRQAGYSEAGLSEIGAALALLVKHGVLG
LALPAALPAPLSAAYFPLQPSDSPAVFGPLAQVQLGGARGGSLERFAEVAFEALRPPAVA
PISLSGGVGGVGGVPGFPSASLLPLSKSPTPADAGAKDSKNVEIPEVIVGAILGPGGRSL
VEIQQMSGANIQISKKGTFAPGTRNRIVTISGTATAISNAHYLIEQKIQEEELKRTRHNA
LSGLMQ