New model in OGS2.0 | DPOGS214990  |
---|---|
Genomic Position | scaffold361:- 79307-85225 |
See gene structure | |
CDS Length | 2076 |
Paired RNAseq reads   | 1105 |
Single RNAseq reads   | 2824 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012163 (0.0) |
Best Drosophila hit   | CG7757, isoform A (2e-163) |
Best Human hit | U4/U6 small nuclear ribonucleoprotein Prp3 (1e-122) |
Best NR hit (blastp)   | Trisn small nuclear ribonucleoprotein, putative [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to Trisn small nuclear ribonucleoprotein [Tribolium castaneum] (4e-174) |
GeneOntology terms    | GO:0000398 nuclear mRNA splicing, via spliceosome GO:0005688 U6 snRNP GO:0005687 U4 snRNP GO:0005681 spliceosomal complex GO:0030532 small nuclear ribonucleoprotein complex GO:0071011 precatalytic spliceosome |
InterPro families    | IPR013881 Pre-mRNA-splicing factor 3 IPR010541 Domain of unknown function DUF1115 |
Orthology group | MCL14713 |
Nucleotide sequence:
ATGGCGTTACAGTTGTCCAAACGTGAAGTAGAAGACCTAAGATCCTCCCTCGATCGTGCA
ATCTATAGAACTATAGGAAAATCAGATAGTTCACTACTATACACGGTGTCTTCGTGCCTG
ACAAACGGGTATGAGCGCCGTAAGATTATTGATAAAATATCATCACACATTGATTCGAAG
AAGGCCAGCAAACTTGCTGACAAGATCATAGCCCTCGCGCAGGAGCTGATCTCATCATCC
AAGAGTCAGAAACGGAAATATGAAGATAAAGAAAAAGACAAAGACAGTAAAAGATCTCGT
CACGAATCCCGCGAGGAGAGACGGGAGAAGGATGACAGAGAGAGAAAGAGTGACAACGGA
GAGGAGCTACCGACCATCTCGGACGGGGACACCATTGGCTCAAAGATGACTGGACTCAGT
GCTGATAAGATTAAGGTCATGATGGCGAATGCTCAGAAGGAAATTGAGGAGAGGAAGCGA
GCTCTGATGGCTATCAAGGGGGAGTCTCGGAACGTGAGCACGGCCGCGGCCGCCGCGGTC
GTCGAGTCCCGGGTGCACCGCGGGGGCATAGCACCCCCTAGCGTCATCAAGCCGATATTG
TACTCCAAACCAGGTCGGGTCACGCCGACAACCGCCGAGGAGTTGGAGAAGCAGAGGAAG
ATAGCGGAGCTGCAGGCCAGGATACAGAGGAAGCTGGCGGGTGGCGCGCTGGCTGCGACC
GGGGGCTCCGGGCCCGCGCCCCTCATACTGGACAGGGAGGGTCGCACCGTGGACACCAGC
GGCAAGAGGGTGCAGCTCACACACGTGGCGCCCACCTTGAAGGCCAACATAAGGGCGAAG
CGGCGCGAGGAGTTCCGCGCCCAGCTGAGCGGGCAGACCACGGAGGCGGTCAACGAAGCC
CCCTGGCAGGACGAGCGGCTGGCCAGCAAGCCCCCGGCGAGAACGCGCAGGGCGCTCCGC
TTCCACGAGCCCGGGAAGTTCACACAGTTGGCGGAGAGACTCCGTATGAAGGCCCAGCTG
GAGAAGCTGCAGACTGAGATATCTCAGATAGCTCGGAAGACGGGCATCTCCTCCGCCACC
AAACTGGCGTTACTGGCCGCCGACACGCCGGAGGCACAGAGAGTGCCGGACATAGAGTGG
TGGGACAGCGTGATCCTGATGACCCCCGAGGAGAGGGAGGCGAGGGCGAAGGCCGGCGAC
GACGAGAGGTCATTCAGCGAGCGCGTGGAGGCCTGCAACACGGGACACGACGACATCGTG
GAGAACCTCAACGAGGACGCCATCACCAACCTGGTGGAACACCCGCAGCAGCTCAGACCG
CCCACCGAACCTCTGAAACCGACTTACATGCCGGTGTTCCTCACCAAGAAGGAAAGGAAG
AAGCTCCGCAGGCAGAGCAGGAGAGAGGCCTGGAAGGAGGAGCAGGAGAAGGTGCGCCTG
GGGCTGGAGGCGCCGCCGGAGCCCAAGCTCAGGATATCCAACCTGATGCGCGCCCTGGGG
ACGGAGGCTGTGCAGGACCCCACGGCCATTGAGGCCAGGGTCAGGGAGCAGATCGCCAAG
AGGCAGAAGACACACCTCGAGGCCAACAAGGCGAGGGCTCTCACCAAGGAACAGCGGAGA
GAGAAGGTGGATAGGAAAATACGCGAGGACACGTCTATGGGGGTGCACGTGGCGGTGTAC
AGGGTGAAGGACCTGTTCGAGAGCGCGTCCGCCAAGTTCAAGGTGGAGGTGAACGCGCGC
CAACTGCACATGACGGGCTGCGTGGTGCTGCACCGAGCCTGCTGCGTGCTGGTGCTGGAG
GGGGGCCCGCGGCAGCACGAGAAGTACAAGCGCCTGATGCTGCACAGGATAAAGTGGGAA
GAGGAGACCGTGAAGAACGCCGACGACAGCGAGGGTCCAAACTCGTGTACGCTGGTCTGG
GAGGGGGTCGCCGCGAGGAGGGCCTTCGGGGACATTAAGTTTAAGGTGATGCCGACGGAG
AAGCAGGCGAGGGAGTTCTTCGCCAAGCACGGCGTCGAGCATTACTGGGACCTATCGTAC
AGCGGGGCCGTGCTGGGGCCAGCGGAGGAGCCCTAG
Protein sequence:
MALQLSKREVEDLRSSLDRAIYRTIGKSDSSLLYTVSSCLTNGYERRKIIDKISSHIDSK
KASKLADKIIALAQELISSSKSQKRKYEDKEKDKDSKRSRHESREERREKDDRERKSDNG
EELPTISDGDTIGSKMTGLSADKIKVMMANAQKEIEERKRALMAIKGESRNVSTAAAAAV
VESRVHRGGIAPPSVIKPILYSKPGRVTPTTAEELEKQRKIAELQARIQRKLAGGALAAT
GGSGPAPLILDREGRTVDTSGKRVQLTHVAPTLKANIRAKRREEFRAQLSGQTTEAVNEA
PWQDERLASKPPARTRRALRFHEPGKFTQLAERLRMKAQLEKLQTEISQIARKTGISSAT
KLALLAADTPEAQRVPDIEWWDSVILMTPEEREARAKAGDDERSFSERVEACNTGHDDIV
ENLNEDAITNLVEHPQQLRPPTEPLKPTYMPVFLTKKERKKLRRQSRREAWKEEQEKVRL
GLEAPPEPKLRISNLMRALGTEAVQDPTAIEARVREQIAKRQKTHLEANKARALTKEQRR
EKVDRKIREDTSMGVHVAVYRVKDLFESASAKFKVEVNARQLHMTGCVVLHRACCVLVLE
GGPRQHEKYKRLMLHRIKWEEETVKNADDSEGPNSCTLVWEGVAARRAFGDIKFKVMPTE
KQAREFFAKHGVEHYWDLSYSGAVLGPAEEP