DPGLEAN17177 in OGS1.0

New model in OGS2.0DPOGS214990 
Genomic Positionscaffold361:- 79307-85225
See gene structure
CDS Length2076
Paired RNAseq reads  1105
Single RNAseq reads  2824
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012163 (0.0)
Best Drosophila hit  CG7757, isoform A (2e-163)
Best Human hitU4/U6 small nuclear ribonucleoprotein Prp3 (1e-122)
Best NR hit (blastp)  Trisn small nuclear ribonucleoprotein, putative [Aedes aegypti] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Trisn small nuclear ribonucleoprotein [Tribolium castaneum] (4e-174)
GeneOntology terms




  
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0005688 U6 snRNP
GO:0005687 U4 snRNP
GO:0005681 spliceosomal complex
GO:0030532 small nuclear ribonucleoprotein complex
GO:0071011 precatalytic spliceosome
InterPro families
  
IPR013881 Pre-mRNA-splicing factor 3
IPR010541 Domain of unknown function DUF1115
Orthology groupMCL14713

Nucleotide sequence:

ATGGCGTTACAGTTGTCCAAACGTGAAGTAGAAGACCTAAGATCCTCCCTCGATCGTGCA
ATCTATAGAACTATAGGAAAATCAGATAGTTCACTACTATACACGGTGTCTTCGTGCCTG
ACAAACGGGTATGAGCGCCGTAAGATTATTGATAAAATATCATCACACATTGATTCGAAG
AAGGCCAGCAAACTTGCTGACAAGATCATAGCCCTCGCGCAGGAGCTGATCTCATCATCC
AAGAGTCAGAAACGGAAATATGAAGATAAAGAAAAAGACAAAGACAGTAAAAGATCTCGT
CACGAATCCCGCGAGGAGAGACGGGAGAAGGATGACAGAGAGAGAAAGAGTGACAACGGA
GAGGAGCTACCGACCATCTCGGACGGGGACACCATTGGCTCAAAGATGACTGGACTCAGT
GCTGATAAGATTAAGGTCATGATGGCGAATGCTCAGAAGGAAATTGAGGAGAGGAAGCGA
GCTCTGATGGCTATCAAGGGGGAGTCTCGGAACGTGAGCACGGCCGCGGCCGCCGCGGTC
GTCGAGTCCCGGGTGCACCGCGGGGGCATAGCACCCCCTAGCGTCATCAAGCCGATATTG
TACTCCAAACCAGGTCGGGTCACGCCGACAACCGCCGAGGAGTTGGAGAAGCAGAGGAAG
ATAGCGGAGCTGCAGGCCAGGATACAGAGGAAGCTGGCGGGTGGCGCGCTGGCTGCGACC
GGGGGCTCCGGGCCCGCGCCCCTCATACTGGACAGGGAGGGTCGCACCGTGGACACCAGC
GGCAAGAGGGTGCAGCTCACACACGTGGCGCCCACCTTGAAGGCCAACATAAGGGCGAAG
CGGCGCGAGGAGTTCCGCGCCCAGCTGAGCGGGCAGACCACGGAGGCGGTCAACGAAGCC
CCCTGGCAGGACGAGCGGCTGGCCAGCAAGCCCCCGGCGAGAACGCGCAGGGCGCTCCGC
TTCCACGAGCCCGGGAAGTTCACACAGTTGGCGGAGAGACTCCGTATGAAGGCCCAGCTG
GAGAAGCTGCAGACTGAGATATCTCAGATAGCTCGGAAGACGGGCATCTCCTCCGCCACC
AAACTGGCGTTACTGGCCGCCGACACGCCGGAGGCACAGAGAGTGCCGGACATAGAGTGG
TGGGACAGCGTGATCCTGATGACCCCCGAGGAGAGGGAGGCGAGGGCGAAGGCCGGCGAC
GACGAGAGGTCATTCAGCGAGCGCGTGGAGGCCTGCAACACGGGACACGACGACATCGTG
GAGAACCTCAACGAGGACGCCATCACCAACCTGGTGGAACACCCGCAGCAGCTCAGACCG
CCCACCGAACCTCTGAAACCGACTTACATGCCGGTGTTCCTCACCAAGAAGGAAAGGAAG
AAGCTCCGCAGGCAGAGCAGGAGAGAGGCCTGGAAGGAGGAGCAGGAGAAGGTGCGCCTG
GGGCTGGAGGCGCCGCCGGAGCCCAAGCTCAGGATATCCAACCTGATGCGCGCCCTGGGG
ACGGAGGCTGTGCAGGACCCCACGGCCATTGAGGCCAGGGTCAGGGAGCAGATCGCCAAG
AGGCAGAAGACACACCTCGAGGCCAACAAGGCGAGGGCTCTCACCAAGGAACAGCGGAGA
GAGAAGGTGGATAGGAAAATACGCGAGGACACGTCTATGGGGGTGCACGTGGCGGTGTAC
AGGGTGAAGGACCTGTTCGAGAGCGCGTCCGCCAAGTTCAAGGTGGAGGTGAACGCGCGC
CAACTGCACATGACGGGCTGCGTGGTGCTGCACCGAGCCTGCTGCGTGCTGGTGCTGGAG
GGGGGCCCGCGGCAGCACGAGAAGTACAAGCGCCTGATGCTGCACAGGATAAAGTGGGAA
GAGGAGACCGTGAAGAACGCCGACGACAGCGAGGGTCCAAACTCGTGTACGCTGGTCTGG
GAGGGGGTCGCCGCGAGGAGGGCCTTCGGGGACATTAAGTTTAAGGTGATGCCGACGGAG
AAGCAGGCGAGGGAGTTCTTCGCCAAGCACGGCGTCGAGCATTACTGGGACCTATCGTAC
AGCGGGGCCGTGCTGGGGCCAGCGGAGGAGCCCTAG

Protein sequence:

MALQLSKREVEDLRSSLDRAIYRTIGKSDSSLLYTVSSCLTNGYERRKIIDKISSHIDSK
KASKLADKIIALAQELISSSKSQKRKYEDKEKDKDSKRSRHESREERREKDDRERKSDNG
EELPTISDGDTIGSKMTGLSADKIKVMMANAQKEIEERKRALMAIKGESRNVSTAAAAAV
VESRVHRGGIAPPSVIKPILYSKPGRVTPTTAEELEKQRKIAELQARIQRKLAGGALAAT
GGSGPAPLILDREGRTVDTSGKRVQLTHVAPTLKANIRAKRREEFRAQLSGQTTEAVNEA
PWQDERLASKPPARTRRALRFHEPGKFTQLAERLRMKAQLEKLQTEISQIARKTGISSAT
KLALLAADTPEAQRVPDIEWWDSVILMTPEEREARAKAGDDERSFSERVEACNTGHDDIV
ENLNEDAITNLVEHPQQLRPPTEPLKPTYMPVFLTKKERKKLRRQSRREAWKEEQEKVRL
GLEAPPEPKLRISNLMRALGTEAVQDPTAIEARVREQIAKRQKTHLEANKARALTKEQRR
EKVDRKIREDTSMGVHVAVYRVKDLFESASAKFKVEVNARQLHMTGCVVLHRACCVLVLE
GGPRQHEKYKRLMLHRIKWEEETVKNADDSEGPNSCTLVWEGVAARRAFGDIKFKVMPTE
KQAREFFAKHGVEHYWDLSYSGAVLGPAEEP