DPGLEAN20012 in OGS1.0

New model in OGS2.0DPOGS203517 
Genomic Positionscaffold1045:+ 20424-24826
See gene structure
CDS Length1347
Paired RNAseq reads  448
Single RNAseq reads  1026
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004347 (3e-148)
Best Drosophila hit  CG8833 (6e-73)
Best Human hitG patch domain-containing protein 1 (5e-59)
Best NR hit (blastp)  PREDICTED: similar to CG8833-PA [Apis mellifera] (1e-87)
Best NR hit (blastx)  GK12430 [Drosophila willistoni] (8e-77)
GeneOntology terms


  
GO:0003676 nucleic acid binding
GO:0071011 precatalytic spliceosome
GO:0071013 catalytic step 2 spliceosome
GO:0000398 nuclear mRNA splicing, via spliceosome
InterPro families
  
IPR000467 D111/G-patch
IPR011666 Domain of unknown function DUF1604
Orthology groupMCL13275

Nucleotide sequence:

ATGAGTGGATCAGATTCCGAAGAAGAAAACACCTTGCGCTATGGAACACCGTTGGAACCA
TACGAAGAAGATGAACTACCATCGAAAAAGAAGTATCAACCTCAGGAGCAATATGCGGTG
GATGAGCATGGAAAGCGCCGCTTTCACGGTGCATTCACTGGTGGATTCTCAGCTGGGTTT
GGTAATTCAGTCGGTACACCAGAAGGATGGACACCAAGCTCGTTTAAGAGTTCACGAGCT
GAGAAAGCACTAATTGCTAACCAAAGACCTGAAGATTTCATGGATGAAGAAGATCGCTCT
GAGTTCGGTATAGCACCTCGCAGCTTGCAAGCCAACCAAGAGTTCACGGGACAGAAGCGG
ACGAGAACTAGTTACTATGACGGCACCATACCGGGAGTTCCAGTACTGGAGCAGCTTCTA
CGACCGGTCCATGAAACAACGGCTGTGAAGATTCTACGGAAGATGGGATGGAGGGATGGC
CAGGGGGTCGGAGACAGAGTCACTAGGACGGAGAAGAAGAAAACAAACAAGCAACATAAA
GTGTACGGGTGTTACATGCCAGATGATATGAGACAGGTCAGCCGGGAATCGGATAGCTCG
GAATCGGAGTTCGAGTACGAGCAGCTGTTCGCGCCGGACGATTATGAGCCTTATGTGTTA
AAGAATAAGAACGATAAATTCGGTCTGGGATATGAAGGGCTGAGTCGGCACTCGGTGCTG
GGGAATATGGTCGGGGAGTACGACGCTGGAGGATCCAAGAGCCACCTCCTCATGAGGGAT
AAGGGCAGGAAGGTGTCGATCCGCGGCCAGGCGTTCGGGGTTGGCGCCTTCGAGGCTGAT
GACGAAGATATATACAGCAGAGAGGACATGACGCATTACGACTACGAGATGGGACCGGAA
GTGACGACGAAGAAGCACGTCGCTAAACAGCAAAATAATGTGCTCCAGGGTTTCGTTAAG
TCCAAGGCTAAACCGCCGTCGGTGGCCTCGTACCCCCCGCCCACTCTGCCACGGGGCTAT
GTCCCAGCAACCAGACGCTCAAGATTCGACGAGAGTAGGCCTCCGCAGAGGGACCAAGGT
CTCGGACGACATGAACTGACATCAGCGGCGAGAGGAGCTCTCTTAGGAGAGACGCCCGTG
GACAGCGCAGAACACACACAACGCAAGGAGCCGGAGCCGCAGGCCGCGTCTGACACGAAG
ATAGTTGGAATAGTTACAGACATCACCGTCCCGCCGGACGCCGGGGAGCTGAATTACATA
TCGATAAAAAATACAGACAAATCCAATCTGAAGGTTTTCAAGCCGTTCGCCAGCGACCCT
CAGAAGCAGCTGAGATATGAGAGGTAA

Protein sequence:

MSGSDSEEENTLRYGTPLEPYEEDELPSKKKYQPQEQYAVDEHGKRRFHGAFTGGFSAGF
GNSVGTPEGWTPSSFKSSRAEKALIANQRPEDFMDEEDRSEFGIAPRSLQANQEFTGQKR
TRTSYYDGTIPGVPVLEQLLRPVHETTAVKILRKMGWRDGQGVGDRVTRTEKKKTNKQHK
VYGCYMPDDMRQVSRESDSSESEFEYEQLFAPDDYEPYVLKNKNDKFGLGYEGLSRHSVL
GNMVGEYDAGGSKSHLLMRDKGRKVSIRGQAFGVGAFEADDEDIYSREDMTHYDYEMGPE
VTTKKHVAKQQNNVLQGFVKSKAKPPSVASYPPPTLPRGYVPATRRSRFDESRPPQRDQG
LGRHELTSAARGALLGETPVDSAEHTQRKEPEPQAASDTKIVGIVTDITVPPDAGELNYI
SIKNTDKSNLKVFKPFASDPQKQLRYER