DPGLEAN17402 in OGS1.0

New model in OGS2.0DPOGS202283 
Genomic Positionscaffold664:- 149337-156386
See gene structure
CDS Length2901
Paired RNAseq reads  5043
Single RNAseq reads  11992
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004939 (0.0)
Best Drosophila hit  CG1646, isoform D (1e-88)
Best Human hitpre-mRNA-processing factor 39 (7e-103)
Best NR hit (blastp)  PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog (yeast) [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog (yeast) [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0005488 binding
GO:0008380 RNA splicing
GO:0006397 mRNA processing
GO:0005730 nucleolus
GO:0005634 nucleus
GO:0005622 intracellular
InterPro families

  
IPR008847 Suppressor of forked
IPR011990 Tetratricopeptide-like helical
IPR003107 RNA-processing protein, HAT helix
Orthology groupMCL13208

Nucleotide sequence:

ATGGATGATGAAATGAGTTCAGATTTACAGAAATGTTTATCAGAGGGTGCTATGGACACT
GATGAGGGTCCATTAATAAATGAAAATTCAAATGGTTTGTCCCATTTATTAACCGACGGA
GAGTTTAACAGTGTTGACGCTGCGGTTGTTGACGAGTCGTCTACCTCGAATCAGGCTTTT
TTAAATGCAGTTGAGTTATCGACTGGCAGCGTCGGTGATTATCTCGATAACAACACGGGC
GGTTTTAACACTGATTCATTCGATGTCGGCGACAACGCTGTCAACTTCCCCAGCGACTCT
TTAAACAATGTACTGTCAGACGGCGATTCCCGGTTGAGTGATGCTTTCAAAACGACGCCA
AATGATATAGATCTTACAGAGAATCAGAAGACGGTGTCTCAATTAGAGCTTGATATGGAG
TTGACGCAGGATACGTCTAACGAAACTAGAAGCAATCAGGATTTTAACATGTCCTCTTTT
AACGAGAGAAATGATACGTTACCAAGTGACCAAGACAACACCAACGACACGCTTGTCGAA
CAGTCAGTAAAGAAATCTAGAAAGAAAAGACGTTCATCAAAAGATGAAGATGATAACAAG
AAAAACCAGAACTCCAATGATAGGAATAGATTAAGCGAAAGGAAACAGGATTCGGTATCA
GATAATACTCAAGGAAACAGCCAGGATAGTTCATATGGAAGATCGAAAGAAAAAGTGAAG
GTTGGAAGTGCTGAAGAGACCGAGGTAGTTTCCGAAGATGAACTGCCGGTTATTCAGAAA
CCGAGTGTAAAAGATGCAGAGAATGTGTCCGATGATGAATTGCCGGGACCTAAACCAGCC
GAACTACCGGCAGATACCGAGGTTGTATCCGAGGACGAACTGCCGACATCAAAGAAGGAT
GGAAAAGAGTCTCGGAAACGAAAAACTGAGGAAGGTGACGGTTATGACCCTGGTTCACCA
ACATCTGAATCGGAGTCGGCTAACAAAAAACAAGCTGTTTCTAAGAACGGTGAAAGCAAA
CCAGTGTCGGCAGAAAAAAGATTCTCAGGGGATGAAAAGCCTAAGAAGAAAACTCTCCCC
GACCTAGACAAGTACTGGAAGGTGGTTAACGATGATCCGACAGATTTCACAGGCTGGACG
TACTTGTTGCAGTACGTCGACCAAGAGAGCGACGCCGAAGCGGCCCGCGAAGCCTACGAC
GCGTTCCTCTCCCACTACCCGTACTGCTACGGCTACTGGCGCAAGTACGCCGATTACGAG
AAGCGTAAAGGAAGTAAAAAGAAATGCCTGGAGGTGTTGGAAAGGGGGCTGAAAGCCATA
CCACTGTCGGTTGACCTTTGGATACATTATCTCAACCACATTAAGACCACGAGGACTGAA
GATCACACCTTCATTAGGTCGCAGTACGAGAGGGCTATCGAAGCATGCGGCCTGGAGTTT
CGTTCCGACCGTCTCTGGGAGTCGTACATCAAGTGGGAGGCCGAGAACGGCTCGGCCCTC
AACGTCACCAACATATATGACCGTCTGCTGGCAACACCGACACTTGGATACACCTCGCAC
TTTGACAATTTCCAGGAGCACGTGATGTCGGAGCCGGCTTGCGGAGCGGTTTCCGCTGAG
GAGCTCGTTCGCCTCCGCGCTGAGGTGAGGGACTCCGCCCCCGCTCAGCCGCCGCCCGAC
CTGCCCCCCGGCGAGGACGTCGGGCGACTAGCTTCAGAAGACGAGGCTCAAGCCATCAAA
GAGCGAATTATAGCAGCGCGAAGAAAAGTTCACAAGACGACAGGAGAAGAAGTAGCGGCC
AGGTGGGCATTTGAAGAAGGGATAAAACGCCCATACTTCCACGTGAAGCCTCTCGAGAGA
TGTCAGCTGAAGAACTGGAAGGCGTACCTGGAGTGGGAGAAGCAGCACGGCTCCTTTAAA
CGAGCACTGGTGTTACACGAGCGCTGTCTCATAGCATGTGCTCTGTATGAAGAGTTCTGG
ATGAGGTTAATAAAGTTTCTGGAAGAACATTCAGCCTCGGACCCCTCAGTGATTCCCCTC
CAGCGGGATGCTCTAGAGCGAGCGTGTACTGTACATCACCTGGACAAGCCCGAGCTGCAC
CTGCACTGGGCGCACTTCGAGGAGGCTAATGGGAACACGAGTCGTGCTGCTGAAATATTA
GATAGGATCGAGAAGACCTGCCCCAACCTGGTGCAGATACAGTACAGGCGAATCAATCTT
GAGAGGCGTCGCGGGGAGTACGATAAGTGCGTCCAGCTGTATGAAGGTTACATTTCATCA
GCTAAAAACAAAGCTATAGCATCCGCGCTCGCTATTAAATACGCACGCTTCCTGTTTCAC
GTGAAGAGGGAACCGGAAGCCGCGAGGAAGGTGCTGGATGATGCGGTACTTAAGGATCCT
CTCAACGCCAGACTACACATGCAGCGGTTGGATCTGGCCCTCCACACACCAGGCACCAAG
TACGAGGAGTTGGAAGAATTGCTGATGAGCTACGAGAAGCAAGAGGGTGCGGAGATCGAG
ACGAGTACGGCGCTGGCGGTGCGGAGGAGGGAACTGGCCGAGGAGCTCGGAGACGCGGCC
TCGGCCAGACAAGCACACACGCACGCACGAACACTCTACAAACACATGAGGAAGAGGGCG
CGGGCGGCCAAACATGACACGCACCATCACACGGCTTGCGCGGACCCGTCAAAGAAGAAA
GAGAACTGTGCAACCACCACCAGCACCACCACAGCCAGTAGCGCAAACCAATACTACCAG
AACGCGGCGGCGACTGCGCAGTCATACGACCAATCGTATGCACAGCCCTACACGCCGCCG
TGGGGCTACCAGCAAGCGGCAGGGCCTTACCCCCACCACCCCCACCCGCACCCCTGGCCG
CAGTACCCCAACTACTATTAA

Protein sequence:

MDDEMSSDLQKCLSEGAMDTDEGPLINENSNGLSHLLTDGEFNSVDAAVVDESSTSNQAF
LNAVELSTGSVGDYLDNNTGGFNTDSFDVGDNAVNFPSDSLNNVLSDGDSRLSDAFKTTP
NDIDLTENQKTVSQLELDMELTQDTSNETRSNQDFNMSSFNERNDTLPSDQDNTNDTLVE
QSVKKSRKKRRSSKDEDDNKKNQNSNDRNRLSERKQDSVSDNTQGNSQDSSYGRSKEKVK
VGSAEETEVVSEDELPVIQKPSVKDAENVSDDELPGPKPAELPADTEVVSEDELPTSKKD
GKESRKRKTEEGDGYDPGSPTSESESANKKQAVSKNGESKPVSAEKRFSGDEKPKKKTLP
DLDKYWKVVNDDPTDFTGWTYLLQYVDQESDAEAAREAYDAFLSHYPYCYGYWRKYADYE
KRKGSKKKCLEVLERGLKAIPLSVDLWIHYLNHIKTTRTEDHTFIRSQYERAIEACGLEF
RSDRLWESYIKWEAENGSALNVTNIYDRLLATPTLGYTSHFDNFQEHVMSEPACGAVSAE
ELVRLRAEVRDSAPAQPPPDLPPGEDVGRLASEDEAQAIKERIIAARRKVHKTTGEEVAA
RWAFEEGIKRPYFHVKPLERCQLKNWKAYLEWEKQHGSFKRALVLHERCLIACALYEEFW
MRLIKFLEEHSASDPSVIPLQRDALERACTVHHLDKPELHLHWAHFEEANGNTSRAAEIL
DRIEKTCPNLVQIQYRRINLERRRGEYDKCVQLYEGYISSAKNKAIASALAIKYARFLFH
VKREPEAARKVLDDAVLKDPLNARLHMQRLDLALHTPGTKYEELEELLMSYEKQEGAEIE
TSTALAVRRRELAEELGDAASARQAHTHARTLYKHMRKRARAAKHDTHHHTACADPSKKK
ENCATTTSTTTASSANQYYQNAAATAQSYDQSYAQPYTPPWGYQQAAGPYPHHPHPHPWP
QYPNYY