New model in OGS2.0 | DPOGS202283  |
---|---|
Genomic Position | scaffold664:- 149337-156386 |
See gene structure | |
CDS Length | 2901 |
Paired RNAseq reads   | 5043 |
Single RNAseq reads   | 11992 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004939 (0.0) |
Best Drosophila hit   | CG1646, isoform D (1e-88) |
Best Human hit | pre-mRNA-processing factor 39 (7e-103) |
Best NR hit (blastp)   | PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog (yeast) [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog (yeast) [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005488 binding GO:0008380 RNA splicing GO:0006397 mRNA processing GO:0005730 nucleolus GO:0005634 nucleus GO:0005622 intracellular |
InterPro families    | IPR008847 Suppressor of forked IPR011990 Tetratricopeptide-like helical IPR003107 RNA-processing protein, HAT helix |
Orthology group | MCL13208 |
Nucleotide sequence:
ATGGATGATGAAATGAGTTCAGATTTACAGAAATGTTTATCAGAGGGTGCTATGGACACT
GATGAGGGTCCATTAATAAATGAAAATTCAAATGGTTTGTCCCATTTATTAACCGACGGA
GAGTTTAACAGTGTTGACGCTGCGGTTGTTGACGAGTCGTCTACCTCGAATCAGGCTTTT
TTAAATGCAGTTGAGTTATCGACTGGCAGCGTCGGTGATTATCTCGATAACAACACGGGC
GGTTTTAACACTGATTCATTCGATGTCGGCGACAACGCTGTCAACTTCCCCAGCGACTCT
TTAAACAATGTACTGTCAGACGGCGATTCCCGGTTGAGTGATGCTTTCAAAACGACGCCA
AATGATATAGATCTTACAGAGAATCAGAAGACGGTGTCTCAATTAGAGCTTGATATGGAG
TTGACGCAGGATACGTCTAACGAAACTAGAAGCAATCAGGATTTTAACATGTCCTCTTTT
AACGAGAGAAATGATACGTTACCAAGTGACCAAGACAACACCAACGACACGCTTGTCGAA
CAGTCAGTAAAGAAATCTAGAAAGAAAAGACGTTCATCAAAAGATGAAGATGATAACAAG
AAAAACCAGAACTCCAATGATAGGAATAGATTAAGCGAAAGGAAACAGGATTCGGTATCA
GATAATACTCAAGGAAACAGCCAGGATAGTTCATATGGAAGATCGAAAGAAAAAGTGAAG
GTTGGAAGTGCTGAAGAGACCGAGGTAGTTTCCGAAGATGAACTGCCGGTTATTCAGAAA
CCGAGTGTAAAAGATGCAGAGAATGTGTCCGATGATGAATTGCCGGGACCTAAACCAGCC
GAACTACCGGCAGATACCGAGGTTGTATCCGAGGACGAACTGCCGACATCAAAGAAGGAT
GGAAAAGAGTCTCGGAAACGAAAAACTGAGGAAGGTGACGGTTATGACCCTGGTTCACCA
ACATCTGAATCGGAGTCGGCTAACAAAAAACAAGCTGTTTCTAAGAACGGTGAAAGCAAA
CCAGTGTCGGCAGAAAAAAGATTCTCAGGGGATGAAAAGCCTAAGAAGAAAACTCTCCCC
GACCTAGACAAGTACTGGAAGGTGGTTAACGATGATCCGACAGATTTCACAGGCTGGACG
TACTTGTTGCAGTACGTCGACCAAGAGAGCGACGCCGAAGCGGCCCGCGAAGCCTACGAC
GCGTTCCTCTCCCACTACCCGTACTGCTACGGCTACTGGCGCAAGTACGCCGATTACGAG
AAGCGTAAAGGAAGTAAAAAGAAATGCCTGGAGGTGTTGGAAAGGGGGCTGAAAGCCATA
CCACTGTCGGTTGACCTTTGGATACATTATCTCAACCACATTAAGACCACGAGGACTGAA
GATCACACCTTCATTAGGTCGCAGTACGAGAGGGCTATCGAAGCATGCGGCCTGGAGTTT
CGTTCCGACCGTCTCTGGGAGTCGTACATCAAGTGGGAGGCCGAGAACGGCTCGGCCCTC
AACGTCACCAACATATATGACCGTCTGCTGGCAACACCGACACTTGGATACACCTCGCAC
TTTGACAATTTCCAGGAGCACGTGATGTCGGAGCCGGCTTGCGGAGCGGTTTCCGCTGAG
GAGCTCGTTCGCCTCCGCGCTGAGGTGAGGGACTCCGCCCCCGCTCAGCCGCCGCCCGAC
CTGCCCCCCGGCGAGGACGTCGGGCGACTAGCTTCAGAAGACGAGGCTCAAGCCATCAAA
GAGCGAATTATAGCAGCGCGAAGAAAAGTTCACAAGACGACAGGAGAAGAAGTAGCGGCC
AGGTGGGCATTTGAAGAAGGGATAAAACGCCCATACTTCCACGTGAAGCCTCTCGAGAGA
TGTCAGCTGAAGAACTGGAAGGCGTACCTGGAGTGGGAGAAGCAGCACGGCTCCTTTAAA
CGAGCACTGGTGTTACACGAGCGCTGTCTCATAGCATGTGCTCTGTATGAAGAGTTCTGG
ATGAGGTTAATAAAGTTTCTGGAAGAACATTCAGCCTCGGACCCCTCAGTGATTCCCCTC
CAGCGGGATGCTCTAGAGCGAGCGTGTACTGTACATCACCTGGACAAGCCCGAGCTGCAC
CTGCACTGGGCGCACTTCGAGGAGGCTAATGGGAACACGAGTCGTGCTGCTGAAATATTA
GATAGGATCGAGAAGACCTGCCCCAACCTGGTGCAGATACAGTACAGGCGAATCAATCTT
GAGAGGCGTCGCGGGGAGTACGATAAGTGCGTCCAGCTGTATGAAGGTTACATTTCATCA
GCTAAAAACAAAGCTATAGCATCCGCGCTCGCTATTAAATACGCACGCTTCCTGTTTCAC
GTGAAGAGGGAACCGGAAGCCGCGAGGAAGGTGCTGGATGATGCGGTACTTAAGGATCCT
CTCAACGCCAGACTACACATGCAGCGGTTGGATCTGGCCCTCCACACACCAGGCACCAAG
TACGAGGAGTTGGAAGAATTGCTGATGAGCTACGAGAAGCAAGAGGGTGCGGAGATCGAG
ACGAGTACGGCGCTGGCGGTGCGGAGGAGGGAACTGGCCGAGGAGCTCGGAGACGCGGCC
TCGGCCAGACAAGCACACACGCACGCACGAACACTCTACAAACACATGAGGAAGAGGGCG
CGGGCGGCCAAACATGACACGCACCATCACACGGCTTGCGCGGACCCGTCAAAGAAGAAA
GAGAACTGTGCAACCACCACCAGCACCACCACAGCCAGTAGCGCAAACCAATACTACCAG
AACGCGGCGGCGACTGCGCAGTCATACGACCAATCGTATGCACAGCCCTACACGCCGCCG
TGGGGCTACCAGCAAGCGGCAGGGCCTTACCCCCACCACCCCCACCCGCACCCCTGGCCG
CAGTACCCCAACTACTATTAA
Protein sequence:
MDDEMSSDLQKCLSEGAMDTDEGPLINENSNGLSHLLTDGEFNSVDAAVVDESSTSNQAF
LNAVELSTGSVGDYLDNNTGGFNTDSFDVGDNAVNFPSDSLNNVLSDGDSRLSDAFKTTP
NDIDLTENQKTVSQLELDMELTQDTSNETRSNQDFNMSSFNERNDTLPSDQDNTNDTLVE
QSVKKSRKKRRSSKDEDDNKKNQNSNDRNRLSERKQDSVSDNTQGNSQDSSYGRSKEKVK
VGSAEETEVVSEDELPVIQKPSVKDAENVSDDELPGPKPAELPADTEVVSEDELPTSKKD
GKESRKRKTEEGDGYDPGSPTSESESANKKQAVSKNGESKPVSAEKRFSGDEKPKKKTLP
DLDKYWKVVNDDPTDFTGWTYLLQYVDQESDAEAAREAYDAFLSHYPYCYGYWRKYADYE
KRKGSKKKCLEVLERGLKAIPLSVDLWIHYLNHIKTTRTEDHTFIRSQYERAIEACGLEF
RSDRLWESYIKWEAENGSALNVTNIYDRLLATPTLGYTSHFDNFQEHVMSEPACGAVSAE
ELVRLRAEVRDSAPAQPPPDLPPGEDVGRLASEDEAQAIKERIIAARRKVHKTTGEEVAA
RWAFEEGIKRPYFHVKPLERCQLKNWKAYLEWEKQHGSFKRALVLHERCLIACALYEEFW
MRLIKFLEEHSASDPSVIPLQRDALERACTVHHLDKPELHLHWAHFEEANGNTSRAAEIL
DRIEKTCPNLVQIQYRRINLERRRGEYDKCVQLYEGYISSAKNKAIASALAIKYARFLFH
VKREPEAARKVLDDAVLKDPLNARLHMQRLDLALHTPGTKYEELEELLMSYEKQEGAEIE
TSTALAVRRRELAEELGDAASARQAHTHARTLYKHMRKRARAAKHDTHHHTACADPSKKK
ENCATTTSTTTASSANQYYQNAAATAQSYDQSYAQPYTPPWGYQQAAGPYPHHPHPHPWP
QYPNYY