New model in OGS2.0 | DPOGS203517  |
---|---|
Genomic Position | scaffold1045:+ 20424-24826 |
See gene structure | |
CDS Length | 1347 |
Paired RNAseq reads   | 448 |
Single RNAseq reads   | 1026 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004347 (3e-148) |
Best Drosophila hit   | CG8833 (6e-73) |
Best Human hit | G patch domain-containing protein 1 (5e-59) |
Best NR hit (blastp)   | PREDICTED: similar to CG8833-PA [Apis mellifera] (1e-87) |
Best NR hit (blastx)   | GK12430 [Drosophila willistoni] (8e-77) |
GeneOntology terms    | GO:0003676 nucleic acid binding GO:0071011 precatalytic spliceosome GO:0071013 catalytic step 2 spliceosome GO:0000398 nuclear mRNA splicing, via spliceosome |
InterPro families    | IPR000467 D111/G-patch IPR011666 Domain of unknown function DUF1604 |
Orthology group | MCL13275 |
Nucleotide sequence:
ATGAGTGGATCAGATTCCGAAGAAGAAAACACCTTGCGCTATGGAACACCGTTGGAACCA
TACGAAGAAGATGAACTACCATCGAAAAAGAAGTATCAACCTCAGGAGCAATATGCGGTG
GATGAGCATGGAAAGCGCCGCTTTCACGGTGCATTCACTGGTGGATTCTCAGCTGGGTTT
GGTAATTCAGTCGGTACACCAGAAGGATGGACACCAAGCTCGTTTAAGAGTTCACGAGCT
GAGAAAGCACTAATTGCTAACCAAAGACCTGAAGATTTCATGGATGAAGAAGATCGCTCT
GAGTTCGGTATAGCACCTCGCAGCTTGCAAGCCAACCAAGAGTTCACGGGACAGAAGCGG
ACGAGAACTAGTTACTATGACGGCACCATACCGGGAGTTCCAGTACTGGAGCAGCTTCTA
CGACCGGTCCATGAAACAACGGCTGTGAAGATTCTACGGAAGATGGGATGGAGGGATGGC
CAGGGGGTCGGAGACAGAGTCACTAGGACGGAGAAGAAGAAAACAAACAAGCAACATAAA
GTGTACGGGTGTTACATGCCAGATGATATGAGACAGGTCAGCCGGGAATCGGATAGCTCG
GAATCGGAGTTCGAGTACGAGCAGCTGTTCGCGCCGGACGATTATGAGCCTTATGTGTTA
AAGAATAAGAACGATAAATTCGGTCTGGGATATGAAGGGCTGAGTCGGCACTCGGTGCTG
GGGAATATGGTCGGGGAGTACGACGCTGGAGGATCCAAGAGCCACCTCCTCATGAGGGAT
AAGGGCAGGAAGGTGTCGATCCGCGGCCAGGCGTTCGGGGTTGGCGCCTTCGAGGCTGAT
GACGAAGATATATACAGCAGAGAGGACATGACGCATTACGACTACGAGATGGGACCGGAA
GTGACGACGAAGAAGCACGTCGCTAAACAGCAAAATAATGTGCTCCAGGGTTTCGTTAAG
TCCAAGGCTAAACCGCCGTCGGTGGCCTCGTACCCCCCGCCCACTCTGCCACGGGGCTAT
GTCCCAGCAACCAGACGCTCAAGATTCGACGAGAGTAGGCCTCCGCAGAGGGACCAAGGT
CTCGGACGACATGAACTGACATCAGCGGCGAGAGGAGCTCTCTTAGGAGAGACGCCCGTG
GACAGCGCAGAACACACACAACGCAAGGAGCCGGAGCCGCAGGCCGCGTCTGACACGAAG
ATAGTTGGAATAGTTACAGACATCACCGTCCCGCCGGACGCCGGGGAGCTGAATTACATA
TCGATAAAAAATACAGACAAATCCAATCTGAAGGTTTTCAAGCCGTTCGCCAGCGACCCT
CAGAAGCAGCTGAGATATGAGAGGTAA
Protein sequence:
MSGSDSEEENTLRYGTPLEPYEEDELPSKKKYQPQEQYAVDEHGKRRFHGAFTGGFSAGF
GNSVGTPEGWTPSSFKSSRAEKALIANQRPEDFMDEEDRSEFGIAPRSLQANQEFTGQKR
TRTSYYDGTIPGVPVLEQLLRPVHETTAVKILRKMGWRDGQGVGDRVTRTEKKKTNKQHK
VYGCYMPDDMRQVSRESDSSESEFEYEQLFAPDDYEPYVLKNKNDKFGLGYEGLSRHSVL
GNMVGEYDAGGSKSHLLMRDKGRKVSIRGQAFGVGAFEADDEDIYSREDMTHYDYEMGPE
VTTKKHVAKQQNNVLQGFVKSKAKPPSVASYPPPTLPRGYVPATRRSRFDESRPPQRDQG
LGRHELTSAARGALLGETPVDSAEHTQRKEPEPQAASDTKIVGIVTDITVPPDAGELNYI
SIKNTDKSNLKVFKPFASDPQKQLRYER