New model in OGS2.0 | DPOGS214942  |
---|---|
Genomic Position | scaffold103:- 113346-120523 |
See gene structure | |
CDS Length | 2538 |
Paired RNAseq reads   | 2729 |
Single RNAseq reads   | 6531 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004823 (0.0) |
Best Drosophila hit   | CG3605 (9e-169) |
Best Human hit | splicing factor 3B subunit 2 (1e-147) |
Best NR hit (blastp)   | PREDICTED: similar to CG3605 CG3605-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC008829 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005681 spliceosomal complex GO:0000398 nuclear mRNA splicing, via spliceosome GO:0005686 U2 snRNP GO:0071013 catalytic step 2 spliceosome GO:0071011 precatalytic spliceosome |
InterPro families    | IPR007180 Domain of unknown function DUF382 IPR006568 PSP, proline-rich |
Orthology group | MCL12528 |
Nucleotide sequence:
ATGGGCCCACCGCCAGGAATGCCGAGTTTTCCTCCCATGCCTCCATCGTCAGGCCCCATG
GGTCCTGGAAGTATGCCGCCTCCTCCAGTGGGTCCACCTGGTACAATGCCTGCAGTCACA
ACATCTGGTGGTCCACCAAACATGCCGCCACCAGGAATGGGTCCACCTCCTAACATGATG
GGTATGGGTCCTCCTGGTATCGGTCCTCCACCCCCGCCCGGTTTGGGACCCCCAGGAATC
AACATGGGACCTCCTCCGATGGGACCGCCAGGCCTTCCGTCACGAATGCCTCCTAACATG
ATGCGGGGAACATCTAATATGAAGAGTAACTACAATCAAACTATAGATATGGGACCGCCT
GGTATGGTGCCACCCTCTAGTATGAATCCTTGGGACAATCAATGCCCTCCTGGTTGGGGG
CGACAAGGGAGAGGGGATGGCCCTCCAGGATGGGACGATCAGGACGATGATGAAGATGAT
AATGATGATGAAAGTGATCCTTCAGGACCTCCACTACCATCCTTGTTGACCATGAAAATA
GATACACCCGAGGAGTTCAGAAATAAACCCCCTTCTGCTGTGGGTGGTGTTGTGCTACCA
AAAGCCTTGGAGGAGGCACTCGCTTACAAAGATCAAAGACAAGCTGCCTTAGGAGATGAA
GCAGATAAAGTAACAGAGCAAACAAAGAAACCTGAACCTCCACCGGCACCTGTGATCAGT
ACAGAGTATGATGGTGAAGAAGAAGGAGACTCGGATGAAGATAACATACCAGAAGCTCCC
TTACCACCAATAATATCTAAGCAAGAGAATCAAACCAAAGCGAGTAAAACTAAACGGAAA
AAGAAGAAGAAGAAGGCGGCGAAACAGAAGAGAAAAGAAGCAAAGTCGGCCGACGAAAGT
AGCAAAGAAGCCCAGAAGACCAGCGACAAAGAAAACGAAAAGGAAGCTGAAATCGAATAC
GTCCAAGAGAACATACAGTTCCACGAACTGGAGCCCATGTACCGTCAGTTCCACCGCATC
CTGGAATCGTTTAAGATAACGGAGAGGAAGGAGGAGATCAAGGATGAACCCGGGAAAGAT
GCACCGAAACCGAGCAAGCCGCTGGAGAAAGTTACCGACCAATTTGCAGCTGACGAAGAG
GCTGTTGAGAAACATGCAGCCGATGAGAAGGAGCGGCTCTCAAAACGCAAGTTAAAGAAG
CTGTCTCGTCTGTCCGTGGCGGAGCTGAAGCAACTGGTGGCCCGGCCGGATGTAGTGGAG
ATGTACGACGTCACCGCCAGGGACCCCAAACTGCTGGTACAGCTGAAGGCTCACAGGAAC
ACTGTCCAAGTGCCGCGCCACTGGTGTTACAAACGGAAGTATCTGCAAGGCAAGCGCGGT
ATCGAGAAGCCGCCGTTCGACCTGCCGGACTTCATCAAGAAGACCGGCATCATGGAGATG
AGAGCCTCGCTCCAGGACAAGGAGGAAACTAAGACATTGAAGGCGAAGATGAGGGAGAGG
ACGCGACCCAAGCTCGGGAAGATTGACATCGACTACCAGAAGCTGCACGACGCGTTCTTC
AAGTGGCAGACGAAACCTCGCATGACCATCCACGGTGACCTCTACTACGAGGGTAAGGAG
TATGAAACTCGACTGAGAGAAAAGAAACCGGGAGATCTCTCAGAGGAACTGAGAACCGCA
CTGGGCATGCCGGTGGGACCTGGCTCTCATAAGGTGCCGCCGCCGTGGCTGATCGCCCAG
CAGCGTTACGGACCGCCTCCGTCTTACCCAAACCTCAAGATCCCGGGCCTGAACGCTCCT
ATACCCGAGGGTTGCGCCTTCGGGTACCACGCGGGCGGCTGGGGTAAGCCTCCCGTCGAT
GAAGCCGGCAAACCTCTCTACGGAGACGTGTTCGGACATCAGAGCAGCGGCCAAGATGAT
GCCGAGGATCAAGATATAGACAGGACCATGTGGGGTGAACTGGAGTCGGAGTCAGAGGAG
GAATCGGAAGAAGAGGAATCAGATGAGGGCGAGAAGGCCGGTGAGGGTGAGGCCGTGGCA
GCGGGCGTGGCGACTCCTGGTGAGGGACTCGTCACACCGCTGGGCACCAGCTCTGTACCG
CCCGGACTGGAGACACCTGACACCATCGAGCTCAGGAAGAAGAAGATGGAGGATCTAGAA
GGCGGTGAGACACCGGCCTTGTATCAAGTGGTCCCCGAGAGACGAGTTGGTCTCACGTCT
GGTATGATGGCGTCCACACATGTGTATGACATCAATGCCGCAAATCCTGGTAAACGAGCT
CCGACCGGTGCAACCAGTGAGGTTGGTCCCAGCGCTGCAGCTGGTGTAGAAGTGGCGCTG
GACCCCTCGGAGCTGGAGCTGGAGCCCGAGGCTGTGGCGGCCAGGTACGAGAGACACCTG
CGGGAACACAGGCCCAAGGGACGCGAGGACCTCTCAGATATGTTGGCCGACCACGTCGCC
AGACAGAAGAATAAACGAAAGCGTCAACAAAACACAGATTCCAAGCAAGCGAAGAAATAC
AAAGAATTCAAGTTCTAA
Protein sequence:
MGPPPGMPSFPPMPPSSGPMGPGSMPPPPVGPPGTMPAVTTSGGPPNMPPPGMGPPPNMM
GMGPPGIGPPPPPGLGPPGINMGPPPMGPPGLPSRMPPNMMRGTSNMKSNYNQTIDMGPP
GMVPPSSMNPWDNQCPPGWGRQGRGDGPPGWDDQDDDEDDNDDESDPSGPPLPSLLTMKI
DTPEEFRNKPPSAVGGVVLPKALEEALAYKDQRQAALGDEADKVTEQTKKPEPPPAPVIS
TEYDGEEEGDSDEDNIPEAPLPPIISKQENQTKASKTKRKKKKKKAAKQKRKEAKSADES
SKEAQKTSDKENEKEAEIEYVQENIQFHELEPMYRQFHRILESFKITERKEEIKDEPGKD
APKPSKPLEKVTDQFAADEEAVEKHAADEKERLSKRKLKKLSRLSVAELKQLVARPDVVE
MYDVTARDPKLLVQLKAHRNTVQVPRHWCYKRKYLQGKRGIEKPPFDLPDFIKKTGIMEM
RASLQDKEETKTLKAKMRERTRPKLGKIDIDYQKLHDAFFKWQTKPRMTIHGDLYYEGKE
YETRLREKKPGDLSEELRTALGMPVGPGSHKVPPPWLIAQQRYGPPPSYPNLKIPGLNAP
IPEGCAFGYHAGGWGKPPVDEAGKPLYGDVFGHQSSGQDDAEDQDIDRTMWGELESESEE
ESEEEESDEGEKAGEGEAVAAGVATPGEGLVTPLGTSSVPPGLETPDTIELRKKKMEDLE
GGETPALYQVVPERRVGLTSGMMASTHVYDINAANPGKRAPTGATSEVGPSAAAGVEVAL
DPSELELEPEAVAARYERHLREHRPKGREDLSDMLADHVARQKNKRKRQQNTDSKQAKKY
KEFKF