New model in OGS2.0 | DPOGS204526  |
---|---|
Genomic Position | scaffold943:- 4750-27070 |
See gene structure | |
CDS Length | 2964 |
Paired RNAseq reads   | 15344 |
Single RNAseq reads   | 39681 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004309 (5e-12) |
Best Drosophila hit   | Dek, isoform A (2e-17) |
Best Human hit | protein DEK isoform 1 (2e-14) |
Best NR hit (blastp)   | PREDICTED: similar to LOC398543 protein [Tribolium castaneum] (2e-23) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC008666 [Tribolium castaneum] (7e-21) |
GeneOntology terms    | GO:0006397 mRNA processing GO:0005634 nucleus GO:0003676 nucleic acid binding |
InterPro families   | IPR014876 DEK, C-terminal |
Orthology group | MCL24234 |
Nucleotide sequence:
ATGTCGGGTGATACCGATAAAGCGAAAATCAGTGACAATCAGGATGAAGACAAGAAATCC
GGAACAGGCGACGGGCAGACGACGGAAGACGAATCATCCCAGGACTCGAAGGGTGAGTCG
GCGCCAATGGACGCGCAAGGTAACAACGTAGCAGACAATATAGAAGGAGACGCGAAGTCA
ACGACGGCGAACGGCAAAGAAGATGACTGTCAGGGCGAGGACTCCAAGGATGATAGGGCT
GTGAAGACGGAACACGCTAAGAAGGTAATGTACAAAAACGGTGAAACCCACGACGATGAC
GACAAAGAGGATTTGAAGGAAAATGACAAAGGGGCTAAGAAGTTAGCGCCGAAGAAGAAG
CCCAAGAAAGAGGACACCGAAGAAGAGGAAGAAGATGAAGAGGAGGAAGATGAGGAGGAG
GAGGATGAAGAAGGAGAAGAGGGCGATGAGGAGGAAGAGCAGAAGACAAAAGAGAAAGTA
AAGAAACCAGTCAAGAAACCTAAAGACGGAGAGGAGGAAGGTGACGACGAGGAAGAGGGG
GAGGAGGAGGGGGAGGAAGAAGAGGAAGAGGAGGAGGAAGAAGAGGCTCCGAAACCGAAA
CCCAAAAAGGAGCCAACTGAACCCCCTGTACCACTACCAGCTGGTAAAGGAATACCCTTG
GGGCATATCAGCAACGTGGAAGTGTCACTGTCGCGCTTCAAGACCCAAGACCAGAAGATA
TTACACCAGTATTTGTACGGACAACTCTGCCTGGATCGCAACGTCAAAAGGAACATCAAG
AAGTTCAAAGGCTACGAATGGGCGATCGGTTCCACGGAGTACAAAGCTAAACTAGAAGAA
ACAGCCAAAATGGAGCCCAAACAGTTGAGGACGATGTGCGAGATGTTGGACTTGGACAAA
AAAGGTGGCGCCAGTGAGTTGGCTGCTCGTCTGGTCGGTTTCCTTCAGCAGCCGGTCGCG
AACTCTCCCCACGCCCGCGGCGTTGCTCGCCCCCCTACCACGCAAGCGGCGACACCCGGC
GGCCGACCGAGACGATCAGCGGCCGTCAAGATACACAACAGAGATATTAATATCACGTCT
ACTGTGGCGGCTGCGATGCCCCCTTCCCGCGCTCCCTGTGTTGTCTACCCCGAGCACACC
CTTCCAACAGCCGAGGCCACTCCCCCGGCGGTTTCGCACGTGCCACTCGGACCTCAGTGT
TCGTTAAAAAGTGTGCGGCGTCGTAGGTGTCACCACCCGCACCCCTATCACGGCGCGAAA
CACGTGGTTGTAACGATGTCGTGTAGCTACTCGGACGAAGAGTATGAATCTGATCCGGAG
ACCAAGGTGAAGGGTCCCAAGCAGCCCAAGGACGGCTCGGAGGACTCTGATGGCTCCTTC
AACCCGAGCGGATCTGAGGCGGACTCGGACTTCGACCCTGAAGGTGGTGAGGGTGTGAGC
GGAGCCGCGCGCAAGAGGAAGAGCTCTGGACGACGCCGGTCCAGTAAGGGGAAGAGGGGG
CGCAAGAGCAAGGGGAGGAAGAAGGGTGGCAGCAGAGGCCGCGGCCGACGGGCACGGTCA
GACAGCGAGGACGAGAGTGAACGCTCTGATAGTGACAGCGAATTGGACTCGGCCAGCGAC
GGAGACGAATCAGATGAACCGAAGTCCAAACGTGGTAGGCCCGCGGGGTCCGTGTCTAAG
GGTCGCAAGGGAGCTGTAGCAAAGGCTAGCGCTAAAGCGACTCCCGCTAAGCGGAAAGCG
CCTACACCCACAGGGAAGAAGAAGGCCGGTGCCAAGCCAGTCGGTAGACCAGCCAAGAAG
GGCAAGCGGGCGTCCTCCGACGAATCTGGAGATGGAGAAGAAGGCAGCGAGGAAGAAGAT
GAAGAAGGCAGCGAGGAGGAGGAGAGCGGAGAAGAGGATGACGAGCCAACTGACAAGAAA
GCCAAGCGTCCACCTACAGACGAGGAGATTAAGAAGTACGTGAAGCAGATCCTGGAGGGC
GCGAACCTGGAGCAGATCACCATGAAGACGGTCTGCAAGCAGGTCTACAGCCACTATCCG
GACTTTGACCTGGCGCACAAGAAGGACTTCATTAAAGCTACTGTCAAATCGACCTGCTGT
TGCAAGGGCTCTGATGATGAATCAGATCAAGGGGATCCGGGGGCGGTGCTGAATGTGCCT
GAGTTTCGAATACCAAATGGCCTGGGTGTACCATTGGGATACTTACACAATGTAAACGAC
GCCCTCAACCGATACCATGTTGTGGACCTGAAGATGTTGCATCTCTATTTATATGGAGTA
AGCGGGACGAGGGATCAGGCTTTCAGACTCGTGGAGTTCCTCGTGAAGCCGGAGTCCAAC
TCTCCATATTTCAGATGTGTTCGTACGATCGAGAGCATTGCTAAGACTTACCATAGAGCT
GTTTCGGACAACGACTATGAATCCGATGCTGAGACGATGGTGGCGTGTTCAGAGAGGGCC
AGAGATGGGTCAGACTACTCCGAAGGTTCGTGTGTTGGAAGTTCAGAGTGTTGTCGCGCG
ATCTGTGACATCAGTGACAGCGAGAAGGAGGCTGACACTAAGATGATATCTGACAAGGAA
ATATCCTCTTCTATCGATGACACCACTGTGCTATCGCAGCCAGATGAAAGTGGCTCTTTG
GCGTCCGGGGAGATAACAAGCGTGTGTGATGAAAAAGTAGCCAAACCAGGGATTGGACAG
GAAATTGCTTTACCCGGTTATGCTAGAGCGAGACGTCTACCAACAGACGAGGATATAAGG
AAGTTTTTGCAGAAAATTTTGACGGGATTGAATCTGGAGAAGATTACTATGAACACCATC
TGCAGAGCGGTTTACAGCCAGTTCCCGGACTACGATCTGGTCAACAAGAGGGACTTCATC
ACAGCTACAGTCAAATCGGTAATTATTGAACCAAAATTAAAGGCCCCTAGTGGCGACTTT
TGTATTTGTCTATCCATATGCTAG
Protein sequence:
MSGDTDKAKISDNQDEDKKSGTGDGQTTEDESSQDSKGESAPMDAQGNNVADNIEGDAKS
TTANGKEDDCQGEDSKDDRAVKTEHAKKVMYKNGETHDDDDKEDLKENDKGAKKLAPKKK
PKKEDTEEEEEDEEEEDEEEEDEEGEEGDEEEEQKTKEKVKKPVKKPKDGEEEGDDEEEG
EEEGEEEEEEEEEEEAPKPKPKKEPTEPPVPLPAGKGIPLGHISNVEVSLSRFKTQDQKI
LHQYLYGQLCLDRNVKRNIKKFKGYEWAIGSTEYKAKLEETAKMEPKQLRTMCEMLDLDK
KGGASELAARLVGFLQQPVANSPHARGVARPPTTQAATPGGRPRRSAAVKIHNRDINITS
TVAAAMPPSRAPCVVYPEHTLPTAEATPPAVSHVPLGPQCSLKSVRRRRCHHPHPYHGAK
HVVVTMSCSYSDEEYESDPETKVKGPKQPKDGSEDSDGSFNPSGSEADSDFDPEGGEGVS
GAARKRKSSGRRRSSKGKRGRKSKGRKKGGSRGRGRRARSDSEDESERSDSDSELDSASD
GDESDEPKSKRGRPAGSVSKGRKGAVAKASAKATPAKRKAPTPTGKKKAGAKPVGRPAKK
GKRASSDESGDGEEGSEEEDEEGSEEEESGEEDDEPTDKKAKRPPTDEEIKKYVKQILEG
ANLEQITMKTVCKQVYSHYPDFDLAHKKDFIKATVKSTCCCKGSDDESDQGDPGAVLNVP
EFRIPNGLGVPLGYLHNVNDALNRYHVVDLKMLHLYLYGVSGTRDQAFRLVEFLVKPESN
SPYFRCVRTIESIAKTYHRAVSDNDYESDAETMVACSERARDGSDYSEGSCVGSSECCRA
ICDISDSEKEADTKMISDKEISSSIDDTTVLSQPDESGSLASGEITSVCDEKVAKPGIGQ
EIALPGYARARRLPTDEDIRKFLQKILTGLNLEKITMNTICRAVYSQFPDYDLVNKRDFI
TATVKSVIIEPKLKAPSGDFCICLSIC