New model in OGS2.0 | DPOGS207031  |
---|---|
Genomic Position | scaffold1:+ 1204182-1214614 |
See gene structure | |
CDS Length | 3603 |
Paired RNAseq reads   | 1029 |
Single RNAseq reads   | 2458 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012974 (4e-11) |
Best Drosophila hit   | Cap-G, isoform G (2e-57) |
Best Human hit | condensin complex subunit 3 (6e-64) |
Best NR hit (blastp)   | PREDICTED: similar to mCG21477 [Nasonia vitripennis] (1e-95) |
Best NR hit (blastx)   | PREDICTED: similar to mCG21477 [Nasonia vitripennis] (7e-92) |
GeneOntology terms   | GO:0005488 binding |
InterPro families    | IPR016024 Armadillo-type fold IPR011989 Armadillo-like helical |
Orthology group | MCL12878 |
Nucleotide sequence:
ATGCCACCTACCGATGCAGAAGTGCGTCGAGAAATTGTAAAAGCAAATCCACGAAATGAT
AAGACAATGTTTAAAATATTTCAAAATGTTCAATATAATGTTGTTCAACATAGGAAGTAT
GTGAAAGAAATGACGAAACTTTACAAAAAGACTGAAGCGGATGACTTTAAGGAAAGCTTT
AAAAATGCATTATTTTACCTCTTTACTTTTGGAGATACAAGTACAAATGTAGATCGTGTT
ATTCAATTTGTAGCAACATTCTGTACATTACTTGACGATGAAGAGGAGTTTCTAATGTTT
ATATTTGATATTATTTTCGAATGTCAGTGTGTATCTGGTCAGTCAGTAAGATACCGAGCA
AGCCAGTTGCTGGCAGCCGTACTAGCTGCACTCGGTGATGAAGCCTCTTTAGATGATGAC
CTCTGTGACAAGCTGTTACTTCATCAGATGCAACGTCTCCAAGATACACGTGGTGCTGTC
AGATGTCGGGCAGCATTAGCCCTCAATAGGTTGCAAAACCCAAGTGATCCAGATGACGAG
GTAACCAGGGGTTACCGATTTCACATGAGCTGTGACCCTAGCTCCTCTGTTAGAAGGGCT
GTAGTGATGTCAATAGCAAAATGCACTCGGAATGTCCCCTTTGTATTGGAGCGCCTCTGT
GACGTTGATGAAGCTGTAAGAAGAGCCGCATTCCTATACATAGCGGCTATGAATGTAACA
CAATTAAGAGTTAGACAGAGAGTTCTTACATTGAAGGTTGGCCTCACTGAACGCAGCCCG
CGAGTGCGTCGTGTGGTAGAAGAGATTTTAATACCCAGCTGGTTGAGTACCTTCCAAGGC
AACATCATAGACTTTCTTAAAGCAATACGTCTGGATAATTCACACGACGCGAAAGATTCG
CAATACGTCGCAGAGAAGCTCTTGGAGTCGCTTTTCAAACGTCTACCGATATCAGAGCTT
CTAGAATGGCTGCCAACTGACAAGTCACTCCGAGTTATCCCCGCTGACAAGTTGAACAAG
GAAACAGTTTGGTACTGGCGCCACCTTGCGGAGCATTTACAAAAGAATGATGATGACGAG
ACCCTCGAGACTGTGCTACCTGATCTAGTTGTACTGACTGGATATATTAAAGCTATCGTG
GAATCACCATGTCCGAATGAGGAGGCGGATCCGGTGTCGTATAGCACTCGTCAGTATGTG
CTTCACGAGCTGGCGAGATTACTACGGACTTACGACGCCAGCGACCCCGCAGGTAGAGAC
GCCCTACAGACATTGATCACCGACACACTTACAGGTGACTACGGTCCTATGAGCGGGGAC
GTGATCCGCGCGTTTGTATCAGCCCTGCAGTTGGTTTTGCCAGATGTGACGAGTAGAGTG
GAACTTGTATGCAATGTTCTGTCAACACTACGGGAACCACCGGAAATGGAGGAGGAAGTA
CCTCCGCCGACCTTGGACGATACAGAGGCAAAATTACAGAGAGCCAGATTGCGTGTCTCC
CTAAATGTGGCAATGGAAGCTCAAGAAGAAGCTGTTAGACATGAAAACTATACTCTTGCT
GCTCAATGCAAAGCTAAGGTTGCCGACATTCAGAAAAAATTGGAGGAATTAACATTTCAA
ACAAAACCAGAACAACCATTAACTACAATCAAAGAGAAACAATGTGATGTGACAACATTA
AATAAGTGTCTAATAATACTAAATACATTACTGGACACACCACAACTGAACAATGTAACA
CCGATGTTGAATCTTATGTTCAGCGAGCTAGAAGTTGAAATATTTTCCAAGCCCGAACTA
TTGGACAATGCTCTTGAAACGGTGGCACTATTTGGCATGCTGGATAAAGAATTTGCAAGA
GATCATAAATCATTCTTCTTTGCTAATTTAGTTGATTCAACAAACGAACCAACAGTGTGT
AAAGTGCTTAAGTGTATAGTTGACCTGTTGTGTGTACACGGAGCTAAGGTTTTTGACGAT
GGTACAGAATCCATCGAGGCTTCCAGGAACAGATCTAAACATTCCATAAATACAACTACT
ATGGATTTTGATGAATCTGTATTGTCATCATCTCAAGCGCACAGCAACGTTATTGAATTA
CTCCTTAAGTTGATGGATAACGCTTGCCCATCATATAGACTGATAATAGTAGAGGGTCTA
TGTCGCCTGATGTATCTAGGACATTTAGAATCGCCTTATATACTGAGCAGATTGATACTA
CTCTGGTTTAACCCAGTCTCAGCGGAAGAAGATGTACTACGACAGACTATAGGCATTTTC
TTCCAGACGTTCCCTAGTACTGTCGACGGTGCCCAAGATCAAATACAAAAATCTATGATA
CCGACACTGCGTGCTCTTTGTTGTGCGCCGTCAAGCTCTCCGATCTGTGAGATCGACCAG
GAGGCGGTTGTGAAGTTTTTTGTATCACTAACAAAAGTCAGCTCGGAGTTGACAGACAGC
CAGGGTGCTATGGCGTTGACACTATGCGAGTATCTAGTTCGTAAACCGACGGGTCCCGCG
TCTGCTCTACTTTGTCGGGCGCTGGCTCTTCTCTCACCGCCTAAAGACGTTCGCACTGCT
GCTAATCTGGCGACTATGATCAAAGATCTCTGTCTGAAATTACCAGATAAACAATCCTGT
AGGAATCTGACGCGTTATCTCGGTGCCCTGGAGGCGTTGGAAAAGAGTAATCTCAATAAG
ATGTCTAACATAGGTGAAACAATTGACAGTATACAGTGTGAAGATACAATGAACATGATG
GGACGTTCGGCCACCTCATTACCTCAACCTTTAGCTCGTAGTACCCACGTGGTTGTTAAT
GAGACTGTAGAAGAAGAACCAGAAATTGAAGAAACTTCTTCGGGTGAATCTCCTACCGAC
CCTATCTCTAGGATATCTGAAGAAAACAAAAATCAAACCGAAGACATGACGGCCGCTGAG
CAAACGGAGACGGAAGTTCCTGAAAAGGAGGACAGTGGATCAGACAGCAGTTCTGTGTCG
CCTGTAAAGAAAGCTAAAATAAGCAAAGACAAGAAAAAGAACATGACATCAAAGAATGAA
AAAGATTTGCGTAAACAAAGGCAACCGAGAAATAAGAAGGACGCTAAAGATAAAGTCAGA
AATGAAGAGGATAAAGAGAAGGGAGTCAAGCGTAGTTCACGATCCACAACAGCCGCAATA
AGAGCCGAAACTGATAAGAAGTGTCAGGAGCATGACTTACAAGAATCACCACCATCGGAC
GGTTCTAATACGACGGTACGTAGATCAAGCCGTGGCCTGCAATCCGGGTCAAGCACCGAG
TCGACCGGATCAAAAGGGAAAAAGAAAGCCAGTCAGATGGCCAGTAGCGAGCGCTCGTCC
CCGAGTCACAGCAGCAATGACAGCGCTCAATTTGATTCTGACACCACAGAACTACACACC
ATCGTCTACGACGCACCATTGGAACAAGAACTTCTGGATGACTCAATTGAGTTGATGGAC
AGCAGACGATCGTCCAGAAATAACGTCACTATTCCTGAAACTCCAGAGGCTAGTGAAGAG
TCCGACTCTGAACTTGAAGTGGCCATAAAAGGAAAGAGGAGATGTAAGGGAAAAAAGAAT
TAA
Protein sequence:
MPPTDAEVRREIVKANPRNDKTMFKIFQNVQYNVVQHRKYVKEMTKLYKKTEADDFKESF
KNALFYLFTFGDTSTNVDRVIQFVATFCTLLDDEEEFLMFIFDIIFECQCVSGQSVRYRA
SQLLAAVLAALGDEASLDDDLCDKLLLHQMQRLQDTRGAVRCRAALALNRLQNPSDPDDE
VTRGYRFHMSCDPSSSVRRAVVMSIAKCTRNVPFVLERLCDVDEAVRRAAFLYIAAMNVT
QLRVRQRVLTLKVGLTERSPRVRRVVEEILIPSWLSTFQGNIIDFLKAIRLDNSHDAKDS
QYVAEKLLESLFKRLPISELLEWLPTDKSLRVIPADKLNKETVWYWRHLAEHLQKNDDDE
TLETVLPDLVVLTGYIKAIVESPCPNEEADPVSYSTRQYVLHELARLLRTYDASDPAGRD
ALQTLITDTLTGDYGPMSGDVIRAFVSALQLVLPDVTSRVELVCNVLSTLREPPEMEEEV
PPPTLDDTEAKLQRARLRVSLNVAMEAQEEAVRHENYTLAAQCKAKVADIQKKLEELTFQ
TKPEQPLTTIKEKQCDVTTLNKCLIILNTLLDTPQLNNVTPMLNLMFSELEVEIFSKPEL
LDNALETVALFGMLDKEFARDHKSFFFANLVDSTNEPTVCKVLKCIVDLLCVHGAKVFDD
GTESIEASRNRSKHSINTTTMDFDESVLSSSQAHSNVIELLLKLMDNACPSYRLIIVEGL
CRLMYLGHLESPYILSRLILLWFNPVSAEEDVLRQTIGIFFQTFPSTVDGAQDQIQKSMI
PTLRALCCAPSSSPICEIDQEAVVKFFVSLTKVSSELTDSQGAMALTLCEYLVRKPTGPA
SALLCRALALLSPPKDVRTAANLATMIKDLCLKLPDKQSCRNLTRYLGALEALEKSNLNK
MSNIGETIDSIQCEDTMNMMGRSATSLPQPLARSTHVVVNETVEEEPEIEETSSGESPTD
PISRISEENKNQTEDMTAAEQTETEVPEKEDSGSDSSSVSPVKKAKISKDKKKNMTSKNE
KDLRKQRQPRNKKDAKDKVRNEEDKEKGVKRSSRSTTAAIRAETDKKCQEHDLQESPPSD
GSNTTVRRSSRGLQSGSSTESTGSKGKKKASQMASSERSSPSHSSNDSAQFDSDTTELHT
IVYDAPLEQELLDDSIELMDSRRSSRNNVTIPETPEASEESDSELEVAIKGKRRCKGKKN