New model in OGS2.0 | DPOGS211550  |
---|---|
Genomic Position | scaffold8729:+ 621-13594 |
See gene structure | |
CDS Length | 3111 |
Paired RNAseq reads   | 2114 |
Single RNAseq reads   | 4836 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014363 (1e-07) |
Best Drosophila hit   | cramped, isoform A (2e-43) |
Best Human hit | protein cramped-like (7e-35) |
Best NR hit (blastp)   | PREDICTED: similar to cramped [Tribolium castaneum] (1e-57) |
Best NR hit (blastx)   | PREDICTED: similar to cramped [Tribolium castaneum] (2e-55) |
GeneOntology terms    | GO:0045449 regulation of transcription GO:0007379 segment specification GO:0005634 nucleus GO:0003677 DNA binding GO:0006260 DNA replication |
InterPro families    | IPR009057 Homeodomain-like IPR017884 SANT, eukarya IPR001005 SANT domain, DNA binding |
Orthology group | MCL13233 |
Nucleotide sequence:
ATGGTGGGGATAGCCATGAGGATTCTTTGGAAGTGTACTCCACAGGTAGCGATCAAAGCA
GGCGTCATGTCTTCGGGAGTTCAACAGGATGAAGAGCGGACCGAGCTGTTGGGATCGTTA
ACAACACAACAACAACAGAGGACCAGCGCCAGGGTCATTAAGAAACTGAGACTGGAACCG
CAAATTGACAAACGAGACGTCATAGAATGCGAGACTCCAAACAAAATAGACGATAAAGAC
CCTTTAAAGTTCCCAACGGTCAAACAACGTATGCCCAAGGCGTTGTGGTCGGCAGATGAG
AAAAGCCTTTTCTTCGAAGCCTTGAACGAATACGGCAAGGACTTCGACTCAATCACCTCG
TACATTTGCGCTAAAATGAAGAAGAAGGGCATGCTAGATGGAAATCTGAAGACTAAGACC
CAAGTCAGCCATTTTTATTACAGGACGTGGCATAAGCTGTCCAAACACGTGCGTTTTGAT
GAAAATGTCAAAAAAGTAGCCCAGGAGCTTTACGCTCTGATAAATTATGGCGAGTTGAGA
AGGAAGCTGGTGTCTGTTAATGAGAAGATCTGCGCTCGTTTGGGAGAAATGGTCCGTGGA
GGATCCATAGCTGTGAGGACCAAGGGAAAGACGATCAGAGTCAGAACACCAATGTGCAGA
GCTCTCAGACGACTCAATCAAATCGCGGAGCGCGCGTACGGCGCTCGTGTGTGCACCCGG
GCTCAAGTGATATTGCGCGCTCGGGACGCGGCGTCTTGGACGCGCGTGCAGGCCGCCGCA
CACAACCCGCGGGCGGTCGTCGCGCTCACTCTCAGGACGAGGCTCGTGGCGCTGCTGTGG
GCTCTCGAGAGACGGTGGAATTGTAAGCCATGTGTTGTAAAAAAAAGTGATTTAAAAACC
GAGGAATATTCTTCGCCATTTTGTGAGAACGGAGACAAGCTGGACGAACACCTCAACTTG
GAAACAGACAGGACCATAATCCAAGAGCGTGCGCCCCCCGGTGTGTCGCTCCACGTCGGT
CCTCGGCCTGAGGCCGACGTCCGTCTGCCGGAGCTTCGCCCTCGCGAGCCTCTGTCGAGT
CAGAAGATCTGCTTTGCTTCATATCTGGAAAGAATGGGCGCTTTGAGAAGACAGGACGGA
GATGTCAAAATTCGTACGCCGAAACGGCACAGAAAAGACAGCGTGTCCGATAAAGATAAG
GAAAATGACAAGAAAATTAAAATAGAGGACATAGAAACAAACAAACTGATTAACATAGAA
GAAACAGCCATAGACGGCATAGAACTGATGGCGCATTACAAGAACAACCAGGAAGATGAA
GAGAAACCCAGCACTGAGGAAGAGAAGGAGATGAGGTCGGAGGAGGTCGTGGAGGAAGAT
TCAGAGAGGGACAAAGACGTGATAGAGAGAGACAATGACGTGTTGGACAGAGACAGAGAC
ATGCGAGAAAGAGACAAAGATATGTTGGACAGAGAGAAAGAGATGCCAGAGAGAGACAAG
GATATGTTAGAGAGAGAGAAAGACAGCTTCTCAGAAATGGAAGACGATGAGAAATATAAC
AAGAGTGACACAGACAATGAGAGCGATGGACGAGAGAAGAAACAGATGAAATATAAGAAT
TTGAAGGTGAAGTTCCGTATACGTCCCAAGAAGAGAGGTGGCTCCGTCTACACCCTCGTG
TTGGATCATGATAAGAATAAAGAGGAAGATGCCAAGATAGATGAAAAGGAGGAGAAGGAA
GAAGAACAGAAGCCGGATATAGATGTAGACTTCGCGATGAGGCAGGTCAGGAAAGGATGG
AGCGTTTGGGACGCGGGCGATCTTACCATAGGAGATTTGTATTTGATGTTCGGTTCCCGC
TCGAGATTAGAATTGGACTACTGGTGGGCGGAGCCTACACCCCCACTACCTAAGACGAGG
ACGGAAGATAGGCTCGATAGGAGAGAGAAAGGGAACAAAACGCCAGACAAGGGAGACAGC
TCAGTAGAAGACGAGAGGGAGAGGTGTGACAACGACATACTCTCACCCAAGAACACGTTC
AGCCAAGACAGTAACGACGGCCTGTCCGGCGACGAGAGGAAGAGTGAGCAGCTGTCGTCG
CCGGACCACAAGTCCGGGACCTTGAAGCTGGTCAGTAAACTGATCAACCGACCCACGCAC
GTCTCCACCAACAACGGCTACTCGCTGGTTAGCGACAGACTTCGGCGACTGCTGGCGCTG
GCGGGGAACAGCCACTTTGGTGGAGGGGCGAGAGGAGTGCACGCGCGGAAACACGGAGCG
ACACATGTGCAGAAGCCCCCCGCGTGTAGATTGAGTCCGACGAGGTCTCCACCCGCCACG
GGCCCCGCGCTCTTCAGACACCCCGCACCCATCGCTCCCAAGCCAGAGCCGGAGTCGTGT
CAGTCGCCTATTAGTCTCAATGGTCTGCCTAAGTGGCGCCGCGGGCGACCTTCCACCGAC
AGACGGGTCGTGGTCCAACGTCTCCTGCCCCTCATGCCCAAACTACCACCACCAAATAAT
CTGATTCCCGTGAAGATGGTGTCCAACTCCCAGCCGGTCCAACCGAGGCTGGTCCCCAAG
CCGCCGCCGTCAAGCTCCTCCGACCTGTCGATGTACTACGTGCTGAGTCAGTCCAACGGA
CAGTTCTTCTTCCACGACGGCGACAGACGGATCCCCATCCTCACCGACTCCGGGAACACC
TCCCAGAACGCCGAAGACGCCAAGGCTGATGATGGAGAGGTTAAGGAAGACAAGGAGTCG
GACATCAAGATAGTTAAAGTAGAAGCTGGCGTCGAGAACGGGGGGGAGGCAGGGGAGGGG
CGGACATACGAGCAGAATGATATCTCAAGTTTCCTCCCGTCCGAGTCCCTGTCGCTGTCT
CCATCCCGCTTGCTCCGCGCGCCGGGCGAGGGGGGCGAGGCGGACTGGCTCGACACACAC
GACTTCTCACTCAGCAGCTTCCTCTCCCACCTGGAGAAGGCCCAGCAGGAGCTACCGGTG
GATTCCCATCTCCAGTCGTTGATGGCTGAAAGCAGTGTGGACTACGTGGCCAAGTTCGCT
GACCTCGCCGCTGAGGTGGACGACCAAGACCTCAGTGACGACCTGCCTTAA
Protein sequence:
MVGIAMRILWKCTPQVAIKAGVMSSGVQQDEERTELLGSLTTQQQQRTSARVIKKLRLEP
QIDKRDVIECETPNKIDDKDPLKFPTVKQRMPKALWSADEKSLFFEALNEYGKDFDSITS
YICAKMKKKGMLDGNLKTKTQVSHFYYRTWHKLSKHVRFDENVKKVAQELYALINYGELR
RKLVSVNEKICARLGEMVRGGSIAVRTKGKTIRVRTPMCRALRRLNQIAERAYGARVCTR
AQVILRARDAASWTRVQAAAHNPRAVVALTLRTRLVALLWALERRWNCKPCVVKKSDLKT
EEYSSPFCENGDKLDEHLNLETDRTIIQERAPPGVSLHVGPRPEADVRLPELRPREPLSS
QKICFASYLERMGALRRQDGDVKIRTPKRHRKDSVSDKDKENDKKIKIEDIETNKLINIE
ETAIDGIELMAHYKNNQEDEEKPSTEEEKEMRSEEVVEEDSERDKDVIERDNDVLDRDRD
MRERDKDMLDREKEMPERDKDMLEREKDSFSEMEDDEKYNKSDTDNESDGREKKQMKYKN
LKVKFRIRPKKRGGSVYTLVLDHDKNKEEDAKIDEKEEKEEEQKPDIDVDFAMRQVRKGW
SVWDAGDLTIGDLYLMFGSRSRLELDYWWAEPTPPLPKTRTEDRLDRREKGNKTPDKGDS
SVEDERERCDNDILSPKNTFSQDSNDGLSGDERKSEQLSSPDHKSGTLKLVSKLINRPTH
VSTNNGYSLVSDRLRRLLALAGNSHFGGGARGVHARKHGATHVQKPPACRLSPTRSPPAT
GPALFRHPAPIAPKPEPESCQSPISLNGLPKWRRGRPSTDRRVVVQRLLPLMPKLPPPNN
LIPVKMVSNSQPVQPRLVPKPPPSSSSDLSMYYVLSQSNGQFFFHDGDRRIPILTDSGNT
SQNAEDAKADDGEVKEDKESDIKIVKVEAGVENGGEAGEGRTYEQNDISSFLPSESLSLS
PSRLLRAPGEGGEADWLDTHDFSLSSFLSHLEKAQQELPVDSHLQSLMAESSVDYVAKFA
DLAAEVDDQDLSDDLP