New model in OGS2.0 | DPOGS206067  |
---|---|
Genomic Position | scaffold84:+ 108306-128560 |
See gene structure | |
CDS Length | 2457 |
Paired RNAseq reads   | 2051 |
Single RNAseq reads   | 5948 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006848 (8e-31) |
Best Drosophila hit   | CG12104 (3e-11) |
Best Human hit | TOX high mobility group box family member 3 isoform 2 (3e-23) |
Best NR hit (blastp)   | high mobility group protein, putative [Ixodes scapularis] (2e-46) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC012723 [Tribolium castaneum] (2e-42) |
GeneOntology terms    | GO:0005634 nucleus GO:0003677 DNA binding |
InterPro families    | IPR000910 High mobility group, HMG1/HMG2 IPR009071 High mobility group, superfamily |
Orthology group | MCL19548 |
Nucleotide sequence:
ATGTCCACAATAGGCTCCCCTGGGAAAGGCGGCGGCGGCGTTCCGACAACAGACAGCGGG
TGTCAGCCGCCACGTGCTACCCAAGGCCACTCGCTTAGTGTCAAGGTCATTGCCACTCCA
GCTCGGTACACGCGGCACCCACGCTTCTCTGTCACCACAAATAGACCGACCCACTCAGGC
AACCTGTGTACCGCAAGAACACACCTCAACACGTACCACAAACGTGCCCGCCACGACCAT
AACTTGTTCCACATAGCGGTGCTCCCTCGCGGATTCGCAAGTGTGTCCCGTGCCACGGCC
ACGTGGTCGCCGCTCGCCGCTTGCCCACTGACGGTGACGACACCTTACCACTTACGCGTT
CATCTTCCATTGATAAATTCCCAAGAACCGACTTTAGAGGCTTTCGATGGCACAGTGGCG
CCACAGCGTGGCTCGGACGAAATAGAAACTAACAGCGCCCCACCGCAGCGCGCGCCTCCG
CCACGCAGTCTTCCCGCGCGGGCCGCACGTGTGTCTAGCCGGCCCGAAAGCCTTGACACG
TCTCTATCCAACACCCGCGACACAAAAGGAAGTGTCTACGCTATGAATGATCAGACTTTT
CACACGCCATCTTTCGGAGACGAAGAGTTTGACATTCCTCTGATCCACGGCCAGCATGCT
GCCAGTGGACAGAACACACATATGCAATATTCACAACTACATCATTCAGCTCCTCAGGTT
GGCATGATGAATCCAGCTCAAGACGGTCTAGCTCCTCCAGGAGGTGCTCCTTCATATCAA
CAGCCTTTATACTTACAAGAACCCCATACACCTGTCACATCTCACAGCAATGCAACCGCG
CCGGCTGGCAATTATATGATACAGCAACAACCTGGAGGCCAGCAGTTATTAATGCTGCAA
CCGAGCCAAGTAATGAGTGGCCCACCAACTCCTAGCACCCCGACACAGGCTGCCCCTGTC
TATGGATCACCACAGAGAGCATCTCCACCTGGAACTACCAGTGATGATTCTGATGATAGT
GTACCATCTCAACATTCCCAGTTGCCTGGTGATGCGGAAAGCCTTCTGCCTATCCATTAT
TGCTCAGCGAGCCGGCAGCAACAAACTACGCTTCATCAGATTGGTGTAAGTAATATGGCC
GTAAAGAGATCATCACCAGAACCAATGGATAATGGAATAAGTAGGGGACAAATGCAAAAA
AAACCTAAGGTTCAGAAGAAAAAGAAAAAGAGAGATCCTAATGAACCCCAAAAACCTGTA
TCTGCCTATGCCTTATTTTTTCGAGACACTCAAGCTGCCATTAAAGGTCAAAACCCTAAT
GCCAGTTTTGGAGAAGTGTCAAAAATTGTTGCATCTATGTGGGATGGCCTCGATTCAGAA
CACAAAAGCGTATATAAACAAAAAACAGAAGTGGCAAAGAAAGAATATTTAAAAGCATTA
GCAGCATATCGGGCCAGTTTGGTTTCAAAGGGCGGAGAACAAGAAAATCAAGTCATGTAT
AATCACAACAACACAAATGCAAATTATGGGAATTATTATCAAGGTCAAGCCTATGGCAAT
GGTCATCCACCACAGGGCTATGCACCAAATTCGACACCACAAGGTTACACACCACAGAAT
TTCCCCGGAGGACAACCCCAACCTCCATATGGTGGTAATGGACCACAAGGATACCCAACA
AATCCCCAAGCACCATCTCAAAATTATCAAGGAAACATGGGACATAATCCCCAAACATAT
CAAAATGTACCTGGACAACCACCTCAAGGTTACCAAGTGAATACAACATCTTCGTCTCAA
GTGTGTCAACCTAACATTGCTCAATCACCAAGAAACTATCAGCCCGCACAATCTCCTAAC
TATGGAGCCAATAATGCCCTGTCCCCGCCAGGCTACAGACAAGTTCAACCACAATCGCCA
CCTGTGCAACAAGTCCATCCTGCTATGCAGTATCATCACTCCCAACAAATGCAGCAAGTA
CATCAAGTCCAATACCAGCAGCAACAACAAATACATCAGCAAAGTCAGCAGCAAGCTATG
CAACATCAGCAACAGCAACAACACCAGCAGCAGCAGCAACAACAACATCAGCAGCAGCAG
CAACATCAACAGCAACAACACCAGCAGCAGCAGCATCAGCAACAACATCAACAACAAACA
CCACAACAAATACCACCTCAACCATTGCCACCACAAGCACCAATTATAAAAAATGAACAG
CAGTCTCCGAACAACAATGGAACAGGTGTACCTCATCAGTCCCCTGAGCAAACTGAAAAT
AGAAGTACTCCATCATGTATACGCCAAGGCTGCACCAATCCTGCCATCCCAAATAGCGAA
TGGGAAGATGAATATTGTTCTAATGAATGTGTTGTTAGTCACTGCAGGGATGTCTTCAGC
TCATGGGTAGCATCAAATACCAACAATCAGATACAAAATTTTTCTGCTGTGAAGTAA
Protein sequence:
MSTIGSPGKGGGGVPTTDSGCQPPRATQGHSLSVKVIATPARYTRHPRFSVTTNRPTHSG
NLCTARTHLNTYHKRARHDHNLFHIAVLPRGFASVSRATATWSPLAACPLTVTTPYHLRV
HLPLINSQEPTLEAFDGTVAPQRGSDEIETNSAPPQRAPPPRSLPARAARVSSRPESLDT
SLSNTRDTKGSVYAMNDQTFHTPSFGDEEFDIPLIHGQHAASGQNTHMQYSQLHHSAPQV
GMMNPAQDGLAPPGGAPSYQQPLYLQEPHTPVTSHSNATAPAGNYMIQQQPGGQQLLMLQ
PSQVMSGPPTPSTPTQAAPVYGSPQRASPPGTTSDDSDDSVPSQHSQLPGDAESLLPIHY
CSASRQQQTTLHQIGVSNMAVKRSSPEPMDNGISRGQMQKKPKVQKKKKKRDPNEPQKPV
SAYALFFRDTQAAIKGQNPNASFGEVSKIVASMWDGLDSEHKSVYKQKTEVAKKEYLKAL
AAYRASLVSKGGEQENQVMYNHNNTNANYGNYYQGQAYGNGHPPQGYAPNSTPQGYTPQN
FPGGQPQPPYGGNGPQGYPTNPQAPSQNYQGNMGHNPQTYQNVPGQPPQGYQVNTTSSSQ
VCQPNIAQSPRNYQPAQSPNYGANNALSPPGYRQVQPQSPPVQQVHPAMQYHHSQQMQQV
HQVQYQQQQQIHQQSQQQAMQHQQQQQHQQQQQQQHQQQQQHQQQQHQQQQHQQQHQQQT
PQQIPPQPLPPQAPIIKNEQQSPNNNGTGVPHQSPEQTENRSTPSCIRQGCTNPAIPNSE
WEDEYCSNECVVSHCRDVFSSWVASNTNNQIQNFSAVK