DPGLEAN14572 in OGS1.0

New model in OGS2.0DPOGS206067 
Genomic Positionscaffold84:+ 108306-128560
See gene structure
CDS Length2457
Paired RNAseq reads  2051
Single RNAseq reads  5948
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006848 (8e-31)
Best Drosophila hit  CG12104 (3e-11)
Best Human hitTOX high mobility group box family member 3 isoform 2 (3e-23)
Best NR hit (blastp)  high mobility group protein, putative [Ixodes scapularis] (2e-46)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC012723 [Tribolium castaneum] (2e-42)
GeneOntology terms
  
GO:0005634 nucleus
GO:0003677 DNA binding
InterPro families
  
IPR000910 High mobility group, HMG1/HMG2
IPR009071 High mobility group, superfamily
Orthology groupMCL19548

Nucleotide sequence:

ATGTCCACAATAGGCTCCCCTGGGAAAGGCGGCGGCGGCGTTCCGACAACAGACAGCGGG
TGTCAGCCGCCACGTGCTACCCAAGGCCACTCGCTTAGTGTCAAGGTCATTGCCACTCCA
GCTCGGTACACGCGGCACCCACGCTTCTCTGTCACCACAAATAGACCGACCCACTCAGGC
AACCTGTGTACCGCAAGAACACACCTCAACACGTACCACAAACGTGCCCGCCACGACCAT
AACTTGTTCCACATAGCGGTGCTCCCTCGCGGATTCGCAAGTGTGTCCCGTGCCACGGCC
ACGTGGTCGCCGCTCGCCGCTTGCCCACTGACGGTGACGACACCTTACCACTTACGCGTT
CATCTTCCATTGATAAATTCCCAAGAACCGACTTTAGAGGCTTTCGATGGCACAGTGGCG
CCACAGCGTGGCTCGGACGAAATAGAAACTAACAGCGCCCCACCGCAGCGCGCGCCTCCG
CCACGCAGTCTTCCCGCGCGGGCCGCACGTGTGTCTAGCCGGCCCGAAAGCCTTGACACG
TCTCTATCCAACACCCGCGACACAAAAGGAAGTGTCTACGCTATGAATGATCAGACTTTT
CACACGCCATCTTTCGGAGACGAAGAGTTTGACATTCCTCTGATCCACGGCCAGCATGCT
GCCAGTGGACAGAACACACATATGCAATATTCACAACTACATCATTCAGCTCCTCAGGTT
GGCATGATGAATCCAGCTCAAGACGGTCTAGCTCCTCCAGGAGGTGCTCCTTCATATCAA
CAGCCTTTATACTTACAAGAACCCCATACACCTGTCACATCTCACAGCAATGCAACCGCG
CCGGCTGGCAATTATATGATACAGCAACAACCTGGAGGCCAGCAGTTATTAATGCTGCAA
CCGAGCCAAGTAATGAGTGGCCCACCAACTCCTAGCACCCCGACACAGGCTGCCCCTGTC
TATGGATCACCACAGAGAGCATCTCCACCTGGAACTACCAGTGATGATTCTGATGATAGT
GTACCATCTCAACATTCCCAGTTGCCTGGTGATGCGGAAAGCCTTCTGCCTATCCATTAT
TGCTCAGCGAGCCGGCAGCAACAAACTACGCTTCATCAGATTGGTGTAAGTAATATGGCC
GTAAAGAGATCATCACCAGAACCAATGGATAATGGAATAAGTAGGGGACAAATGCAAAAA
AAACCTAAGGTTCAGAAGAAAAAGAAAAAGAGAGATCCTAATGAACCCCAAAAACCTGTA
TCTGCCTATGCCTTATTTTTTCGAGACACTCAAGCTGCCATTAAAGGTCAAAACCCTAAT
GCCAGTTTTGGAGAAGTGTCAAAAATTGTTGCATCTATGTGGGATGGCCTCGATTCAGAA
CACAAAAGCGTATATAAACAAAAAACAGAAGTGGCAAAGAAAGAATATTTAAAAGCATTA
GCAGCATATCGGGCCAGTTTGGTTTCAAAGGGCGGAGAACAAGAAAATCAAGTCATGTAT
AATCACAACAACACAAATGCAAATTATGGGAATTATTATCAAGGTCAAGCCTATGGCAAT
GGTCATCCACCACAGGGCTATGCACCAAATTCGACACCACAAGGTTACACACCACAGAAT
TTCCCCGGAGGACAACCCCAACCTCCATATGGTGGTAATGGACCACAAGGATACCCAACA
AATCCCCAAGCACCATCTCAAAATTATCAAGGAAACATGGGACATAATCCCCAAACATAT
CAAAATGTACCTGGACAACCACCTCAAGGTTACCAAGTGAATACAACATCTTCGTCTCAA
GTGTGTCAACCTAACATTGCTCAATCACCAAGAAACTATCAGCCCGCACAATCTCCTAAC
TATGGAGCCAATAATGCCCTGTCCCCGCCAGGCTACAGACAAGTTCAACCACAATCGCCA
CCTGTGCAACAAGTCCATCCTGCTATGCAGTATCATCACTCCCAACAAATGCAGCAAGTA
CATCAAGTCCAATACCAGCAGCAACAACAAATACATCAGCAAAGTCAGCAGCAAGCTATG
CAACATCAGCAACAGCAACAACACCAGCAGCAGCAGCAACAACAACATCAGCAGCAGCAG
CAACATCAACAGCAACAACACCAGCAGCAGCAGCATCAGCAACAACATCAACAACAAACA
CCACAACAAATACCACCTCAACCATTGCCACCACAAGCACCAATTATAAAAAATGAACAG
CAGTCTCCGAACAACAATGGAACAGGTGTACCTCATCAGTCCCCTGAGCAAACTGAAAAT
AGAAGTACTCCATCATGTATACGCCAAGGCTGCACCAATCCTGCCATCCCAAATAGCGAA
TGGGAAGATGAATATTGTTCTAATGAATGTGTTGTTAGTCACTGCAGGGATGTCTTCAGC
TCATGGGTAGCATCAAATACCAACAATCAGATACAAAATTTTTCTGCTGTGAAGTAA

Protein sequence:

MSTIGSPGKGGGGVPTTDSGCQPPRATQGHSLSVKVIATPARYTRHPRFSVTTNRPTHSG
NLCTARTHLNTYHKRARHDHNLFHIAVLPRGFASVSRATATWSPLAACPLTVTTPYHLRV
HLPLINSQEPTLEAFDGTVAPQRGSDEIETNSAPPQRAPPPRSLPARAARVSSRPESLDT
SLSNTRDTKGSVYAMNDQTFHTPSFGDEEFDIPLIHGQHAASGQNTHMQYSQLHHSAPQV
GMMNPAQDGLAPPGGAPSYQQPLYLQEPHTPVTSHSNATAPAGNYMIQQQPGGQQLLMLQ
PSQVMSGPPTPSTPTQAAPVYGSPQRASPPGTTSDDSDDSVPSQHSQLPGDAESLLPIHY
CSASRQQQTTLHQIGVSNMAVKRSSPEPMDNGISRGQMQKKPKVQKKKKKRDPNEPQKPV
SAYALFFRDTQAAIKGQNPNASFGEVSKIVASMWDGLDSEHKSVYKQKTEVAKKEYLKAL
AAYRASLVSKGGEQENQVMYNHNNTNANYGNYYQGQAYGNGHPPQGYAPNSTPQGYTPQN
FPGGQPQPPYGGNGPQGYPTNPQAPSQNYQGNMGHNPQTYQNVPGQPPQGYQVNTTSSSQ
VCQPNIAQSPRNYQPAQSPNYGANNALSPPGYRQVQPQSPPVQQVHPAMQYHHSQQMQQV
HQVQYQQQQQIHQQSQQQAMQHQQQQQHQQQQQQQHQQQQQHQQQQHQQQQHQQQHQQQT
PQQIPPQPLPPQAPIIKNEQQSPNNNGTGVPHQSPEQTENRSTPSCIRQGCTNPAIPNSE
WEDEYCSNECVVSHCRDVFSSWVASNTNNQIQNFSAVK