DPGLEAN16790 in OGS1.0

New model in OGS2.0DPOGS205464 
Genomic Positionscaffold224:- 70220-72116
See gene structure
CDS Length1554
Paired RNAseq reads  1012
Single RNAseq reads  2658
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008274 (4e-178)
Best Drosophila hit  bobby sox, isoform C (2e-42)
Best Human hitHMG box transcription factor BBX isoform 2 (8e-23)
Best NR hit (blastp)  PREDICTED: similar to bobby sox CG1414-PC [Tribolium castaneum] (2e-60)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL011161 [Aedes aegypti] (2e-56)
GeneOntology terms


  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0003677 DNA binding
InterPro families
  
IPR009071 High mobility group, superfamily
IPR000910 High mobility group, HMG1/HMG2
Orthology groupMCL17874

Nucleotide sequence:

ATGAACGCCTTTCTCATATTCTGCAAACGTCACCGTAGCGTCGTCAGAGACAAGTATCCA
AATTTAGAAAACAGATCTATAACTAAAATTCTCGGAGAATGGTGGGCCAATTTGGACCAG
GAAGAGAAAGCTTGTTATACAGGTCTTGCTAAACAGTACAAAGACGCGTTCTTCAACGCC
AACCCTGACTTCAAATGGTACAAGTTGCCCGCACCACCGCTACGTACACTATCCTCCCGG
CCTCGGGAGTCCACTGAGAAAGCAGAATCCTCCAACGAATATAACGACCATGAACTGGAA
AAAACAAATAATAATTCCACAAAATTCGACTACAATAAGAAACCAGACAGTGATCCAGAA
ACTGAAGCTAAACCACTATCCATGTTCACGCCTGGAAAGTTAGCCGATGAAGCTCAAATG
GGAGGTCTGAGTAGCCTCCTAGCTACCAAGACTGAAGTCCAAACTCCTAATCCATATTAT
TCTCCTCCTTCCTGTAAATTTAACGCGATCTCAACTCCAATCGAATACAGATCTCATGAT
TCAATAGACAGACCCAAAACCAAACGAGAAGACAGCATACGTGAGCTTCAAAATGCACTT
ACAGAAACAACTCGAATGTTCGAAGAAGATTTCGATGAAAAGGAACAATTACGTTATTAC
GGTGCAGCTAACGACCAATTCACGAACCAAGATGTTATCGACCAAATAGTTGATAAACGC
TATTCTAAAGATGATGAAGGATACCAGAGAAACTGGTCGGATGATGAGAAAAATTCCAAG
TCTGGTAGAACTTGCAAAGGGAAAAGATATCAAGAGTTCATGGCTGTTGGAGGACTGATA
GTGAATAAGAGGCCGAGGAGAGATTATCCCGATAGATTGTCGGACGAAGGCTACAATGCA
TCTTGTAGCTGGGATCCTGGATCTTCGCTCGAGGAATCAACAATGACAATGGCAGACGAG
TCGACGCCCGACACTAACTATATACAGCACGACATAACAGTCGAAAGCGAACCAAATGTG
GAACCAAACGACACGCCCGAAATAGACAATAACTGTAACAAATCATTTAAGGCTGCCGAC
TTCGATTTAGAAGCTAAAATAAGAGCTCTACCCTCCCTTAGTTTAGAGAAGTTTCAACAG
AAAAAACGCGAGAATAAACGTAAGAAAAAAAACGTTAGCCTGAGAACTAAATCCGTCAAG
TCATCGCAAATAATAAACTCGGTTCCGCGTCCGGTTATGGACGAGCGTCATGAAATGGCG
GAGAATTGGCGCGAGACCGTCATAGGGAGCCAGAAACGGAAACCGAGGAAAATAAGCATC
ACACGACTCGAAATCAACAGCCTCGTCTCCAGTAACATGAACGGCGGCAATAAAATCAGT
CCAGAAATAAAAATTGCCACAGAAGCTCCTTGCACCATCCAAAGCATGGACATATGCAAT
CAGAGCCATGGTAATGTCGACCTGTTCGCGTTGGCCACGTTGGCCGAGGTCGCTGCCAAC
ACGTCCAAAATAGAGCAGACCAATGCGGCAAGCGAAGACGCTTCCAAGGTATGA

Protein sequence:

MNAFLIFCKRHRSVVRDKYPNLENRSITKILGEWWANLDQEEKACYTGLAKQYKDAFFNA
NPDFKWYKLPAPPLRTLSSRPRESTEKAESSNEYNDHELEKTNNNSTKFDYNKKPDSDPE
TEAKPLSMFTPGKLADEAQMGGLSSLLATKTEVQTPNPYYSPPSCKFNAISTPIEYRSHD
SIDRPKTKREDSIRELQNALTETTRMFEEDFDEKEQLRYYGAANDQFTNQDVIDQIVDKR
YSKDDEGYQRNWSDDEKNSKSGRTCKGKRYQEFMAVGGLIVNKRPRRDYPDRLSDEGYNA
SCSWDPGSSLEESTMTMADESTPDTNYIQHDITVESEPNVEPNDTPEIDNNCNKSFKAAD
FDLEAKIRALPSLSLEKFQQKKRENKRKKKNVSLRTKSVKSSQIINSVPRPVMDERHEMA
ENWRETVIGSQKRKPRKISITRLEINSLVSSNMNGGNKISPEIKIATEAPCTIQSMDICN
QSHGNVDLFALATLAEVAANTSKIEQTNAASEDASKV