DPGLEAN04356 in OGS1.0

New model in OGS2.0DPOGS213271 
Genomic Positionscaffold1029:- 15346-25968
See gene structure
CDS Length3114
Paired RNAseq reads  306
Single RNAseq reads  857
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001187 (2e-46)
Best Drosophila hit  CG17689, isoform A (2e-81)
Best Human hitfamily with sequence similarity 48, member A isoform b (5e-31)
Best NR hit (blastp)  AGAP012403-PA [Anopheles gambiae str. PEST] (3e-128)
Best NR hit (blastx)  AGAP012403-PA [Anopheles gambiae str. PEST] (1e-100)
GeneOntology terms  GO:0070461 SAGA-type complex
InterPro families  IPR021950 Spt20 family
Orthology groupMCL16613

Nucleotide sequence:

ATGGATGGTTTAATTCACGCGGCATTGGAAGCAGAGGTAATACTTAACCGGGCTAAACAT
GTGAATACAAAGCTAACAAATTTTGACAGTAGTGTCTCAGATCATAAAATGACTTGGACC
CAAGAAAAGATGCACCTAGCGGAAACTGTTGATGAATCAAGGATGAAGTTTCAGAAAAAT
TCTGTCAGTGGTTCAGCAAAAACTGCAGAAAAATTTGATTTATTTAAGAAATTACATGAA
TTGTACAACGAGTTAAGTAGAGATGAAACTTCACAAGCAAACTATCAAGGGCTAAAGACG
ACATCATATTTATTGGAAAAGTTGTTAGCAACCTACAACCTTAATACATTAATAATTAAT
TTATATCCTGGTAACAAGGGTTACTCTCTGTCCCTTAAGATAAATGGAACTGCTCAGAAC
ATAAACCCACCGGATGCAAATGCATCATCAAGTCAAGAAGAAACTTTAATAGAAACTCCT
CGTTGGCCATATGAAGAAGAAGAGTTATTAAGTTATATAGATAATGAAGAATTGCCCGTA
GTCTTGTTAGATCTCCTTGAGTCCGAACACTCGTGTTTATTCTACTCTGGCTGCATCATA
GCTCAGATAAGAGATTATAGGCAAGCATATCCAAACTTTGTCTGTGATACACACCATGTG
CTCTTAAGGCCCACAAACCAGAGTATAATAACGGATGCGATGTGTATCGGTCGAAGCGGT
TGGGCGGGTGAAGAGCGGGGGGCTTTGGAGGCTGTCGAGGCGGCGTTAGTGCACGCGGCC
GCGCCCCCGCTGTGTCTAGAGCCTCGGCCGGCGGTCGGTCTGCTAGCGGCTAGGCTTCAT
GCTGCCCCAAGACTGTTTAATACACCGAGGATACGACGTCAGGCTCGGAGATTTTCACAG
GTGTCGGTTAATAGAAAAAGGAAATTGGACCAGTTCACTCATTATCATGGTCTGGAGTTG
TTGGAGTTAATACACCGTCAGAGAGCGAAAAACAGCCGCCAAACTGTTCCACACACACGG
TTAACATCGAAATTCCCAAAGAAACCACCGGAGGTGTTCAAACCTATAGAACCTCCAAAA
ATGGATCCGTTGCCGCTCGCACTACCGTCTGAACCGAACGCCCCGTTACGGTTGGCCCGC
GCCTATGAGCGTCCACGCCCCACACCGGACTGCCAGCCGCAGTTGGTGGAAGAGTACATC
CTGGAAACTGAGAAGAGCTCCCCGCACGCCGGAGCTGGTTTCTTCCACATCAAACTGTCT
ATACTACAGAGGCCATCCGACCAAGAGTTCCTTGGTGAACTGTATGTTGATAGAGATCAC
GTGGAAGGTGAAAGAAATGGAGCAGCCTGTAGATTCTCATTAGGTTCGCGACTTCAAGCC
AACAAATACATACAACAGTTCACAGAAATTTTCACAGAGGAAGGTAGAAAATCTGTTCGG
ATAAAGCATATTGTGCCCGGACAGTTACCGAGAGTTTCCTTCACAGGAGGCATGAGAGAT
ATGCAAAGAACAGCACAAGCGAACAACTCCACAGTTCAGACTCATGCCACAACCGTGCCT
GTTGTAACCTCCGCCATAGCTGCTACACCCAATGCCCATTCAAACGCAAGACAGCTGCCT
ATACTGCAGGCACAATTGCAACAAGTTGGCAATGTTAACGTGACAGCCGTGGGCACTGTT
GTGGGCAGTGTGGGCACGGTGGGCAACGTGGGCACAGTGGGCAGTGTGGGCAACGTCGGC
ACCGTGGGCACCGTTGGCAACGTGGTAAATGTTGGCCCTGCGACCGGTGTAACAGAAGCA
TTGAAACAGCAACCATCTCCAACAACACCCAGGCTTTCGCCACAGGCGTCAACGAATCAA
TTGCTAGCACAACAGCTCACTAATCCGCCGCAACCTCTCAACCCTCAGAAGATGCAATCA
GCCATCATACACATACAGCATCCCTTGATGTCGTCTTCGGGAACATCACAGGTTCAGAGT
ATACAGTATACAAATACAACGACCAATCAGCAGAAGACGACTATAACTAAAGCGAGATCG
ACGAACCCAGCGATCAACGCGCTCGTTACTAGTCTTATGAATTCAGCGCAACAGTTTCAG
CAAGCGGCAAGTCAAAATGCGGCTAAAGCGGTGGTGAGCACTTCAAGCAGTAACGCTACC
ATCCTGAATCTATTGAACAGCGCACCGGCTGCCATGACTCACGTCACCACCAGCGATAGC
GACACACACAAGCTTCTGACCCGAACTGTTTCTATAGCGGGGGCTAGACTCATAGCTTCG
ACGAGCAGTCATACATTACCTACATATACACAACAGGTAATGACTGGTTACACACGTGAG
AACGAGTCAACGAACGTGTCTAGTAGTGAGAGTGCTTTGCTAGAGAGGTTGATGGGTCCC
GAGCCGTCCTCAACACCGCCGTCTCAGACGCCACAGACGGCACAACCACAGCCACAACCA
CAACCACAACCTGTCTGTCATTTGCAGGTAATGACTGGTTACACACGTGAGAACGAGTCA
ACGAACGTGTCTAGTAGTGAGAGTGCTTTGCTAGAGAGGTTGATGGGTCCCGAGCCGTCC
TCAACACCGCCGTCTCAGACGCCACAGACGGCACAACCACAGCCACAACCACAACCACAA
CCTGTCTGCCATTTGCAGGGTCTAAGTTTAACATCGCTTCAGGGCCTACAGAGTATCCAG
GGGTTGCAAAACGTTCAAGTCCAGATACCTGGTCTATCTGCGCCTATATCATTGTCACTG
AACGTGTCAGGTGCACCCAGCGGGTTGTTGGTCTCAGTGCCGCCTACTACTTCTGTGGTA
CTCACAAATCAGCCTTCAGTGTTGTCATTGCCTATAGCTCAACTGATGTCCGGCGGTGTG
AAGGGCGGCGTCCGCAGTGGATCGGTTCAGGTGGTCCGAGCGCCGCGACCGGCGAGACTA
GTCCGACCCACCCGACCGTCGCTGCCAAATATTACAAACATCACCAATATCACTAACATG
ACGAACATACCATCCACTCCGGGCACAACTCAGTTTATAGCCCAATCGCAGGGACAGAGT
CAAGTGTTGAACGCCCACCAGGTGCGAAGAAAATCAAACCCTGACAGTTCATAG

Protein sequence:

MDGLIHAALEAEVILNRAKHVNTKLTNFDSSVSDHKMTWTQEKMHLAETVDESRMKFQKN
SVSGSAKTAEKFDLFKKLHELYNELSRDETSQANYQGLKTTSYLLEKLLATYNLNTLIIN
LYPGNKGYSLSLKINGTAQNINPPDANASSSQEETLIETPRWPYEEEELLSYIDNEELPV
VLLDLLESEHSCLFYSGCIIAQIRDYRQAYPNFVCDTHHVLLRPTNQSIITDAMCIGRSG
WAGEERGALEAVEAALVHAAAPPLCLEPRPAVGLLAARLHAAPRLFNTPRIRRQARRFSQ
VSVNRKRKLDQFTHYHGLELLELIHRQRAKNSRQTVPHTRLTSKFPKKPPEVFKPIEPPK
MDPLPLALPSEPNAPLRLARAYERPRPTPDCQPQLVEEYILETEKSSPHAGAGFFHIKLS
ILQRPSDQEFLGELYVDRDHVEGERNGAACRFSLGSRLQANKYIQQFTEIFTEEGRKSVR
IKHIVPGQLPRVSFTGGMRDMQRTAQANNSTVQTHATTVPVVTSAIAATPNAHSNARQLP
ILQAQLQQVGNVNVTAVGTVVGSVGTVGNVGTVGSVGNVGTVGTVGNVVNVGPATGVTEA
LKQQPSPTTPRLSPQASTNQLLAQQLTNPPQPLNPQKMQSAIIHIQHPLMSSSGTSQVQS
IQYTNTTTNQQKTTITKARSTNPAINALVTSLMNSAQQFQQAASQNAAKAVVSTSSSNAT
ILNLLNSAPAAMTHVTTSDSDTHKLLTRTVSIAGARLIASTSSHTLPTYTQQVMTGYTRE
NESTNVSSSESALLERLMGPEPSSTPPSQTPQTAQPQPQPQPQPVCHLQVMTGYTRENES
TNVSSSESALLERLMGPEPSSTPPSQTPQTAQPQPQPQPQPVCHLQGLSLTSLQGLQSIQ
GLQNVQVQIPGLSAPISLSLNVSGAPSGLLVSVPPTTSVVLTNQPSVLSLPIAQLMSGGV
KGGVRSGSVQVVRAPRPARLVRPTRPSLPNITNITNITNMTNIPSTPGTTQFIAQSQGQS
QVLNAHQVRRKSNPDSS