New model in OGS2.0 | DPOGS213271  |
---|---|
Genomic Position | scaffold1029:- 15346-25968 |
See gene structure | |
CDS Length | 3114 |
Paired RNAseq reads   | 306 |
Single RNAseq reads   | 857 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001187 (2e-46) |
Best Drosophila hit   | CG17689, isoform A (2e-81) |
Best Human hit | family with sequence similarity 48, member A isoform b (5e-31) |
Best NR hit (blastp)   | AGAP012403-PA [Anopheles gambiae str. PEST] (3e-128) |
Best NR hit (blastx)   | AGAP012403-PA [Anopheles gambiae str. PEST] (1e-100) |
GeneOntology terms   | GO:0070461 SAGA-type complex |
InterPro families   | IPR021950 Spt20 family |
Orthology group | MCL16613 |
Nucleotide sequence:
ATGGATGGTTTAATTCACGCGGCATTGGAAGCAGAGGTAATACTTAACCGGGCTAAACAT
GTGAATACAAAGCTAACAAATTTTGACAGTAGTGTCTCAGATCATAAAATGACTTGGACC
CAAGAAAAGATGCACCTAGCGGAAACTGTTGATGAATCAAGGATGAAGTTTCAGAAAAAT
TCTGTCAGTGGTTCAGCAAAAACTGCAGAAAAATTTGATTTATTTAAGAAATTACATGAA
TTGTACAACGAGTTAAGTAGAGATGAAACTTCACAAGCAAACTATCAAGGGCTAAAGACG
ACATCATATTTATTGGAAAAGTTGTTAGCAACCTACAACCTTAATACATTAATAATTAAT
TTATATCCTGGTAACAAGGGTTACTCTCTGTCCCTTAAGATAAATGGAACTGCTCAGAAC
ATAAACCCACCGGATGCAAATGCATCATCAAGTCAAGAAGAAACTTTAATAGAAACTCCT
CGTTGGCCATATGAAGAAGAAGAGTTATTAAGTTATATAGATAATGAAGAATTGCCCGTA
GTCTTGTTAGATCTCCTTGAGTCCGAACACTCGTGTTTATTCTACTCTGGCTGCATCATA
GCTCAGATAAGAGATTATAGGCAAGCATATCCAAACTTTGTCTGTGATACACACCATGTG
CTCTTAAGGCCCACAAACCAGAGTATAATAACGGATGCGATGTGTATCGGTCGAAGCGGT
TGGGCGGGTGAAGAGCGGGGGGCTTTGGAGGCTGTCGAGGCGGCGTTAGTGCACGCGGCC
GCGCCCCCGCTGTGTCTAGAGCCTCGGCCGGCGGTCGGTCTGCTAGCGGCTAGGCTTCAT
GCTGCCCCAAGACTGTTTAATACACCGAGGATACGACGTCAGGCTCGGAGATTTTCACAG
GTGTCGGTTAATAGAAAAAGGAAATTGGACCAGTTCACTCATTATCATGGTCTGGAGTTG
TTGGAGTTAATACACCGTCAGAGAGCGAAAAACAGCCGCCAAACTGTTCCACACACACGG
TTAACATCGAAATTCCCAAAGAAACCACCGGAGGTGTTCAAACCTATAGAACCTCCAAAA
ATGGATCCGTTGCCGCTCGCACTACCGTCTGAACCGAACGCCCCGTTACGGTTGGCCCGC
GCCTATGAGCGTCCACGCCCCACACCGGACTGCCAGCCGCAGTTGGTGGAAGAGTACATC
CTGGAAACTGAGAAGAGCTCCCCGCACGCCGGAGCTGGTTTCTTCCACATCAAACTGTCT
ATACTACAGAGGCCATCCGACCAAGAGTTCCTTGGTGAACTGTATGTTGATAGAGATCAC
GTGGAAGGTGAAAGAAATGGAGCAGCCTGTAGATTCTCATTAGGTTCGCGACTTCAAGCC
AACAAATACATACAACAGTTCACAGAAATTTTCACAGAGGAAGGTAGAAAATCTGTTCGG
ATAAAGCATATTGTGCCCGGACAGTTACCGAGAGTTTCCTTCACAGGAGGCATGAGAGAT
ATGCAAAGAACAGCACAAGCGAACAACTCCACAGTTCAGACTCATGCCACAACCGTGCCT
GTTGTAACCTCCGCCATAGCTGCTACACCCAATGCCCATTCAAACGCAAGACAGCTGCCT
ATACTGCAGGCACAATTGCAACAAGTTGGCAATGTTAACGTGACAGCCGTGGGCACTGTT
GTGGGCAGTGTGGGCACGGTGGGCAACGTGGGCACAGTGGGCAGTGTGGGCAACGTCGGC
ACCGTGGGCACCGTTGGCAACGTGGTAAATGTTGGCCCTGCGACCGGTGTAACAGAAGCA
TTGAAACAGCAACCATCTCCAACAACACCCAGGCTTTCGCCACAGGCGTCAACGAATCAA
TTGCTAGCACAACAGCTCACTAATCCGCCGCAACCTCTCAACCCTCAGAAGATGCAATCA
GCCATCATACACATACAGCATCCCTTGATGTCGTCTTCGGGAACATCACAGGTTCAGAGT
ATACAGTATACAAATACAACGACCAATCAGCAGAAGACGACTATAACTAAAGCGAGATCG
ACGAACCCAGCGATCAACGCGCTCGTTACTAGTCTTATGAATTCAGCGCAACAGTTTCAG
CAAGCGGCAAGTCAAAATGCGGCTAAAGCGGTGGTGAGCACTTCAAGCAGTAACGCTACC
ATCCTGAATCTATTGAACAGCGCACCGGCTGCCATGACTCACGTCACCACCAGCGATAGC
GACACACACAAGCTTCTGACCCGAACTGTTTCTATAGCGGGGGCTAGACTCATAGCTTCG
ACGAGCAGTCATACATTACCTACATATACACAACAGGTAATGACTGGTTACACACGTGAG
AACGAGTCAACGAACGTGTCTAGTAGTGAGAGTGCTTTGCTAGAGAGGTTGATGGGTCCC
GAGCCGTCCTCAACACCGCCGTCTCAGACGCCACAGACGGCACAACCACAGCCACAACCA
CAACCACAACCTGTCTGTCATTTGCAGGTAATGACTGGTTACACACGTGAGAACGAGTCA
ACGAACGTGTCTAGTAGTGAGAGTGCTTTGCTAGAGAGGTTGATGGGTCCCGAGCCGTCC
TCAACACCGCCGTCTCAGACGCCACAGACGGCACAACCACAGCCACAACCACAACCACAA
CCTGTCTGCCATTTGCAGGGTCTAAGTTTAACATCGCTTCAGGGCCTACAGAGTATCCAG
GGGTTGCAAAACGTTCAAGTCCAGATACCTGGTCTATCTGCGCCTATATCATTGTCACTG
AACGTGTCAGGTGCACCCAGCGGGTTGTTGGTCTCAGTGCCGCCTACTACTTCTGTGGTA
CTCACAAATCAGCCTTCAGTGTTGTCATTGCCTATAGCTCAACTGATGTCCGGCGGTGTG
AAGGGCGGCGTCCGCAGTGGATCGGTTCAGGTGGTCCGAGCGCCGCGACCGGCGAGACTA
GTCCGACCCACCCGACCGTCGCTGCCAAATATTACAAACATCACCAATATCACTAACATG
ACGAACATACCATCCACTCCGGGCACAACTCAGTTTATAGCCCAATCGCAGGGACAGAGT
CAAGTGTTGAACGCCCACCAGGTGCGAAGAAAATCAAACCCTGACAGTTCATAG
Protein sequence:
MDGLIHAALEAEVILNRAKHVNTKLTNFDSSVSDHKMTWTQEKMHLAETVDESRMKFQKN
SVSGSAKTAEKFDLFKKLHELYNELSRDETSQANYQGLKTTSYLLEKLLATYNLNTLIIN
LYPGNKGYSLSLKINGTAQNINPPDANASSSQEETLIETPRWPYEEEELLSYIDNEELPV
VLLDLLESEHSCLFYSGCIIAQIRDYRQAYPNFVCDTHHVLLRPTNQSIITDAMCIGRSG
WAGEERGALEAVEAALVHAAAPPLCLEPRPAVGLLAARLHAAPRLFNTPRIRRQARRFSQ
VSVNRKRKLDQFTHYHGLELLELIHRQRAKNSRQTVPHTRLTSKFPKKPPEVFKPIEPPK
MDPLPLALPSEPNAPLRLARAYERPRPTPDCQPQLVEEYILETEKSSPHAGAGFFHIKLS
ILQRPSDQEFLGELYVDRDHVEGERNGAACRFSLGSRLQANKYIQQFTEIFTEEGRKSVR
IKHIVPGQLPRVSFTGGMRDMQRTAQANNSTVQTHATTVPVVTSAIAATPNAHSNARQLP
ILQAQLQQVGNVNVTAVGTVVGSVGTVGNVGTVGSVGNVGTVGTVGNVVNVGPATGVTEA
LKQQPSPTTPRLSPQASTNQLLAQQLTNPPQPLNPQKMQSAIIHIQHPLMSSSGTSQVQS
IQYTNTTTNQQKTTITKARSTNPAINALVTSLMNSAQQFQQAASQNAAKAVVSTSSSNAT
ILNLLNSAPAAMTHVTTSDSDTHKLLTRTVSIAGARLIASTSSHTLPTYTQQVMTGYTRE
NESTNVSSSESALLERLMGPEPSSTPPSQTPQTAQPQPQPQPQPVCHLQVMTGYTRENES
TNVSSSESALLERLMGPEPSSTPPSQTPQTAQPQPQPQPQPVCHLQGLSLTSLQGLQSIQ
GLQNVQVQIPGLSAPISLSLNVSGAPSGLLVSVPPTTSVVLTNQPSVLSLPIAQLMSGGV
KGGVRSGSVQVVRAPRPARLVRPTRPSLPNITNITNITNMTNIPSTPGTTQFIAQSQGQS
QVLNAHQVRRKSNPDSS