New model in OGS2.0 | DPOGS205269  |
---|---|
Genomic Position | scaffold766:- 3008-11059 |
See gene structure | |
CDS Length | 3585 |
Paired RNAseq reads   | 2446 |
Single RNAseq reads   | 5826 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011069 (0.0) |
Best Drosophila hit   | CG2469, isoform A (0.0) |
Best Human hit | RNA polymerase-associated protein CTR9 homolog (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to tpr repeat nuclear phosphoprotein [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to SH2 domain binding protein 1 [Apis mellifera] (0.0) |
GeneOntology terms   | GO:0005488 binding |
InterPro families    | IPR011990 Tetratricopeptide-like helical IPR019734 Tetratricopeptide repeat IPR013026 Tetratricopeptide repeat-containing IPR013105 Tetratricopeptide TPR2 IPR001440 Tetratricopeptide TPR-1 |
Orthology group | MCL12535 |
Nucleotide sequence:
ATGTCCACCGATGAGGTGATAGAGCTTGATCCCGAATCTTTACCGTGTGGTGAAGAAGTG
CTTAGTATATTGCAACAAGAGAGATCCCAGCTAAATGTCTGGATTAATGTCGCCCTTGCC
TACTACAAACAAAACAAGATCGATGATTTCCTTAAAATTCTCGAGGCATCCCGAGTGGAT
GCAAATATTGACTATAGGGACTTTGAAAGGGATCAAATGCGAGCACTTGATATGTTGGCT
GCATACTATGTTCAAGAGGCAAATAAAGAAAAGTCTAAAGACAAGAAAAAAGAGCTTTTT
ACTGAAGCTACTTTGCTTTATACTATGGCAGATAAAATTATTATGTATGATCAGAATCAT
CTTCTTGGGCGAGCATATTTCTGTCTTCTTGAGGGTGATAAAATGGCACAAGCAGACACA
CAGTTCAATTTTGTCCTCAATCAATCACCAAATAATGTGCCATCACTGCTCGGTAAAGCC
TGTATTGCTTTTAACCGAAAGGATTATAGAGGAGCTCTAGCATTTTACAAAAAAGCTCTA
AGGACGAATCCTAACAGCCCAGCTGCTTTACGTTTGGGAATGGGCCATTGTTTCATGAAA
TTAAATAATCAGGAAAAAGCCAGAATGGCGTTTGAGAGGGCATTGCAACTTGATCCTCAA
TGTGTTGGAGCTTTAGTTGGGCTGTCGATCTTGAAATTGAATTTACAAGAGAGTGAATCC
AATAAGATGGCAGTCATCATGTTGTCTAAGGCATACGCAATTGATCCCAAAAACCCAATG
GTTTTAAATCATTTGGCAAATCATTTCTTCTTTAAAAAGGATTACAGCAAAGTCCAACAT
CTTGCTCTGCACGCTTATCACAACACTGAGAATGACGCAATGAGGGCGGAAAGTTGTCAT
CATTTGGCCAGAGCTTATCATGCGCAAGGTGATTGCGTTAAAGCATTCCAATACTACTAC
CAAGCTACGCAGTTTGCGCCACCGAATTTCGTACTACCGCATTATGGCCTCGGACAAATG
TATATTTACAGAGGTGACACTGAAAATGCGGCTCAATGCTTTGAAAAAGTTCTTAAAGCT
CAACCAGGCAACTACGAAACTATGAAGATTCTAGGATCTCTATATGCTAACTCTCCATCT
CAATTACAACGGGATATAGCGAGACAGCATCTCAAGAAAGTAACCGAACAATTTCCTGAC
GATGTAGAGGCTTGGATTGAATTGGCACAAATTTTGGAACAAAATGATTTACAGGGTTCA
CTGAATGCATATACCACTGCCATGAAAATACTTAAAGAAAAAGTAAATGCTGAGATTCCA
GCAGAAATATTAAACAATGTTGCCGCATTGCATTACCGACTTGGTAATTTAAATGAAGCT
ATGAAATATTTGGAGGAAGCCTTGGAAAGAGAGAAAGTGGAAGCGGAGACTCTCGATGCC
CAGTACTACAATTCAATATTAGTGACAACCATGTACAACCTGGCGAGACTTAACGAAGCG
CTCTGTGTATACAACAAGGCTGAGAAACTGTACAAAGATATATTGAAGGAGCATCCCAAT
TATATTGATTGCTACTTGAGATTGGGCTGCATGGCAAGAGATAAAGGGCAGATATACGAA
GCGTCGGACTGGTTCAAAGAAGCTTTGAAAGTGAATATAGAGCACCCGGACACGTGGTCG
CTGCTGGGGAACCTTCACCTGGCGCAGCAGGAGTGGGGTCCGGGGCAGAAGAAGTTTGAA
CGGATCTTACAAAACTCCACCACGTCCAACGACGCCTACTCGCTGATTGCGCTCGGCAAC
GTGTGGCTTCAGACTCTACACCAGCCAGGTCGTGAGAAGGATCGCGAGAAGAGACATCAA
GAACGAGCTCTCGCTTTATATAAACAGGTGCTGAAGAACGATCCGAAGAATATATGGGCA
GCCAACGGTATAGGATGCGTGCTCGCGCACAAAGGTTGCATCAACGAAGCTCGCGATATC
TTCGCCCAAGTGCGGGAAGCGACGGCTGATTTTCCAGACGTCTGGATGAACATTGCTCAT
ATATACGTGGATCAGAAACAATACATAAATGCCATACAAATGTACGAGAACTGCATAAGA
AAGTTTCGGACCCATCACGACGTGGAGTGGTTGACGTGGCTCGGGCGCGCTCAGACACTA
GCGGGTAGAGCGCGTGCCGCTCGCACGTCTCTACTGAGAGCACGTCGGGTAGCGCCCCAC
GACCCCGCCCTACTCTACAACACCGCGCTCGCTCTACGACGCCTGGCTGCCCACGTGCTG
AAAGACGAACGATCCGAACTCAGGGTCGTACTGAGAGCGGTTCATGAACTACATGTCTCA
CATAGATACTTCCAACGTCTTGGGGCAGCGGCCGCGGCCGAGGCCAGGACATGCGCCGAC
CTACTCTCACAGGCGCAGTGGCATGTAGCGAGAGCGAGACGGCAACACCAGGAGGAACTC
ACACTCAGGGACAAGCAACGAGAACAACGAGAGGCCTTCAGGAAACAACAGGAGGAAGAA
CGCAAACGGAGGGAGGAGGAACAAGCGAAGAGCACAGTGGAAATGTTACAGAAGCGACAA
GAATACAAGGAGAAGACAAAGAACGCTTTGTTATTCGCGGATATGCCGTCTGAGAGCAAA
CAGAAAGGACGAGGCCGCAGGAGAGACGAGTATATATCGGACTCTGGCAGCGAACAAGAC
AGGCCGAGGGAAGAAGGGGAACCGAAACAACGTAAACGCAAGCGCGATGCCGAAGGACGT
AAAGGTAAAACTAAGAAGAAGAATCAACGAACCAGCGACACTGATAGTGATGCACCGAGG
AATAAGAGCAGAAAGAAGGGCGAAAGGGGTATCGGTAAACGCGAGAAAGCCAAAATGGCT
GAGGATAAGCTGGGAGCTAAACAGCGCGCTAAGATCGTATCAAAAGAAACTATATCGACT
TCGGAATCAGACTCCGATGGTGGTAAGCGCAAGTCACGCAGCCGGAGTCGCGGCGGAAGT
CGCAGCGGAAGCGGGAGTCGAAGTCCTCCCGCCAATAAGGGACGGAAGAGGATTATGTCG
GCATCAGACAATCACGTTCAAAAAGTCGATCGAAATCGAGAAGTCGCTCGCGGTCCGGTT
CAGCTAAGAGCCGACCAAAATCAAAGAGTCGATCCAGGTCTAAAGACAGATCCAGATCGA
AGAGCCGCTCGAAAAGCCGATCTAGATCAAAAAGTAGATCTCGCTCAAAAAGCCCTTCCA
GGTCGAAGAGTCGCTCTCGGTCCAAAAGTGGATCTAGATCCAAAAGCCCATCGAGATCGC
GTTCAAGATCCAAAAGCCGTTCTAGATCAAAGTCGAAGAGTGCGTCGCGTTCCAGGTCAA
GATCCAAAAGTGGATCACGCAGTCGTTCTGGATCGAGATCTCACTCTGGATCTAGATCTA
GATCCGGTTCAAGAAACTCTAGACCCGCGACACCCGAATCTAGAAAATCAGTCTCAGCTA
GTGAAGATGAAGCTTAGTCGGATATTTAAGATGACGTATAAACACAAGATATCTCTAAGA
TATTCGGAAATCGCTTTCATTGTGTTTTTAATTTTCTTCGGTTAA
Protein sequence:
MSTDEVIELDPESLPCGEEVLSILQQERSQLNVWINVALAYYKQNKIDDFLKILEASRVD
ANIDYRDFERDQMRALDMLAAYYVQEANKEKSKDKKKELFTEATLLYTMADKIIMYDQNH
LLGRAYFCLLEGDKMAQADTQFNFVLNQSPNNVPSLLGKACIAFNRKDYRGALAFYKKAL
RTNPNSPAALRLGMGHCFMKLNNQEKARMAFERALQLDPQCVGALVGLSILKLNLQESES
NKMAVIMLSKAYAIDPKNPMVLNHLANHFFFKKDYSKVQHLALHAYHNTENDAMRAESCH
HLARAYHAQGDCVKAFQYYYQATQFAPPNFVLPHYGLGQMYIYRGDTENAAQCFEKVLKA
QPGNYETMKILGSLYANSPSQLQRDIARQHLKKVTEQFPDDVEAWIELAQILEQNDLQGS
LNAYTTAMKILKEKVNAEIPAEILNNVAALHYRLGNLNEAMKYLEEALEREKVEAETLDA
QYYNSILVTTMYNLARLNEALCVYNKAEKLYKDILKEHPNYIDCYLRLGCMARDKGQIYE
ASDWFKEALKVNIEHPDTWSLLGNLHLAQQEWGPGQKKFERILQNSTTSNDAYSLIALGN
VWLQTLHQPGREKDREKRHQERALALYKQVLKNDPKNIWAANGIGCVLAHKGCINEARDI
FAQVREATADFPDVWMNIAHIYVDQKQYINAIQMYENCIRKFRTHHDVEWLTWLGRAQTL
AGRARAARTSLLRARRVAPHDPALLYNTALALRRLAAHVLKDERSELRVVLRAVHELHVS
HRYFQRLGAAAAAEARTCADLLSQAQWHVARARRQHQEELTLRDKQREQREAFRKQQEEE
RKRREEEQAKSTVEMLQKRQEYKEKTKNALLFADMPSESKQKGRGRRRDEYISDSGSEQD
RPREEGEPKQRKRKRDAEGRKGKTKKKNQRTSDTDSDAPRNKSRKKGERGIGKREKAKMA
EDKLGAKQRAKIVSKETISTSESDSDGGKRKSRSRSRGGSRSGSGSRSPPANKGRKRIMS
ASDNHVQKVDRNREVARGPVQLRADQNQRVDPGLKTDPDRRAARKADLDQKVDLAQKALP
GRRVALGPKVDLDPKAHRDRVQDPKAVLDQSRRVRRVPGQDPKVDHAVVLDRDLTLDLDL
DPVQETLDPRHPNLENQSQLVKMKLSRIFKMTYKHKISLRYSEIAFIVFLIFFG