New model in OGS2.0 | DPOGS214287  |
---|---|
Genomic Position | scaffold225:- 84632-92102 |
See gene structure | |
CDS Length | 3396 |
Paired RNAseq reads   | 6607 |
Single RNAseq reads   | 16751 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005999 (8e-106) |
Best Drosophila hit   | CG6621 (5e-92) |
Best Human hit | tetratricopeptide repeat protein 14 isoform a (5e-51) |
Best NR hit (blastp)   | PREDICTED: similar to CG6621 CG6621-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG6621 CG6621-PA [Tribolium castaneum] (5e-169) |
GeneOntology terms    | GO:0005575 cellular_component GO:0008150 biological_process GO:0005488 binding |
InterPro families    | IPR001440 Tetratricopeptide TPR-1 IPR019734 Tetratricopeptide repeat IPR003029 Ribosomal protein S1, RNA-binding domain IPR013026 Tetratricopeptide repeat-containing IPR011990 Tetratricopeptide-like helical |
Orthology group | MCL18020 |
Nucleotide sequence:
ATGGAACCAACTTTGGATGCGTCTTTGGTGGCGCAGTCCATAAATTATCATGGGCAGCAG
CTACAAAAAACGTGGGAGGCCGAACGAGGTGAAGATGATTTATCCAAGATTGGTGTGGGG
CCTTTAGACTTTGCTGTGTACCAGTCCAGGCACAAGCATTTGACTTTTCAAGACAGAGGG
AAGAGACTCAAATTACATCAGTTCATCGCGAAGGAAGCATCAGCATTATTCGATGCATCT
CTACTGGACGAAACTCCGTCATCTTCCAGCGTAAGCGCTGAAGCTCCTACACCAGAAGAC
AATCTATTTGCACTGATGCCGCCATTTGAAACATTTTTACATGTTGACAAATCAGACAGG
CTGAGACATTTTTTTGATAATGTGAAAACAGGGGAGCTGATAATAGGTGCTGTTATCAAC
AGAACAGCATCAGGGATGATGCTTAAGGTGCTATGTACTGCTGGACCTACTTCTAGATAT
GTTGCTGACATCAATGTTAAGGCATTCTTACCTGTTGCTAATATCATACCGGCGGTGGAC
AAGAAAAATGTATCGAGAAACTACCTGATGAATGATACTGTGTGCTGTGAAGTAATTGAA
GTTATTCCCGACACTGACAAAATGGTGTGCGGCATGAAGGGTGTTACTCGGAAGCCTGAA
GATTCTCCACCGAAGCCTCCGCTAGGCCTCCTCAGCACTGACGACTTTCCTTTGATATAC
AAGAAAACTATGGAAATGAAGGGAGAGAGTTATGAAGCTATTTTGGAGAAGAGCCCAGGA
TTCAATAATCCTAACTGCGTCCAATATCTCTCAGAACTTCTTGGCATATCAAATATGCAT
TGCAGCAATTTTTCAACATTAAGGGGAGGATTTTCAGCTGCAGAATATGCTGATGAACTC
CGTCAAGCTCAAGCAAGCAAGTGGGCATTCCGGTCAGTAGCTGAAGGAATTGAACACTTT
AAAGCAGGAAGACATTCAGAAGCATTTCAATGTCTCAATAAAGCACTCAGCATTGATCCC
CGAAACGTAGAAGGCCTTGTTGCGAGAGGTGCTCTGTATGCTAATAGTGGAACATTTAAG
AAAGCCATAGAAGACTTTGAAACTTCTCTAAAACTGAACCCTAACCATGCAAATGCACGA
AAATATTTGGGAGAAACGTTAGTCGCTCTCGGACGCAGCTATGAAGATGAGAACAAAATT
GCTGAAGCCCAAAAAGCTTACGAGGATTGTTTGGCGATTATACCATTCCATGAGGAGGCT
CAGAATTCACTAGACTTTCTGAAGAGTAAGACGTCTACCACCAAGCCATTAATAGAGCCG
GCCGAGTTACTTCTACCTGGATTAACAGGAGCTAAATCGTTTGAGATGAAAGAAACACTG
AAGCAATTGTTGAATCTAACAGAGAAGAAGGAAAAGAAGAAGAAGAAGAAGCGTGGGAAA
GGCAAAAAGAAGCGCTCCAGTAGTTCCTCGTCGTCTTCGAGTGACTCGTCGAGCTCGAGC
TCCTCGTCCGAGTCATCGTCCTCGTCAACAGAATCTAGTGGTTCAGAGGGTCCAAATAGG
AAGAAGAAACGTCGCTCGCAATCAAACAACAAGCGACAGAGGTCGCTGTCGCCTCTGAGC
AAGCGTATGGCTATGCTGGGAGACGCTGAGTCGGCGTCACGTACACACAACTCGCAGTTC
AACCACCCGTATGGTTATCAGCCGCCGCCGCCCGCAGAAGAACCCGCCGCGCCCGGCAGG
TCTCAGGCCGATATTGATTATGAATTGAAGGTACGCAAGTTCCTGGACATGACGAAAGAA
GATTCTGATTATGAAGAAAAAGTTCGAAACTTCTTGGAAGAGACGGCGCAATACAAACGA
AATCGAAAAATGCAAGAACTCGGTCAGCAGACACAACCGGGCGCTGAACATGATAAGAAG
AAGAAGAGAAAGAAGGATAAGAAAAAGAAGAAGGAATCAAAACGCAAACGCAAAGAACAA
GAGAGAGAAGAGAAAAGAAAAAATAAGATCGCTCGTATGTCAAACAGTTCCGATTATAAT
CTACGTGATATAGAAAATATTGGTGATAAAAAACTGAGAGATGCTATAAGAAAAGAATTG
AAAGGAAAATCAAAGAGAGATCACAGTTCAGATGGTGAATATGAAAAAAAACACAATGAA
AAGAGTCGCATACTTGATGAAATGCACGGACTGGAGGAGCTTGAATCCAAGCTGAGTGCG
TACCACGTGATGGTGGAAAAGGAAATCGGTAAACGAGACAGATCTCTCAGTCCGCTGGAC
CAGGTGCCGCCGCCGCCGCTTGACAAGCCCAAGTGGAAAATGTCAATGAACGCTGTCAAA
GAAACGGTCAAGAAGAAGGATACTCCAGTACAAAAAGGATACAAGGAGCGTTACGCATTT
GAAGATAGCTCTGACGACTCTCAAGATCCTCGAAAGCCGTCACCATCGAGCGGCGACAAG
AATGTGTCTGTTCGACGCGCAATGGCCATGTCTATGAAGGAGCCGCCGCCGCTACCGTCA
GCGCCGCCACCCAAGAGCAGCCGCGAGCCCGACCCGCCTGGCACGGACCCGCCGCACACA
CATCAACACCCACACACGCACCCGCACCCACATCTGCACCCTCCTCACGCACCCCCGGTG
CGTAAAGGTAATATAGTGCTGGACAAGTTTGGATCATTCCGATTGGCTCAAGAAGGTGAG
ACGCCGGTGTCTGTAGGAGACGGACGACCAGAACAGTTCGTGACCCGCATCAAGCCGCCG
ACGCCCTCACAACGAAGACCTCGCTCACCACCATCACCCAGGAGAAGGTCATCCAACTCC
TCTGACGATAGACGCTCGGCTAAACGATCTAGAAGCCGTTCCATGCCACGGAAGTATCGT
TCCCGCTCCGGATCCCGTTCCCGTTCCGGTTCCAGCGCGAGTGGCTCGGTGGCGTCTCGC
CGCAGTCGCACCGTGTCGCCGAGATATCGCTCCAGATCTGATTCCTACTCACGAAGCAGA
TCACGCTCTCGATCGGGATCGCGCGACAGAAATCGTCGCATGAATCGTCGCGGCAATTGG
CGCGGACGCGGCGGTTTCGAGCGTGGCACCTACTACCGTCCCCGTTTCCACACTTACAAT
GGTGGCGGGAACCGTGGTAGGGGGCGCGGTGACTTCCGCAGGGACGACGGACGTCGCTTC
CAACACGAGTGGAGGGATAATCGATCTCGTGGAGGACGGCCTTTCAGACCCAGGAGAGGA
GGCGGCGGACGCGGCAGACCTTTCAGGGGTGGCTTCCGTGACTTCCGCGACAGACGCGGC
GGTAGATATTCCCGGTCCCGCAGCCCCGACAGAACACGCAGGTCCAGGTCATACAGCCCG
GAGAGAAGAGACAAGGACAGAGACAGGTTAGTGTAA
Protein sequence:
MEPTLDASLVAQSINYHGQQLQKTWEAERGEDDLSKIGVGPLDFAVYQSRHKHLTFQDRG
KRLKLHQFIAKEASALFDASLLDETPSSSSVSAEAPTPEDNLFALMPPFETFLHVDKSDR
LRHFFDNVKTGELIIGAVINRTASGMMLKVLCTAGPTSRYVADINVKAFLPVANIIPAVD
KKNVSRNYLMNDTVCCEVIEVIPDTDKMVCGMKGVTRKPEDSPPKPPLGLLSTDDFPLIY
KKTMEMKGESYEAILEKSPGFNNPNCVQYLSELLGISNMHCSNFSTLRGGFSAAEYADEL
RQAQASKWAFRSVAEGIEHFKAGRHSEAFQCLNKALSIDPRNVEGLVARGALYANSGTFK
KAIEDFETSLKLNPNHANARKYLGETLVALGRSYEDENKIAEAQKAYEDCLAIIPFHEEA
QNSLDFLKSKTSTTKPLIEPAELLLPGLTGAKSFEMKETLKQLLNLTEKKEKKKKKKRGK
GKKKRSSSSSSSSSDSSSSSSSSESSSSSTESSGSEGPNRKKKRRSQSNNKRQRSLSPLS
KRMAMLGDAESASRTHNSQFNHPYGYQPPPPAEEPAAPGRSQADIDYELKVRKFLDMTKE
DSDYEEKVRNFLEETAQYKRNRKMQELGQQTQPGAEHDKKKKRKKDKKKKKESKRKRKEQ
EREEKRKNKIARMSNSSDYNLRDIENIGDKKLRDAIRKELKGKSKRDHSSDGEYEKKHNE
KSRILDEMHGLEELESKLSAYHVMVEKEIGKRDRSLSPLDQVPPPPLDKPKWKMSMNAVK
ETVKKKDTPVQKGYKERYAFEDSSDDSQDPRKPSPSSGDKNVSVRRAMAMSMKEPPPLPS
APPPKSSREPDPPGTDPPHTHQHPHTHPHPHLHPPHAPPVRKGNIVLDKFGSFRLAQEGE
TPVSVGDGRPEQFVTRIKPPTPSQRRPRSPPSPRRRSSNSSDDRRSAKRSRSRSMPRKYR
SRSGSRSRSGSSASGSVASRRSRTVSPRYRSRSDSYSRSRSRSRSGSRDRNRRMNRRGNW
RGRGGFERGTYYRPRFHTYNGGGNRGRGRGDFRRDDGRRFQHEWRDNRSRGGRPFRPRRG
GGGRGRPFRGGFRDFRDRRGGRYSRSRSPDRTRRSRSYSPERRDKDRDRLV