New model in OGS2.0 | DPOGS208896  |
---|---|
Genomic Position | scaffold53:- 57030-63778 |
See gene structure | |
CDS Length | 3441 |
Paired RNAseq reads   | 1528 |
Single RNAseq reads   | 3600 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002469 (0.0) |
Best Drosophila hit   | ND |
Best Human hit | tetratricopeptide repeat protein 21B (2e-169) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL012104 [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | tetratricopeptide repeat domain 21B [Culex quinquefasciatus] (0.0) |
GeneOntology terms    | GO:0005488 binding GO:0005856 cytoskeleton GO:0005929 cilium GO:0005737 cytoplasm GO:0035085 cilium axoneme |
InterPro families    | IPR019734 Tetratricopeptide repeat IPR013026 Tetratricopeptide repeat-containing IPR011990 Tetratricopeptide-like helical |
Orthology group | MCL13149 |
Nucleotide sequence:
ATGCATATGTCTGTAGGAAATTTGAAAAGCTCTATATTAGATTGCTTTGAATTAGCTTTA
CAAAAGAGTGATAAAAATGTTGAAGCACTCATTGGTTTGGCTAAATTTAAATACCTTACT
AAGAATTACGATGCATCTAATTTGACTTTAGATAAACTTATTGTTAACAATCCCGGTCAA
GTGGTACCGCTTGTTGAGAAAATGAGAAATGAATTTGCTGCACAGAAGTGGGATGTTGTG
TATGACACAATCGAAAGGGTTTTCTCTATAGATCCCGTTAATATTGAAGCTTTGAAGATA
AAAATTTTTATTGCACTCTGTAAAAATGCTGACTATATTGAGGCAGTTGACGAACTAAAC
AAGTTCTTTTCTATTCTTGAGGCTGAAGAGGCTACTAACGGTTATCAGTTCTATACCACG
GCCCAGTTGTTTTCAAAATTATGTGGAAGATCAAGCGCTGTACTGTCGCAGGCATATAGA
TTTGCTCAATATGCATCAGAATTACAACCGAGTAATGTTGATTACTTAAGTGAAGTCGGC
TACCAATGTATCTTGCAGGGGAAATACAAAGATGCCTTAAACTTTTTCAGAGCAGCAAGC
AAAGTTGATAGCAATTCCATCACTGCTTTATGCGGACTTACTTTATGTCAAATGTATGAT
AACGGCACCACCAGTCAGATAGCAGAACAAGTTGAACTCCTATATGAAATGCAGGGTACT
GAAAAATTTACAGTTTTATATTTACTATCAGCACAATTAAATAGCAAGAATGCAGAGAAA
TTTTTATACAATGCCATTGAAACGCAAATCTCGCAGGCAGAATGTTATCCATACAGTATG
GTGTATCTAAAGAAACTGGATCCAGATTTCTTGTTACAAGTATACAAAGAACTCAAAAAA
CTTCTGCCAAAGAAGCCATTTATTGTTGTTGGTCATCTCCTTTATTCTCAAGATGGCAAC
AACACTTTAGTGGCAAATTGTTTCAAACTGTTGACAGCAATTACAAATTCATGTCCCGGT
TTGATACCTGCTTTGTATGAATTGGCGAAATTGAAGTTCCTATTTGGTTACTCCAACGAG
GCAAAGGCATTTGCCCAACAAATTATCGATCTTGACAACACACACGCCGGTAGCCAAGTG
CTATTGGCCCAAATATACGTCCAACAAAGCAATTTTAATAAAGCCCTACAGAGTTTGGAA
ATGTGTTTGAGTTACAACTTTAAAATTCGTGACAGTGCAATGTATCACTTTCTGAATGGC
GTCATAATCAAAAATATGAATCAAATTCAGGACGCTCTCGCAAGTTTTAATACAAGCTTG
CAGATAGCCAATAATAAGAATAATATAACAAGGCAGTATGATTCGGATCTCAATATAATT
GATAAGGCTACATTATTCTTTCTGATAATAGAACTACAGACGTCTTTGGGACAATTTGCG
GAGGCTGGGAAAACTATGCAAGAAGCAATACAAGAAATATCTTACACGTCGGAGGAAAGT
CGTTTACAAATAGCTCGCGCAGACTTAGCATTAAGCAATGGAGATGTCGACAACGCTATA
GAAATCCTCAACGAAGTAAAACCGGGACAGAAGTATTATTTCCAAGCTCACAGCAAAATG
GCGCATATCTACCTAAAAGAGAAGCAAGATAAGGCCATGTTCACATCGTGTTTTAAAGAT
ATAGTGAACAATCATCCTATGGTAGACGCTCATACGATGATGGGAGATGCATTCATGTCC
ATACACGAACCAAATCAAGCGGCTGCATCATATGATGTAGCGCTTAAAGGGAATCCGAAA
GACATTCGGTTAATAAAGAAAATGGGTACAGCGCTGTTGAAGATGCATGAATATGATAAA
CTTACACAACACTATGAGAACGCTATCAGAACGTTGAACGATGATGAACTGAGATTTGAG
TATCTGGAACTACTTATAAAGCTCAAACAGTACGACAAGGTCGACACAACGATCTCATCA
GAACTAAACCAACATTATAACAAGGAAAAAGACATCAATACATTGAGACGACGCGTTAGA
TTGCTTCTAATACAAGCGAGGAGTAGGGAAATGAAAAATCCAACAGCCGGGAACACCAAC
CTGATACTGGCAGAAGCCAGAGACCTACAAATGTCGGTATTGAAAAGAGTTGAAATAGAT
TCGAGGGCAGATATAGAAGAAGAAAAGAAAATGCTTTCGTCCATCCTCTGTTTATTGGCG
AAAGCGAGATCTGCGAAGGAACCAGCCGTGGCAACGAACTTATACTCGGAGGCTTTGATA
TATTCACCGCGGGACCCTGATACACTATTGGCGTTGGCGAAGTTATACGCACAGATGAAC
AATCCCGAGCGCTGTGAACAAACTTGCACGTTGCTATTGAACGCCGATCCTAACAACGAG
GCAGCGGCAGTCATGATGGCGGACCTCGCGTTCCGTAAGGCTGATCTGGAAACGGCTCAG
CGCCATCTAAGCCAGATATTATCAGTGAGGCCGTTGAGTTTCGATGCTTTGTCACAACTC
ATTGAGGTGCAATGGCGCAGAGGAATGCTAACAGCCGTCGAGTCAGCTATAGAGGACGCG
AGGAGAGAACTCTCTGGAAAGGAAGATCCTGGGTTCTCGTATTGCTGTGGTGTGTTTAGT
TCCTACAACGGCGCCGTGAACCAGGCTCTCCGTCAATTGAACGTGTCGCGGCGTTCTCGC
TGGTCGACTCTCGCCGCTAGACGGATGATACTGCTGTGTATGGAACAAGCCACTGAGACA
GACGGACAGAATGCCGACTCTGATACACATCTGCTAGCTCTTCGAACAGCGGAGAAGCTT
TTGAACGAAGTGTCGTCATCCGAGCGCAGGTCCCTATCAGCGTTATTACAGTTAGCAACG
AGGCAAAAGTCACAAGCGGAGAGAGTTTTGCAGGAACTCCTACCGTTAGCGACCGAAGAC
ATTCACCAAGATGACCCATACCTTATATTGGCCATTGCTAATGCGTATAACATTGTAAAA
CAACCAACTCGAGCAAAGAACATTCTGAAGAGAACTATATCTTCTATTCCCTGGACGCCT
GAGAGAGCGGATGGCTTGGAGAGATGTTGGTTGGAGGTGGTAGATAACCAGATAAGTTCA
GGGAGAATGGATGCCGCAAAAGAGATTTTAACAAAAATTCTCAATCACAACAAATCCTGT
GCTCAGGCTTATCAATATTTAGGTTTCTTGGCAGAAAAGGAACAAAATTACAAGAGCTCG
GCCGTCAACTATGACAACGCTTGGAAATATACAGGAAAAAATGACCTGGCCGTTGGATAC
AAACTAGCGTACGCCTACCTCAAGCTAAAGAAATACCCTGAATGTATAGTCGTGTGTAGA
CATATATTAAAAGTTCACCCAGACTACCCCAAGATAAAAAAAGAAATTCTAGAAAAAGCT
AAGACCAATTTGAGAACGTGA
Protein sequence:
MHMSVGNLKSSILDCFELALQKSDKNVEALIGLAKFKYLTKNYDASNLTLDKLIVNNPGQ
VVPLVEKMRNEFAAQKWDVVYDTIERVFSIDPVNIEALKIKIFIALCKNADYIEAVDELN
KFFSILEAEEATNGYQFYTTAQLFSKLCGRSSAVLSQAYRFAQYASELQPSNVDYLSEVG
YQCILQGKYKDALNFFRAASKVDSNSITALCGLTLCQMYDNGTTSQIAEQVELLYEMQGT
EKFTVLYLLSAQLNSKNAEKFLYNAIETQISQAECYPYSMVYLKKLDPDFLLQVYKELKK
LLPKKPFIVVGHLLYSQDGNNTLVANCFKLLTAITNSCPGLIPALYELAKLKFLFGYSNE
AKAFAQQIIDLDNTHAGSQVLLAQIYVQQSNFNKALQSLEMCLSYNFKIRDSAMYHFLNG
VIIKNMNQIQDALASFNTSLQIANNKNNITRQYDSDLNIIDKATLFFLIIELQTSLGQFA
EAGKTMQEAIQEISYTSEESRLQIARADLALSNGDVDNAIEILNEVKPGQKYYFQAHSKM
AHIYLKEKQDKAMFTSCFKDIVNNHPMVDAHTMMGDAFMSIHEPNQAAASYDVALKGNPK
DIRLIKKMGTALLKMHEYDKLTQHYENAIRTLNDDELRFEYLELLIKLKQYDKVDTTISS
ELNQHYNKEKDINTLRRRVRLLLIQARSREMKNPTAGNTNLILAEARDLQMSVLKRVEID
SRADIEEEKKMLSSILCLLAKARSAKEPAVATNLYSEALIYSPRDPDTLLALAKLYAQMN
NPERCEQTCTLLLNADPNNEAAAVMMADLAFRKADLETAQRHLSQILSVRPLSFDALSQL
IEVQWRRGMLTAVESAIEDARRELSGKEDPGFSYCCGVFSSYNGAVNQALRQLNVSRRSR
WSTLAARRMILLCMEQATETDGQNADSDTHLLALRTAEKLLNEVSSSERRSLSALLQLAT
RQKSQAERVLQELLPLATEDIHQDDPYLILAIANAYNIVKQPTRAKNILKRTISSIPWTP
ERADGLERCWLEVVDNQISSGRMDAAKEILTKILNHNKSCAQAYQYLGFLAEKEQNYKSS
AVNYDNAWKYTGKNDLAVGYKLAYAYLKLKKYPECIVVCRHILKVHPDYPKIKKEILEKA
KTNLRT