DPGLEAN03531 in OGS1.0

New model in OGS2.0DPOGS208896 
Genomic Positionscaffold53:- 57030-63778
See gene structure
CDS Length3441
Paired RNAseq reads  1528
Single RNAseq reads  3600
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002469 (0.0)
Best Drosophila hit  ND
Best Human hittetratricopeptide repeat protein 21B (2e-169)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL012104 [Aedes aegypti] (0.0)
Best NR hit (blastx)  tetratricopeptide repeat domain 21B [Culex quinquefasciatus] (0.0)
GeneOntology terms



  
GO:0005488 binding
GO:0005856 cytoskeleton
GO:0005929 cilium
GO:0005737 cytoplasm
GO:0035085 cilium axoneme
InterPro families

  
IPR019734 Tetratricopeptide repeat
IPR013026 Tetratricopeptide repeat-containing
IPR011990 Tetratricopeptide-like helical
Orthology groupMCL13149

Nucleotide sequence:

ATGCATATGTCTGTAGGAAATTTGAAAAGCTCTATATTAGATTGCTTTGAATTAGCTTTA
CAAAAGAGTGATAAAAATGTTGAAGCACTCATTGGTTTGGCTAAATTTAAATACCTTACT
AAGAATTACGATGCATCTAATTTGACTTTAGATAAACTTATTGTTAACAATCCCGGTCAA
GTGGTACCGCTTGTTGAGAAAATGAGAAATGAATTTGCTGCACAGAAGTGGGATGTTGTG
TATGACACAATCGAAAGGGTTTTCTCTATAGATCCCGTTAATATTGAAGCTTTGAAGATA
AAAATTTTTATTGCACTCTGTAAAAATGCTGACTATATTGAGGCAGTTGACGAACTAAAC
AAGTTCTTTTCTATTCTTGAGGCTGAAGAGGCTACTAACGGTTATCAGTTCTATACCACG
GCCCAGTTGTTTTCAAAATTATGTGGAAGATCAAGCGCTGTACTGTCGCAGGCATATAGA
TTTGCTCAATATGCATCAGAATTACAACCGAGTAATGTTGATTACTTAAGTGAAGTCGGC
TACCAATGTATCTTGCAGGGGAAATACAAAGATGCCTTAAACTTTTTCAGAGCAGCAAGC
AAAGTTGATAGCAATTCCATCACTGCTTTATGCGGACTTACTTTATGTCAAATGTATGAT
AACGGCACCACCAGTCAGATAGCAGAACAAGTTGAACTCCTATATGAAATGCAGGGTACT
GAAAAATTTACAGTTTTATATTTACTATCAGCACAATTAAATAGCAAGAATGCAGAGAAA
TTTTTATACAATGCCATTGAAACGCAAATCTCGCAGGCAGAATGTTATCCATACAGTATG
GTGTATCTAAAGAAACTGGATCCAGATTTCTTGTTACAAGTATACAAAGAACTCAAAAAA
CTTCTGCCAAAGAAGCCATTTATTGTTGTTGGTCATCTCCTTTATTCTCAAGATGGCAAC
AACACTTTAGTGGCAAATTGTTTCAAACTGTTGACAGCAATTACAAATTCATGTCCCGGT
TTGATACCTGCTTTGTATGAATTGGCGAAATTGAAGTTCCTATTTGGTTACTCCAACGAG
GCAAAGGCATTTGCCCAACAAATTATCGATCTTGACAACACACACGCCGGTAGCCAAGTG
CTATTGGCCCAAATATACGTCCAACAAAGCAATTTTAATAAAGCCCTACAGAGTTTGGAA
ATGTGTTTGAGTTACAACTTTAAAATTCGTGACAGTGCAATGTATCACTTTCTGAATGGC
GTCATAATCAAAAATATGAATCAAATTCAGGACGCTCTCGCAAGTTTTAATACAAGCTTG
CAGATAGCCAATAATAAGAATAATATAACAAGGCAGTATGATTCGGATCTCAATATAATT
GATAAGGCTACATTATTCTTTCTGATAATAGAACTACAGACGTCTTTGGGACAATTTGCG
GAGGCTGGGAAAACTATGCAAGAAGCAATACAAGAAATATCTTACACGTCGGAGGAAAGT
CGTTTACAAATAGCTCGCGCAGACTTAGCATTAAGCAATGGAGATGTCGACAACGCTATA
GAAATCCTCAACGAAGTAAAACCGGGACAGAAGTATTATTTCCAAGCTCACAGCAAAATG
GCGCATATCTACCTAAAAGAGAAGCAAGATAAGGCCATGTTCACATCGTGTTTTAAAGAT
ATAGTGAACAATCATCCTATGGTAGACGCTCATACGATGATGGGAGATGCATTCATGTCC
ATACACGAACCAAATCAAGCGGCTGCATCATATGATGTAGCGCTTAAAGGGAATCCGAAA
GACATTCGGTTAATAAAGAAAATGGGTACAGCGCTGTTGAAGATGCATGAATATGATAAA
CTTACACAACACTATGAGAACGCTATCAGAACGTTGAACGATGATGAACTGAGATTTGAG
TATCTGGAACTACTTATAAAGCTCAAACAGTACGACAAGGTCGACACAACGATCTCATCA
GAACTAAACCAACATTATAACAAGGAAAAAGACATCAATACATTGAGACGACGCGTTAGA
TTGCTTCTAATACAAGCGAGGAGTAGGGAAATGAAAAATCCAACAGCCGGGAACACCAAC
CTGATACTGGCAGAAGCCAGAGACCTACAAATGTCGGTATTGAAAAGAGTTGAAATAGAT
TCGAGGGCAGATATAGAAGAAGAAAAGAAAATGCTTTCGTCCATCCTCTGTTTATTGGCG
AAAGCGAGATCTGCGAAGGAACCAGCCGTGGCAACGAACTTATACTCGGAGGCTTTGATA
TATTCACCGCGGGACCCTGATACACTATTGGCGTTGGCGAAGTTATACGCACAGATGAAC
AATCCCGAGCGCTGTGAACAAACTTGCACGTTGCTATTGAACGCCGATCCTAACAACGAG
GCAGCGGCAGTCATGATGGCGGACCTCGCGTTCCGTAAGGCTGATCTGGAAACGGCTCAG
CGCCATCTAAGCCAGATATTATCAGTGAGGCCGTTGAGTTTCGATGCTTTGTCACAACTC
ATTGAGGTGCAATGGCGCAGAGGAATGCTAACAGCCGTCGAGTCAGCTATAGAGGACGCG
AGGAGAGAACTCTCTGGAAAGGAAGATCCTGGGTTCTCGTATTGCTGTGGTGTGTTTAGT
TCCTACAACGGCGCCGTGAACCAGGCTCTCCGTCAATTGAACGTGTCGCGGCGTTCTCGC
TGGTCGACTCTCGCCGCTAGACGGATGATACTGCTGTGTATGGAACAAGCCACTGAGACA
GACGGACAGAATGCCGACTCTGATACACATCTGCTAGCTCTTCGAACAGCGGAGAAGCTT
TTGAACGAAGTGTCGTCATCCGAGCGCAGGTCCCTATCAGCGTTATTACAGTTAGCAACG
AGGCAAAAGTCACAAGCGGAGAGAGTTTTGCAGGAACTCCTACCGTTAGCGACCGAAGAC
ATTCACCAAGATGACCCATACCTTATATTGGCCATTGCTAATGCGTATAACATTGTAAAA
CAACCAACTCGAGCAAAGAACATTCTGAAGAGAACTATATCTTCTATTCCCTGGACGCCT
GAGAGAGCGGATGGCTTGGAGAGATGTTGGTTGGAGGTGGTAGATAACCAGATAAGTTCA
GGGAGAATGGATGCCGCAAAAGAGATTTTAACAAAAATTCTCAATCACAACAAATCCTGT
GCTCAGGCTTATCAATATTTAGGTTTCTTGGCAGAAAAGGAACAAAATTACAAGAGCTCG
GCCGTCAACTATGACAACGCTTGGAAATATACAGGAAAAAATGACCTGGCCGTTGGATAC
AAACTAGCGTACGCCTACCTCAAGCTAAAGAAATACCCTGAATGTATAGTCGTGTGTAGA
CATATATTAAAAGTTCACCCAGACTACCCCAAGATAAAAAAAGAAATTCTAGAAAAAGCT
AAGACCAATTTGAGAACGTGA

Protein sequence:

MHMSVGNLKSSILDCFELALQKSDKNVEALIGLAKFKYLTKNYDASNLTLDKLIVNNPGQ
VVPLVEKMRNEFAAQKWDVVYDTIERVFSIDPVNIEALKIKIFIALCKNADYIEAVDELN
KFFSILEAEEATNGYQFYTTAQLFSKLCGRSSAVLSQAYRFAQYASELQPSNVDYLSEVG
YQCILQGKYKDALNFFRAASKVDSNSITALCGLTLCQMYDNGTTSQIAEQVELLYEMQGT
EKFTVLYLLSAQLNSKNAEKFLYNAIETQISQAECYPYSMVYLKKLDPDFLLQVYKELKK
LLPKKPFIVVGHLLYSQDGNNTLVANCFKLLTAITNSCPGLIPALYELAKLKFLFGYSNE
AKAFAQQIIDLDNTHAGSQVLLAQIYVQQSNFNKALQSLEMCLSYNFKIRDSAMYHFLNG
VIIKNMNQIQDALASFNTSLQIANNKNNITRQYDSDLNIIDKATLFFLIIELQTSLGQFA
EAGKTMQEAIQEISYTSEESRLQIARADLALSNGDVDNAIEILNEVKPGQKYYFQAHSKM
AHIYLKEKQDKAMFTSCFKDIVNNHPMVDAHTMMGDAFMSIHEPNQAAASYDVALKGNPK
DIRLIKKMGTALLKMHEYDKLTQHYENAIRTLNDDELRFEYLELLIKLKQYDKVDTTISS
ELNQHYNKEKDINTLRRRVRLLLIQARSREMKNPTAGNTNLILAEARDLQMSVLKRVEID
SRADIEEEKKMLSSILCLLAKARSAKEPAVATNLYSEALIYSPRDPDTLLALAKLYAQMN
NPERCEQTCTLLLNADPNNEAAAVMMADLAFRKADLETAQRHLSQILSVRPLSFDALSQL
IEVQWRRGMLTAVESAIEDARRELSGKEDPGFSYCCGVFSSYNGAVNQALRQLNVSRRSR
WSTLAARRMILLCMEQATETDGQNADSDTHLLALRTAEKLLNEVSSSERRSLSALLQLAT
RQKSQAERVLQELLPLATEDIHQDDPYLILAIANAYNIVKQPTRAKNILKRTISSIPWTP
ERADGLERCWLEVVDNQISSGRMDAAKEILTKILNHNKSCAQAYQYLGFLAEKEQNYKSS
AVNYDNAWKYTGKNDLAVGYKLAYAYLKLKKYPECIVVCRHILKVHPDYPKIKKEILEKA
KTNLRT