New model in OGS2.0 | DPOGS210728  |
---|---|
Genomic Position | scaffold314:+ 9747-20868 |
See gene structure | |
CDS Length | 3609 |
Paired RNAseq reads   | 984 |
Single RNAseq reads   | 2443 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006258 (4e-89) |
Best Drosophila hit   | CG16908 (3e-26) |
Best Human hit | TEL2-interacting protein 1 homolog (1e-36) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC001200 [Tribolium castaneum] (3e-82) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC001200 [Tribolium castaneum] (4e-66) |
GeneOntology terms    | GO:0005488 binding GO:0003674 molecular_function GO:0005575 cellular_component GO:0008150 biological_process |
InterPro families    | IPR016024 Armadillo-type fold IPR011989 Armadillo-like helical |
Orthology group | MCL12695 |
Nucleotide sequence:
ATGAACGCACACTTGAAAGAGGCATTTACACGCATAAAGCCAATATGCGACATGGTGATG
GTAAATCCATCGCCGGAGCATATAACAGCGTTTGCAGCACTGGTAGGAGAGCTAAAGATG
GAAATAATCCAAGAGCTGCAGCAGTATATGCTGTTTCCGTTTATCACACACTTACAGTCC
AAAGAAATGGAAACAAAATATGAAATGCAAGGACTGTTAATAGACAGCATGCGGGAAGTG
CTACAGAGGGTGTGTGTAACCAGTTTTGAGATGTGCATGAAGATTGAAATGGGCCTTTTG
AGTCTCGTGTTTGAAAAACCAAAGCCGGGAATGATTGCGGATGTTCCCGAGGAACTCAAA
CTAAGTGTGATGCAGTGCTTGACGGTGTTGATGTTGCATATCGATCAGCCGACGAGGGTG
AAGATGCTGGAGAACCAAGTGCCATTGTTGGCTCAAGCTGTGTTTGTGTCGGTGCACCTC
GCAAAGTTAGAGAAACTAAGGAGCTTGAGGCTAGCTGCCATAACGTGTCTGTGTGCGCAC
ACGGGCTGCCACCCTCAGCAGACGGACGCGCGTGGTCTAGTTCCAGACCCCGCTATAGAG
ACTGCTGTCTTAGGGCTGCTAGCATGCATCCTGCCGGGCGTGCTGGCTGCGCTGCAGGAC
GTTGCCATGAGTGACAACCCTGGACATGCAGTTGTTGTTGCCGCACTGAATGCAACCCAT
CGCGTGCTGTGCCTCACGATGCACAACAAACACTTGGCGACCAAAGAAAACATCACGGCC
GATGACTTCGTGGCCATGTTAGCCGAAAAGGCCAAGCCCTCCAATGATGTAAGCAAGGAA
AGCATCAGAGATAGTCGGCTAGGTGCGATCATAGCGGCAAGAGATTATCACTCGCTAGCC
GAGTTCATACGATCTGGTAGGGACCAAACGGATCCTCCCAGGAGGGAGAGAAAGATTCCC
AAACAAAAAAAGAATAGAAATCAGAATGCCAACCAAAAAAACGTTAATTACATGGAAAGG
CTCGGCAGCTATGAAAGAACGCCGAATACTTCACAGAATGCAGCGAAGGCACCCCGCGAT
ATTCCCAAGCGGACATCCGAATGGTACACGATGGCCGGGGACAAACTGGCCATAGTGATC
AAGAGTCTCATACCGCTGGTCTCTCACGAACACTTCAAAGTGAGGAAGGAGCTCGCCATA
CTCTGCTATAGGATCATATCAGAGTGTAGTGCCACCATGCAGCCTTCTCTGCCGATGTCT
TTGGATGTACTGATCTCCCTCTGCCATGACTCCTACCAGGAGGTCTCGGACTACTGTGAT
GCAGCACTGAAGGCACAGTTTTCGAACCCCGAGAGAGAAACTATGGACAGTCTCTGTGAG
AACTTCTTCGCCACCATCAACTGCTTGCCGAGAATTATGAACAATATTGACGAGAACCGC
AAGCTGTCCGCCCTGAACCTGATAGCCGGCTACCTCACCATCCTGTGTGACGGCGGCCGC
CCCCAGCGCCTGACGAGTCTCTTGACAGCTAGCGACGGCTTCGACCGCGTGTGCGACGCG
CTAATCGCTACGGCTGACATGTACACCGACATGTCTCTCCTCGCCAGACCGGCCGGAGGC
GACATCACAGGGATGTCCTCGTGTGAGTTGTCCGGTCCGTGTCCCTGGCGCCGTCTCCGT
CACCTGTCTCCGGCTGGAAGTCAACAGCTGCAAGGCCTGCTGGCGTCGCTGGGCGCGGCG
GAGTGCGCGGAGCTACTGCTGGACAGATTTCTGGAACTCTTCCACGAGCGACGCTCCTGC
GACCTCATATACATCATCAACTATCTTGGATCAGGTCCAGATTCAAACCCGGATCTAGCT
CGTCGCATCATCAGCGTGTATATAACCGAGGACGTGTGGTACCAGCCGCTGGAGGTGCAG
AGCGGGGAGAAGCCGCTCTCAGCGGACGAGACCCTCGACGAGAGCATTTACAACCCACGG
TCCTGGACCAGGGACACCGTACCCGGTCTGTTCGAGGGCGCCATAGAGACTCGCTTCACC
GACATCAGCTCGACCCTACCGCGGGTCCGCCTGGAGCCCAACACGTGCGTGACCCTGGGA
CACGCGCGGAGGAACCTCACCAGGAGCTGCCTCCTCACTGAGGGGCTCGGCCTCATGGCG
CTGCGCCTGGGCAGGGACTACCAGCAGTACCTACTGAAGACACTGTGTCTCTTGTTGGAG
AGAGTCGGCAGCCGGTACCCTCCGCTCCGCTCATCGGGTCTCATCGCCCTGCAGCAAGTG
GCCGCGGCCACCGAGGCCAGTGACGTCACAGACCTCATCGGCAGGAACGCCGACTACTTC
ACCAGCCAGGTCACCGGCCGGCTTAAGAAGGCCTGGAACACGCAGTCGGCCCTACAGATA
CTGTCAGTGGTTATGGAGTACAGTGACGTCACTATACTGGACTACTTGTACGGGATAGTG
GAGGACGCCTGGAACACGCAGTCGGCCCTACAGATACTGTCAGTGGTTATGGAGTACAGT
GACGTCACTATACTGGACTACTTGTACGGGATAGTGGAGGACGTGTTAGTGCAGAGCTGC
GACCAGTACTACGAGAAGAATTTGTATTCATATTTACAAGTGTTCCTGACATTCATAAAC
TGCATACGGAAATGGTTTCTGATCCAAGACACGGTCAAGAGGAAGGAGAGTCACGGGCTG
GAGATAGACGTGCTGAAGGACGTCATAGAGTTCGCCAACAACAAGGAGGAAGTAGAGAGG
TTGTTGAACACGAAGGAGTTCGAGGAGGAGACGGGCAGGAGTGTGGAGGAGATGTACCAG
GAGGACCTGCAGCGGAAGGAGGAGGACCTCCTGGACTACGACGACACCGTGACGCAGGAG
GCCGTACCTCTGCCACAACACGTGCGGGTCACGATCACTATACTGAAGCGCTGCGTACAC
TTCGTGTCGTACAAGAGTCGAGACGAGGCGCTGGTGGCGATGGAGGCGCTGTGGCGAGGC
CTGGAGCTGCTGCGAGGACACGACGACGAGCTTCTGCCGCTGGTGCACGCGCTGTGGGAG
CCGTTGGCCGCCCGCCTGCAGGCCGAGCCCGTGCTGGCGCGAGCGGCGCTGCGGGTGTTG
GCGCTGGTGGCCGACCTGGCCGGGGACTTCGTCCGGGAGAGAGTCGTCAAGGACGTGCTC
CCCCGCGTGTGTTCGGTGCTCCGCTCGTGTTCTCGCCGCAGCGTGTTGGCGGACGCGGGG
TCGTCGTACCGCCTCACGGCGTCGTACTCCCTTCAGCGCTCCGCCCTCGAAGCGCTGGGT
CCGCTCGCCGCTAACGTGGGGCTCCGGGGCATCGCGCTCCTCGACGCCCTGAAGGCCGGG
GCGCTCTACCTTCACAAAAACCAACCCAAACCCCTACAGCTGTTGGCGGTGAAGTTCTTC
AAGGACATGCTGGAGTACGACTACGGCAGCTCGTGGCAGTTCCTCCGCCGCCTCGCCAAC
AACAAGCAGCCGCTGACGCCGCCCGCCAACCGGTTCCTGCACCTGGAGCCCGTCGTGGGC
TCGCCCTACGAGTGTACGGACCCGCACTACGACAACAACATCAAGCTCATCTTTTACGTA
CACAAGTGA
Protein sequence:
MNAHLKEAFTRIKPICDMVMVNPSPEHITAFAALVGELKMEIIQELQQYMLFPFITHLQS
KEMETKYEMQGLLIDSMREVLQRVCVTSFEMCMKIEMGLLSLVFEKPKPGMIADVPEELK
LSVMQCLTVLMLHIDQPTRVKMLENQVPLLAQAVFVSVHLAKLEKLRSLRLAAITCLCAH
TGCHPQQTDARGLVPDPAIETAVLGLLACILPGVLAALQDVAMSDNPGHAVVVAALNATH
RVLCLTMHNKHLATKENITADDFVAMLAEKAKPSNDVSKESIRDSRLGAIIAARDYHSLA
EFIRSGRDQTDPPRRERKIPKQKKNRNQNANQKNVNYMERLGSYERTPNTSQNAAKAPRD
IPKRTSEWYTMAGDKLAIVIKSLIPLVSHEHFKVRKELAILCYRIISECSATMQPSLPMS
LDVLISLCHDSYQEVSDYCDAALKAQFSNPERETMDSLCENFFATINCLPRIMNNIDENR
KLSALNLIAGYLTILCDGGRPQRLTSLLTASDGFDRVCDALIATADMYTDMSLLARPAGG
DITGMSSCELSGPCPWRRLRHLSPAGSQQLQGLLASLGAAECAELLLDRFLELFHERRSC
DLIYIINYLGSGPDSNPDLARRIISVYITEDVWYQPLEVQSGEKPLSADETLDESIYNPR
SWTRDTVPGLFEGAIETRFTDISSTLPRVRLEPNTCVTLGHARRNLTRSCLLTEGLGLMA
LRLGRDYQQYLLKTLCLLLERVGSRYPPLRSSGLIALQQVAAATEASDVTDLIGRNADYF
TSQVTGRLKKAWNTQSALQILSVVMEYSDVTILDYLYGIVEDAWNTQSALQILSVVMEYS
DVTILDYLYGIVEDVLVQSCDQYYEKNLYSYLQVFLTFINCIRKWFLIQDTVKRKESHGL
EIDVLKDVIEFANNKEEVERLLNTKEFEEETGRSVEEMYQEDLQRKEEDLLDYDDTVTQE
AVPLPQHVRVTITILKRCVHFVSYKSRDEALVAMEALWRGLELLRGHDDELLPLVHALWE
PLAARLQAEPVLARAALRVLALVADLAGDFVRERVVKDVLPRVCSVLRSCSRRSVLADAG
SSYRLTASYSLQRSALEALGPLAANVGLRGIALLDALKAGALYLHKNQPKPLQLLAVKFF
KDMLEYDYGSSWQFLRRLANNKQPLTPPANRFLHLEPVVGSPYECTDPHYDNNIKLIFYV
HK