New model in OGS2.0 | DPOGS212766  |
---|---|
Genomic Position | scaffold3:+ 489774-497327 |
See gene structure | |
CDS Length | 2691 |
Paired RNAseq reads   | 244 |
Single RNAseq reads   | 619 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013138 (2e-08) |
Best Drosophila hit   | CG32668 (6e-21) |
Best Human hit | armadillo repeat-containing protein 2 (8e-21) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC012901 [Tribolium castaneum] (9e-66) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC012901 [Tribolium castaneum] (3e-42) |
GeneOntology terms   | GO:0005488 binding |
InterPro families    | IPR011989 Armadillo-like helical IPR016024 Armadillo-type fold |
Orthology group | MCL17418 |
Nucleotide sequence:
ATGAGTGGTGCGAGCGGGTTAGGCCTTCGCCCTTTACGCACACGCCGGCCGTTTACACCT
CGGGAGCCCCAGAGGACTCTACTGTCGGACACGAGACGAGTTGACATACGTCCCACCAGC
GGCTTTGATCTGAAGTATCAAACGGTTCAAGAGAGTAGCGAGGATGCGTTTATGAATTAC
CAGGTATCGGAACAAGAGCACGTCTCAAACGGTGTTCCGTTCGGAGAAAATCAGCAGCAA
AGAAAGAAAACGTTGAAGTCTACGAAATCTATTAAAGGCGCCGACGCTTGGAGCGGTTTT
CCTAAGCTACCTCACCTCAGCGGAAAAAGCAAGCCGTTACATAGAAGGAATACAATAGGA
CAAAACGATGCATCCGATTCAAGTAAGGAGCTGGCTCAATCTATATCAGTTACACGTCCT
TTATCTGTTAGTAACGGTCCACTCTCGTATCTCAGATCGTTTCACGAAAAAAGTCAATTT
TGCAGTACAGATAGCTCTAGCAGAAGTAAAACTTTGGGTGAGAAGTCGGTATCTTACGAC
GAGGGTACCCTGGGCGAGGTGTCCGTGAGACACCTCGATGTACAGCTTCCTGTCACCAGC
GAGGATTGTCACAACATGACAGCCTTAGAGATATCAGAGGCGCTGACCCAGAAGAATCAG
AGCGTTGATCGCGTGCTGTTTCTACTGGACGCTCTCCAAAAGACCGTCGAGGAGACCAGC
CCCGGGGACAGTCTCCGCGAGCTGGTGCTCCGAGCACTGCTCTCTCGAACTCGCGATGAT
AGTGAGAGGGTCCTCGTCAAGGTCGCCAGAGTCATGCTGACGATGCGTGTCACTGGAGCG
TATTTGACTGCTGCCAGCAAACTTGTCTTCAAAATTGCCAGCAACGATAAGAATGATAGT
GTCTTCAAGAATGGGAACCTTCTCGAACTGATCGTGGAGTCGTGTGCACGCGCGTGTCCC
CTGTCCGAGAGCTCGAGTGTGTTGCACGCAATGGGTGCTCTGCGAGTGTTGGCCCTGGAG
CCTTCCCTGGCGGCCCGCAGCCGGACGGCCGGGGCTCTACACCTCGCCGTCCTGCACCTC
AAAATCATTAATAACGCAAAAGCTGAACGTCCGAGGCAAGTGACGGAGGAGACGACGCAC
GCGCTGTACCAGTTGACTGGGGCTCTGAGGAACCTCGCTGGCAGCGGGTGGCAGGAGAGG
GAGGGGGAGGAGGAGCGGAGCGGGGAGGGGGACGCGGGGAGGGAGTTCGCCAGCAGCGGA
GCCATAGGAGAGCTCATCAACGCTTTGACATTACATACAGATCGAGATGTACTCACTAAC
GTTGCGAGGTGTCTAAGCGTGTTGTCATCATGGCAGTCTTGTTTGAACGCCTTGTGTTCC
TGTCCGGGAGCGTCTCGTGCCTTGCTAACAGCGCTGGGAGCGTGCGCCGCGAGGGCAGCG
CTAGCTGTGAGACTCGCCTACACGTTGGGCAACATGGCCGCCCACTGCGATCAGGCTAGG
ATAGACATATACAGCGAGAAGGGCAGCATCGATGTACTGTTGACTATACTGGAGTCGTAC
ACGCAACGTAACGACAACGACACGAGAGACCACGACAACGATCCGGACTTACATCTAATA
GGTTCCGATCTCGGCGGATCGGACGGCTCAAACGAGGACGTCCTCATAAAGACAGTTCGA
GTGGTCGCGAATCTCTGTCTGACGGAGGAGACAGGTCGCGGGCTGGTGGAACACGCCGAT
AGAACTGTGAGGGCGATGCTGAGCTGTCTGGAGGTGGCAGCCAGGGTTGGGGGGAAGGAA
CTAACAGATGCAGAGCGGAGGTCCAGCGAGGAACGTCGCGAGGAACTGGCGACGGCCGCA
CTGGCCACCATCAACAACATCACCTTCTACTTTGAGCCGACAGACTCCACACACTTTGAT
ACCTTGGACCACTTGGTTAAAGTAACATGTGGCTGGTTGTCTCACGGTGGGCTTCCAACA
CACGAGGCGGTTCGAGCGCTCGGTAACCTCACCCGGTGTGATCGCGCCGCTCGTGCCGCC
GTCCTGTACGGGGCCTTAGACGCTTTACCGCCTTTACACGCTCACGACGACGAGGAGGTT
CGCAGCGCGTCCGCTGGTGTTTTGGTGAACGTGTGCGGTGTGAGCGTGGGCGGCGGAGTG
CAGGAAGCCGGGGGCGCGGCCGCTAGGGCTCTGGCCGCGGCCGCGAGGATGAAGGACGTC
CGCTCGGGGGCGCTGCTGGCGCGAGCCGTGTGGAACGCTCTCGAACAACGGCCGTTGGAC
CTCCACAACGCTAGGATGGCGGCCGCGGCGCTAGCGACCTTCATAGAAGACGAGTCATTG
TTCGCTATGTGTGAAGCCGCAAAGTGCGAGGAGCGACGCGCAAGCGACCCAGATATAATG
AAAAACCACAGTGTAAAATTGGGTTTGGAAGGTCGAGGCTACCACCGGGAGCATGTTGAA
TCGAAATTTTCGTTGAGCGTGGAAGAAGACTTGCACCTGGAAGAGGAGTTTGAAGAAGAA
GGGGAAAGATGGTCAGGCTCGGACCTGGGTTTCGAGGAGGGCAGTCCGGAGCCCTGCTCG
TGTGGACGCTGCGCCCGCGGCAGCTCATGGAGGGCCCTCGCCGACGTGGCGCTGCCGTTG
CTGCAAAGACTGCTGCCAGCACGACGAGATGCAGCAGTCGGCACAGATTAA
Protein sequence:
MSGASGLGLRPLRTRRPFTPREPQRTLLSDTRRVDIRPTSGFDLKYQTVQESSEDAFMNY
QVSEQEHVSNGVPFGENQQQRKKTLKSTKSIKGADAWSGFPKLPHLSGKSKPLHRRNTIG
QNDASDSSKELAQSISVTRPLSVSNGPLSYLRSFHEKSQFCSTDSSSRSKTLGEKSVSYD
EGTLGEVSVRHLDVQLPVTSEDCHNMTALEISEALTQKNQSVDRVLFLLDALQKTVEETS
PGDSLRELVLRALLSRTRDDSERVLVKVARVMLTMRVTGAYLTAASKLVFKIASNDKNDS
VFKNGNLLELIVESCARACPLSESSSVLHAMGALRVLALEPSLAARSRTAGALHLAVLHL
KIINNAKAERPRQVTEETTHALYQLTGALRNLAGSGWQEREGEEERSGEGDAGREFASSG
AIGELINALTLHTDRDVLTNVARCLSVLSSWQSCLNALCSCPGASRALLTALGACAARAA
LAVRLAYTLGNMAAHCDQARIDIYSEKGSIDVLLTILESYTQRNDNDTRDHDNDPDLHLI
GSDLGGSDGSNEDVLIKTVRVVANLCLTEETGRGLVEHADRTVRAMLSCLEVAARVGGKE
LTDAERRSSEERREELATAALATINNITFYFEPTDSTHFDTLDHLVKVTCGWLSHGGLPT
HEAVRALGNLTRCDRAARAAVLYGALDALPPLHAHDDEEVRSASAGVLVNVCGVSVGGGV
QEAGGAAARALAAAARMKDVRSGALLARAVWNALEQRPLDLHNARMAAAALATFIEDESL
FAMCEAAKCEERRASDPDIMKNHSVKLGLEGRGYHREHVESKFSLSVEEDLHLEEEFEEE
GERWSGSDLGFEEGSPEPCSCGRCARGSSWRALADVALPLLQRLLPARRDAAVGTD