New model in OGS2.0 | DPOGS209892  |
---|---|
Genomic Position | scaffold82:- 42436-47080 |
See gene structure | |
CDS Length | 2130 |
Paired RNAseq reads   | 1686 |
Single RNAseq reads   | 4292 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | ND |
Best Drosophila hit   | cactin (2e-72) |
Best Human hit | chromosome 19 open reading frame 29 (6e-68) |
Best NR hit (blastp)   | PREDICTED: similar to cactin CG1676-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to cactin CG1676-PA [Tribolium castaneum] (8e-165) |
GeneOntology terms    | GO:0005515 protein binding GO:0005634 nucleus |
InterPro families    | IPR019134 Cactin protein, cactus-binding domain, C-terminal IPR018816 Cactin, domain |
Orthology group | MCL15428 |
Nucleotide sequence:
ATGGAAAATGGAGTTTGCAAATTGTCGTCACGACATATAAGTAAACACAGAGATATCTCT
CGCGGTCGTTCAGTTGAGCATCATCGAGATTGTGGAGGACGAAGTGAAGACAAAATCTCT
GAAAAAAGGTCAAGATCACCTAACAGAAAACGGAAGTCCGCCAGCAAAGATAGACGTAAG
TCACCACAAAAAAAACACACGTCTTCTAGTAGTGAAAGAAAGAAAGAATCAAAGAAGAAG
AAAAGTAAAAAGAAAAAAAAGTCACGTAATTCATCAAGTAGCAGCAGCAGCAGCAGCAGT
AGCAGTAGTGACAGTGACGAGGAAGAGTTAAAACTACTACAGAGACTTGAAGCAGAGAGG
TTGAGGCTGAAAGAAGAAAAAAGAAAGCAGAAAGAAATGATAAAAGCAAATGAAACACCC
GAGGAGAAAAGAGCTCGACGTCTAAAAGAAAAGCAAGAAAAGGAAAGGAAGAGACGTGAG
CGTATGGGGTGGGACAACGAGTATCAGTGTTACACCGATCAAGATAACCCCTTCGGAGAC
TCTGCTCTCACTGACACATTTGTATGGACAAAGAAACTGGCTAAGGAAGGTGTTAAGAAT
GTCTCTCACAACGAACTGGAGGCATTGAACAGGCAGAAACAATTAGAAAATAAAATTGAA
TTGGAGAAGGTGAAGCAGCGTCGACTGGAGCGCGAGGCTGAGCGTGCGGCACGCGAGGCC
GAGGCGGCGGCGGCGGCCCGGGCCCGGGAGGCCGCGCAGTTCAGCAGCTGGGCTCGACAC
GAAGACGAGTTCCATCTGCAGCAGGCCCGGCTGCGCTCGCAGATACGGATACGAGACGGA
AGAGCTAAGCCGATAGACCTCCTCGCGTGGTACGTGAGTTCCGAGCAGTGTGTCGATGCG
CTCGAAATGCACGAGCCGTACACGTACCTGAACGGCCTGCACGCACAGGACCTGGAGGAC
TTACTGGAGGATATCAAGGTGTACAAGGAGCTGGAGCAGGACGTGAACCAATCGTACTGG
GAGGACGTGCAGACTATCGTGTCGTCGGAGCTGGCGAAGCTGCGGCGCCTGGCTCCGGGC
CGGGACGGGGTGCACGCCGCCGTGGCCGAAGACGTGGCCGGCGTGTTCCGCGGGAAGAGC
ACGGCCGCCCTGCTTCAGTTGCAGGACGCCATCGAACACAAGATGGCCGCCAGGACCGCC
GGGATCGACGTGCACTACTGGGAGAGTCTGCTCAGCCAGCTCAAAGCTCACATGGCACGA
GCTCGTCTCCGAGACCGGCACCAGAACAACCTCCGCCGCAAGCTGCAGCTGCTGAAGAGG
GAACAAGGAGTCGCCGCGGACGAGCACGCGGAACACGAGGACAAACACACACACGGAGAG
GGCGCTGGTCCGGAGCAGAAGTCGCCTCGGACGGAGAGCGAGGCGGAGGAGGCGGAGGCG
GAGGGCGAGTCGTGGTGCGGAAGTTACTCCCCGCGGTACCTGGCGCCCGCCTCGCTGGAG
CCCGCCACGCTGCTGCTGGAGCCCCACGAGGACCGCCAGCGCCTCGCCTTCCTCCGAGCC
AGGCTGCATGCCGCCGCCGCCGCCGACCAGCACAAGGCCACGCTCGCTAAGCTTCCGGAG
GCAGCTGATGCAGTGCCGGGCACCAGCACGGGCGCTCTGGAGGCGGCCGCGAGGCGCTCC
ATGGAGGGAGGCAGTGAGGGCGGCGCCGCACAGTTCAGTGTGGAGCACGTGCTGCCCGAC
CAGCCTTGCTTGTGGGCGGACAAGTACAGACCCAGGAAACCAAGATACTTCAACAGAGTC
CACACCGGCTTCGAGTGGAACAAATACAACCAGACTCATTACGACATGGACAACCCTCCG
CCGAAGATCGTTCAAGGATACAAGTTCAACATCTTCTACCCGGACCTCATCGACAAGAGC
GCCACCCCTGAGTTCTCACTTAAGCCGTGTGCTGACAACCCTGAGTTTGCTGTGCTTCGT
TTCCATGCGGGCCCACCCTATGAAGACATCGCCTTCAAGATAGTGAACCGTGAGTGGGAG
TACTCCTACAAGAGAGGCTTCCGCTGTCACTTCCACAACAACATCTTCCAGTTGTGGTTC
CACTTCAAGAGATACAGATACAGGCGTTGA
Protein sequence:
MENGVCKLSSRHISKHRDISRGRSVEHHRDCGGRSEDKISEKRSRSPNRKRKSASKDRRK
SPQKKHTSSSSERKKESKKKKSKKKKKSRNSSSSSSSSSSSSSDSDEEELKLLQRLEAER
LRLKEEKRKQKEMIKANETPEEKRARRLKEKQEKERKRRERMGWDNEYQCYTDQDNPFGD
SALTDTFVWTKKLAKEGVKNVSHNELEALNRQKQLENKIELEKVKQRRLEREAERAAREA
EAAAAARAREAAQFSSWARHEDEFHLQQARLRSQIRIRDGRAKPIDLLAWYVSSEQCVDA
LEMHEPYTYLNGLHAQDLEDLLEDIKVYKELEQDVNQSYWEDVQTIVSSELAKLRRLAPG
RDGVHAAVAEDVAGVFRGKSTAALLQLQDAIEHKMAARTAGIDVHYWESLLSQLKAHMAR
ARLRDRHQNNLRRKLQLLKREQGVAADEHAEHEDKHTHGEGAGPEQKSPRTESEAEEAEA
EGESWCGSYSPRYLAPASLEPATLLLEPHEDRQRLAFLRARLHAAAAADQHKATLAKLPE
AADAVPGTSTGALEAAARRSMEGGSEGGAAQFSVEHVLPDQPCLWADKYRPRKPRYFNRV
HTGFEWNKYNQTHYDMDNPPPKIVQGYKFNIFYPDLIDKSATPEFSLKPCADNPEFAVLR
FHAGPPYEDIAFKIVNREWEYSYKRGFRCHFHNNIFQLWFHFKRYRYRR