New model in OGS2.0 | DPOGS214908  |
---|---|
Genomic Position | scaffold473:- 5157-9056 |
See gene structure | |
CDS Length | 1332 |
Paired RNAseq reads   | 848 |
Single RNAseq reads   | 2134 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013889 (9e-27) |
Best Drosophila hit   | GDI interacting protein 3, isoform C (3e-84) |
Best Human hit | UBX domain-containing protein 6 isoform 1 (5e-61) |
Best NR hit (blastp)   | PREDICTED: similar to UBX domain-containing protein 1 [Tribolium castaneum] (6e-131) |
Best NR hit (blastx)   | PREDICTED: similar to UBX domain-containing protein 1 [Tribolium castaneum] (4e-131) |
GeneOntology terms   | GO:0005515 protein binding |
InterPro families    | IPR006567 PUG domain IPR001012 UBX IPR018997 PUB domain |
Orthology group | MCL13334 |
Nucleotide sequence:
ATGGCTGATAAAATAAAGAAGTTTTTTCAAAAGAAAAAAATTGATGCTAAGTTCAAGTTA
GCTGGACCCGGGCATAAATTAACTGAATCATCTCAATCAAGCCAGTCTTCTTTTTATAAA
AAAGAAGTTCCTACAGTAAAAAGATCAGGGCTGTCAGAGGAAAGTAAAGTAGCAGCCGAT
GCTGCATTGGCAAGATTACAACAAAAAAGAGACAACCCTTCCTTTAACACATCTTTGGCT
GCTATAAAGGCTCAAGTGAAGAAAGAATTGGAGAATGAAGTAGCATCTTCTTCAAAAGAA
CCAATTCAAGTGAAAGAAACTACTGAAGGAAATGTAGACATTCCTAAAAACTTGGCCGCG
TCTGGTGTATACTTTAAATGTCCTATAATAAGCAATGATATTCTGTCTCGGGATGAATGG
AAGAAAAATATTAAGACTTTTTTGTATGAACAATTAGAAGAAGAAAGAGGTCTTACTGCA
TGTCTTATAATACAGTCCTGTAATAGCAATAGAGAGAAGGTTGATATATGCGTGGAAACT
CTATGCAAGTATTTAGAAAATATTGTGACACATCCCGATGATGAAAAGTATCAGAAGATT
CGAATGAGCAACAGAGCATTTTGCGAAAGAGTCCAACCCATTGAAGGCTCGATGGAATTA
TTATTGGCAGCGGGTTTCATGCAAGAAAAACTTTTGAATAATGAAGGCAATGAAGAAGAT
TTTTTAGTTTTTAAAAAGGAAAATATTCCTTCAGTTGAAAGCTTGACTATGTTGATAGAT
GCTCTACGTACATCGGAACCGATTCCATTGGAACTTGACAGGAATCTCCAAGTATTGTTA
CCTTCTCAAGCAGCCAATAAAGTGCAATTACCGAGTTCATTCTACGCGCTTAGTCCAGAA
GAAATTAAGAGAGAACAACAATTGAGAACCGAAGCCATGGAAAGAAGTCAAATGCTACGA
ACTAAAGCGATGAGGGAAAAGGACGAATTACGTGAAATGAGAAAATATAAATTTGCGATT
ATAAGAGTGCGTTTCCCTGACGGAATATTGTTGCAAGGCACATTTTCGGTGTACGAGCGT
TATAGTGAAATACATGAATTCGTTCAAGAAAATTTGGAACACAACGGCCTTCCGTTTATA
CTGAACACTCCAACCGGCCACAAGATAATATATGAAGAAGATGCGAATAAAACTCTTATA
GATCTAAGACTTGTACCAACAACAATGCTCACATTCGCCTGGCACAGTTCAGTCATAGAC
GAAATCAATAACAGCCCTAATAAGGACGTTTATTTGAAACCGGAAGTCATGGTCCTCGTA
CAAGAAATTTGA
Protein sequence:
MADKIKKFFQKKKIDAKFKLAGPGHKLTESSQSSQSSFYKKEVPTVKRSGLSEESKVAAD
AALARLQQKRDNPSFNTSLAAIKAQVKKELENEVASSSKEPIQVKETTEGNVDIPKNLAA
SGVYFKCPIISNDILSRDEWKKNIKTFLYEQLEEERGLTACLIIQSCNSNREKVDICVET
LCKYLENIVTHPDDEKYQKIRMSNRAFCERVQPIEGSMELLLAAGFMQEKLLNNEGNEED
FLVFKKENIPSVESLTMLIDALRTSEPIPLELDRNLQVLLPSQAANKVQLPSSFYALSPE
EIKREQQLRTEAMERSQMLRTKAMREKDELREMRKYKFAIIRVRFPDGILLQGTFSVYER
YSEIHEFVQENLEHNGLPFILNTPTGHKIIYEEDANKTLIDLRLVPTTMLTFAWHSSVID
EINNSPNKDVYLKPEVMVLVQEI