DPGLEAN16982 in OGS1.0

New model in OGS2.0DPOGS214908 
Genomic Positionscaffold473:- 5157-9056
See gene structure
CDS Length1332
Paired RNAseq reads  848
Single RNAseq reads  2134
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013889 (9e-27)
Best Drosophila hit  GDI interacting protein 3, isoform C (3e-84)
Best Human hitUBX domain-containing protein 6 isoform 1 (5e-61)
Best NR hit (blastp)  PREDICTED: similar to UBX domain-containing protein 1 [Tribolium castaneum] (6e-131)
Best NR hit (blastx)  PREDICTED: similar to UBX domain-containing protein 1 [Tribolium castaneum] (4e-131)
GeneOntology terms  GO:0005515 protein binding
InterPro families

  
IPR006567 PUG domain
IPR001012 UBX
IPR018997 PUB domain
Orthology groupMCL13334

Nucleotide sequence:

ATGGCTGATAAAATAAAGAAGTTTTTTCAAAAGAAAAAAATTGATGCTAAGTTCAAGTTA
GCTGGACCCGGGCATAAATTAACTGAATCATCTCAATCAAGCCAGTCTTCTTTTTATAAA
AAAGAAGTTCCTACAGTAAAAAGATCAGGGCTGTCAGAGGAAAGTAAAGTAGCAGCCGAT
GCTGCATTGGCAAGATTACAACAAAAAAGAGACAACCCTTCCTTTAACACATCTTTGGCT
GCTATAAAGGCTCAAGTGAAGAAAGAATTGGAGAATGAAGTAGCATCTTCTTCAAAAGAA
CCAATTCAAGTGAAAGAAACTACTGAAGGAAATGTAGACATTCCTAAAAACTTGGCCGCG
TCTGGTGTATACTTTAAATGTCCTATAATAAGCAATGATATTCTGTCTCGGGATGAATGG
AAGAAAAATATTAAGACTTTTTTGTATGAACAATTAGAAGAAGAAAGAGGTCTTACTGCA
TGTCTTATAATACAGTCCTGTAATAGCAATAGAGAGAAGGTTGATATATGCGTGGAAACT
CTATGCAAGTATTTAGAAAATATTGTGACACATCCCGATGATGAAAAGTATCAGAAGATT
CGAATGAGCAACAGAGCATTTTGCGAAAGAGTCCAACCCATTGAAGGCTCGATGGAATTA
TTATTGGCAGCGGGTTTCATGCAAGAAAAACTTTTGAATAATGAAGGCAATGAAGAAGAT
TTTTTAGTTTTTAAAAAGGAAAATATTCCTTCAGTTGAAAGCTTGACTATGTTGATAGAT
GCTCTACGTACATCGGAACCGATTCCATTGGAACTTGACAGGAATCTCCAAGTATTGTTA
CCTTCTCAAGCAGCCAATAAAGTGCAATTACCGAGTTCATTCTACGCGCTTAGTCCAGAA
GAAATTAAGAGAGAACAACAATTGAGAACCGAAGCCATGGAAAGAAGTCAAATGCTACGA
ACTAAAGCGATGAGGGAAAAGGACGAATTACGTGAAATGAGAAAATATAAATTTGCGATT
ATAAGAGTGCGTTTCCCTGACGGAATATTGTTGCAAGGCACATTTTCGGTGTACGAGCGT
TATAGTGAAATACATGAATTCGTTCAAGAAAATTTGGAACACAACGGCCTTCCGTTTATA
CTGAACACTCCAACCGGCCACAAGATAATATATGAAGAAGATGCGAATAAAACTCTTATA
GATCTAAGACTTGTACCAACAACAATGCTCACATTCGCCTGGCACAGTTCAGTCATAGAC
GAAATCAATAACAGCCCTAATAAGGACGTTTATTTGAAACCGGAAGTCATGGTCCTCGTA
CAAGAAATTTGA

Protein sequence:

MADKIKKFFQKKKIDAKFKLAGPGHKLTESSQSSQSSFYKKEVPTVKRSGLSEESKVAAD
AALARLQQKRDNPSFNTSLAAIKAQVKKELENEVASSSKEPIQVKETTEGNVDIPKNLAA
SGVYFKCPIISNDILSRDEWKKNIKTFLYEQLEEERGLTACLIIQSCNSNREKVDICVET
LCKYLENIVTHPDDEKYQKIRMSNRAFCERVQPIEGSMELLLAAGFMQEKLLNNEGNEED
FLVFKKENIPSVESLTMLIDALRTSEPIPLELDRNLQVLLPSQAANKVQLPSSFYALSPE
EIKREQQLRTEAMERSQMLRTKAMREKDELREMRKYKFAIIRVRFPDGILLQGTFSVYER
YSEIHEFVQENLEHNGLPFILNTPTGHKIIYEEDANKTLIDLRLVPTTMLTFAWHSSVID
EINNSPNKDVYLKPEVMVLVQEI