DPGLEAN02059 in OGS1.0

New model in OGS2.0DPOGS214229 
Genomic Positionscaffold323:- 1173-7553
See gene structure
CDS Length1809
Paired RNAseq reads  113
Single RNAseq reads  455
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005954 (3e-83)
Best Drosophila hit  CG33966 (3e-71)
Best Human hitalpha-tocopherol transfer protein-like (3e-24)
Best NR hit (blastp)  AGAP012165-PA [Anopheles gambiae str. PEST] (1e-89)
Best NR hit (blastx)  AGAP012165-PA [Anopheles gambiae str. PEST] (7e-87)
GeneOntology terms  GO:0008431 vitamin E binding
InterPro families

  
IPR001251 Cellular retinaldehyde-binding/triple function, C-terminal
IPR001071 Cellular retinaldehyde binding/alpha-tocopherol transport
IPR011074 Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL18138

Nucleotide sequence:

ATGTCTCTACGACAATTGTGTCCCGAATTAGCCGAAAAGGCTAAGTTGGAATTAAATGAA
GATCCAAAAACTATTGAGGGCGACATACAGCATATAAAGGACTGGTTGGCTAAGCAACCA
CACCTGAAAGTAAGAACAGATGACCAATGGTTGCTCGCTTTTATTAGAGGATGTAAGCAC
AGTCTCGAAAGGACTAAAGAGAAGTTGGATCTGTTTTACACATTACGAACAGTAGCACCG
GAGATTTACAAAGTGAAACATAATGAACCTCTATTCAATACAATCATGGATCTGGGGAGT
TACTTGATATTGCCAAAGTTGGAAAAGCCGGATTCACCTCGGATTGCTCTTATTCGGCCT
GGAATGTACAATCCAGATAAATTTTCTTTTTTCGATATATTCTCTTGTGGTGCTGTATTT
CAAAATATTCTGATGTACGAAGATGATGCCATAGTTATATCTGGGCTTACAACCCTTATA
GACTTAGAGAGTGTAACAATGGGTCATTTGTTGCAACTCACACCGAGTGTTATGAAAAAG
ATGGTCGTTTACACCCAGGATGCTCTTCCAATCCGCATGAAAGGCGTTCACTATATTAAC
ACTCCTCCAGGCTTCGAAACGGTATTCAACGCAATTAAGTCGTTGCTTAATGAGAAGAAT
AGAAACAGGTTGTATGTACACAACAAAAATTATAATGAATTATACAAACACATCTCCCAG
GAGGTTTTACCAGCGGAATATGGAGGAAAAGGTGGCAGCATACAGGAAATTAAGGGATAT
TGGAAGAATAAAATAGACGCATGCAGTTCATATTTGGAAGAAGATCTTAAGAATGGAACT
GATGAATCAAAACGTCCCGGAAAACCAAACACTTCTGAAAACCTATTCGGTCTAGAAGGA
TCTTTCCAGTTAGCCAAAAAGGCACAGGAGGAGTTAAATGAAGATCCGAAAAATATTCAA
CGTGACCTACAATATATTAAGGATTGGTTATCCAAGCAACCTCATTTAAAGGCTAGACTA
GATGATCAGTGGCTTGTCGCTTTTTTAAGAGGATGCAAGTACAGTCTAGAGCGCACGAAA
GAAAAAATAGACCTATATTATTCTATGAGGTCGTTGGCACCAGAACTATTTAGGGTGAAG
GCTACTGATTCTGTTTTTGATGAATTAATCAGTTTGGGGACTTACCTGATACTGCCGAAA
ACCGCTACCCCTGATTCACCGAGGATTATCATAATTCGAGCTGGTTGTTATGATCCCGCT
AAATACAACTTTATTGACATATTCTCTGCTACTGCACACATACAGAAGATTCTCATTTTC
GAAGATGACGCAATTGTTGTATCTGGTTTTAAAACAATTATGGACATGGAAGGCATCACT
CTCGCACACTTATTGCAAATCACGCCCAGCGTTATGAAGAAGATGGCTGTTCTTTCACAG
GACGCCTGGCCGCTACGTATGAAAGGAGCACATTACATTAATACACCGTCATGGTTTGAT
AATTTTTTTAACATGGTTAAAAATTTGTTAAATGAAAAAAATAGACAGCGTCTTTACGTA
CATAATAAAAATTTCGAAGAACTATACAAACATATTCCTCAGGAAATATTACCAAATGAA
TATGGTGGAAATGGTGGTAATATTAAGGAGATTTCAGAATATTGGAAGGCTAAGGTACAA
GAGTATAGCTCGTGGTTAGAAGATGATTTAAAATACGGTTCGGACGAATCAAAGCGAGTG
GGAAACCCAAGGACGGCTGAGACATTGTTTGGGGTCGAGGGTTCTTTCAGACAACTGGAG
TTTGATTAA

Protein sequence:

MSLRQLCPELAEKAKLELNEDPKTIEGDIQHIKDWLAKQPHLKVRTDDQWLLAFIRGCKH
SLERTKEKLDLFYTLRTVAPEIYKVKHNEPLFNTIMDLGSYLILPKLEKPDSPRIALIRP
GMYNPDKFSFFDIFSCGAVFQNILMYEDDAIVISGLTTLIDLESVTMGHLLQLTPSVMKK
MVVYTQDALPIRMKGVHYINTPPGFETVFNAIKSLLNEKNRNRLYVHNKNYNELYKHISQ
EVLPAEYGGKGGSIQEIKGYWKNKIDACSSYLEEDLKNGTDESKRPGKPNTSENLFGLEG
SFQLAKKAQEELNEDPKNIQRDLQYIKDWLSKQPHLKARLDDQWLVAFLRGCKYSLERTK
EKIDLYYSMRSLAPELFRVKATDSVFDELISLGTYLILPKTATPDSPRIIIIRAGCYDPA
KYNFIDIFSATAHIQKILIFEDDAIVVSGFKTIMDMEGITLAHLLQITPSVMKKMAVLSQ
DAWPLRMKGAHYINTPSWFDNFFNMVKNLLNEKNRQRLYVHNKNFEELYKHIPQEILPNE
YGGNGGNIKEISEYWKAKVQEYSSWLEDDLKYGSDESKRVGNPRTAETLFGVEGSFRQLE
FD