DPGLEAN21307 in OGS1.0

New model in OGS2.0DPOGS214055 
Genomic Positionscaffold421:- 68016-70130
See gene structure
CDS Length1344
Paired RNAseq reads  499
Single RNAseq reads  1354
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010388 (0.0)
Best Drosophila hit  CG7275 (1e-168)
Best Human hitDDB1- and CUL4-associated factor 13 (1e-149)
Best NR hit (blastp)  PREDICTED: similar to GA20229-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to GA20229-PA [Tribolium castaneum] (0.0)
GeneOntology terms  GO:0006911 phagocytosis, engulfment
InterPro families






  
IPR011046 WD40 repeat-like-containing domain
IPR019775 WD40 repeat, conserved site
IPR015943 WD40/YVTN repeat-like-containing domain
IPR001680 WD40 repeat
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR007287 Sof1-like protein
IPR019781 WD40 repeat, subgroup
Orthology groupMCL13348

Nucleotide sequence:

ATGTCCAACTTGAAAATAAAAGTAATAAGTCGTAACCCAGAAGATTATCTGCGTTCCACT
AAAAGAGATATTCATAAAATTCCTAGGAATTATGACCCGAGTCTGCATCCCCTTGAGGGT
CCACGGGAGTATGTCAGAGCATTAAATGCGGTAAAACTAGAAAGAGTATTTGCGAAGCCT
TTTCTTGGCAGTCTTGATGGACACTCGGATGGTGTATCTAGTTTGGGAAAGCATCCTAGC
CGATTATCCGCTTTGGCTAGCGGAGCTTTTGATGGAGAAATTAGGATATGGGACTTAACT
AGTCGTAAATGTACCAGAAATTTCATTGCACATGAAGGTTGGGTTCGTGCTATTTGCTAC
ACACCAAACGGTCAACAGTTTATGAGTGTTGGTGATGATAAAACAATTAAAACCTGGAAA
GCTGATATTCAAGACCCTGATGACGAAGATCCTGTTAATACACTTCTCAGCATGTCAGTG
GTATCTGGTATTAGCCATCATAGAGCAAAACCAATATTTGCTACTTGCGGTGAACATTGT
CAGTTGTGGGAAAATACTAGGAGTGAACCTGTCAAAGTATTTCAATGGGGAGTAGATAGC
CTGCATCATGTTGCATTTAATCAGGTAGAAACAAATCTGTTAGCAGCATGTGCGAGTGAT
AGGAGCGTTATACTTTATGACTTCCGTGAGTCAGGACCTCTTAGGAAAGTAGTGATGGAA
CTGAGATCTAATGCACTATCTTGGAATCCCATGGAGGCATATATATTTACTGTAGCTAAT
GAAGACTATAACCTGTACACATTTGATATCAGAAAACTGAGACAACCAGTGAATGTTCAT
GTTGACCACACATCTGCGGTGATCGATGTGGATTATGCACCGACTGGGAGAGAATTTGTC
GCTGGTAGCTATGATAAGACTGTTAGGATATTCGAGAGCCTTAAAGGACACTCCAGAGAT
GTGTATCATACGAAGAGAATGCAGAGATTGACATGTGTTAAGTGGACATTGGATAATAAA
TATATTTTGACTGGATCAGATGAAATGAATATAAGAATGTGGAAGGCTAGAGCTTCGGAG
AAACTTGGTGTTCTCAAACCTCGAGAACGTACAGCTCTTAATTATTCGGAAGCTTTGAAG
GAGAAATTCAGTGGTCATCCACAGATCAAACGTATAGCTCGTCACAGGCACGTGCCGAAA
CACATATTGAACGCTCAGAAAGAACTTCGTACTATCAAAGAGAAGAGCAAACGTAAAGAG
GGCAACAAGCGCTCCCACAGCAAACCTGGAGCTGTGCCATTTGTACCTGAACGTAAAAAG
CATGTCGTTAAAGAAGATGAGTGA

Protein sequence:

MSNLKIKVISRNPEDYLRSTKRDIHKIPRNYDPSLHPLEGPREYVRALNAVKLERVFAKP
FLGSLDGHSDGVSSLGKHPSRLSALASGAFDGEIRIWDLTSRKCTRNFIAHEGWVRAICY
TPNGQQFMSVGDDKTIKTWKADIQDPDDEDPVNTLLSMSVVSGISHHRAKPIFATCGEHC
QLWENTRSEPVKVFQWGVDSLHHVAFNQVETNLLAACASDRSVILYDFRESGPLRKVVME
LRSNALSWNPMEAYIFTVANEDYNLYTFDIRKLRQPVNVHVDHTSAVIDVDYAPTGREFV
AGSYDKTVRIFESLKGHSRDVYHTKRMQRLTCVKWTLDNKYILTGSDEMNIRMWKARASE
KLGVLKPRERTALNYSEALKEKFSGHPQIKRIARHRHVPKHILNAQKELRTIKEKSKRKE
GNKRSHSKPGAVPFVPERKKHVVKEDE