DPGLEAN22088 in OGS1.0

New model in OGS2.0DPOGS208973 
Genomic Positionscaffold1599:- 29590-41160
See gene structure
CDS Length3087
Paired RNAseq reads  5141
Single RNAseq reads  11383
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012538 (4e-35)
Best Drosophila hit  draper, isoform B (0.0)
Best Human hitmultiple epidermal growth factor-like domains protein 11 precursor (9e-151)
Best NR hit (blastp)  PREDICTED: similar to draper CG2086-PB [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to draper CG2086-PB [Tribolium castaneum] (0.0)
GeneOntology terms






  
GO:0007155 cell adhesion
GO:0008345 larval locomotory behavior
GO:0006909 phagocytosis
GO:0031224 intrinsic to membrane
GO:0005515 protein binding
GO:0035212 cell competition in a multicellular organism
GO:0048190 wing disc dorsal/ventral pattern formation
GO:0043277 apoptotic cell clearance
InterPro families



  
IPR013032 EGF-like region, conserved site
IPR000742 Epidermal growth factor-like, type 3
IPR002049 EGF-like, laminin
IPR011489 EMI domain
IPR006210 Epidermal growth factor-like
Orthology groupMCL10755

Nucleotide sequence:

ATGTTGACGGCAGGAGTTCTGGTGTTCCTGGCTGCATCGGTCTCAGCCCTACTGGAAGGA
CCTAACGTTTGTACTAAGCAGGAATCATACACAACAGTAATACGAGTATCCGCACAGCAG
CCGTATCAAGTGAAGGAGTACGCTTGGTGTTTCAATGTGCCCCCACGATGTTCAAAATAT
AATATAAAATTCAAGCAAGTATTTAAAACACAAAAGCTGATAAAGCAGCGTCCTATAGAG
CAATGCTGTGATGGCTACGCTCCGGATCCTGAGGGGAGGCAGTGCGTGCCCGTGTGCGTG
GAGGCCTGTGTTCACGGGAAATGTGTGGCCCCCAACACTTGTGCGTGTGCGCACCGCTAC
GGAGGGCCCGCCTGTGATATATCATGCCCTGACGGTAAGTGGGGTAGGAATTGTCAGAAC
GAATGTCGTTGCATGCACGGCGGTTCATGCGAGCCGCTAACAGGCGACTGTGCATGCGCA
GTCGGCTGGTGGGGAGATGCTTGCGAGAAAGCCTGTGCACCTCCAACATTTGGTCAAAAC
TGTGAACAGATCTGTGCCTGTAAGAACAATGGCACCTGCGATCCCGCTAGCGGCGCCTGT
ACTTGTCCACGCGAATTCACCGGACCTTTGTGTGAGAAGGAGTGTCCCAAAAACGACATA
TGTCCGTTATCCTGTCTTTGTCAAAACGACGCTAAATGCAACCTGTTCACCGGCGACTGC
GAGTGTACTCCCGGCTGGACCGGTGAAGTATGCGCCAATAAATGTCCAGCCGGTCGCTGG
GGAGATAAATGTAGCAGGACCTGCGAGTGTTGGAACGGCGCTAGCTGTCATCACGTGACC
GGTCAATGTCAATGTGAAGCTGGGTTTACTGGGGACAAGTGTCTAGAGTCCTGTCCACTG
GGTACATACGGTGTGTCATGTGCTGGCGTGTGTAAGTGTCAACACGGGGGCGAGTGTTCG
CCCAAGGACGGTTCGTGCACCTGCAAACCGGGCTGGCAGGGCGACCGGTGTGAGAGGCGA
GCGTGTCCGAACGGTCTGTGGGGCCCGCGCTGTGATAAGACCTGCGAGTGTAATCCCACA
ACTACTGATATGTGTGATCCCTGGACGGGTGCGTGTGAGTGCGCCGCGGGGTGGGCGGGC
GAGTCGTGCGCTCGACAGTGCCCTCTGCTAACGTACGGCAAGGGCTGTCGATCGTCCTGT
CGCTGTGAGAACAGCGGACACTGCTCACCCGTTAATGGTTCATGTCTGTGTGCGCCGGGT
TACCGTGGTCTGCGCTGCGAGGAGCCCTGTCCCTACCCGTTCTATGGTGATAACTGTGCC
GACACGTGCGACTGTCGCAACAACGCTTCGTGCTCACACGAGACCGGCCAATGTGACTGC
AAACCGGGTTTCGATGGTTTGAAATGTGATCGGCCGTGCGACGGGAAGACATTCGGCTTG
AGGTGTCGTCAGCCGTGTAACTGCGAGAACGACGCTCCTTGTAATCCCGTAAACGGTGAG
TGTGTATGCGGGCCGGGCTATGAAGGTCCTCGTTGTGAGCGTCGTTGCCGCGCCGGTTAC
TACGGACAGAACTGTTCTTTTCCGTGCGACTGTACTGAGAACGCGGTCGGATGTCATCAC
GTCACCGGAGCCTGTGTCTGCGAGAGCAGCTGGAGAGGTATCCGCTGCGAGACTCAATGC
GAGGCTGGGCAGTACGGTGTGCAGTGCAGCGAGAAGTGTCCCTGCGCCAACAACTCGTCC
TGTGACGCGGAGAGCGGACGGTGTGAGTGCGCCCCGGGCTGGAGGGGCGAGCGCTGCGAC
GTACCCTGCGAGCCCGGCACGTACGGCGCCGCCTGTAAACAGCTGTGTCCTCATCATCCG
CTAGGTAATGTGACGTGTAACCCTACCACCGGCGAGTACAGCTGTGCTAGCGGGTACACG
GGTGTGTCCTGTGAGTATCCGTGTCCGCTGGGCACGTACGGCGAGGGATGTCAACAAAAA
TGCAACTGTAAGAACGGAGCCGACTGCCATCATGTCACTGGCGAGTGTCAGTGCCTGCCG
GGTTGGCGCGGGTCCCTGTGCGGCGAGGCTTGTCCCCCAGGTTGGTGGGGCGCGGGATGT
TCCCAGCCGTGCCGGTGCGCGCGTGGCGCTGCCTGTCGCCCCAACGACGGATACTGCAGG
TGCCCGCCCGGATACACCGGCAACTACTGTACACAGTTCTGTCCCGAGGGCTACTTCGGC
GATCACTGTATGGAAGCATGTAATTGTTCGTCTCACGGCAACTGGGTCTGCGAGCCGGTC
AGAGGCTGTGTGTGTCACCGGGGCTTCGTAGGAGAGAACTGTGACTTGAGAGCCAGCGAC
GCCATAGTTATAGATCAAGCTAGAGGTAGCTCAAATGCTGGTTTAACAGCTGTGATGTTG
GTAGCCGTTATAGCTTGCGGTGCTGCAGCTGTGCTCGTGCTGTTGTACTACAGAAGAAGG
GTGCGCTCGCTCAAAAGAGAGATAGCTCACGTTCACTACACCGCCGACCCCAACACTCAG
CCCGATCAACAACACTTCGACAACCCGGTGTACTCGTTCCAGAATTCAACGCGTAGCGAC
GATTCAACGACATTATTAAACAACTCAACAATATTCAACAATCTGGATAATAGCAGCAAA
ATTAGCAACGCGGCGTTAGAGAAACTAAGAATGACCGCCTCCAGCTCTAATGGAACCTAC
GATCCCTTCTCGTCTATAAAGAACAAGGACGCTGATATGACCAATCCTAATTTGTATCAC
TGTATCGAGGACGACAATAAATTAGACCACGTGTACGATGAGATCAAACACAAGGAGGGA
TACGAAATGGAGTACGATCACCTGAATTACACCCCTCCTGCGAACACGTGGAAGCCTCAC
TACGTTCGTATGAATGGTTCCATTGGGGGTGGTCAGACTTCGACTCCCCCTATACCGCCA
TTACCGAAACTTCACACTGCTGTCCCTATAGTCCCGGGCCCCCAAGTGCAAGGCCCTCTG
GAGCCTGGGGAGCCCTCAGTACTGAGTGACAACGAGGCCCCACCCCCACCCCCCACAAGA
GAGGAGCCTCACGAACAACCTCTATAG

Protein sequence:

MLTAGVLVFLAASVSALLEGPNVCTKQESYTTVIRVSAQQPYQVKEYAWCFNVPPRCSKY
NIKFKQVFKTQKLIKQRPIEQCCDGYAPDPEGRQCVPVCVEACVHGKCVAPNTCACAHRY
GGPACDISCPDGKWGRNCQNECRCMHGGSCEPLTGDCACAVGWWGDACEKACAPPTFGQN
CEQICACKNNGTCDPASGACTCPREFTGPLCEKECPKNDICPLSCLCQNDAKCNLFTGDC
ECTPGWTGEVCANKCPAGRWGDKCSRTCECWNGASCHHVTGQCQCEAGFTGDKCLESCPL
GTYGVSCAGVCKCQHGGECSPKDGSCTCKPGWQGDRCERRACPNGLWGPRCDKTCECNPT
TTDMCDPWTGACECAAGWAGESCARQCPLLTYGKGCRSSCRCENSGHCSPVNGSCLCAPG
YRGLRCEEPCPYPFYGDNCADTCDCRNNASCSHETGQCDCKPGFDGLKCDRPCDGKTFGL
RCRQPCNCENDAPCNPVNGECVCGPGYEGPRCERRCRAGYYGQNCSFPCDCTENAVGCHH
VTGACVCESSWRGIRCETQCEAGQYGVQCSEKCPCANNSSCDAESGRCECAPGWRGERCD
VPCEPGTYGAACKQLCPHHPLGNVTCNPTTGEYSCASGYTGVSCEYPCPLGTYGEGCQQK
CNCKNGADCHHVTGECQCLPGWRGSLCGEACPPGWWGAGCSQPCRCARGAACRPNDGYCR
CPPGYTGNYCTQFCPEGYFGDHCMEACNCSSHGNWVCEPVRGCVCHRGFVGENCDLRASD
AIVIDQARGSSNAGLTAVMLVAVIACGAAAVLVLLYYRRRVRSLKREIAHVHYTADPNTQ
PDQQHFDNPVYSFQNSTRSDDSTTLLNNSTIFNNLDNSSKISNAALEKLRMTASSSNGTY
DPFSSIKNKDADMTNPNLYHCIEDDNKLDHVYDEIKHKEGYEMEYDHLNYTPPANTWKPH
YVRMNGSIGGGQTSTPPIPPLPKLHTAVPIVPGPQVQGPLEPGEPSVLSDNEAPPPPPTR
EEPHEQPL