DPGLEAN04644 in OGS1.0

New model in OGS2.0DPOGS203815 
Genomic Positionscaffold120:+ 194731-202691
See gene structure
CDS Length1734
Paired RNAseq reads  215
Single RNAseq reads  560
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003723 (3e-148)
Best Drosophila hit  CG31665, isoform B (8e-111)
Best Human hitdelta and Notch-like epidermal growth factor-related receptor precursor (3e-35)
Best NR hit (blastp)  GF14011 [Drosophila ananassae] (2e-126)
Best NR hit (blastx)  PREDICTED: similar to CG31665 CG31665-PB [Tribolium castaneum] (8e-133)
GeneOntology terms  GO:0005509 calcium ion binding
InterPro families





  
IPR000742 Epidermal growth factor-like, type 3
IPR000152 EGF-type aspartate/asparagine hydroxylation site
IPR013032 EGF-like region, conserved site
IPR018097 EGF-like calcium-binding, conserved site
IPR006209 EGF
IPR006210 Epidermal growth factor-like
IPR001881 EGF-like calcium-binding
Orthology groupMCL18145

Nucleotide sequence:

ATGACGTCGCCATTTACTAAGCTTATCCACTCAACTGCGTCTAGCGTAAAGCCATTAGGG
ACGACAGAGCCTCTCAGCACAGCTACTCCAACAACAAAGGAAAGCGTTAAAACAACGAAA
GAATTCGCCACAGAGTACCCAACCACATCACAAAAAGTTACCACTCAAGAGTACGTGAAT
ACAAGGAGTACACTTAAAAGCTCAAAAACGCCAAAGTTAAGCAATACAAATAGTACAAGT
CACTCTATAACGCAATTGCAAGATCGCTTGGGGGCTATAGACTGTGATTTGCCCGTATTG
CCGAGGGAATCTCGGCTCTGGCGAGGCAATGAGACACACGAGCTCAATTTACCCGTCACG
GAGTGTGTTGGCGAACGCAACAGAGGCGAGGAGGAGTGTTCTTCGGTTATCGTGTCGTGG
GAGGGTGTCGCTGCCATTCAAAGCGGAGATATTCTTATCGTTCGCATCGCTGACTCGTCT
TCTATTGTAATCTATAATTCTAAGAACGAAACTCTGGACGCGAACTCTTCACACACCGCT
GACAAGAGATCTCACATATCGGTTCAACCGGCGGTGTACCAAGTGACAAGGCAGGGGCAC
GAACACTGCGATGTGTCTGATGGGATGCTACTTGATATAACGCCTCTTGATGAGCACGGT
GCCAAAATTTTTACATTGTACGACAAAGATCTCACTGAAGGCGTTAATTTACTCATAGTG
GTATCGGAAAACTGGGGATCACAATGCGTTCGTCTAAAAGTTACCGTGAAATCGGATAAC
TGCGGAGAATCTCAAGAATGTTCCGGGAAAGGCGTTTGCTATACGAACGTCTCTATGGAA
GGTTATGAGTGCCAATGTTGCCGTGGGTACGTCGGTTCTCATTGCGAGGACAGGGAGGTT
TGCAATCCATCACCCTGTTTGAATAATGGCATATGTGTTGACCTCACGCAACCTTCGAAC
GGTGCCACTTATCACTGCCTATGTCCATACGGTTATACTGGTGACCGTTGCGAGTTGGAG
TACAACGAGTGTGAGTCCTCTCCTTGCGGTAACGGCGGTTCATGTACTGACAGGGTTGGT
GGTTATGACTGTTCCTGCACTCGGGGATATACAGGCGATAACTGCCAATTGAAGGTTGAC
CTTTGTTCACCCAACCCCTGTCCAACCCATCGCTACTGTATGGATCACGGCAGTAGTTAC
ACATGCGAATGTCCCAGCGGGTTCGTTGGCGAAGAGTGTCACATACCGGCCACTTCGGCA
TGTGATAACAACCCATGTGCTCACGGCGGTACGTGCTGGAGCGGTGTCGATTCGTTCTAC
TGTTCATGCCGGCCAGGATACACGGGGAAACTCTGCGAAGAGGACTTCATTCTGGAGTCA
GTGATGGACGAGAGCGAGAGAGATGAACAGACTGGAGGCGGCTCTGTACGGGAGATGAGA
TTACCATTAGGGCTGTATCACGACCGACTACACAACGTGTATATCGCAGCTGGGACATTG
GGCGCTGCCATCGCTATTGTTGGAGTTGTGGTAACAGCGTGTCACTGCCGTGTGAACAAG
ACCTATTCCCGCCTAATGTCCCGTCTGTCCCGTGTCACGGAGTCAGGTCCGCCACATCAC
TGGCTGGAAGACAAGCGCGCGCCACCAGCACCCCTCCCTCCAGCACTTGACACCACGGAC
ATGTACTACACCCTGGACTTCAGCGACAGTCAGAGCTCACCGCTCATACAATAG

Protein sequence:

MTSPFTKLIHSTASSVKPLGTTEPLSTATPTTKESVKTTKEFATEYPTTSQKVTTQEYVN
TRSTLKSSKTPKLSNTNSTSHSITQLQDRLGAIDCDLPVLPRESRLWRGNETHELNLPVT
ECVGERNRGEEECSSVIVSWEGVAAIQSGDILIVRIADSSSIVIYNSKNETLDANSSHTA
DKRSHISVQPAVYQVTRQGHEHCDVSDGMLLDITPLDEHGAKIFTLYDKDLTEGVNLLIV
VSENWGSQCVRLKVTVKSDNCGESQECSGKGVCYTNVSMEGYECQCCRGYVGSHCEDREV
CNPSPCLNNGICVDLTQPSNGATYHCLCPYGYTGDRCELEYNECESSPCGNGGSCTDRVG
GYDCSCTRGYTGDNCQLKVDLCSPNPCPTHRYCMDHGSSYTCECPSGFVGEECHIPATSA
CDNNPCAHGGTCWSGVDSFYCSCRPGYTGKLCEEDFILESVMDESERDEQTGGGSVREMR
LPLGLYHDRLHNVYIAAGTLGAAIAIVGVVVTACHCRVNKTYSRLMSRLSRVTESGPPHH
WLEDKRAPPAPLPPALDTTDMYYTLDFSDSQSSPLIQ