DPGLEAN17977 in OGS1.0

New model in OGS2.0DPOGS200002 
Genomic Positionscaffold967:+ 7884-13312
See gene structure
CDS Length3018
Paired RNAseq reads  3235
Single RNAseq reads  7823
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014370 (2e-34)
Best Drosophila hit  toll, isoform A (3e-85)
Best Human hitcarboxypeptidase N subunit 2 (3e-26)
Best NR hit (blastp)  GF16456 [Drosophila ananassae] (1e-102)
Best NR hit (blastx)  GF16456 [Drosophila ananassae] (4e-98)
GeneOntology terms





























  
GO:0019732 antifungal humoral response
GO:0019730 antimicrobial humoral response
GO:0006963 positive regulation of antibacterial peptide biosynthetic process
GO:0006967 positive regulation of antifungal peptide biosynthetic process
GO:0004888 transmembrane receptor activity
GO:0008063 Toll signaling pathway
GO:0005887 integral to plasma membrane
GO:0009950 dorsal/ventral axis specification
GO:0006952 defense response
GO:0030097 hemopoiesis
GO:0007416 synapse assembly
GO:0006955 immune response
GO:0009620 response to fungus
GO:0016021 integral to membrane
GO:0009880 embryonic pattern specification
GO:0070976 TIR domain binding
GO:0009617 response to bacterium
GO:0045087 innate immune response
GO:0019955 cytokine binding
GO:0050832 defense response to fungus
GO:0050830 defense response to Gram-positive bacterium
GO:0035007 regulation of melanization defense response
GO:0035172 hemocyte proliferation
GO:0000910 cytokinesis
GO:0031224 intrinsic to membrane
GO:0007165 signal transduction
GO:0007352 zygotic specification of dorsal/ventral axis
GO:0045610 regulation of hemocyte differentiation
GO:0005515 protein binding
GO:0043234 protein complex
GO:0005886 plasma membrane
InterPro families

  
IPR003591 Leucine-rich repeat, typical subtype
IPR000157 Toll-Interleukin receptor
IPR001611 Leucine-rich repeat
Orthology groupMCL10277

Nucleotide sequence:

ATGTATCATGTGTTGCTGTTGTCGCTGACGCTGGCTTTGGTAGCAGCGGAGTACGACTTC
ATGTCCGGTGTTATATGGGGTGGGCCTGAGAGAGGGTGCGACAGTTCGGAGGGTGCGGGA
CAGGCTGTCGTCAGCTGCACCCTCGCCAGCGGGAATATCACCCTCAACGTGGACAGATCA
GCCTCATGGCTGAAGATCACGTGTGAAGAGAACAGTTCGTTCTCGTGCAGCGAATTGTTA
GAAGCTCGCCCTCATATATCACGATATGTAACAGTAAATGGACAAAAAGATAGACAGATA
TCTAGATTAGATGTAGATAGTTGTAGATTGCCCGAGGAGAGCCTAGCTTGTCTACTAGAC
CTAGTGAACGCTAGCTCGGCCGTACTCTTGAGGCTCATACACTGTGAGGGTAGAGTGACT
GACAGCAGCTTGGCTGGCGTGGACACCGTGAAGTTCAGGATGAACTATGTCGATAAGAAC
ACCACCTCGGTGCCGTATCCAGCGTTATCAGAGCTCCCATCACTGCTGAGCTTCACTTTG
AAGGGGGGCTCGTTGGTGCTGGACGTTCAGAACGTTACTTTACCGAAGTTGAGGACCCTG
GAGCTGGCCGATGGCGGCCTCGAGGTCATACCTAGTAACGTGTTCACGAACACGCCGAAC
ATACAGACCTTGATGTTGTGGGGGAATAGAATTAGTAAGCTCGAAGAAGACGCTTTCAAA
GGTCTCAAGGAGCTAGCGAACGTGAGTCTCAACTCCAACAAAATCTCTTCTCTCCCCAAC
AAAATTTTCTCCCACACCCCCTTGACCAGGAGAGTTGACCTTTACGACAATAGACTGGTT
ATATTACAAAAGGATCTCTTCAGTGGTTTGCAACATTTGGAAGAGGTCATTATAACATCG
AATAAAGCGAATCTGACCCTAGAAGATACATTTGCTAATCTACCCTCACTGAAAAATCTC
AAACTGGAGACGAGTAATATAGAAGAACTGCCAGAGAACCTGTTCCGCAACTCCACGTCG
TTGAGAACACTGCTGTTGGGTGGCAACAAAATAGAGAATCTCCCGCCAACCATTTTCAGT
GATCAGAAATTGGTGGTACTGAATCTGTACGATAATAGGATTAGTGAACTACCCGCAGTG
CTGTTGAAGAATCAGTCGTCATTGGAAAGATTGGATCTGAGGCGGAATCTCATAAAGAAC
ATTCCTGGCGGGCTATTCTCTGACGCTAACAAGTTGAAGGTCTGTGATCTGAGCGACAAT
CAGATCGAGTTATTAGACAGCAATTCATTTATCGGAGCTTCAAATTTAATAGAGTTGTAT
CTAGCCGGGAATCGAATACACTACCTGAGGAGGGCAACGTTTGAGGTCATGCCTTCATTA
GAGATATTATCTCTAGCGAGAAATAACTTGACCATATCAGAAGGGATGAACACAATAGCC
CTGACGCCGTCCGACGAATACTACACCGGCGACTACTCAAGGACATTCCAATACTACTCG
GTGTTTAAAAGTCTCAAGTATCTGAAGACATTGAATTTGAGTAAAAACAACGTGTCCATT
ATATGTGAGGATTGGAGGCAGCTGGTTGGACTCAAGAAGTTGGATTTGTCCTACAACAGC
ATCGACTTCCTGTCGGATGTTTCGATGCACTTCGACTTAAGCGACGCCATCATAGACGTG
AGGCACAACAGAATAACAACAATAGTACCCCCTGTATACACGAGTGACTCGGACAAACCT
ACCTTCATATTAGACTACAACCCGTTCGCCTGCGACTGTTATCTTTACGAATTAATACAA
AGATATAAATCCGGGAAGAACACACCCATCCTACAAATGGACAAGACCAAGTGCGCTTCA
CCTCGCTCCCTAAGGAACACGCAGATAACCCAGCTGAGTCCCGAGCAGCTGTTCTGTGAC
GTCCCTTGCAGTGACTGCTCCTGCAAGATAAGGCCGTACAATCGGAGATTTGTCCTGGAC
TGTGACGAGATGCCGGCAGCGCCGCCCGAGGTCCCGGAAGTGTTCGAGGCTTTGGAACTG
TCCAACGAGATCCACTTGAAGCGGAGCACGGACTTCATACCGAGCTACTACCGGTACGTC
GACATGGCAAGCTTGAATCTCACCGCAGCGCCATCTGTCGCCGGTCCGCTGGAACTGAAC
TTGACCAACAACAACCTCCGAAGCGCCCCGCTGGCTTTGCTGGTTACTAACTGCTCGTTA
TATCTATCGAACAATCCGTTCCTGTGCGGTTGCGATGATTACGAGAGCGTTGAAAATCTT
ATTAGATACAAACATTTGATACGCGATTTCAAAGAAATTAGATGCGAGGATGGCGGGCTC
GTCTCTAACGTTAACACCGGCCAGATCTGCGTGGCGAGAGATGCCGCCATAATCGGCTCG
ACGATCGCCATGTTCGGCGTGATTCTGGCTATTTTCACCGCGACGGCGTACAAATATTCA
ACGGAGATACGAATTCTACTGAGGAAATATCATCTCTGGTGGGGAGACGAGTTCGACTGC
GAGAAGGAGTACGACGCCTTCGTGTCTTACTCGCACCAGGACGAGGGTTACGTAGTGGAG
CAACTGGTCCCGAACCTGGAAGGGGGGAAGCCGCCTCTGAGACTGTGCGTCCACTACCGG
AACTGGGTGATAGGCGACTTCATACCGAGCCAGATAGCGAGATCCGTGGAACAATCTAGA
AAAACCATAATAGTGCTTTCCAAACACTTCGTGAACTCGATATGGGGTCACATGGAATTC
AGGACGGCGCATGGCAAGGGCAAGGTGATAATACTCATGCTGGACGACCTCTCCGCCGAT
GACAGCCTGGACCCGGAGCTCAAGGCCTATATAGCCATGAACACGTACGTCAAGTCCAAA
GATCCCCTGGTCTTCGATAGGATAAGGGATGCTGTTCTCAGCAAGCCGCCGAACAAGTCA
CCGATGGGCCTAAATGTGCAGTTGAAAGACGGAAAGTTAGTCAATGTGAACAAGGATATT
GATATAGCAATAAAATGA

Protein sequence:

MYHVLLLSLTLALVAAEYDFMSGVIWGGPERGCDSSEGAGQAVVSCTLASGNITLNVDRS
ASWLKITCEENSSFSCSELLEARPHISRYVTVNGQKDRQISRLDVDSCRLPEESLACLLD
LVNASSAVLLRLIHCEGRVTDSSLAGVDTVKFRMNYVDKNTTSVPYPALSELPSLLSFTL
KGGSLVLDVQNVTLPKLRTLELADGGLEVIPSNVFTNTPNIQTLMLWGNRISKLEEDAFK
GLKELANVSLNSNKISSLPNKIFSHTPLTRRVDLYDNRLVILQKDLFSGLQHLEEVIITS
NKANLTLEDTFANLPSLKNLKLETSNIEELPENLFRNSTSLRTLLLGGNKIENLPPTIFS
DQKLVVLNLYDNRISELPAVLLKNQSSLERLDLRRNLIKNIPGGLFSDANKLKVCDLSDN
QIELLDSNSFIGASNLIELYLAGNRIHYLRRATFEVMPSLEILSLARNNLTISEGMNTIA
LTPSDEYYTGDYSRTFQYYSVFKSLKYLKTLNLSKNNVSIICEDWRQLVGLKKLDLSYNS
IDFLSDVSMHFDLSDAIIDVRHNRITTIVPPVYTSDSDKPTFILDYNPFACDCYLYELIQ
RYKSGKNTPILQMDKTKCASPRSLRNTQITQLSPEQLFCDVPCSDCSCKIRPYNRRFVLD
CDEMPAAPPEVPEVFEALELSNEIHLKRSTDFIPSYYRYVDMASLNLTAAPSVAGPLELN
LTNNNLRSAPLALLVTNCSLYLSNNPFLCGCDDYESVENLIRYKHLIRDFKEIRCEDGGL
VSNVNTGQICVARDAAIIGSTIAMFGVILAIFTATAYKYSTEIRILLRKYHLWWGDEFDC
EKEYDAFVSYSHQDEGYVVEQLVPNLEGGKPPLRLCVHYRNWVIGDFIPSQIARSVEQSR
KTIIVLSKHFVNSIWGHMEFRTAHGKGKVIILMLDDLSADDSLDPELKAYIAMNTYVKSK
DPLVFDRIRDAVLSKPPNKSPMGLNVQLKDGKLVNVNKDIDIAIK