New model in OGS2.0 | DPOGS200002  |
---|---|
Genomic Position | scaffold967:+ 7884-13312 |
See gene structure | |
CDS Length | 3018 |
Paired RNAseq reads   | 3235 |
Single RNAseq reads   | 7823 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014370 (2e-34) |
Best Drosophila hit   | toll, isoform A (3e-85) |
Best Human hit | carboxypeptidase N subunit 2 (3e-26) |
Best NR hit (blastp)   | GF16456 [Drosophila ananassae] (1e-102) |
Best NR hit (blastx)   | GF16456 [Drosophila ananassae] (4e-98) |
GeneOntology terms    | GO:0019732 antifungal humoral response GO:0019730 antimicrobial humoral response GO:0006963 positive regulation of antibacterial peptide biosynthetic process GO:0006967 positive regulation of antifungal peptide biosynthetic process GO:0004888 transmembrane receptor activity GO:0008063 Toll signaling pathway GO:0005887 integral to plasma membrane GO:0009950 dorsal/ventral axis specification GO:0006952 defense response GO:0030097 hemopoiesis GO:0007416 synapse assembly GO:0006955 immune response GO:0009620 response to fungus GO:0016021 integral to membrane GO:0009880 embryonic pattern specification GO:0070976 TIR domain binding GO:0009617 response to bacterium GO:0045087 innate immune response GO:0019955 cytokine binding GO:0050832 defense response to fungus GO:0050830 defense response to Gram-positive bacterium GO:0035007 regulation of melanization defense response GO:0035172 hemocyte proliferation GO:0000910 cytokinesis GO:0031224 intrinsic to membrane GO:0007165 signal transduction GO:0007352 zygotic specification of dorsal/ventral axis GO:0045610 regulation of hemocyte differentiation GO:0005515 protein binding GO:0043234 protein complex GO:0005886 plasma membrane |
InterPro families    | IPR003591 Leucine-rich repeat, typical subtype IPR000157 Toll-Interleukin receptor IPR001611 Leucine-rich repeat |
Orthology group | MCL10277 |
Nucleotide sequence:
ATGTATCATGTGTTGCTGTTGTCGCTGACGCTGGCTTTGGTAGCAGCGGAGTACGACTTC
ATGTCCGGTGTTATATGGGGTGGGCCTGAGAGAGGGTGCGACAGTTCGGAGGGTGCGGGA
CAGGCTGTCGTCAGCTGCACCCTCGCCAGCGGGAATATCACCCTCAACGTGGACAGATCA
GCCTCATGGCTGAAGATCACGTGTGAAGAGAACAGTTCGTTCTCGTGCAGCGAATTGTTA
GAAGCTCGCCCTCATATATCACGATATGTAACAGTAAATGGACAAAAAGATAGACAGATA
TCTAGATTAGATGTAGATAGTTGTAGATTGCCCGAGGAGAGCCTAGCTTGTCTACTAGAC
CTAGTGAACGCTAGCTCGGCCGTACTCTTGAGGCTCATACACTGTGAGGGTAGAGTGACT
GACAGCAGCTTGGCTGGCGTGGACACCGTGAAGTTCAGGATGAACTATGTCGATAAGAAC
ACCACCTCGGTGCCGTATCCAGCGTTATCAGAGCTCCCATCACTGCTGAGCTTCACTTTG
AAGGGGGGCTCGTTGGTGCTGGACGTTCAGAACGTTACTTTACCGAAGTTGAGGACCCTG
GAGCTGGCCGATGGCGGCCTCGAGGTCATACCTAGTAACGTGTTCACGAACACGCCGAAC
ATACAGACCTTGATGTTGTGGGGGAATAGAATTAGTAAGCTCGAAGAAGACGCTTTCAAA
GGTCTCAAGGAGCTAGCGAACGTGAGTCTCAACTCCAACAAAATCTCTTCTCTCCCCAAC
AAAATTTTCTCCCACACCCCCTTGACCAGGAGAGTTGACCTTTACGACAATAGACTGGTT
ATATTACAAAAGGATCTCTTCAGTGGTTTGCAACATTTGGAAGAGGTCATTATAACATCG
AATAAAGCGAATCTGACCCTAGAAGATACATTTGCTAATCTACCCTCACTGAAAAATCTC
AAACTGGAGACGAGTAATATAGAAGAACTGCCAGAGAACCTGTTCCGCAACTCCACGTCG
TTGAGAACACTGCTGTTGGGTGGCAACAAAATAGAGAATCTCCCGCCAACCATTTTCAGT
GATCAGAAATTGGTGGTACTGAATCTGTACGATAATAGGATTAGTGAACTACCCGCAGTG
CTGTTGAAGAATCAGTCGTCATTGGAAAGATTGGATCTGAGGCGGAATCTCATAAAGAAC
ATTCCTGGCGGGCTATTCTCTGACGCTAACAAGTTGAAGGTCTGTGATCTGAGCGACAAT
CAGATCGAGTTATTAGACAGCAATTCATTTATCGGAGCTTCAAATTTAATAGAGTTGTAT
CTAGCCGGGAATCGAATACACTACCTGAGGAGGGCAACGTTTGAGGTCATGCCTTCATTA
GAGATATTATCTCTAGCGAGAAATAACTTGACCATATCAGAAGGGATGAACACAATAGCC
CTGACGCCGTCCGACGAATACTACACCGGCGACTACTCAAGGACATTCCAATACTACTCG
GTGTTTAAAAGTCTCAAGTATCTGAAGACATTGAATTTGAGTAAAAACAACGTGTCCATT
ATATGTGAGGATTGGAGGCAGCTGGTTGGACTCAAGAAGTTGGATTTGTCCTACAACAGC
ATCGACTTCCTGTCGGATGTTTCGATGCACTTCGACTTAAGCGACGCCATCATAGACGTG
AGGCACAACAGAATAACAACAATAGTACCCCCTGTATACACGAGTGACTCGGACAAACCT
ACCTTCATATTAGACTACAACCCGTTCGCCTGCGACTGTTATCTTTACGAATTAATACAA
AGATATAAATCCGGGAAGAACACACCCATCCTACAAATGGACAAGACCAAGTGCGCTTCA
CCTCGCTCCCTAAGGAACACGCAGATAACCCAGCTGAGTCCCGAGCAGCTGTTCTGTGAC
GTCCCTTGCAGTGACTGCTCCTGCAAGATAAGGCCGTACAATCGGAGATTTGTCCTGGAC
TGTGACGAGATGCCGGCAGCGCCGCCCGAGGTCCCGGAAGTGTTCGAGGCTTTGGAACTG
TCCAACGAGATCCACTTGAAGCGGAGCACGGACTTCATACCGAGCTACTACCGGTACGTC
GACATGGCAAGCTTGAATCTCACCGCAGCGCCATCTGTCGCCGGTCCGCTGGAACTGAAC
TTGACCAACAACAACCTCCGAAGCGCCCCGCTGGCTTTGCTGGTTACTAACTGCTCGTTA
TATCTATCGAACAATCCGTTCCTGTGCGGTTGCGATGATTACGAGAGCGTTGAAAATCTT
ATTAGATACAAACATTTGATACGCGATTTCAAAGAAATTAGATGCGAGGATGGCGGGCTC
GTCTCTAACGTTAACACCGGCCAGATCTGCGTGGCGAGAGATGCCGCCATAATCGGCTCG
ACGATCGCCATGTTCGGCGTGATTCTGGCTATTTTCACCGCGACGGCGTACAAATATTCA
ACGGAGATACGAATTCTACTGAGGAAATATCATCTCTGGTGGGGAGACGAGTTCGACTGC
GAGAAGGAGTACGACGCCTTCGTGTCTTACTCGCACCAGGACGAGGGTTACGTAGTGGAG
CAACTGGTCCCGAACCTGGAAGGGGGGAAGCCGCCTCTGAGACTGTGCGTCCACTACCGG
AACTGGGTGATAGGCGACTTCATACCGAGCCAGATAGCGAGATCCGTGGAACAATCTAGA
AAAACCATAATAGTGCTTTCCAAACACTTCGTGAACTCGATATGGGGTCACATGGAATTC
AGGACGGCGCATGGCAAGGGCAAGGTGATAATACTCATGCTGGACGACCTCTCCGCCGAT
GACAGCCTGGACCCGGAGCTCAAGGCCTATATAGCCATGAACACGTACGTCAAGTCCAAA
GATCCCCTGGTCTTCGATAGGATAAGGGATGCTGTTCTCAGCAAGCCGCCGAACAAGTCA
CCGATGGGCCTAAATGTGCAGTTGAAAGACGGAAAGTTAGTCAATGTGAACAAGGATATT
GATATAGCAATAAAATGA
Protein sequence:
MYHVLLLSLTLALVAAEYDFMSGVIWGGPERGCDSSEGAGQAVVSCTLASGNITLNVDRS
ASWLKITCEENSSFSCSELLEARPHISRYVTVNGQKDRQISRLDVDSCRLPEESLACLLD
LVNASSAVLLRLIHCEGRVTDSSLAGVDTVKFRMNYVDKNTTSVPYPALSELPSLLSFTL
KGGSLVLDVQNVTLPKLRTLELADGGLEVIPSNVFTNTPNIQTLMLWGNRISKLEEDAFK
GLKELANVSLNSNKISSLPNKIFSHTPLTRRVDLYDNRLVILQKDLFSGLQHLEEVIITS
NKANLTLEDTFANLPSLKNLKLETSNIEELPENLFRNSTSLRTLLLGGNKIENLPPTIFS
DQKLVVLNLYDNRISELPAVLLKNQSSLERLDLRRNLIKNIPGGLFSDANKLKVCDLSDN
QIELLDSNSFIGASNLIELYLAGNRIHYLRRATFEVMPSLEILSLARNNLTISEGMNTIA
LTPSDEYYTGDYSRTFQYYSVFKSLKYLKTLNLSKNNVSIICEDWRQLVGLKKLDLSYNS
IDFLSDVSMHFDLSDAIIDVRHNRITTIVPPVYTSDSDKPTFILDYNPFACDCYLYELIQ
RYKSGKNTPILQMDKTKCASPRSLRNTQITQLSPEQLFCDVPCSDCSCKIRPYNRRFVLD
CDEMPAAPPEVPEVFEALELSNEIHLKRSTDFIPSYYRYVDMASLNLTAAPSVAGPLELN
LTNNNLRSAPLALLVTNCSLYLSNNPFLCGCDDYESVENLIRYKHLIRDFKEIRCEDGGL
VSNVNTGQICVARDAAIIGSTIAMFGVILAIFTATAYKYSTEIRILLRKYHLWWGDEFDC
EKEYDAFVSYSHQDEGYVVEQLVPNLEGGKPPLRLCVHYRNWVIGDFIPSQIARSVEQSR
KTIIVLSKHFVNSIWGHMEFRTAHGKGKVIILMLDDLSADDSLDPELKAYIAMNTYVKSK
DPLVFDRIRDAVLSKPPNKSPMGLNVQLKDGKLVNVNKDIDIAIK