Monarch geneset OGS2.0

DPOGS200002
TranscriptDPOGS200002-TA2874 bp
ProteinDPOGS200002-PA957 aa
Genomic positionDPSCF300466 + 36541-41969
RNAseq coverage726x (Rank: top 18%)
Annotation
HeliconiusHMEL0142575e-10532.05% 
BombyxBGIBMGA010304-TA5e-4335.45% 
DrosophilaTl-PB3e-9928.77% 
EBI UniRef50UniRef50_E0VZA14e-10529.48%Protein toll, putative n=2 Tax=Paraneoptera RepID=E0VZA1_PEDHC
NCBI RefSeqXP_002431445.17e-10629.48%protein toll precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|1947450811e-10031.81%GF16456 [Drosophila ananassae]
NCBI nr blastxgi|2420220312e-10729.89%protein toll precursor, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00312242.9e-30intrinsic to membrane
GO:00071652.9e-30signal transduction
GO:00048882.9e-30transmembrane receptor activity
GO:00450872.9e-30innate immune response
KEGG pathway 
InterPro domain[784-924] IPR0001572.9e-30Toll-Interleukin receptor
Orthology groupMCL10212 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200002-TA
ATGTATCATGTGTTGCTGTTGTCGCTGACGCTGGCTTTGGTAGCAGCGGAGTACGACTTCATGTCCGGTGTTATATGGGGTGGGCCTGAGAGAGGGTGCGACAGTTCGGAGGGTGCGGGACAGGCTGTCGTCAGCTGCACCCTCGCCAGCGGGAATATCACCCTCAACGTGGACAGATCAGCCTCATGGCTGAAGATCACGTGTGAAGAGAACAGTTCGTTCTCGTGCAGCGAATTGTTAGAAGCTCGCCCTCATATATCACGATATGTAACAGTAAATGGACAAAAAGATAGACAGATATCTAGATTAGATGTAGATAGTTGTAGATTGCCCGAGGAGAGCCTAGCTTGTCTACTAGACCTAGTGAACGCTAGCTCGGCCGTACTCTTGAGGCTCATACACTGTGAGGGTAGAGTGACTGACAGCAGCTTGGCTGGCGTGGACACCGTGAAGTTCAGGATGAACTATGTCGATAAGAACACCACCTCGGTGCCGTATCCAGCGTTATCAGAGCTCCCATCACTGCTGAGCTTCACTTTGAAGGGGGGCTCGTTGGTGCTGGACGTTCAGAACGTTACTTTACCGAAGTTGAGGACCCTGGAGCTGGCCGATGGCGGCCTCGAGGTCATACCTAGTAACGTGTTCACGAACACGCCGAACATACAGACCTTGATGTTGTGGGGGAATAGAATTAGTAAGCTCGAAGAAGACGCTTTCAAAGGTCTCAAGGAGCTAGCGAACGTGAGTCTCAACTCCAACAAAATCTCTTCTCTCCCCAACAAAATTTTCTCCCACACCCCCTTGACCAGGAGAGTTGACCTTTACGACAATAGACTGGTTATATTACAAAAGGATCTCTTCAGTGGTTTGCAACATTTGGAAGAGGTCATTATAACATCGAATAAAGCGAATCTGACCCTAGAAGATACATTTGCTAATCTACCCTCACTGAAAAATCTCAAACTGGAGACGAGTAATATAGAAGAACTGCCAGAGAACCTGTTCCGCAACTCCACGTCGTTGAGAACACTGCTGTTGGGTGGCAACAAAATAGAGAATCTCCCGCCAACCATTTTCAGTGATCAGAAATTGGTGGTACTGAATCTGTACGATAATAGGATTAGTGAACTACCCGCAGTGCTGTTGAAGAATCAGTCGTCATTGGAAAGATTGGATCTGAGGCGGAATCTCATAAAGAACATTCCTGGCGGGCTATTCTCTGACGCTAACAAGTTGAAGATATTATCTCTAGCGAGAAATAACTTGACCATATCAGAAGGGATGAACACAATAGCCCTGACGCCGTCCGACGAATACTACACCGGCGACTACTCAAGGACATTCCAATACTACTCGGTGTTTAAAAGTCTCAAGTATCTGAAGACATTGAATTTGAGTAAAAACAACGTGTCCATTATATGTGAGGATTGGAGGCAGCTGGTTGGACTCAAGAAGTTGGATTTGTCCTACAACAGCATCGACTTCCTGTCGGATGTTTCGATGCACTTCGACTTAAGCGACGCCATCATAGACGTGAGGCACAACAGAATAACAACAATAGTACCCCCTGTATACACGAGTGACTCGGACAAACCTACCTTCATATTAGACTACAACCCGTTCGCCTGCGACTGTTATCTTTACGAATTAATACAAAGATATAAATCCGGGAAGAACACACCCATCCTACAAATGGACAAGACCAAGTGCGCTTCACCTCGCTCCCTAAGGAACACGCAGATAACCCAGCTGAGTCCCGAGCAGCTGTTCTGTGACGTCCCTTGCAGTGACTGCTCCTGCAAGATAAGGCCGTACAATCGGAGATTTGTCCTGGACTGTGACGAGATGCCGGCAGCGCCGCCCGAGGTCCCGGAAGTGTTCGAGGCTTTGGAACTGTCCAACGAGATCCACTTGAAGCGGAGCACGGACTTCATACCGAGCTACTACCGGTACGTCGACATGGCAAGCTTGAATCTCACCGCAGCGCCATCTGTCGCCGGTCCGCTGGAACTGAACTTGACCAACAACAACCTCCGAAGCGCCCCGCTGGCTTTGCTGGTTACTAACTGCTCGTTATATCTATCGAACAATCCGTTCCTGTGCGGTTGCGATGATTACGAGAGCGTTGAAAATCTTATTAGATACAAACATTTGATACGCGATTTCAAAGAAATTAGATGCGAGGATGGCGGGCTCGTCTCTAACGTTAACACCGGCCAGATCTGCGTGGCGAGAGATGCCGCCATAATCGGCTCGACGATCGCCATGTTCGGCGTGATTCTGGCTATTTTCACCGCGACGGCGTACAAATATTCAACGGAGATACGAATTCTACTGAGGAAATATCATCTCTGGTGGGGAGACGAGTTCGACTGCGAGAAGGAGTACGACGCCTTCGTGTCTTACTCGCACCAGGACGAGGGTTACGTAGTGGAGCAACTGGTCCCGAACCTGGAAGGGGGGAAGCCGCCTCTGAGACTGTGCGTCCACTACCGGAACTGGGTGATAGGCGACTTCATACCGAGCCAGATAGCGAGATCCGTGGAACAATCTAGAAAAACCATAATAGTGCTTTCCAAACACTTCGTGAACTCGATATGGGGTCACATGGAATTCAGGACGGCGCATGGCAAGGGCAAGGTGATAATACTCATGCTGGACGACCTCTCCGCCGATGACAGCCTGGACCCGGAGCTCAAGGCCTATATAGCCATGAACACGTACGTCAAGTCCAAAGATCCCCTGGTCTTCGATAGGATAAGGGATGCTGTTCTCAGCAAGCCGCCGAACAAGTCACCGATGGGCCTAAATGTGCAGTTGAAAGACGGAAAGTTAGTCAATGTGAACAAGGATATTGATATAGCAATAAAATGA

Protein sequence:

>DPOGS200002-PA
MYHVLLLSLTLALVAAEYDFMSGVIWGGPERGCDSSEGAGQAVVSCTLASGNITLNVDRSASWLKITCEENSSFSCSELLEARPHISRYVTVNGQKDRQISRLDVDSCRLPEESLACLLDLVNASSAVLLRLIHCEGRVTDSSLAGVDTVKFRMNYVDKNTTSVPYPALSELPSLLSFTLKGGSLVLDVQNVTLPKLRTLELADGGLEVIPSNVFTNTPNIQTLMLWGNRISKLEEDAFKGLKELANVSLNSNKISSLPNKIFSHTPLTRRVDLYDNRLVILQKDLFSGLQHLEEVIITSNKANLTLEDTFANLPSLKNLKLETSNIEELPENLFRNSTSLRTLLLGGNKIENLPPTIFSDQKLVVLNLYDNRISELPAVLLKNQSSLERLDLRRNLIKNIPGGLFSDANKLKILSLARNNLTISEGMNTIALTPSDEYYTGDYSRTFQYYSVFKSLKYLKTLNLSKNNVSIICEDWRQLVGLKKLDLSYNSIDFLSDVSMHFDLSDAIIDVRHNRITTIVPPVYTSDSDKPTFILDYNPFACDCYLYELIQRYKSGKNTPILQMDKTKCASPRSLRNTQITQLSPEQLFCDVPCSDCSCKIRPYNRRFVLDCDEMPAAPPEVPEVFEALELSNEIHLKRSTDFIPSYYRYVDMASLNLTAAPSVAGPLELNLTNNNLRSAPLALLVTNCSLYLSNNPFLCGCDDYESVENLIRYKHLIRDFKEIRCEDGGLVSNVNTGQICVARDAAIIGSTIAMFGVILAIFTATAYKYSTEIRILLRKYHLWWGDEFDCEKEYDAFVSYSHQDEGYVVEQLVPNLEGGKPPLRLCVHYRNWVIGDFIPSQIARSVEQSRKTIIVLSKHFVNSIWGHMEFRTAHGKGKVIILMLDDLSADDSLDPELKAYIAMNTYVKSKDPLVFDRIRDAVLSKPPNKSPMGLNVQLKDGKLVNVNKDIDIAIK-