Monarch geneset OGS2.0

DPOGS211472
TranscriptDPOGS211472-TA2727 bp
ProteinDPOGS211472-PA908 aa
Genomic positionDPSCF300113 - 287651-292184
RNAseq coverage460x (Rank: top 27%)
Annotation
HeliconiusHMEL0109271e-9237.48% 
BombyxBGIBMGA014370-TA1e-6929.41% 
DrosophilaTl-PB2e-5327.01% 
EBI UniRef50UniRef50_A4GVU07e-8932.54%Toll receptor n=1 Tax=Manduca sexta RepID=A4GVU0_MANSE
NCBI RefSeqXP_001357966.14e-5827.21%Tl [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1266357563e-8832.54%toll receptor [Manduca sexta]
NCBI nr blastxgi|1266357561e-9432.25%toll receptor [Manduca sexta]
Group
Gene OntologyGO:00312241.4e-31intrinsic to membrane
GO:00071651.4e-31signal transduction
GO:00048881.4e-31transmembrane receptor activity
GO:00450871.4e-31innate immune response
KEGG pathwaydre:4031251e-19 
 K10159 (TLR2)maps-> Amoebiasis
    Leishmaniasis
    Malaria
    Toll-like receptor signaling pathway
    Chagas disease
    Phagosome
InterPro domain[739-894] IPR0001571.4e-31Toll-Interleukin receptor
Orthology groupMCL20963 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211472-TA
ATGGAAGTTCTCAGGAGTCGTTCGTTGGTGTCAGTGTTAGTGTCAGTGTTAGTGTTTGTTTCGGCGGAGACTCCACGTTGTCCGCCAACGGCGAAATCTGACCTATGTGTCGACAAGGGATCAGCTCTTGATGAATACTTCTTCGTCATTAAAGGAAAAGCCTGGACTATTAAATATGACCTCAAGGAACTCAGCTTCACCTGTCGAAAAGATCTACAGCTGGATAGTGAAGAGCTGCCTCGGTTCACGACCCTGACGATAATCCCTGTTGTTTCTCTGACGGCATGTGCACCACCGTCTATTGGATTCGCAGCCGCGCTAAAAACGCTCAATGTCACTGTACGCGGGGGACTTGTGCTGGAAGAGATGCCGGCGACACCTACCCTCAACGCCACCCATCTACATGGCCTCGAGCTGCGCGCCTTTGATCTGAATGCACCATTTGCACCCGCCCCGTTCCTTGAACCCGAAGACAATTTTCTTGAACCGTTAAAAGGTCTTTTAGATCTGAGACTTGTCCGTGTAACTCTGACATTGAAAAACACGCAAAGTTTTCCTGTCGGGCTTCGCAGACTTATATTACACCATGCAAACCTAACAACACTCCCAGCGGAGGTGTTCGCTCGTCTTCCTGACTTAGTTGATCTCTTTGTTGTTGACGAAACGCTTCAGGAGCTGGACGTTAGCACGGCTTTGGCGCTTCGGGCTGTAGTTGTGGCGGCTCCTCTGAATGGAATCACTCTTGGCTCACGCGTGGATACGGCCACACTGAGAAAGATACGTAGTGCTGAAATACGTGGTAACTGTTCGGGGCTTCGTAGCTTAAGCATTGAGGAATTAAACGTCCCACCACCGACCGCATGGCTAGCGCGATGTGGCGTACGTGAACTACGTCTAAGATATTTATCTCCTGGTGTACTGGGTGCTGGTGAGCTGAGAGGTACTCGTGACCTTCTACGACTAGAAATACATGGATGTGGACTACGAATATTACCACCAGATTGGCTGGCTGACTCGAGTCGCCTGAAAATTCTCGATCTCTCCGAAAACGAACTTGAAACAATTCCAAGTGCAATTATTTCCGCATGTCCCCGGTTGTCGGAGGTGATACTTCCCAACAATCTCCTCAACTCAAGCGTGGTTAAAGCACTAGTAGCAGCACCACTAACAAAATTAGTTCTAGATGGCAACCCACTTGGCGATTTATGTCGAACCGGTTCAGGACTCATAGCAGACGCTATGTCGACCTTATGGAATAGACGGGAGCTCATATACTTATCTCTGCGAAGCACAGGGGTTACTCGTATATGTCACGACTGGTGCCGTCTGCCGAAACTTACACAAATTAATTTACGCAATAACAGCATATCAAAACTGGAGTTCGAGGATCTGCAATGGTCAAGCGCCGTGGTGGTGGATTTGCGTGGTAACCCAGTGAAGGATGTGTCATACTCGCAAGAACAGTATGAGAAAGTTCTGTTGGCTCCCGTTACCCGTGACAGTATTTCCGCTATGATAAGACTGGATAGTCACTTACGTTGCGACTGTAATGAGTACTGGTTTTCCCTGGCACTACGCGCACGACCGCAGCACGGACAGGCTTCGGTTCACGCCATCTGTGAGAACGAGAAAACATTCACTTCTGTGCCTCCAACAGATCTGCTATGCGAAGTGCCAGAGCTGTGCCCAGAAGGGTGTAAGTGTAAAGTCGACGGTGAACGATCGCATATATCGTGTCCGAATGCTGGATTGATAGCTTTGCCGACAATCCCTCCTCTCCCACCTCTCGCCTCACTGTCATTACCAGGAAATGACATTACAATATTGGATTTCACTAACATTTCGACCAGCTTCAAATTACTGGACTTGACTAATAACCAGATTAAGCATATCGACGCTGCGACGGTGGCCACTCTATTTGCTGATAACAGGAGGGTGCTACTAGACGGTAATCCTTTGTCTTGCGAATGTCGCGACGTGGCACTACAACGTGAACTTGCAAATCGTGCGGAGGCTGGCGAAAATGGTGATGCTCGGCGACAGTGCAGGGACGTTGCTGAGGCTGCTTGCGCGATGGCGCTGTGGGCACTGCCGGTGGCAATACTAGTGCTCTCGGCGGCAGTACTTGCTACTTGCCTCGTGCGGCCGGCTGCACGCCGCCGTCTAAAGTTGTTCCTATTTGAGCGTGGGATGTGTGTGCGTTGGGTGCTAGGCGCAGCACCAGATGCTGTAGCTGAAGCTGCGCGTGAATATGATGCCTTCGTGTCGTTTTCACATCACGATAGCGAGTACGCAGCGGCAATAGCGGCGCGGCTGGAACGTGGACCACGCGCACGTCGCTTGTGTCTTCATGAACGCGACTGGACTCCGGGAGAATGGATTCCAGAACAAATAGCAACATCTGTGCGGCGCAGCCGCCGTACAGTAGCGCTGGTGTCAGAGAGCTTCCTCGTAAGCCAATGGGCCCGCGCAGAATTAAGGGAGGCATACACCGCTGCATTACGCGAGGGTCGCGCCCGTTTGCTAGCTGTGTTATTGCCAGGGATGGAACCTCCACGTGCAGCAACTGCACCAGAATTGCGAGCCTACCTTGCTGCCGTTACATACCTGCGCTGGGATGATCCACATTTCTGGGACAAACTTCTCCTAGCTGTGCCACCCCCACCAAATCCTTCAACTGCGGCACCCTCTCCGCCCACTCTTTGCTCTCCACCACCTCCGCCTTGA

Protein sequence:

>DPOGS211472-PA
MEVLRSRSLVSVLVSVLVFVSAETPRCPPTAKSDLCVDKGSALDEYFFVIKGKAWTIKYDLKELSFTCRKDLQLDSEELPRFTTLTIIPVVSLTACAPPSIGFAAALKTLNVTVRGGLVLEEMPATPTLNATHLHGLELRAFDLNAPFAPAPFLEPEDNFLEPLKGLLDLRLVRVTLTLKNTQSFPVGLRRLILHHANLTTLPAEVFARLPDLVDLFVVDETLQELDVSTALALRAVVVAAPLNGITLGSRVDTATLRKIRSAEIRGNCSGLRSLSIEELNVPPPTAWLARCGVRELRLRYLSPGVLGAGELRGTRDLLRLEIHGCGLRILPPDWLADSSRLKILDLSENELETIPSAIISACPRLSEVILPNNLLNSSVVKALVAAPLTKLVLDGNPLGDLCRTGSGLIADAMSTLWNRRELIYLSLRSTGVTRICHDWCRLPKLTQINLRNNSISKLEFEDLQWSSAVVVDLRGNPVKDVSYSQEQYEKVLLAPVTRDSISAMIRLDSHLRCDCNEYWFSLALRARPQHGQASVHAICENEKTFTSVPPTDLLCEVPELCPEGCKCKVDGERSHISCPNAGLIALPTIPPLPPLASLSLPGNDITILDFTNISTSFKLLDLTNNQIKHIDAATVATLFADNRRVLLDGNPLSCECRDVALQRELANRAEAGENGDARRQCRDVAEAACAMALWALPVAILVLSAAVLATCLVRPAARRRLKLFLFERGMCVRWVLGAAPDAVAEAAREYDAFVSFSHHDSEYAAAIAARLERGPRARRLCLHERDWTPGEWIPEQIATSVRRSRRTVALVSESFLVSQWARAELREAYTAALREGRARLLAVLLPGMEPPRAATAPELRAYLAAVTYLRWDDPHFWDKLLLAVPPPPNPSTAAPSPPTLCSPPPPP-