Monarch geneset OGS2.0

DPOGS205293
TranscriptDPOGS205293-TA870 bp
ProteinDPOGS205293-PA289 aa
Genomic positionDPSCF300021 + 489284-490153
RNAseq coverage236x (Rank: top 43%)
Annotation
HeliconiusHMEL0174795e-16493.43% 
BombyxBGIBMGA011034-TA1e-15188.58% 
DrosophilaTollo-PA2e-3941.62% 
EBI UniRef50UniRef50_E7ELW16e-16092.04%Toll receptor 18 wheeler n=2 Tax=Obtectomera RepID=E7ELW1_SPOFR
NCBI RefSeqNP_001116821.15e-6152.10%18 wheeler [Bombyx mori]
NCBI nr blastpgi|3181049312e-15992.04%toll receptor 18 wheeler [Spodoptera frugiperda]
NCBI nr blastxgi|3181049311e-15392.04%toll receptor 18 wheeler [Spodoptera frugiperda]
Group
Gene OntologyGO:00312243.8e-29intrinsic to membrane
GO:00071653.8e-29signal transduction
GO:00048883.8e-29transmembrane receptor activity
GO:00450873.8e-29innate immune response
KEGG pathwaymdo:1000144678e-20 
 K10160 (TLR4)maps-> Amoebiasis
    Leishmaniasis
    Pathogenic Escherichia coli infection
    Malaria
    Toll-like receptor signaling pathway
    Chagas disease
    Phagosome
InterPro domain[59-200] IPR0001573.8e-29Toll-Interleukin receptor
Orthology groupMCL30807 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205293-TA
ATGATAAGCGCCTTCTTCGTTTCCCACAACATACCATTACTAGCGTCGGCTCTCACTGGCTTCATGCTGATTTTATTAATATTAGCTTTAGTATTCACATTTAGATACGCTTGCAGAATGTGGTTGTATTCTAATTGCGGTATTAAGCTTTCGCCCTTAGCGGGCGCCTTTAACGATGCTGATAAACTTTACGACGCTTACATTTGTTATAGTCCCAAGGATGAGGAATTTGTCGTGGAGTCTTTAGCGCGGGAGCTGGAAAACGGTTATCCTTCATACCATTTATGTCTACATTATAGAGACGTGCCGCAATTTGAAGCAACATATGCTCAGTTTCCCGATTTGGTAGTCGAAGCGACAGAGGCGTCGAGACGTATTATAGTAGTGTTATCTAAAAACTTTATATTAACTGAATGGTCACAAATAGAGTTTAGGCAAGCGTTACAGAGGGCTCTTCGTAAAAATCCTCATAAGTTAATTGTAGTCGTCGTAGGGCTACTAGCTCGAGATCCGGAATTAAAATCATATTTCAAAAGTGCTTTGGAGATAACATGGAAAGAGAAAAGATTTTGGGAGCGATTACGATATGCGATGCCGTCATGTAAGCGACGCGGGCACAAATTGAAGAGGCTTAATTATGGAAGAAATTCCAACACGTACACAATGGACGCGTCGGTGCTAAATAGTACCTGTCAAACGCTCTGCGGCAAATCCGCAAATTCAGTAGAACGTTCTCCTTGTGACAGGCCGTTGTCCGAGCACATATATTCCACTATAGATTCGGATTATTCGTCAAACGATTTCCAAGGCCGCCACAATCACCAACAATCAGTGGTATTGCATCACACAGTTCAAACGTACCTGGTCTAA

Protein sequence:

>DPOGS205293-PA
MISAFFVSHNIPLLASALTGFMLILLILALVFTFRYACRMWLYSNCGIKLSPLAGAFNDADKLYDAYICYSPKDEEFVVESLARELENGYPSYHLCLHYRDVPQFEATYAQFPDLVVEATEASRRIIVVLSKNFILTEWSQIEFRQALQRALRKNPHKLIVVVVGLLARDPELKSYFKSALEITWKEKRFWERLRYAMPSCKRRGHKLKRLNYGRNSNTYTMDASVLNSTCQTLCGKSANSVERSPCDRPLSEHIYSTIDSDYSSNDFQGRHNHQQSVVLHHTVQTYLV-