Monarch geneset OGS2.0

DPOGS203198
TranscriptDPOGS203198-TA3231 bp
ProteinDPOGS203198-PA1076 aa
Genomic positionDPSCF300035 + 322153-325563
RNAseq coverage66x (Rank: top 67%)
Annotation
HeliconiusHMEL0038430.084.38% 
BombyxBGIBMGA011084-TA0.087.55% 
DrosophilaToll-6-PC0.054.19% 
EBI UniRef50UniRef50_B4GUC80.053.35%GL24793 n=9 Tax=Neoptera RepID=B4GUC8_DROPE
NCBI RefSeqXP_971999.10.068.63%PREDICTED: similar to toll [Tribolium castaneum]
NCBI nr blastpgi|910764640.068.63%PREDICTED: similar to toll [Tribolium castaneum]
NCBI nr blastxgi|910764640.067.48%PREDICTED: similar to toll [Tribolium castaneum]
Group
Gene OntologyGO:00312246.3e-39intrinsic to membrane
GO:00071656.3e-39signal transduction
GO:00048886.3e-39transmembrane receptor activity
GO:00450876.3e-39innate immune response
KEGG pathwayxtr:1000366823e-37 
 K06838 (SLIT1)maps-> Axon guidance
InterPro domain[857-1011] IPR0001576.3e-39Toll-Interleukin receptor
[864-878] IPR0040752e-06Interleukin-1 receptor, type I/Toll precursor
Orthology groupMCL10048 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203198-TA
ATGAGTTTAGAAATCGCATCTGAAAGTTTTACAGCTGTAAGACAGTTAGAAAAGTTGGACTTGAGCTATAATAATATCTGGTCGTTTCCGGAAAACTTATTCTGCCCTTTGACTAACTTAGTTTACTTGAACGTGTCTTCAAATAGACTACAAGATGTAAGTGATTTGGGTTTTAGAGAACGGGCTATGCATCAAGCTCTTATAAGTGAACATGAAGGCCCTTCACCATCAACGTCCACTTCACCGCATACATCATGTTCTTTGGATATCGAAGTATTAGATGCATCGGTAAACCAATTTGTTCTTATGCCAGAAAATGGATTTATGGCTCTACGAAGACTTAAAGAACTACATATTCACGACAATGAGATTTCTATGGTCGCTGACAAAGCTTTATCCGGTCTGAAACAACTGCAAATAATTGATCTTTCCAATAACAAAATAGTGGCCTTACCTCAGGACTTATTTCGGGATTGTAGACCAGTGATTAAAGAAATATACTTACAAAATAATTCTATAAGTGTTCTTTCACCAAGTCTGTTTGCAAATTTAGATCAACTGCTGGCATTAGACTTATCAAACAACCATTTAACAAGCACTTGGATAAATGAAAACACGTTTACTGGTCTCATAAGGATGATTGATGCGTACGCTTTAAATGGGCTGTACGTTTTATCTTTACTATCCATTGATAATAACCACCTAGAAGAACTTCACCCTGAGGCTTTTCGGAATACTTCTTCTCTACAAGATTTGAATTTAAATGGAAATCGATTGAAAAAAGTGCCGACAGCTCTTAAAAATATGCGTTTATTAAGAACCCTTGATCTAGGAGAGAATCAGATTATGTCGTTAGAAGAACCTGGCTTTGTAGGTTTACATAATGTATATGGATTACGTCTTATAGGGAATAAAATTGAAAATATAAGCAAGGAAGTTTTCACAGATCTACCTTCCCTACAGATTTTAAATTTGGCGCGCAATAAAATAAAACAGATTGATATGGATGCCTTTGAAACATTATCAAATTTACAAGCTGTGAGATTAGATGCTAACCAACTCAAAGATATCCAGGGCCTTTTCGTAAATATACCCTCTCTTCTGTGGTTAAACGTGTCAGGTAACCAAATAGAATGGTTTGACTACGCTGTGATACCAGTTGGACTTCAATGGTTAGATGTTCACAGTAATAACATCAAAGAATTAAGAAATAACTACCGTTTGGATAAAGAACTACGGTTACAAACACTAGACGCCAGTTTCAATTTAATGACGAAAATCTTTACTTATTCTATCCCCAGTAGTATTGAGCTCTTGTTCTTAAATGACAATCAAATTACACAAGTTGAAGCCCAAACTTTTGTCGGAAAAACCAATTTAACGAGGGTCGATTTGTATGCAAATCAGATAACTAGCATGGACCTTAATGCTCTTCGTTTGACGCCAGTTGATCCTGGAAGACCATTGCCAGAATTTTATATCGGAGGAAATCCTTTTCAATGTGATTGTACGATGGAATGGTTGCAAAGAATAAATAAATTAGATCATTTAAGACAACATCCTCGAGTTATGGACTTGGAAAGTATATATTGTAAGCTACTATATAACCGTGAAAGAACTTATATTCCGCTTATTGAAGCGGAATCATCTCAGTTTTTGTGTACATATAAAACTCACTGTTTTACTTTGTGTCATTGTTGTGATTTTGACGCTTGTGATTGTGAAATGACGTGTCCGTCGAACTGTACTTGTTATCACGACCAACCGTGGTCAGCGAATATTGTGGACTGTTCCGGTGCTGGCTATGCTGAAATACCTAACACTATACCTATGGATGCAACTGAGCTATATTTAGATGGAAATAATTTTGGAGGGTTAACCAGTCACGCTTTTATCGGACGTAAAAATTTAAAAATATTATATGCAAATAACTCGAACATAGATGCACTGTACAATAATACTTTCAGTGGACTAAAACGATTAACCGTATTACATTTGGAAAAGAACAACATAAAGGAGTTGTTAGGATTTGAATTGTCGCCTCTTGAGAATTTACGAGAGTTACATCTTCAAGACAATAAAATACATTATATCGACAATCGAACATTTATGGAGTTAAGGCACTTAGAAGTACTACGTTTAGAAGGTAACAACATTTACAGTTTTGCTGTCTGGCAATTCACGATGAATCCATATTTAGTTGAGATAAGCCTATCACGAAACCCGTGGTCTTGCGACTGTCAGTACATGCATAAATTCCGAAATTGGTTTAAAAATAATCTTGGCAAAGTTGAAGCTTCTGATAAGATCACATGTATATTTGACAATGTAACAAACGCCGTTGGACCGTTAATGTCTGATTTTAACTCAACTATTTGTACAAGTCACGTAGGTGGAAGTTCATCAATCATCGAAAACCAAGTTATCAATGACTATCTGCCATTACTATTAATATCACTTTTTGTATTTGTAATGAGTTCTGCGCTTATTTGTGGAATATTTTATTGGAGAAGAGAACTCAGAGTGTGGGTTTATTATCATTGCGGATTCCGAATGTGCTATAAAAGTACCGCTTTCGACGACGAGGCCGACAAAGATAGGTTATTCGACGCCTACATTAGTTACAGCGTGAAAGACGAAGCATTTGTTGCTCAAATGTTAGCGCCCGGCCTAGAATCCACGGACCCAAGTTTCCGCCTTTGTCTTCATTACCGCGATTTTAATGCATCAGCCTACGTAGCGGACACCATTATTGAAGCAGTTGAATCTTCAAAGCGAACTATAATAGTGCTGTCTAAAAATTTTATTAACAACGAATGGTGTCGATTCGAATTTAAAACGGCACTTCATGAAGTTCTTAAAGAGAGACGAAGAAGACTGATAATAATATTATTGGGTGACTTGCCGAATAGAGACATGGATCCTGAATTAAGGTTGTGTTTAAAAGCGAATACGTGTATTGAGTGGGGTGATAGACAATTTTGGCAAAAACTAAGGTTCGCCATGCCTGATCTGCGGAAGTGTCAATATCATCGTTCAACTGTGAACATTTACGCGTCAGTGTCACCTGTAGGGGCCGGGCGTGCGCCAGCGCCGACCCCTCCTCCGCCGCCTGGCAAGCTGCCACCTCTGCTGGCTGATGGGCTAGCCGACAGACTTGGAATGCCGACCAGCGTACACCGCGATCATCATTCCCACCGAATGCCTCCACATGCTCAGCTGTGGGCGTAG

Protein sequence:

>DPOGS203198-PA
MSLEIASESFTAVRQLEKLDLSYNNIWSFPENLFCPLTNLVYLNVSSNRLQDVSDLGFRERAMHQALISEHEGPSPSTSTSPHTSCSLDIEVLDASVNQFVLMPENGFMALRRLKELHIHDNEISMVADKALSGLKQLQIIDLSNNKIVALPQDLFRDCRPVIKEIYLQNNSISVLSPSLFANLDQLLALDLSNNHLTSTWINENTFTGLIRMIDAYALNGLYVLSLLSIDNNHLEELHPEAFRNTSSLQDLNLNGNRLKKVPTALKNMRLLRTLDLGENQIMSLEEPGFVGLHNVYGLRLIGNKIENISKEVFTDLPSLQILNLARNKIKQIDMDAFETLSNLQAVRLDANQLKDIQGLFVNIPSLLWLNVSGNQIEWFDYAVIPVGLQWLDVHSNNIKELRNNYRLDKELRLQTLDASFNLMTKIFTYSIPSSIELLFLNDNQITQVEAQTFVGKTNLTRVDLYANQITSMDLNALRLTPVDPGRPLPEFYIGGNPFQCDCTMEWLQRINKLDHLRQHPRVMDLESIYCKLLYNRERTYIPLIEAESSQFLCTYKTHCFTLCHCCDFDACDCEMTCPSNCTCYHDQPWSANIVDCSGAGYAEIPNTIPMDATELYLDGNNFGGLTSHAFIGRKNLKILYANNSNIDALYNNTFSGLKRLTVLHLEKNNIKELLGFELSPLENLRELHLQDNKIHYIDNRTFMELRHLEVLRLEGNNIYSFAVWQFTMNPYLVEISLSRNPWSCDCQYMHKFRNWFKNNLGKVEASDKITCIFDNVTNAVGPLMSDFNSTICTSHVGGSSSIIENQVINDYLPLLLISLFVFVMSSALICGIFYWRRELRVWVYYHCGFRMCYKSTAFDDEADKDRLFDAYISYSVKDEAFVAQMLAPGLESTDPSFRLCLHYRDFNASAYVADTIIEAVESSKRTIIVLSKNFINNEWCRFEFKTALHEVLKERRRRLIIILLGDLPNRDMDPELRLCLKANTCIEWGDRQFWQKLRFAMPDLRKCQYHRSTVNIYASVSPVGAGRAPAPTPPPPPGKLPPLLADGLADRLGMPTSVHRDHHSHRMPPHAQLWA-