Monarch geneset OGS2.0

DPOGS205279
TranscriptDPOGS205279-TA3609 bp
ProteinDPOGS205279-PA1202 aa
Genomic positionDPSCF300021 - 388854-393518
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0174870.084.78% 
BombyxBGIBMGA011082-TA0.086.97% 
DrosophilaToll-6-PC0.040.45% 
EBI UniRef50UniRef50_D6WCH90.051.67%Toll-7-like protein n=10 Tax=Coelomata RepID=D6WCH9_TRICA
NCBI RefSeqXP_972312.10.051.67%PREDICTED: similar to toll [Tribolium castaneum]
NCBI nr blastpgi|910764740.051.67%PREDICTED: similar to toll [Tribolium castaneum]
NCBI nr blastxgi|2420057590.050.98%toll, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00312242.8e-17intrinsic to membrane
GO:00071652.8e-17signal transduction
GO:00048882.8e-17transmembrane receptor activity
GO:00450872.8e-17innate immune response
KEGG pathwayxtr:7801954e-48 
 K06850 (SLIT3)maps-> Axon guidance
InterPro domain[913-1056] IPR0001572.8e-17Toll-Interleukin receptor
Orthology groupMCL10048 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205279-TA
ATGTCAATGGAATTTCATCGCGATTCATTTAGAGGACTGACAGATTTAAGGACTTTGGATCTTGGTGACAATAATATATGGATTTTGCCCTCTGAAATATTCTGTCCTCTTTATAACTTAAAAGAGTTAAATATAACTTTAAATAGGTTGCAGGATATCTCGAATTTGGGTTTCTCAGATTGGGGAAATGGTCCAACCGCACCAGGTAAATCCTGTAATACTGTCCTAGAAACACTAGACATGTCTTTCAATGAAATCAGTGCTCTTCCAGACAATGGTCTGTCGAGCTTGCGAGCACTTCAGAGGTTGCTCTTACAAAATAATAGAATTTCCACTGTCGCCGATCGTGCTTTTGTGGGATTAAGTGATTTGCAAATGCTTAATTTGTCAACAAACGCTTTGACTGCATTACCTCCAGAAATGTTCCAATCTTCACGAGACATTAAACAGATATATTTAAATAACAATTCATTAAGTGTTCTGGCGCCCGGTTTATTAGAAGGATTAGATCAACTGCAAATATTGGATTTATCTTATAACGAACTGACAAGTGAATGGGTCAATAGAGATACATTTTCAGGATTAGTGCGTCTCATTGTTTTGAACTTATCGCACAATAGAATAACAAAGATTGATGCTCTATTGTTCCAAGATTTAAACAATCTACAGTTTTTGAGTTTGGAATATAATAACGTAGCTCGAATAGCTGATGGCGCCTTTTCTTATTTGAAAAATCTACATTCTCTTTCTTTGGCGCATAATAATATAGTAGAGGTAGATAGTAACCATTTTTCAAATTTATACGTATTAAATCAGCTATTCCTCGATGGAAATAGAATAACGAAAGTTGATTTGCGTTCTTTTGAAAATATAACCAAATTACATGATTTGGGTTTAAGCGGGAATCAATTATCTGAGGTACCTGAGGCTATCAAAACTCTAAGATTTTTAACAGCTCTAGACTTGGGAATGAATAGAATAACAAAAGTTACTACAAATTTATTTGAAGGCTTGGATGATTTATTTGGTTTACGACTAGTGGGAAATAAAATCGAAAAAATCTCAAAAGACACATTTGCTGCCTTACCATCATTGCAAATATTAAATTTGGCTTCCAATAATATTGATCAAATAGATGATGGAGCTTTTGCCTCTAACTTACAGTTAAGAGCTATAGTATTAGATGGAAATAAGTTAGTAGATTTAAAAGGAATTTTTACTAAAACACAACCTCTTGTCTGGTTAAATGTTTCAAACAATGAATTACTGTGGTTCGACTATAGCCACATACCAACAAATCTCGAATGGTTGGACATGCACGAAAATAAAATAGAAAAACTTGAGGACACTTATGGTGTTAAAGAAACATGTAATGTGAAAATGCTTGATGTAAGCAACAATAAAATAAGAAACATTGATGAATTTTCATTTCCAAGTAGTATAGAGACAGTAGTATTAAATAACAATAATATTGAGAAAATAAATCCAGGAACATTTCTTCAGAAGTATAATTTAAACAAAGTTATGCTATATTCAAACAAGATAAAAACTTTAGATGTTGGTGCATTCGCAATTTCTTTTGTTCCTGAGGATAGAGATCTACCAGAATTCTATATCAGTGAGAATCCTTTCGAATGTGACTGTACAATGGAATGGCTACAAAGAATAAACCAATTAAGTGAACTTAGACAGCGTCCCAGAGTAATGGATTTAGAAAATGTAAGATGCTCCCTAACACATTCAAGGAGTAGTTCAGATGTACTACTGTTGGAAGTCAAATCGTTTGAATTTTTATGTGAATATGAATCACATTGTTTTACTTTATGTCACTGTTGCGATTTTGATGCATGTGACTGTAAAATGACTTGTCCAGATAAATGCTCCTGCTATCATGACCTCACGTGGAGCTCAAACGTAGTAGATTGCTCAAGTGGTGGTTATGACCATGTGCCAGACCAAATACCAATGGACGCTACTGAAATTTATTTAGATGGAAATGATCTTAAAGAACTTGGAAATCATGTTTTCATTGGCAAAAAACGCCTGCAAGTTTTATATCTTAATAACAGCAATATAAATTTAATACAAAATAAAACTTTTAATGGTATTGAATCATTACGAGTACTTCATTTGGAGAATAATAAATTGGAGGTGTTAAGAAATACACAGTTTACAAAATTACAAAACCTGAATGAACTTTATTTGCAAGACAATAACATAAGGTTTATTGAAAATGACACTTTTAACTATTTGCCTTCTTTGGAATATTTGAGTCTAGATAATAACGGTTACGTTGAGTACATGCCATGGCGAGTTATTACCGACAATAATCCACGTACGAGAGTATCTGTGGAGGGGAACAATTGGATATGTGACTGTAAAGATGTAGCACAACTAAATCAATGGTTGATTAAAAAATCCAAAGATACGGATAACATGATGTGTTTCTTTGCTCATGGTCAACCGATGAACAAAACTATAGCAACTGTTTCTAAAGAATGTAAGGCTGAGACTACCACCGAGGAAACAGGAACGGAGACTCAGAAACGATTATTTATTGAATCAAATGATAGTGTTGAGAATTACATCCCGTATATCGTGGCTGGATTGATAACAGTAACTGTGCTTTTGCTTATGTGTGCTTTCCTTTTTATGTTTAGAGACGATTTTAAACTGTGGATGCACTCAAAATATGGTATAAGAGTATTTAGTTCCGCCTCGAAGGATGTGAATGATAACAAAAACAAACGCTTTGCCGCTTTCTTTATGTATAATCCACAAGATGAAGGCGTCATGCGTGTCGTGAGTTCGGAGTTAGAGCAGTTGGGCCATACTCTGTGCTTACAACATCGAGATTTACAGTTAATCGAAAGACGATCTAGTGATAATTTAGTTAGTGCAGCAGAAAGTTCAAAAAGACTTATAATAGTGTTATCTACAAGCTTTTTAGAACAAGAGTGGGACATGCCGGCATCAAAAGCGGCTGTTCAATGTGCTATAAACTCAGTGAATGTGCGACATAGGCGACAAAAAATTATCTTTCTCGTAACGACTGACTTAAGTGCCATAAGTATCGATCCAGACTTGAAAGTGTTGCTTAAAACGTGTACAGTGATACTGTGGGGTGAAAAGTATTGTTTAGAGAAATTAAATTTTTGGTTGCCTGACGTGGACGTTACGTTACCAAATCGGACGATACATAACGCTAATAATATAAAAACTGGTAATAGGGAGGGGTTTGGAAGGCACGTGAATTTGAGGTATACAGCTCCTCCAACTTCTCATGATTCTTGGTACAAGTATGGAATGGATCGTCAAATGTCTATTATGTCAAGTCCGATGCATTCAACTAGTGCCAGTGCTGAGGTTTCGGCGCGGAGTACCGAAGACGAGACATGTTCGGTTGCCAGCAGTGAAGGACGACCGGATGATCGGTTGCCTCACCACCATAGTTACGTATCAATCGACAACCATCCCTGTGAGGAGAGACCATTAAGACCCGCCCAACAAATAGCTATGTCTAATACAAGACCCACTAATTCGCCGCGACAGTGCGCGCGCGCCGCTAACCCGCTGCGAAACTATAAATCATCGTTGGACGCGGGATGA

Protein sequence:

>DPOGS205279-PA
MSMEFHRDSFRGLTDLRTLDLGDNNIWILPSEIFCPLYNLKELNITLNRLQDISNLGFSDWGNGPTAPGKSCNTVLETLDMSFNEISALPDNGLSSLRALQRLLLQNNRISTVADRAFVGLSDLQMLNLSTNALTALPPEMFQSSRDIKQIYLNNNSLSVLAPGLLEGLDQLQILDLSYNELTSEWVNRDTFSGLVRLIVLNLSHNRITKIDALLFQDLNNLQFLSLEYNNVARIADGAFSYLKNLHSLSLAHNNIVEVDSNHFSNLYVLNQLFLDGNRITKVDLRSFENITKLHDLGLSGNQLSEVPEAIKTLRFLTALDLGMNRITKVTTNLFEGLDDLFGLRLVGNKIEKISKDTFAALPSLQILNLASNNIDQIDDGAFASNLQLRAIVLDGNKLVDLKGIFTKTQPLVWLNVSNNELLWFDYSHIPTNLEWLDMHENKIEKLEDTYGVKETCNVKMLDVSNNKIRNIDEFSFPSSIETVVLNNNNIEKINPGTFLQKYNLNKVMLYSNKIKTLDVGAFAISFVPEDRDLPEFYISENPFECDCTMEWLQRINQLSELRQRPRVMDLENVRCSLTHSRSSSDVLLLEVKSFEFLCEYESHCFTLCHCCDFDACDCKMTCPDKCSCYHDLTWSSNVVDCSSGGYDHVPDQIPMDATEIYLDGNDLKELGNHVFIGKKRLQVLYLNNSNINLIQNKTFNGIESLRVLHLENNKLEVLRNTQFTKLQNLNELYLQDNNIRFIENDTFNYLPSLEYLSLDNNGYVEYMPWRVITDNNPRTRVSVEGNNWICDCKDVAQLNQWLIKKSKDTDNMMCFFAHGQPMNKTIATVSKECKAETTTEETGTETQKRLFIESNDSVENYIPYIVAGLITVTVLLLMCAFLFMFRDDFKLWMHSKYGIRVFSSASKDVNDNKNKRFAAFFMYNPQDEGVMRVVSSELEQLGHTLCLQHRDLQLIERRSSDNLVSAAESSKRLIIVLSTSFLEQEWDMPASKAAVQCAINSVNVRHRRQKIIFLVTTDLSAISIDPDLKVLLKTCTVILWGEKYCLEKLNFWLPDVDVTLPNRTIHNANNIKTGNREGFGRHVNLRYTAPPTSHDSWYKYGMDRQMSIMSSPMHSTSASAEVSARSTEDETCSVASSEGRPDDRLPHHHSYVSIDNHPCEERPLRPAQQIAMSNTRPTNSPRQCARAANPLRNYKSSLDAG-