Monarch geneset OGS2.0

DPOGS203200
TranscriptDPOGS203200-TA3822 bp
ProteinDPOGS203200-PA1273 aa
Genomic positionDPSCF300035 + 527511-531332
RNAseq coverage67x (Rank: top 67%)
Annotation
HeliconiusHMEL0110100.081.47% 
BombyxBGIBMGA011085-TA0.081.31% 
DrosophilaTollo-PA0.055.26% 
EBI UniRef50UniRef50_E0VDA30.065.53%Toll, putative n=3 Tax=Neoptera RepID=E0VDA3_PEDHC
NCBI RefSeqXP_002424097.10.065.53%toll, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420065180.065.53%toll, putative [Pediculus humanus corporis]
NCBI nr blastxgi|1892342170.065.25%PREDICTED: similar to vasorin [Tribolium castaneum]
Group
Gene OntologyGO:00312242.2e-35intrinsic to membrane
GO:00071652.2e-35signal transduction
GO:00048882.2e-35transmembrane receptor activity
GO:00450872.2e-35innate immune response
KEGG pathwayoaa:1000825912e-42 
 K06839 (SLIT2)maps-> Axon guidance
InterPro domain[1066-1207] IPR0001572.2e-35Toll-Interleukin receptor
Orthology groupMCL10048 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203200-TA
ATGTGTCCGATGTGGATTACAAGCGCTATATTAGCGCTTAGCGCTGTTTTACCGGTATGGAGTGCTTCCCTGTCAGCAACCACAGGAACTCGATACCAAGCTCCAGACGAATGTCGATGGACAACTGACGAGGAAGACGCTGGAGTTGCGTTACAGTGTAGACTAAGAACTATAAATAGTGAATTAGAAAATACTAACTTTAGTGCTATTCAACCACATCTAACTGTTCGTTTACGACTGGAGTGTAGTGATGCGCTCTTTTTCCAAAGCTCCCTCGCACCGGCTAGTTTTCGTCAGTTAGTGGAATTGAGAGAACTTACGATTGAATATTGTAAAATAGGCAATCTATCTGATGGCGCCTTCACTGGCCTTCGTGAATTAAGAAATTTGACTATAAGAACACATAATACAGACTGGTCTTCAATGTCGCTAGAAATAACGCCAACAGCTTTTTCCAGAGATGTCCAAAATTTGGAGCGCCTTGATTTAAGCGAAAATAATATGTTGTCATTTCCTGAGGGATCTCTATGTTCTTTAAGAAATCTCGAATATTTAAATATGACTGGTAATAGAATGAGAGATGTCAGCCATTTTCAATTTTCAACTGCACATCGTCATCCGAATGAAAAATGTGGGGAAAATATTTTAGTGTTAGATTTATCTAAAAATGTAATTGATACATTACCACCAGGACTTCTTTCGGGATTAAGACGACTACAAAAATTCTATCTTCAAGGAAATGGGCTTAACTCTGTAGCAGACAGAGCTTTGGAAGGGTTAATATCATTGACAAAAATAAGATTTTCGGATAACCAACTTACTAGCTTACCTCCGGAGCTATTTAGCGATACTAAGGAATTAAAAGAAATTTATCTCAACAATAACACAATTACAGTACTTGCTCCAGGATTATTTAGTGATCTTTTACAACTTTTAATCTTAGATCTGTCCTATAACGAATTGACATCTGATTGGATAAATACTTCTACATTTTCGGGTCTGAAACGACTTGTTTATTTGGATATATCACATAACAGAGTGTCAAAAATGGAAATAGCATTGTTTAGAGATCTGCATAACCTTCAAATTCTAAAATTGCAAGATAATTTTATTGAACACATACCTGAAAACGTGTTTATTTCCCTACAGAATCTACATACTTTGATATTATCCAATAACAGGCTGACGAATATTGAAAGCTATGCTTTTATCGGTTTACCTGTTTTATCTGTATTGTCAATTGACAGTAATCGCATTTCAAAGATACATCCACATGCTCTACGTAATTGTACCTCCTTACAAGATTTACATATAAATGTTAATAGACTGGATGAAGTTCCTATAGCATTAAAAGAAATACCTCAATTGAAAACTTTAGATTTAGGAGAGAATTTAATCGTCAGTATAGAAAATGCATCTTTTATGACAATGCAACAAATGTACGGCTTAAGATTAACTGAAAACAACATCGGCAACATAAGCAAGGGTGTTTTTGACAAAATGACTTCACTTAAAATTTTAAACCTTTCTAGAAATAAAATTCATAAGATTGAATCAGGAGCTTTTGACAATAATATAAATTTACAGGCTATAAGATTAGATGGGAACTACTTGACCGATATAGGCGGTCTTTTTGCTAAATTACCCAATCTCGTATGGCTGAACATTTCCGATAATCGTCTTGAATGGTTTGACTATGCTATGATTCCAACAGGATTACAGTGGCTGGACATTCATGCCAACCGAATTGCTGAACTTGGAAATTACTTTGAAATTGAATCTCAACTTTCTCTTAGCACGTTTGATGCCAGTTCTAACAGACTTGCAGAAATAACAGGAAGTGCGATACCTAATTCAGTAGAAATGTTGTACTTGAACGATAATTTAATTTCAAAAGTACAGTCTTATACATTTTTCAAGAAGCCTAATCTGACAAGAGTAGATTTATATGGAAATAAAATAACAAGTTTAGACCCTAACTCTCTCAGAATATCAGCTGTGCCACAAGATAAGACTGTACCCGAGTTTTTCATTGGCGGGAACCCATTAGAATGTGATTGTACAATGGAATGGTTACAAAAAATTAATACTGGAAACAGAGCAAGGACACAGCCTAAGTTAATGGACTTGGATAGCATATATTGCAAATTACTTTATAATCGTGGAAACTCATATGTGCCTTTAGTTGAAGCAGCATCACACCAATTTCTTTGTAAATATGACTTTCATTGCTTCGCACTATGCCATTGTTGTGATTTTGATGCGTGTGATTGTGAAATGACTTGCCCAAATAACTGCACTTGTTACCATGATCAGTCATGGTCCGCTAATGTTGTAGAATGTTCTAACTCGGGATACGTTAACTCACTGCCAGAAAGAATACCCATGGAAGCCACTCAGTTATACCTTGACGGTAATGATATTAAGATGCTACCAAGCCATGCATTTATTGGTAGAAAACGCCTTAAAGTATTATATTTGAATTCATCACACATTGAAACTATTCACAATAGAACGTTCAATGGATTAAAAGAACTGGAAGTTTTACATCTTGACTTCAACTTATTAACAATAGTTGAAGGACAAGAATTTGATGGATTGGATAATTTAAAAGAACTTTATCTTAATAATAATAAAATAAAAACAATCGGTAAAGACATGTTTAATCACATGGCAAAATTAAAAATATTATACCTTTCACACAACAGGCTGGTATCACTAACTGTCTGGCAAATAAATTCCGCTATAACCTCTATTACGCTTTCGTTTAACCCTTGGTCGTGCGATTGCGAATACACAGAAATCTTCCGTGAATGGACAAAACGAGTATCTTCAAGTATTATGGATCTATCGAACATTAGGTGCATTTATACGAAAACGAACAGTACAGATATCGCAGTACATAACGAAAGCGTATATGATGATCCAAACTCAGGATTTAAAATAATAGAAGAAAATGGTACTATATGTACCGGATTACCAAGTATTGATAATAGTATCAACGGCAACTTAACAGCAACCAAAACAATTATAACCAATGAAGATGTTCCTGATTACATTCCATTTCTTCTGGCGACTGCAGGGGCATCTCTATTTCTCATTATAACCGTTATCGTTATTTTTAAATACAGGCAAGAATTGCGAGTATGGGTTCATTCGAAATTTGGTGTAAGATTATTTTATACCAACGTGGACCGTGAAGAAAACCTATTTGATGCATTCGTAAGCTATAGTTCCAAAGATGAAGCATGGGTGACTGATAAACTTGCCCTGGTATTAGAGACAGGCAATCCTCAATACAAATTATATCTGCATTATCGTGATTTACCAGGAGGCGGTTACATAACACCACAAAGTATTACGCAAGCGGTGGAGTCCTCACGTCGTACTATTATGGTGCTCAGTGAAAATTTTATGAATTCGGAATGGAACCATGTCGAATTTAAATCAGCATATCTTCAACTTTTAAGAGACCGCCGGAAAAGACTTATCGTGATCCGAAAGGATAATATCCCGTTAAAGCAACTAGATACTGAAATCAGATTATATCTCAAAACTAATACTTATTTAAATTGGGGTGAAAATTTGTTCTGGGAAAAACTAAAATTTGCTTTGCCAGATGTTTCTGATAAACAAAGGTGCCGAAGTATGCCGAGTCCGGGCCCAGGTGCCGTACCGGTGCATAGACCTCATTTACCAAGAAACCATCTAGGGGCATTGCCTCCGCCACCTCATGTTCCCCATCAAATGTTACCCCCTCATCCGTCACACACACAATTTCCACCAAGAGCGTCTCCGCGGAATCTTTCTGCCCATGTGTAG

Protein sequence:

>DPOGS203200-PA
MCPMWITSAILALSAVLPVWSASLSATTGTRYQAPDECRWTTDEEDAGVALQCRLRTINSELENTNFSAIQPHLTVRLRLECSDALFFQSSLAPASFRQLVELRELTIEYCKIGNLSDGAFTGLRELRNLTIRTHNTDWSSMSLEITPTAFSRDVQNLERLDLSENNMLSFPEGSLCSLRNLEYLNMTGNRMRDVSHFQFSTAHRHPNEKCGENILVLDLSKNVIDTLPPGLLSGLRRLQKFYLQGNGLNSVADRALEGLISLTKIRFSDNQLTSLPPELFSDTKELKEIYLNNNTITVLAPGLFSDLLQLLILDLSYNELTSDWINTSTFSGLKRLVYLDISHNRVSKMEIALFRDLHNLQILKLQDNFIEHIPENVFISLQNLHTLILSNNRLTNIESYAFIGLPVLSVLSIDSNRISKIHPHALRNCTSLQDLHINVNRLDEVPIALKEIPQLKTLDLGENLIVSIENASFMTMQQMYGLRLTENNIGNISKGVFDKMTSLKILNLSRNKIHKIESGAFDNNINLQAIRLDGNYLTDIGGLFAKLPNLVWLNISDNRLEWFDYAMIPTGLQWLDIHANRIAELGNYFEIESQLSLSTFDASSNRLAEITGSAIPNSVEMLYLNDNLISKVQSYTFFKKPNLTRVDLYGNKITSLDPNSLRISAVPQDKTVPEFFIGGNPLECDCTMEWLQKINTGNRARTQPKLMDLDSIYCKLLYNRGNSYVPLVEAASHQFLCKYDFHCFALCHCCDFDACDCEMTCPNNCTCYHDQSWSANVVECSNSGYVNSLPERIPMEATQLYLDGNDIKMLPSHAFIGRKRLKVLYLNSSHIETIHNRTFNGLKELEVLHLDFNLLTIVEGQEFDGLDNLKELYLNNNKIKTIGKDMFNHMAKLKILYLSHNRLVSLTVWQINSAITSITLSFNPWSCDCEYTEIFREWTKRVSSSIMDLSNIRCIYTKTNSTDIAVHNESVYDDPNSGFKIIEENGTICTGLPSIDNSINGNLTATKTIITNEDVPDYIPFLLATAGASLFLIITVIVIFKYRQELRVWVHSKFGVRLFYTNVDREENLFDAFVSYSSKDEAWVTDKLALVLETGNPQYKLYLHYRDLPGGGYITPQSITQAVESSRRTIMVLSENFMNSEWNHVEFKSAYLQLLRDRRKRLIVIRKDNIPLKQLDTEIRLYLKTNTYLNWGENLFWEKLKFALPDVSDKQRCRSMPSPGPGAVPVHRPHLPRNHLGALPPPPHVPHQMLPPHPSHTQFPPRASPRNLSAHV-