Monarch geneset OGS2.0

DPOGS202899
TranscriptDPOGS202899-TA2157 bp
ProteinDPOGS202899-PA718 aa
Genomic positionDPSCF300126 - 110686-119079
RNAseq coverage76x (Rank: top 65%)
Annotation
HeliconiusHMEL0162947e-4634.82% 
BombyxBGIBMGA003268-TA1e-3029.10% 
DrosophilaCG7896-PA3e-3229.41% 
EBI UniRef50UniRef50_Q5TWN69e-3729.70%AGAP007061-PA n=5 Tax=Eukaryota RepID=Q5TWN6_ANOGA
NCBI RefSeqXP_565143.22e-3729.70%AGAP007061-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582863573e-3629.70%AGAP007061-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3287043201e-4332.68%PREDICTED: leucine-rich repeat-containing protein 15-like [Acyrthosiphon pisum]
Group
KEGG pathwaydme:Dmel_CG51955e-24 
 K05401 (TLR3)maps-> Toll-like receptor signaling pathway
Orthology groupMCL34466 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202899-TA
ATGTCGGTGATAGTGTTGCCTGATAAATCAAGTTCAGTCGCAACTGATTTCTTAGTATGCGTTTCGTGTCCAGCGTCAAGTATCATGGCGTTCGATGGAAACATACCTATGAGCTGTATGGACCTGTTGCTCTGGTGTTATGAACCCAACTACAAAGTCCACTGCGATTCACAATTCGATACATTTGCTGACTTAAGTAATGAGAACAGAGGTGTCACAACTTATTTGGCATATCTCTATGGATTTGAATGTATTCTATCTGGTAATTGCGAATCTGTTTTAAACAATGCAACCTGTTCGTTCGAAGTTACCTGCTTTGGTGATTTCAGTGAGATTGTTACGGAAATATTTGACAACGCAGGCCGTTGTCCTGCAACAGCCTTTAATGTGAATAATGGACATTATTATGTACCGATAACACTGATTTATAAATCCAGTATTTTTTTCCAAAGCCTTATTAATGACGGCTCATTTTTCATATACGTAGTAAGAACATTAGATCTCAGTGGTAATAATTTTGATTTTCAACCTACCATTTCTACAATGTATGGTTTAAATAGCTTAAATTTATCTCATAACGATATATCTAATGCACAACTTTTTAACCATGAATATGAACTTCCTAACTTAAGAGATATAGATCTTTCCCATAACGCCATATCTAACTTAGACGTAACCGATTCCAGTAGTGATTATTGGTTTGACAATCTCATTAAAATAAACCTATCATATAACAATATAATTAAAATACCTTATCAGTTTTTTAAATATTTCAAGGTTTTAACAACTTTGGATCTGTCTTATAATAATATTTCTATAGTAACGTCAGATACATTTGAAGGTATTTCAGGATTGAAATATTTAAATATTTCCTCAAATAAAATCATTGATATAAATACTTCACTATTCAGATTCGTTCAGCTCATTGAACTGAATCTCAGTTACAATCAAATATCAAATGTAAATAAAAACAATTTCGATCAATTATATGAGTTAAGAGAACTGGATATAAGTAATAACTTTATTGAAAATATAGATGATATTGTCTTTGTTGATGTTATGCCGAATTTAGCAATAATAAATGTGAAGTATAATCGTATAAGGACTTTACGAAAAAACTTGTTTTTGGACCTAAATAATTTGAAGGAAATTGATTTTTCATATAACAATATCACCTATTTACCCAAAAATATGTTCCTTAGAAGAAATATCACTTTTTTTAGTATAAAGGGCAACAACTTGTCAGGACCTCTAGAAAAAGGGCTATTTGAAGGAATGGATTCAATACCTATTTTAGATATAAGCCATCAAAGCATTAGATCTGTTGAGGACTATACATTTTTCGGTCTAACAAAACTGACACAGCTATTATTAAATGATAATCGCATAAGTAATTTGACGAAAAATTGTTTTTTATCACTCCAAACTTTACAGCGTATTGATTTATCGAATAACTTTATAATAAGAATAGATTTCGTTAAATCGGATCTCCTAAGTCTCCAGTACTTCGCTGTGTCAAATAATTTAGTTGAAAAAATAGATGTAACAGATTTTTTTTACCTGCACAAGTTGCAGTATTTGGATTTATCTCGAAACAATATTTCGAAGGTTTACCCAAAAACCTTTCAAATGTTAAGTTCTTTAGAGAACCTCTATCTATCAAAAAATCCTTTAGTTGGTAGCTTAGACGAGAACACTTTCGACGGTTTATACAGCGTCCCGTTATTAGATTTGTCAGATGTATCGTTTAATTCAATTGGGGATTTAGAAATGGCTCGTTTTGATGAACTACCGAATTTATCCGAACTAATTGTAGATCATAATAGAATATCTGAAATAGACGTGAATGACCTGCAACGCACGGGGCTACGTAAATTAAGTATTGGTGATAATCCTTTACCTTGCAACAAATTAAAAGAACTAAAGGATATTGGTGGTCCGATGCTAGAAATTACGGGCTTGAGATATGTGTATTCAAAGGACGAAAGCTTTAAAGGCGTAGGGTCCACGCCTATAATACAAACAAATTTGAATAACGACACTAATGACCTTTCTAAACTAAGGGAAGAACTTGCAAATAGTATTGTTATAGAGAAAGAAAAAATGTTGGCCGAGATAGAAAGAAAAGAAAGAATGATGAACAATTTAAACGACTGA

Protein sequence:

>DPOGS202899-PA
MSVIVLPDKSSSVATDFLVCVSCPASSIMAFDGNIPMSCMDLLLWCYEPNYKVHCDSQFDTFADLSNENRGVTTYLAYLYGFECILSGNCESVLNNATCSFEVTCFGDFSEIVTEIFDNAGRCPATAFNVNNGHYYVPITLIYKSSIFFQSLINDGSFFIYVVRTLDLSGNNFDFQPTISTMYGLNSLNLSHNDISNAQLFNHEYELPNLRDIDLSHNAISNLDVTDSSSDYWFDNLIKINLSYNNIIKIPYQFFKYFKVLTTLDLSYNNISIVTSDTFEGISGLKYLNISSNKIIDINTSLFRFVQLIELNLSYNQISNVNKNNFDQLYELRELDISNNFIENIDDIVFVDVMPNLAIINVKYNRIRTLRKNLFLDLNNLKEIDFSYNNITYLPKNMFLRRNITFFSIKGNNLSGPLEKGLFEGMDSIPILDISHQSIRSVEDYTFFGLTKLTQLLLNDNRISNLTKNCFLSLQTLQRIDLSNNFIIRIDFVKSDLLSLQYFAVSNNLVEKIDVTDFFYLHKLQYLDLSRNNISKVYPKTFQMLSSLENLYLSKNPLVGSLDENTFDGLYSVPLLDLSDVSFNSIGDLEMARFDELPNLSELIVDHNRISEIDVNDLQRTGLRKLSIGDNPLPCNKLKELKDIGGPMLEITGLRYVYSKDESFKGVGSTPIIQTNLNNDTNDLSKLREELANSIVIEKEKMLAEIERKERMMNNLND-