Monarch geneset OGS2.0

DPOGS209354
TranscriptDPOGS209354-TA3216 bp
ProteinDPOGS209354-PA1071 aa
Genomic positionDPSCF300118 - 460933-467338
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0063710.047.36% 
BombyxBGIBMGA005685-TA0.043.78% 
DrosophilaCG4168-PB1e-9630.88% 
EBI UniRef50UniRef50_D6WHI09e-10629.20%Chaoptic-like protein n=1 Tax=Tribolium castaneum RepID=D6WHI0_TRICA
NCBI RefSeqXP_975409.12e-10629.20%PREDICTED: similar to putative GPCR class b orphan receptor 1 (AGAP009007-PA) [Tribolium castaneum]
NCBI nr blastpgi|910791203e-10529.20%PREDICTED: similar to putative GPCR class b orphan receptor 1 (AGAP009007-PA) [Tribolium castaneum]
NCBI nr blastxgi|910791204e-11029.52%PREDICTED: similar to putative GPCR class b orphan receptor 1 (AGAP009007-PA) [Tribolium castaneum]
Group
KEGG pathwaydme:Dmel_CG51952e-31 
 K05401 (TLR3)maps-> Toll-like receptor signaling pathway
Orthology groupMCL12737 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209354-TA
ATGAATCAAGCCTCGGTGTGCTGCAGTCGCTCTGCGGAGGTGACCGGCGTGTGGACGGAGCTAGTTTTTGTACCAGCGTGTGGTGCGATGCGTGGATCGTGTCGTCCGAGGCTCTGCTTCGCCTCAAGTGCAGCGTTGTTTCGCTTTCTCACTCTCTTATATTGTATCACACCTGTCCGTCGAGCAGCACAGCTGCAGTTTGCAGCTCTGTTAGTGCTGCTGGTGTCGGCAACAGCTCGAGCACCGCGTCCCTGTGCCGCCAGCCCGCTATGCGTTTGCCGCGACGACCACTTCGCGTGCGACGCCGTACCCTTTCACAGATTTCCAGAGACCGAGACAGGCGTGCTTCACGTGTCAATATCCGCTGCTCGTCTGGGCGTACTGGGGGAAGCGGCTCTGGACGGACGGCCTCTACGTACGCTCGTGCTGGTCGCCTCGCGTCTGCATCAAGTCGACGGCGCCGCGCTAGCCTCCATGGCCACATCGCTCGCATCATTAGATATGAGTTATAACGAGTTTACCGAGGTTCCAATAGAGGCGTTGCGACATTTGAAAGTTTTAAATTGGCTAAATTTACAAAACAATTTCATAAGCGATTTAAACTCTGTGATGGATTGGGGCGGCCTCACCGACTCTTTGAGTAGCTTATCGTTAAGTAACAATCATATCTGTGTAATTAGCCAGGGCGTATTTTCTTCGCTCCGTCATTTAACTCAGTTAGAGCTTGACGGCAACAGACTAAGACAACTGGACGCCGAAGCTCTCCCCATCTCTCTGGCTATTCTACGCCTTTCCGATAATTTACTTTCGGGCCTTCCCTGCAGAGCATTAACTCACCTTCCTCGTCTACGTCACCTTCATTTAAGAAATAATATTCTGCAACCAAAGTTTAATATAACATGTCGCAGCGAGCGATCAAAAATAGATTCACTCGATCTTAGTCACAATGAACTTAGCGACGGTTTTAACTTTGACTTTCATCATAGTATTCAACTGAAGCAATTGGTTTTAGACCTCAATGACTTTACTGCTGTTCCAGCATTTGTTCTTGAATGTGGTCGGTTAGAAAAGTTATCCATTTCCTACAATAACTTACAACATGTATCAGACACTATAGTCCATGGTCTAAAGCATAGTTTACAGAGATTCGATTTGGACCACAATGAATTAACATTATTACCAGATTCTTTACGTGAGATGAACCGACTACGACATCTGTCTGTAACATACAATCGTTTGGAAGATATCAAACACTTACCACCAAAGTTACATTCCTTATCATTATCCGGAAATTATTTCAATGCATTTCCAAGTGCTCTCCAAAATTTAAGTGTAGCAACTTTGTCTTATTTAGATCTCGGCTACAATCGAATTTCCTACGTTGCTTCTGATAATTTCGGTGTATGGTCTAAGGCCCTGACAACTCTTGGTCTCCGAGGAAACAGGATAGCCCAGTTGTTGCTCGATTCCTTCCCACCCTTGCCGCTCCGTGAACTTGTACTTAGTTTTAATGACTTGTATTATATTGAAGCTGGCGTATTTTCAAATTTAACACAGTTAAGAATTTTAGAATTATCTTCTGCTGTATTTAGTGGTGATATTTCTACGGGGTCTGGTCTCAGAACTTTGACGTGGCTCGGTCTAGACAATAATAACATTCATTATATGTCGTCCGAAGACATTCTACAGTTTCCCTCTTTAGAATATCTAAATTTAGATTTTAACAAGATAATTGAATTCCCCAGCGATTTGGGAAATACACAAGGATCCAAACAGTTTCATAGCCTTCCCTGGTTGAGACTGTTACGATTGGAGGGAAACAGATTGCGGGCTCTACCTCGTGACGTCTTTAAGAATACTTTACTAGAATACTTAGATTTAAGTAACAACCAGTTATCTTTGTTTCCGAGCAGTGCACTGGCCCAAGTCGGTTTCACTTTACGTCGTCTTGAATTATCAAAAAATAAAATAGAATATCTCGATGCGGCTATGTTCCACGCGACAGCTTTCTTACATGAACTTGGTTTAGCTCAGAACGCTTTGACTGTCTTGTCAGACAACACTCTCGCAGGGTTGCCAAGATTACGTAGACTCGATCTGTCGTTTAATGCTATAAAAACAAATTTCAAAGAATTATTTCACAACGTACCTCGTTTGCGGCGATTATCTTTAGCTAATACCGGATTAAAAACTGCTCCTCATATTCCACTGGCTAATCTCACGGAATTGAACCTGAGCAATAATTACATAACATCATACAGTGAGGTCGACATGAAGCATTTTCAAAATTTAAGAGAATTAGATATTGCAGGAAATAAATTTACAACACTTCGTCCTGCTATGTGGGTGGCTGTACCGAAATTGTTGTCACTCGATGTTTCGAGAAATCCAATAGTTCGAATACAACAGGGTTTGTTTGAGGGATTACAAAGACTTTTGTATCTCAAGATGGATGACTTAAAGTATTTGGAAACATTAGAACCTCGCGCTTTTCGCGCCTTGATATCCCTCAGGTCTCTTACTTTAGAGACGCCTGCGGGCGAAGGGAGGGCGGTTCCTATAACAGAAATCGTATCATCATCACCTTATATAGAAGTGTTAGCCGTCCACGTACATAAAGAAATCGTGGATTCTCAATTTTCGGGAATGGTTGCGCCAAAACTAAGATCACTAGAAGTACGAGGTGCCTCAATTAGGACTGTAACTGCAGATGCTTTTTCTGCCTTAAGTAAGCAACGAGCGCTGACTTTGCGTCTGACTGGTTCTTCAGTAGCGGAGCTGCCGGCAGGGCTCATACTGCCGTTAGTTCGAGTACCTCACCTCGCACTCGATTTAACTGACAATCAGTTAGTTAGTTTTGGTCCGTCAATTCTTTATCCAAATCTCACTGGGTGGAATCGTTACGCTACAAAAGTGTTGCCCGGTGGTCTTTTATTGGGTGGAAATCCTCTCCGCTGCGGTTGCTCTGCGTCCTGGGTGGGCGGTTGGTTGCGTCGCTGGACGAGCGAGGTCGGCGGCGGCTCCCGTCGAGCTCGAGCCGCAGCCCGCCACACCACCTGTCTGACTCCGTCCGGACCCCGCGCCTTGCTCGCGGTCAACGCTGACGATGCCGAGTGTCACGCCAGCGCTCTTTCCAGTCGCTCCTGTACGCTTGCGTATCGTCATGTTTTCTATTATATGTTTTTATTGGTGATGACGTTGTATTTTTTTAGTTAA

Protein sequence:

>DPOGS209354-PA
MNQASVCCSRSAEVTGVWTELVFVPACGAMRGSCRPRLCFASSAALFRFLTLLYCITPVRRAAQLQFAALLVLLVSATARAPRPCAASPLCVCRDDHFACDAVPFHRFPETETGVLHVSISAARLGVLGEAALDGRPLRTLVLVASRLHQVDGAALASMATSLASLDMSYNEFTEVPIEALRHLKVLNWLNLQNNFISDLNSVMDWGGLTDSLSSLSLSNNHICVISQGVFSSLRHLTQLELDGNRLRQLDAEALPISLAILRLSDNLLSGLPCRALTHLPRLRHLHLRNNILQPKFNITCRSERSKIDSLDLSHNELSDGFNFDFHHSIQLKQLVLDLNDFTAVPAFVLECGRLEKLSISYNNLQHVSDTIVHGLKHSLQRFDLDHNELTLLPDSLREMNRLRHLSVTYNRLEDIKHLPPKLHSLSLSGNYFNAFPSALQNLSVATLSYLDLGYNRISYVASDNFGVWSKALTTLGLRGNRIAQLLLDSFPPLPLRELVLSFNDLYYIEAGVFSNLTQLRILELSSAVFSGDISTGSGLRTLTWLGLDNNNIHYMSSEDILQFPSLEYLNLDFNKIIEFPSDLGNTQGSKQFHSLPWLRLLRLEGNRLRALPRDVFKNTLLEYLDLSNNQLSLFPSSALAQVGFTLRRLELSKNKIEYLDAAMFHATAFLHELGLAQNALTVLSDNTLAGLPRLRRLDLSFNAIKTNFKELFHNVPRLRRLSLANTGLKTAPHIPLANLTELNLSNNYITSYSEVDMKHFQNLRELDIAGNKFTTLRPAMWVAVPKLLSLDVSRNPIVRIQQGLFEGLQRLLYLKMDDLKYLETLEPRAFRALISLRSLTLETPAGEGRAVPITEIVSSSPYIEVLAVHVHKEIVDSQFSGMVAPKLRSLEVRGASIRTVTADAFSALSKQRALTLRLTGSSVAELPAGLILPLVRVPHLALDLTDNQLVSFGPSILYPNLTGWNRYATKVLPGGLLLGGNPLRCGCSASWVGGWLRRWTSEVGGGSRRARAAARHTTCLTPSGPRALLAVNADDAECHASALSSRSCTLAYRHVFYYMFLLVMTLYFFS-