Monarch geneset OGS2.0

DPOGS208301
TranscriptDPOGS208301-TA3996 bp
ProteinDPOGS208301-PA1331 aa
Genomic positionDPSCF300079 + 648596-664150
RNAseq coverage118x (Rank: top 58%)
Annotation
HeliconiusHMEL0214960.086.49% 
BombyxBGIBMGA006467-TA0.081.31% 
DrosophilaCG7896-PA0.053.42% 
EBI UniRef50UniRef50_B3LYV90.053.60%GF18811 n=10 Tax=Endopterygota RepID=B3LYV9_DROAN
NCBI RefSeqXP_968875.10.055.75%PREDICTED: similar to GA20668-PA [Tribolium castaneum]
NCBI nr blastpgi|1839793070.083.33%similar to CG7896 [Papilio xuthus]
NCBI nr blastxgi|1839793070.083.60%similar to CG7896 [Papilio xuthus]
Group
KEGG pathwaydme:Dmel_CG51955e-44 
 K05401 (TLR3)maps-> Toll-like receptor signaling pathway
InterPro domain[1115-1162] IPR0004831.9e-06Cysteine-rich flanking region, C-terminal domain
Orthology groupMCL15826 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208301-TA
ATGGCAAAGGGAACTCTGGTCACAATGTGGTTTCTTCTATTACTCTTCATGCCGTACGTCATCAGTCAGCAACCATGGGTACCGTGTTCAGAGTTGAACGATGACCTTCGGTACCCTTGTCGGTGCAGGGTTCAAGTCGACAGAGCATTGCAGTTACGAATATTGATGAACTGTGACCACGTGGTATTCGCGGGTGACTTTCCACCACTTCCTTACGGCGCCCCCATAGTTTCGTTCAGTCAACGTTGGGCTGGACAACAGTCATTACCAACACAGATTTTCTCATCTTACGGTCTTCCGTTAAAAGAGCTGGACTTTTCTCATAACAGTCTCCGTCGGTTGCCTGACCGTTTGTTAGCGGGAATCAAAGGCAATATTACCAAAGTAGTTTTAGAAGATAATCTCCTCGGTGACAATTTGAATCCAATCTTCTCGACCGCCGAGTTTCACAATCTTCCAGCGTTGGAAGAATTGGATTTAAGCGGAAATAATATAAGAGGACTTGAAGAAGGTCTCCTAATTGGTTGCGATGTGCTCAAGGTTTTACGCTTGAACCGTAACAATATGAATTTCGTCCCATCCTCTTCTCTCAACGGGCCACAGTCATTAAAAGTTCTTTCGCTTAGAGAAAATAGAATAGGCATAATAAGACAAGCAACTTTCATATCTCAAAAGTCTTTACAAGAAATAGATTTGCATGGGAACATGATATCTACGATTGAAGGAGGAGCATTTATAGGCTTGAAAGGTTTAGAGAGTCTGGATCTTGGACGGAACAGACTGTCCAAATTCAACAGTGACGTATTTCAAGGAATAGAGAACTTGGAGAAATTGGATTTGTCGGAAAACTTTATAGGCGATTTTCCGACAGTTGCACTTAAATTGTTCGCCGGATTAAAGCATTTGAATATGTCCAGCAATATGATAACGAACATGGATCACAGTCACCTTAATGCTCTATCAGCATTGGTAGTTTTGGATCTGAGCAGAAACAATTTAGTAAAACTCTCACCAGGAACTTTCGTTGGTTTAACTGAATTGAAATATCTTGATATTGGTGTGAATTCTTTACGTACTGTGGAGGACGATGCATTCGATGGCCTTACTAGTTTAGAAACATTGTTATTGAGGGACAATAACATTTTACTTATTCCTGCAGCTGCATTGTCTCGATTGCCTAGTCTGACGTCTATTCATTTAGGATTTAATAGAGTAACAGCGCTCTCTAGCGATATTTTACGGGCAGTCTCCGAAGGCATAAATTCGTTGGTTCTATCGAGAAACGTTATCAGGGAATTGCCCCCGGCTGCTTTTGAACATTTTAAATATATACGTCATTTAGATCTATCTGGAAATCTCTTAAATTCGATAACAGCAGACGTATTCAGCGGTCTAGAGACTACGCTTGAATTTTTGTCTCTCAGCCAAAACAGAATATTAGGATTCACTGGAGAATATTTAAAATTTGTGAACCTGTGGTTTCTAGATATATCTGGAAATCAAATATCAGAGATACCAGTTAACGCATTCGAATCAATAAAGAGTTTAACGCACCTTAATATGAGTCATAACTTACATATTAATGTGTTGCCACAGAATCTTTTCGATTATAATGAAGGACTTCTATCCGTAGATATAAGCCATGTTGGACTCAAAGCATTGCCGGTTAATTTGTTTTCAAAGACTCATAATTTGGAATACATATATTTATCACATAATTTGTTACAAGAAGTATCGGAAGGTACTTTTAAGAATCTTAAGAACCTAACTCATCTCGACCTTTCGTACAATAACATAGTTACAATAAGAACACCTGCCTTTGTAAATGTCATGTCAATACAATATTTATCTCTGAAAGGAAATCAACTGAATGCGTTTAAAGGTGAATTCTTTAATACTGGGACCAGCTTAGAGGTTTTAGATGTATCAGATAATCAGCTGAGCTACTTATTTCCATCCTCTTTTAAGATTCATCCTAGATTAAGAGAAATAATACTTGCTAATAACCAGTTCAATTTCTTCCCCGCAGAACTTATTAGTACCTTGCAATATTTGGAAAAAGTAGATTTGTCGGGCAATGTTTTGAAAAATGTGGATGAATTAGATTTTGCTCGACTGCCTAAATTACGTACGATCTTACTAGCAAGAAATGAACTCGAATCCGTAAGTGAAATGGCTTTCCATAATTCTACACAGATCCAGCGTTTAGATTTGTCTTACAATAGAATAGATCGTTTAGGTGATCGATTATTCGAGGGTCTCATTAGATTAGAACTTTTGAATTTAGCCGGAAATCTTCTATATGAACTACCAGATAATATATTTGACAGATCAAGGCTCCATATGCTGGAATCAATAGTACTCAGTCACAATTTATTTGAACATGCGCCGTTAAAAGCGCTGCAAAAACAATATTTCTTTGTGTCATCAGTAGATTTATCCTATAATGAAATCGTAGATATTCCCGCAGAAGATAGCGTAATGGTCAATATTAAGAAACTTGACCTCTCCTTTAACCCATTATCAGAGAAAACAATAGATAATGTCCTAACAGAACCAAAAACAGTAAGAGAATTAAATCTAGCTGGCACCGGGATAAAATATGTTAAACAATTGGAGACGCCGTTTTTATATCGATTAAATCTATCTCATAACAACATTACTAAATTACCCGAAAAGACCTTCGCAAGAACCACTATGCTTGAATCTTTAGATCTCTCCTTTAATCAGATCGGTGATGTGTCTAATTCCCTTTCTATATCCTGGCCTAAATTAAAAAATCTTCAAAAGTTAAATATTTCGAATAATCCTATAATAATGGTACTGGAAGGTAATTTTGAAGGACTAATTTCACTTCGATTTTTAAATATGGAAAATCTAGAAAAATGTACAAAAATAGAAAAGAATGCTTTCAGACCCCTATCAAATCTTGTAGAACTTCGCGCATATGGATATCCAAGATTGGGTTATTTCGATGTTCAAGGAGCTCTACAGTATGTATTAGCAATGGAAAAATTAGATGTTGAAGTAAAAGATACTAATGTTGGCCCAGACCAATTACATTCAACATTACATCCCCGTCTCGAAGAATTGGGTTTAAGAGGAAGTAGACTAAAGACAATCTCTTCTGGGGTACTTGCAGGTTTAAAAGCACCTTCAATCACTGTACGATTCCGTAATACATCTATTACTAATTTGCCTCCAGCACTATTGTTTCCTTTGCCACGTTCCTCACAAATTACAATCGACGTAGGAGGTAGTTCATTGACAACACTGCAGCCACAATTATTGGTAGCTCTTGATGATCGTCGTGCAGATTTGTCTATGTTTGGGCTAGATGCTAATCCAATACGTTGTGATTGTAACGCCAGAGCTTTAAGGAGATGGTTACCTACCGTAGGTATTCAAGGTGTGAGATGTCACTCACCCGACCATTTATCAGGATATTTAATAGTTGAAATAGGGGACGACGAGCTCTCATGTGATTCTAGAAAGAGAACTACAGCTACTTCTTCCAGTAGTATTGCTACAACATCACCTCCAAGACTTGTACATAAAACGTCAGCGGAGCCGGATATTATCTGGTCAGTGGCACCCTCGCATGATCGACCAAAAGCAACAGGAGAGCCTAAAGGAGCACCTGTTATCGGAATTGCCACTTCTAATAATGATGACAATTTGATAATAGGGATAGTAGGTGGTGTTGTTGCTTTTATAGCAATACTTATTGTTGCTATATGTATTGTGCGTTTACGTATGACTTCGACATCTTATCGCGGAGGGCCCTTAGCGAATAGTCCCGGTGCTGGGGCAGCTCAATTATGGGGTGCAGCCTGGCCTGGATATGCAGCGACTTTACCCCCACCATCATTGTCTACAGCAACATTACCTCATAAAGTGCAATCCGGGCCTGGTTCAGTACGTTATATGGCAGCTCCACCTCCAGCCCCTTACTTTATAAGCTTGCCACCTCATGACGATAAAATTTATCGATGA

Protein sequence:

>DPOGS208301-PA
MAKGTLVTMWFLLLLFMPYVISQQPWVPCSELNDDLRYPCRCRVQVDRALQLRILMNCDHVVFAGDFPPLPYGAPIVSFSQRWAGQQSLPTQIFSSYGLPLKELDFSHNSLRRLPDRLLAGIKGNITKVVLEDNLLGDNLNPIFSTAEFHNLPALEELDLSGNNIRGLEEGLLIGCDVLKVLRLNRNNMNFVPSSSLNGPQSLKVLSLRENRIGIIRQATFISQKSLQEIDLHGNMISTIEGGAFIGLKGLESLDLGRNRLSKFNSDVFQGIENLEKLDLSENFIGDFPTVALKLFAGLKHLNMSSNMITNMDHSHLNALSALVVLDLSRNNLVKLSPGTFVGLTELKYLDIGVNSLRTVEDDAFDGLTSLETLLLRDNNILLIPAAALSRLPSLTSIHLGFNRVTALSSDILRAVSEGINSLVLSRNVIRELPPAAFEHFKYIRHLDLSGNLLNSITADVFSGLETTLEFLSLSQNRILGFTGEYLKFVNLWFLDISGNQISEIPVNAFESIKSLTHLNMSHNLHINVLPQNLFDYNEGLLSVDISHVGLKALPVNLFSKTHNLEYIYLSHNLLQEVSEGTFKNLKNLTHLDLSYNNIVTIRTPAFVNVMSIQYLSLKGNQLNAFKGEFFNTGTSLEVLDVSDNQLSYLFPSSFKIHPRLREIILANNQFNFFPAELISTLQYLEKVDLSGNVLKNVDELDFARLPKLRTILLARNELESVSEMAFHNSTQIQRLDLSYNRIDRLGDRLFEGLIRLELLNLAGNLLYELPDNIFDRSRLHMLESIVLSHNLFEHAPLKALQKQYFFVSSVDLSYNEIVDIPAEDSVMVNIKKLDLSFNPLSEKTIDNVLTEPKTVRELNLAGTGIKYVKQLETPFLYRLNLSHNNITKLPEKTFARTTMLESLDLSFNQIGDVSNSLSISWPKLKNLQKLNISNNPIIMVLEGNFEGLISLRFLNMENLEKCTKIEKNAFRPLSNLVELRAYGYPRLGYFDVQGALQYVLAMEKLDVEVKDTNVGPDQLHSTLHPRLEELGLRGSRLKTISSGVLAGLKAPSITVRFRNTSITNLPPALLFPLPRSSQITIDVGGSSLTTLQPQLLVALDDRRADLSMFGLDANPIRCDCNARALRRWLPTVGIQGVRCHSPDHLSGYLIVEIGDDELSCDSRKRTTATSSSSIATTSPPRLVHKTSAEPDIIWSVAPSHDRPKATGEPKGAPVIGIATSNNDDNLIIGIVGGVVAFIAILIVAICIVRLRMTSTSYRGGPLANSPGAGAAQLWGAAWPGYAATLPPPSLSTATLPHKVQSGPGSVRYMAAPPPAPYFISLPPHDDKIYR-