Monarch geneset OGS2.0

DPOGS202907
TranscriptDPOGS202907-TA2538 bp
ProteinDPOGS202907-PA845 aa
Genomic positionDPSCF300126 + 213489-227552
RNAseq coverage334x (Rank: top 35%)
Annotation
HeliconiusHMEL0048979e-14271.23% 
BombyxBGIBMGA004180-TA6e-9462.77% 
Drosophilakey-PB1e-0831.71% 
EBI UniRef50UniRef50_E0VHH62e-2024.45%Optineurin, putative n=1 Tax=Pediculus humanus corporis RepID=E0VHH6_PEDHC
NCBI RefSeqXP_002425570.13e-2124.45%Optineurin, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420095986e-2024.45%Optineurin, putative [Pediculus humanus corporis]
NCBI nr blastxgi|1544134682e-3021.11%viral A-type inclusion protein [Trichomonas vaginalis G3]
Group
KEGG pathwayxla:7350692e-07 
 K07210 (IKBKG, IKKG, NEMO)maps-> Prostate cancer
    Toll-like receptor signaling pathway
    MAPK signaling pathway
    B cell receptor signaling pathway
    Pathways in cancer
    Shigellosis
    Chemokine signaling pathway
    Adipocytokine signaling pathway
    Chagas disease
    T cell receptor signaling pathway
    RIG-I-like receptor signaling pathway
    Apoptosis
    Small cell lung cancer
    Cytosolic DNA-sensing pathway
    Pancreatic cancer
    Acute myeloid leukemia
    Primary immunodeficiency
    NOD-like receptor signaling pathway
    Epithelial cell signaling in Helicobacter pylori infection
    Chronic myeloid leukemia
InterPro domain[86-151] IPR0210636e-08NF-kappa-B essential modulator NEMO, N-terminal
Orthology groupMCL20413 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202907-TA
ATGGCATCCAGTCCTGACAATGAAGACGACTCCCTCATCGTGATCCTGGGTACATCCCCCGGCGGCAGTATGGTTGAAAAAAGTAATGGCACAACAAATGGAGATCTAGAAAGATCACAAATAGAAGATGCTATGAGATCACTATCAAATGAAGCTAATATGGCTTTTAAGGCTCAATTTAATTTGGGTGAAAGTCCGTCACCAGCCAGTATGATGGTAGCGAGTACTATAATAACCGAGGACAGGAGTACAGAGGAGTTGCAAAAGAGATTCGGCGATTTATTGGACGAGAATTTTGTATTGAAGGAGACTTTGAAGCAAAACAACGACTCTATGAAGGAGCAGTTCATACTGATAGCGTCGTGTCAGGAAGATATGCTGAAGACACACCGACTGCACAAGGAAAAGTTTGATGAAACAAGGGAGCTTGTCGAGCGGCTTAGACAAGAAAATAAGCAACTCAAAATGGATATATCACGTCTGGCAGAAGGTGAACAGAACAGTATCGGCCAAAAAAAACTGTCAGGCTTCGAACTGGTCACATCAGTTGAGGAAGATACTATCGAGAAGCTGTCATCACAGTTAGAACTCGTCGAGAAACAGAGACGACAGTTGATAGTTGACAACGAGAAGCTGAGCTGGCAGAAGGAATCCCTAGAACATATAGTTGATTCCATGAACAAGGAGAGAGACGACGCCAAGAAAAAGCTACACAAGGTCGAGCTTCAACTCTCCACTATGGAGAACGATCATGCTCAGGAAGTGAGCAAACTGCACTGCATCATAAGTGACCTGCAAAACAAAATGAAGACGCAGAGTGTGAATACATCTCCAGAGGAGGTTTCCAAACGTGATGTGTACATACAGAAGCTGGAGGGCAAGATGTCCTTATTACAGAATGAATTGAAGAAAGCTCAGATAAAGATTCTTGACCTGGAAAATATTAAGTTGGAATTCAGCCAGCACAAGTCCAACGTGTCTGAGACGGTGAAAATGTACAAAGACCAGATCCAGGAACTAAAGGATAGAATTAAAGAGGTCCAAACGACGCCGTCACCAGCCAGTATGATGGTAGCGAGTACTATAATAACCGAGGACAGGAGTACAGAGGAGTTGCAAAAGAGATTCGGCGATTTATTGGACGAGAATTTTGTATTGAAGGAGACTTTGAAGCAAAACAACGACTCTATGAAGGAGCAGTTCATACTGATAGCGTCGTGTCAGGAAGATATGCTGAAGACACACCGACTGCACAAGGAAAAGTTTGATGAAACAAGGGAGCTTGTCGAGCGGCTTAGACAAGAAAATAAGCAACTCAAAATGGATATATCACGTCTGGCTGAAGGTGAACACAACAGTATCGGCCAAAAAAAACTGTCAGGCTTCGAACTGGTCACATCAGTTGAGGAAGATACTATCGAGAAGCTGTCATCACAGTTAGAACTCGTCGAGAAACAGAGGCGACAGAATACATCTCCAGAGGAGGTTTCCAAACGTGATGTGTACATACAGAAGCTGGAGGGCAAGATGTCCTTATTACAGAATGAATTGAAGAAAGCTCAGATAAAGATTCTTGACCTGGAAAATATTAAGTTGGAATTCAGCCAGCACAAGTCCAACGTGTCTGAGACGGTGAAAATGTACAAAGACCAGATCCAGGAACTAAAGGATAGAATTAAAGAGGTCCAAACGACGGTGTTCCAGCCCATCCGCGTGTCCGTGTCGGAGCCGTCGAGTTCGTCCGAGTTCCTCAACAATGTCAAGCTCTACGACCGCACGCTCAAGCACCTGGCCGACTACCTCAACTCGCTCAGTAACGGGCTATCTGATAGTCTAGCTCACACCCTGGGCGTGGTGTCCAGTATACAGGATGTTAAGATCGACCGCGGCTCAGTGGACAAGGTCAAGTGTGGAGTCGGGGAGCTCAAGACTCTCATAGCAACACAGCACTCTAACGTTGTGTCTAACGTAGCTCATGTCCGAAGCACGCTGTCCACGTTCGAAGGCATCTTTAAGGATCACAACGAACTGTTGAAGAGATCCGTCACCAACACCGACACGGTGCAGGCTCCGTGTGTGCAGCAGTTGACGGAAGCGCTCGTGGCTCGCGGCCAGCAGGTGTCCGAACTGCTGGAAGAGCTGGCAGCTGTGAAGGCACGCACCGACGACGCCGACTTACTGAGGGCCCAGGTCGACTTGTACAAAAGCGACTTCGAAGCTGAGAGGGAATCCCGAGAGAAGATGGCCAGCGAGAAAGAAAATCTCCTCGCAGACCTCAGAGTGGCTCAGAAGAAGATACAAGACTTGACAACACAGTTGGAGGAGCTTCGTGTTCTGAGTCCAAGCCTGCACAAGAGCATCACCAGCCCTCGCCCGCGGTCCGCCGGCAAGCCCGCCCCCACCACCACCGCCCGCACTGCCCCCGCCACCGCTGTCGCAGCCAACGCTGCCTTCAGGTGTCCTAAATGCATGATGTTCTCCAGCGACCAGTACAACCTCATGGAGGAGCACTTCGAATACTGTCTAGACGACTTTTAA

Protein sequence:

>DPOGS202907-PA
MASSPDNEDDSLIVILGTSPGGSMVEKSNGTTNGDLERSQIEDAMRSLSNEANMAFKAQFNLGESPSPASMMVASTIITEDRSTEELQKRFGDLLDENFVLKETLKQNNDSMKEQFILIASCQEDMLKTHRLHKEKFDETRELVERLRQENKQLKMDISRLAEGEQNSIGQKKLSGFELVTSVEEDTIEKLSSQLELVEKQRRQLIVDNEKLSWQKESLEHIVDSMNKERDDAKKKLHKVELQLSTMENDHAQEVSKLHCIISDLQNKMKTQSVNTSPEEVSKRDVYIQKLEGKMSLLQNELKKAQIKILDLENIKLEFSQHKSNVSETVKMYKDQIQELKDRIKEVQTTPSPASMMVASTIITEDRSTEELQKRFGDLLDENFVLKETLKQNNDSMKEQFILIASCQEDMLKTHRLHKEKFDETRELVERLRQENKQLKMDISRLAEGEHNSIGQKKLSGFELVTSVEEDTIEKLSSQLELVEKQRRQNTSPEEVSKRDVYIQKLEGKMSLLQNELKKAQIKILDLENIKLEFSQHKSNVSETVKMYKDQIQELKDRIKEVQTTVFQPIRVSVSEPSSSSEFLNNVKLYDRTLKHLADYLNSLSNGLSDSLAHTLGVVSSIQDVKIDRGSVDKVKCGVGELKTLIATQHSNVVSNVAHVRSTLSTFEGIFKDHNELLKRSVTNTDTVQAPCVQQLTEALVARGQQVSELLEELAAVKARTDDADLLRAQVDLYKSDFEAERESREKMASEKENLLADLRVAQKKIQDLTTQLEELRVLSPSLHKSITSPRPRSAGKPAPTTTARTAPATAVAANAAFRCPKCMMFSSDQYNLMEEHFEYCLDDF-