Monarch geneset OGS2.0

DPOGS210990
TranscriptDPOGS210990-TA4005 bp
ProteinDPOGS210990-PA1334 aa
Genomic positionDPSCF300004 + 223137-248673
RNAseq coverage360x (Rank: top 33%)
Annotation
HeliconiusHMEL0250210.094.64% 
BombyxBGIBMGA014559-TA0.084.43% 
DrosophilaCG5873-PA0.079.14% 
EBI UniRef50UniRef50_Q9VEJ90.079.14%CG5873 n=32 Tax=Neoptera RepID=Q9VEJ9_DROME
NCBI RefSeqXP_973252.20.085.59%PREDICTED: similar to GA19195-PA [Tribolium castaneum]
NCBI nr blastpgi|1892414880.085.59%PREDICTED: similar to GA19195-PA [Tribolium castaneum]
NCBI nr blastxgi|1892414880.085.59%PREDICTED: similar to GA19195-PA [Tribolium castaneum]
Group
Gene OntologyGO:00069793.5e-176response to oxidative stress
GO:00200373.5e-176heme binding
GO:00046013.5e-176peroxidase activity
GO:00551143.5e-176oxidation-reduction process
KEGG pathwaytgu:1002183124e-84 
 K00431 (TPO)maps-> Cytokine-cytokine receptor interaction
    Autoimmune thyroid disease
    Tyrosine metabolism
    Hematopoietic cell lineage
    Jak-STAT signaling pathway
InterPro domain[143-722] IPR0102553.5e-176Haem peroxidase
[293-689] IPR0020073.3e-171Haem peroxidase, animal
[173-184] IPR0197911.2e-33Haem peroxidase, animal, subgroup
Orthology groupMCL16596 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210990-TA
ATGTTTTCGAGAGCGCTGTTAGTAGCGCTGTGTATAGTGTACGCGACCAGTCTTACTCCGTTGGCAGCGCCAGGAGATGATGACTGCGCTCTGTATCTCACAGGACCCGGCAGATCTTCCGCATATGATTACAGCCTCAACTTACTAAAGGGAAACTTCGCATATGACGGTCAACACACTTGCATAACATACGAGGCAATCAATCAAGCGTATCTTGACGCCAGAACAAGAATCTTGGTATCACAACCTAAGGGCGACTGGAAAGCCGAAGACTTCGCCAGTGTCGGAGAATTGGTGCTTGATATATCTATAAACTTGGCAAGAATATATGGGCTAACATATGAAGAAATAGAAAAAGGTCTACCACTGATAGACACCTCTCGCACGTTGATACGTGAAGTTTGCCCACCAGTGTTCTCTCATGTTGAGTGTCGCGCTGGGAAATACAGGAGGCTGGATGGTTTATGCAACAACCTGGTGCACCCCACGTGGGGCGCTACTATGGCGCCTTTCCAGAGGCTCATCGGTCCGTTATTTTCTGATGGCATCAATGCTCCCCGTATATCTCACACGGGACGGGACCTTCCCTTGTCCCGGGTGGTGTCCCGGACTATGCACCCCGATGAAGGCTTCCACGATCACGCTGGTACTGTGATGGTTATAGCATGGGGGCAGTTCATGGACCACGACTACACCCTCACTGGTACTCCACTAGATCCAATCCACCGCAACGATCCAGAGGAGTGCTGCAAGCGTCCACCTCACCTCAAGCATCCGTACTGCAACGAGATCCGCATTCCAGATGACGATTATTTCTACAGACTGTTCGGTGTCAAATGTATTGACTTCGTGAGAGGTTTCCCATCACCAAGACCCGGCTGCAGACTGGGTTCAAGGGTGCCTTTCAATACTCTGACGGGTACCATTGACGGTAACACTGTGTATGGAGTCACTGAAAAATTCTCAAGAAAGCTGCGTACTGGATACGGTGGTTTACTTCGAATGAATCCTGTATTCAAAGAGTACGGTCTGAAAGACCTTTTGCCGCTTAAACTGGACATCCCCGACGAAGGTTGCACTAGGCCTAACAAAAATATGTTCTGTTTCGAAGCCGGTGAGATTAGAGTCAATGAACAATTAGTTCTTACTGTGATGCACACTTTAATGGCTCGTGAACACAATCGAGTGGCCGAAGCTCTAGCTTTAGTGAACCCGCATTGGGATGACGAGACTCTCTTCCAAGAAGCTAGAAGGATTAACATCGCTGAAATACAACATATCACATATAACGAGTTCCTGCCTATACTTCTTGGGAAGGACGTCATGGAAAAATTCGGTTTGGTGCTTGAAAAAGAGGGCTACTGGGACGGTTATGACCAAAATGTTAATCCTGATGTAATAGCTGGATTCGCTGCTGCCGCTTACAGATTTGGTCATTCTTTGTTACCTACTGCCGTTGAGAGATGGTCTAAGGCCCATAAATTTATTGCATCAAAACGATTGTCAGACCTTATACGTAGACCATATGACTTATATCGTGCTGGTGTTCTCGACGAGTACGTTATGGGACTAATGAACCAAGTAGCTCAAGCTATGGACGATTCAATCACTCAAGAGGTGACAAATCACCTTTTCAAGAAAGTAGGTGCAAGATTTGGAATGGATCTGGTATCATTGAATATGCAAAGAGGACGAGAATTCGGCCTTCCAGGATATATGGAATTCAGAAAGTTCTGTGGATTGTCCGGAGCAGATTCCTTCCAAGACTTGTTCGGATCTATGGCCAATGAAACTATTCGCAAATATGAATCCATTTTTGAACACCCAGTTGATGTTGATTTGTGGTCTGGTGGTGTTTCTGAGAGACCTTTACCAGGTTCCATGTTAGGACCAACATTCGCTTGCATAATTGCTACACAGTTTTCGTATTCACGAAGAGGTGACAGATTCTGGTTCGAGTTACCCAATCAACCGTCATCGTTCACCCCGGAACAGTTAGTTGAAATAAGAAAAGCACGTCTCGCGCGTATCATTTGTGACAATACAGACATCATTGACACAGTACAATTATATCCAATGGTCCTGCCTGATCATGAGCTCAATCCACGAGTGCCATGTAGAAGTGGGATAATTCCATCGATGGACTTCAGCAAGTGGGCTGAACACACGCCTTTCGGAGGCGCTAAGGAATACGTGAGTAGCTTCTCGTACAACTCCTTCCACGGCAAGAAAAAAATAGAATACAACGAAAAATCCAAAAATACTAAAGACGACAGCAATAATTACGCTGAAGCGGGCGATATTTCTACAGAAGAAAACCTTAAGGAAAATAGTAAAACAGAACCATACGATAGTATTGAAGATCTTGAAGATAGTAGCAGGGAACGTTTTGAATATGATAATAAGGACGTATCGGAAAAAAACATAAATGACAGTTATGAAATTATCGAAAAAAATAATAAAAATATAACACCTAAAAGCGAAATTGACAAAAGTAAGGAGAAGAACAAAAAATATATTAATTCCAATGAGGAAAGTAACGAACGTGATGTTTCACTCAATAGAGAAAGTGTTAAACATATACCTCACAAAATAGATAAAAAAGATAAAAAGAATCGCTATGAGATTGACGAAAGCAGAGAATACGATAGCACAAAAGAGGATCTAGGTCAAAGTAACGAACACAAAGTAGGAGACAGTAAAAAACACAAAGAAAGAGAGCAACTCGGTGATAAAGTATCATTCGAAAAAGTTTTACCTACAAGTGAGGATTCGTTAAGCAGTGATAAATTGGAAAATGATCGTCACAGCGAAGACGATAGAAATTCAAATGAAACGAATTCCGAAAGTATTGAGAATTTTGATGACTTTGAAAAAACACAAAGGGTTGATAAAGGTAAATTGGGTTTGAACAATAACAATAACAACCCAATAAAGGAACAAAAGGTTGAGGATTTCAGTTATGAAAATTTAAATGGTAAAAACAATCAAATTAATAAAGATTTTAAAGATGATGATATTAAGGACTTTTCGGCAGAGATTTCACCAGATTCACAAACTACCTTAGAAAAACCTGATACGATTACTAATTCAAAAAGCTCTTCAATCGTTGACTCTACCAACGCAAAACAAGTCAAGATAAATGATGATGAAATTAAACCTGTTATAGAACTTAATAATGAAGACAGTAGTTCTCAAAGCGAAGAAGTAAGCAAAGAATATAGCGAAAGTGAAGAAATCAAACCAAGAGATGAATATATTAAGAAAAATTTGAAGAATATAGGTGTAAAAGATTTACTTTCAACATCTAAAACATTAAATGAAGACGTTAAATCACAATTTGAAAGAATACCAGAAAACTACAAGCATACAAATGCTAAAAGTGATAGTACGGAAGAACCGCCATCTCATAAAAATAACGACGAAGAAGGGACCCTAGATATATTGACTCCAGATAATCATGAGTATGATAAAGATTTAAATATCAAGTTTAGTGATTTAAGTATTAAATTACCTGAAATTAAACTGCCAAAAGATATATTAGCATACACTCACAAGGAAGGAGAAGACGATGACAAAGATCAAAAAAGAAAGCTTGAAGATACAGAAGAGGAACAAGATGATGAAGAGGAGGAGGAACATCACGAAAACTCGAAATACGAAGGTAAATCTAAAGAAAACAAAGACAATTTTTATCAATTTTACTCCTACCCTCAAGATGAGGATTCATCAGAAGAGCCCAAAAAGAAAAATAAAGGATATTTCAACTATCGTGACAGTGGCGAAGACTTGTATGAAAAATTTGTGAGAGAGAGATTTGGAAAATCAGGCAATTTAAAATCAAGATCTGAAAAATTGATAAATTACGAACCTATTATGCAAAACAAAAATCTCTACCAAAGTGTACAGAAAGTTTTGAAAAGAACAAACCAAATAGAAAAAGAAGCCCAAGATAGCAAAGATCCAAATTACATGTGGACTTTAGAATATGGGGGGAAATTATAA

Protein sequence:

>DPOGS210990-PA
MFSRALLVALCIVYATSLTPLAAPGDDDCALYLTGPGRSSAYDYSLNLLKGNFAYDGQHTCITYEAINQAYLDARTRILVSQPKGDWKAEDFASVGELVLDISINLARIYGLTYEEIEKGLPLIDTSRTLIREVCPPVFSHVECRAGKYRRLDGLCNNLVHPTWGATMAPFQRLIGPLFSDGINAPRISHTGRDLPLSRVVSRTMHPDEGFHDHAGTVMVIAWGQFMDHDYTLTGTPLDPIHRNDPEECCKRPPHLKHPYCNEIRIPDDDYFYRLFGVKCIDFVRGFPSPRPGCRLGSRVPFNTLTGTIDGNTVYGVTEKFSRKLRTGYGGLLRMNPVFKEYGLKDLLPLKLDIPDEGCTRPNKNMFCFEAGEIRVNEQLVLTVMHTLMAREHNRVAEALALVNPHWDDETLFQEARRINIAEIQHITYNEFLPILLGKDVMEKFGLVLEKEGYWDGYDQNVNPDVIAGFAAAAYRFGHSLLPTAVERWSKAHKFIASKRLSDLIRRPYDLYRAGVLDEYVMGLMNQVAQAMDDSITQEVTNHLFKKVGARFGMDLVSLNMQRGREFGLPGYMEFRKFCGLSGADSFQDLFGSMANETIRKYESIFEHPVDVDLWSGGVSERPLPGSMLGPTFACIIATQFSYSRRGDRFWFELPNQPSSFTPEQLVEIRKARLARIICDNTDIIDTVQLYPMVLPDHELNPRVPCRSGIIPSMDFSKWAEHTPFGGAKEYVSSFSYNSFHGKKKIEYNEKSKNTKDDSNNYAEAGDISTEENLKENSKTEPYDSIEDLEDSSRERFEYDNKDVSEKNINDSYEIIEKNNKNITPKSEIDKSKEKNKKYINSNEESNERDVSLNRESVKHIPHKIDKKDKKNRYEIDESREYDSTKEDLGQSNEHKVGDSKKHKEREQLGDKVSFEKVLPTSEDSLSSDKLENDRHSEDDRNSNETNSESIENFDDFEKTQRVDKGKLGLNNNNNNPIKEQKVEDFSYENLNGKNNQINKDFKDDDIKDFSAEISPDSQTTLEKPDTITNSKSSSIVDSTNAKQVKINDDEIKPVIELNNEDSSSQSEEVSKEYSESEEIKPRDEYIKKNLKNIGVKDLLSTSKTLNEDVKSQFERIPENYKHTNAKSDSTEEPPSHKNNDEEGTLDILTPDNHEYDKDLNIKFSDLSIKLPEIKLPKDILAYTHKEGEDDDKDQKRKLEDTEEEQDDEEEEEHHENSKYEGKSKENKDNFYQFYSYPQDEDSSEEPKKKNKGYFNYRDSGEDLYEKFVRERFGKSGNLKSRSEKLINYEPIMQNKNLYQSVQKVLKRTNQIEKEAQDSKDPNYMWTLEYGGKL-