Monarch geneset OGS2.0

DPOGS206025
TranscriptDPOGS206025-TA4266 bp
ProteinDPOGS206025-PA1421 aa
Genomic positionDPSCF300028 - 1708648-1733729
RNAseq coverage219x (Rank: top 45%)
Annotation
HeliconiusHMEL0127350.080.26% 
BombyxBGIBMGA000553-TA0.072.83% 
DrosophilaPxn-PA0.047.13% 
EBI UniRef50UniRef50_D6WFS40.053.41%Peroxidasin n=2 Tax=Endopterygota RepID=D6WFS4_TRICA
NCBI RefSeqXP_968570.10.053.41%PREDICTED: similar to peroxidasin [Tribolium castaneum]
NCBI nr blastpgi|910940430.053.41%PREDICTED: similar to peroxidasin [Tribolium castaneum]
NCBI nr blastxgi|910940430.053.41%PREDICTED: similar to peroxidasin [Tribolium castaneum]
Group
Gene OntologyGO:00069798.7e-205response to oxidative stress
GO:00200378.7e-205heme binding
GO:00046018.7e-205peroxidase activity
GO:00551148.7e-205oxidation-reduction process
KEGG pathwaybfo:BRAFLDRAFT_675151e-172 
 K00431 (TPO)maps-> Cytokine-cytokine receptor interaction
    Autoimmune thyroid disease
    Tyrosine metabolism
    Hematopoietic cell lineage
    Jak-STAT signaling pathway
InterPro domain[777-1355] IPR0102558.7e-205Haem peroxidase
[934-1329] IPR0020071.9e-190Haem peroxidase, animal
[807-818] IPR0197911.4e-42Haem peroxidase, animal, subgroup
[454-561] IPR0137835.5e-26Immunoglobulin-like fold
[474-561] IPR0130983.4e-18Immunoglobulin I-set
[486-551] IPR0035982.6e-13Immunoglobulin subtype 2
[480-562] IPR0035992.4e-11Immunoglobulin subtype
Orthology groupMCL11235 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206025-TA
ATGTCGAGAAGTGAATCTTTACGTAATATTTGTGGTGTAGAGGCGGAGCGCCCGCTAGGACCGGCGAGCGACGAGCGGCGGCCGTTCCTCATTCCCGCTTTGACTCTCGTTCCTCTCAGTCTATGCGAGGCGGCGGTAAACGCGAGTCACTGCCCACCGACCATGGGCACTATGAAGTTTCTAAACTTCCTTCTCATACTTAACTTTATAATTCTAACATCGTCCGAAGTATGTCCGCCGAAATGCCAGTGCAATAGAAATGCCATAAGATGCTTACACCAAGATTTGCTGGAGATCCCGAAGTCGCCGGCCGAAGCGCATACGTTGGACCTGCGCTTCAACCGGATCTTCGATATCGCTCCAGGAATTTTTAGCCATCTCCACCGGCTGCGATCTTTGCTTCTGAATGATAATCAGTTGAGAGAATTGCGCTCTGGCGCATTTCACGGCCTTCGACGTCTGAGATATCTCTACTTATACCGCAATCATATAAAACATATAGCTTCCGATGTCTTCCACGGCATGTCTCATTTGGAACAGCTGTACTTGCACGTAAACGAGATTCATCAGATCGAACCAGAAACCTTTTCCAACCTTCCCCGATTAGGTCGACTGTACCTTCATAACAATAACTTGAAAACGATACCCCCTGGTTCATTCCGGGGTATGCCGAAACTGAGCAAACTCCGATTAGACAGTAATGCGTTGGTTTGTGATTGTAATATGTTATGGTTTGCTCGAATGCTCGCTGAACATCGTAACATTACTATTGCCGCAACTTGCTATGAACCTGCGAAAGCAACTGGAACATCTTTAGCAGCAATGCAGGAAAAAGATTTCCACTGTCGTCAACCGGAGATTATGTCTGATCCTGAGGATGTGGTTGTTAATTTTGGAGATGAAGCTATTTTTACTTGTATGGCTAGTGGCGAACCAGCACCTGAAATAGTGTGGTTCCGCGACTCAGCCGCCTTACCTGACGATACAAGCAGATACGAAATTATGGATAATGGAACTCTTATGGTTCATCATGCAGATGAAAATGATATTGGTGTTTTCGAATGTTCGGCCAAAAATCCTGCTGGTGAAGCGCGATCCAAGCCAGCCAGAATGATGCTGCAAACTAAACCAGATAATAATGCCTTTCCTGTTTTTACCATCTTGCCCCGAAAAAGTGTGGTTAATATTAATCAACCCTACGCACGTTTCGATTGTGTGGCAAAAGGCAATCCAAAACCTCATATTTCTTGGTATTTCAATGGAGAGCGTATACTGTTAACTGATCGAATAACTATGCATCACAATGGATCTATAGTTATTGAAAATATAAAATACGAAGATACAGGATCTTACACATGTCAAGCTGAAAATGTCAACGGGAAGATAACGGCATCTGTTACTTTAGAAGTTATGGTGGCGCCTGCATTTATCATAGTTCCAAAAGACCAAACTGTAACAATTGGTGATTCAGCACATTTTCGATGCACAGCTAGAGGAACTCCGACACCTATTATAAAATGGTACAGAAACACTATGTCTTTGCCACCAAGTGAAAATATCGTTTTTAGCGATAACGATCAAAATTTGACAATCGTAGAAACTTCCGAAGATGATGCAGGATTATATCATTGCAGAGCAGAAAATTCCGAAGGTCTCACTGAAATATCTGCTGTTTTGAAAATAGAAAGTTTTGAAATAATTCCACCAAAAATTACCTTGAAACCAGAAGATACAGATGCATTTAAGGAAACAACAGTTCAGTTGCCTTGTGAATATGAGAGTGATCCACCAGCACTTGTGGAATGGAGAAAAGATGGAAGCCGTATTATAACTAATGACAGAATAAGTATATCTTTAATTGGGAGTTTGATTATTAACAACGTTTCCATAACCGATACTGGAAGTTACGAGTGTTCTGTTCACAACGAACATGGACGTGATACGGCTTCATCATTTTTGACGGTAAAAGATCACATTTTACCTGGCGATGAATATGTAAATATAGCTATAACTGAAGCTATAAGAGATGTTGATCAGGCAATAGCGAAAAGTATAGACAATTTGTTTAACAACAAAAGTTCCAACATTAGTTTTCAAGATCTGTATAGAATTACTAGATTCCCAAATGCCCCAGCTAGAGAAGTTGCTCGGGCGGCTGAAATATATGAGAGAACTTTAGATAAAGTAAAAGGATTTATACAATCTGGATTGAAAATAACATCGGCACAACCATTTAATTATGAAAATATATTATCAGCACAACATTTAGAAATCATAGCCAGACTCTCTGGTTGCGTAGCACACCGTGAAAGCAAAGACTGTTCTGACATGTGTTTCCATAAAAAATACCGAAGTATTGATGGCAGTTGCAATAACTTTGATCAACCAACATGGGGTACATCGCTCACTGCATTTCGACGCATTCTCTTTCCTATTTACGAAAATGGCTTTAGTGAACCAACAGGTTGGAACAAGAAAGTTAAATATAATGGTTATTCTTTACCGAGTGCTCGGTTAGTTTCTACAACAATTATTAGTACCACTGAAATTTCTGAGGATGTTCGGATTACTCATATGACAATGCAATGGGGTCAGTGGTTAGATCATGACTTAGATCACGCTTTGCCATCTGCAAGTTCTCAAACGTGGGATGGTGTTGACTGTAAAAAAACATGTGACTACGCGGCTCCTTGTTTTCCGATAGATGTCCCTAAAAATGATCCTCGAATAACCAACCGCCGATGCATTGATTTTATTCGAACTAGCGCTGTATGTGGATCGGGTATGACCTCGGTTTTATTTGGCAGACTGCAGCCAAGAGAGCAAATAAATCAACTCACGTCTTACATTGATGCCTCTCAAGTATATGGTTTTGAGAAATCTGTAGCTGAGGATCTCCGTGATTTGACGAACACTAACGGTACTCTCCGAGTAGGAGCTAAGTTCCCGGGTAAGAAACCATTACTACCAACAACAGGTTTAAATGGTATGGACTGCAGACGTAATCTTGCAGAAAGCAATCGTAATTGCTTTGTTGCGGGTGATATAAGGGCAAATGAACAGATTGGTTTAGCTGCTATGCACACTATCTGGATGAGAGAGCATAACCGTATCGCAACAGAACTAAAAGCCATAAATCCCTTCTGGGACGGAGAAAAATTATACCAAGAAGCGAGAAAAATTGTCGGAGCGCAAATGCAAGTCATAACTTACGAACAATGGCTGCCTCTCATTCTTGGTCCAGAGGGATACGAACAGCTGGGAAAATACAAGGAATATGACCCTAATCTAAACCCTTCAGTCTCAAACGTTTTCGCCACTGCTGCTCTTCGATTTGGACACTCTATCATTAATCCACTTTTACATCGTTATGACGAGAACTTTGAGCCGATCCCTCAAGGTCATTTACTGTTGCGTCATGCATTTTTCTCCCCATGGAGACTAGTCGATGAGGGTGGAGTTGATCCGCTATTTAGAGGAATGTTCACGACGCCTGCTAAATTGAAGACACCAACACAGAATTTAAACTCTGAACTTACGGAAAAACTATTCCATACTGCACATGCAGTCGCTCTTGACTTAGCTGCAATAAATATTCAACGAGGACGTGATCATGCTATTCCACCGTACAATAAATGGCGGCAATTTTGCAATATGACCGAGGCTAACGATTTCGATGACTTGGCCAATGAGATCACTGACAAAACCGTACGAGACAAGCTAAGAGAATTGTATGGCTCTGTGCACAATATTGATGTTTGGGTTGGTGGCATTTTAGAGGATCAAGTTGAGGGAGGTAAAATAGGACCTCTTTTCCGATGCTTACTTATTGAACAGTTTCAACGATTACGTCATGGCGATCGTTTGTGGTATGAAAATCCGTCGACATTCTCAAGAGACCAATTGCGACAAATCAAAAACGCAAACTTTGCAAGGGTTTTATGTGATAATGGTGACAATATTGATACAATAAGTGAGAATGTATTCTTGTTACCTGAATTACAGGACGGTCTTGTATCTTGCGAGGATGTCCCTAAGATCGATCTACGTTTTTGGGCCGACTGTGAATCATGCGGCGATGATGATTACGAAACTGAATCAAATCGAGTGCGCAGAGATGTAATGTCAAGTGCCGATCTTTACACTGAACTGACAGAAAATGATCACCGTCTAAATACCCTAGAAGATTCTCACGAGGAATTGGTGAAAGCAATTAATAAGCTTAAAAAGAGGGTCAAAGAGTTAGAGAAAGCATGCAATAAGTAA

Protein sequence:

>DPOGS206025-PA
MSRSESLRNICGVEAERPLGPASDERRPFLIPALTLVPLSLCEAAVNASHCPPTMGTMKFLNFLLILNFIILTSSEVCPPKCQCNRNAIRCLHQDLLEIPKSPAEAHTLDLRFNRIFDIAPGIFSHLHRLRSLLLNDNQLRELRSGAFHGLRRLRYLYLYRNHIKHIASDVFHGMSHLEQLYLHVNEIHQIEPETFSNLPRLGRLYLHNNNLKTIPPGSFRGMPKLSKLRLDSNALVCDCNMLWFARMLAEHRNITIAATCYEPAKATGTSLAAMQEKDFHCRQPEIMSDPEDVVVNFGDEAIFTCMASGEPAPEIVWFRDSAALPDDTSRYEIMDNGTLMVHHADENDIGVFECSAKNPAGEARSKPARMMLQTKPDNNAFPVFTILPRKSVVNINQPYARFDCVAKGNPKPHISWYFNGERILLTDRITMHHNGSIVIENIKYEDTGSYTCQAENVNGKITASVTLEVMVAPAFIIVPKDQTVTIGDSAHFRCTARGTPTPIIKWYRNTMSLPPSENIVFSDNDQNLTIVETSEDDAGLYHCRAENSEGLTEISAVLKIESFEIIPPKITLKPEDTDAFKETTVQLPCEYESDPPALVEWRKDGSRIITNDRISISLIGSLIINNVSITDTGSYECSVHNEHGRDTASSFLTVKDHILPGDEYVNIAITEAIRDVDQAIAKSIDNLFNNKSSNISFQDLYRITRFPNAPAREVARAAEIYERTLDKVKGFIQSGLKITSAQPFNYENILSAQHLEIIARLSGCVAHRESKDCSDMCFHKKYRSIDGSCNNFDQPTWGTSLTAFRRILFPIYENGFSEPTGWNKKVKYNGYSLPSARLVSTTIISTTEISEDVRITHMTMQWGQWLDHDLDHALPSASSQTWDGVDCKKTCDYAAPCFPIDVPKNDPRITNRRCIDFIRTSAVCGSGMTSVLFGRLQPREQINQLTSYIDASQVYGFEKSVAEDLRDLTNTNGTLRVGAKFPGKKPLLPTTGLNGMDCRRNLAESNRNCFVAGDIRANEQIGLAAMHTIWMREHNRIATELKAINPFWDGEKLYQEARKIVGAQMQVITYEQWLPLILGPEGYEQLGKYKEYDPNLNPSVSNVFATAALRFGHSIINPLLHRYDENFEPIPQGHLLLRHAFFSPWRLVDEGGVDPLFRGMFTTPAKLKTPTQNLNSELTEKLFHTAHAVALDLAAINIQRGRDHAIPPYNKWRQFCNMTEANDFDDLANEITDKTVRDKLRELYGSVHNIDVWVGGILEDQVEGGKIGPLFRCLLIEQFQRLRHGDRLWYENPSTFSRDQLRQIKNANFARVLCDNGDNIDTISENVFLLPELQDGLVSCEDVPKIDLRFWADCESCGDDDYETESNRVRRDVMSSADLYTELTENDHRLNTLEDSHEELVKAINKLKKRVKELEKACNK-