Monarch geneset OGS2.0

DPOGS215049
TranscriptDPOGS215049-TA4422 bp
ProteinDPOGS215049-PA1473 aa
Genomic positionDPSCF300208 - 50432-71234
RNAseq coverage205x (Rank: top 47%)
Annotation
HeliconiusHMEL0020000.062.12% 
BombyxBGIBMGA005680-TA0.061.73% 
DrosophilaCG42331-PD0.053.34% 
EBI UniRef50UniRef50_E2BAM20.058.49%Peroxidasin-like protein n=1 Tax=Harpegnathos saltator RepID=E2BAM2_HARSA
NCBI RefSeqXP_001607719.10.058.97%PREDICTED: similar to oxidase/peroxidase [Nasonia vitripennis]
NCBI nr blastpgi|3838657430.052.93%PREDICTED: uncharacterized protein LOC100875470 [Megachile rotundata]
NCBI nr blastxgi|3838657430.048.06%PREDICTED: uncharacterized protein LOC100875470 [Megachile rotundata]
Group
Gene OntologyGO:00069795.2e-186response to oxidative stress
GO:00200375.2e-186heme binding
GO:00046015.2e-186peroxidase activity
GO:00551145.2e-186oxidation-reduction process
KEGG pathwaytgu:1002183129e-96 
 K00431 (TPO)maps-> Cytokine-cytokine receptor interaction
    Autoimmune thyroid disease
    Tyrosine metabolism
    Hematopoietic cell lineage
    Jak-STAT signaling pathway
InterPro domain[408-986] IPR0102555.2e-186Haem peroxidase
[562-957] IPR0020078.4e-179Haem peroxidase, animal
[438-449] IPR0197911.5e-39Haem peroxidase, animal, subgroup
Orthology groupMCL15629 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215049-TA
ATGACACGCCGTAGCCTTATGGTGTCCACGAGGCATAAAAACATAATCATCGTCTTGTTTATATTAATCACCGCTGATTGCAAGACAACCAACAGCAAATCGGATGTTGAAAACACAACACACAACTGTCCCTTCACGAGAATCATCGAAGACATTTTCAACACTAAAAGAACGCCATCAAATGCACCGTACTGGAAGGAGAGAGACGCCGAGCTGTTCGTTAACGCGACCAGCTTCTTTGAAACGAACCAAAATATAAGCAAATCCATTCAAGAGGAGCTAGTCAATGACATAATAGCTTCTATCAAACGAATCAAAGCAAATCTGTCAACATCTGACGACAATAATAAACTCAACGAAGCGATAATTATAGCACCCGAACATAACGAAGACAATCATCATACGGACAAAAAAGTGACTCAAATAAAGAGTGGTATAGAGGACAAAACTGATATTCAAATGTCCGAACCAAAACAAAATAAAGAACAAAGGATCACGAAACAAACAGAGAAGCCTGACAGAAATACCTTTGTTGAAGTATTGACTATAGAACCTAACAACACGAAGTCTAATACGTATTCAATTTTAGATGTCCTAAAAAATCTAATGCCCAAACTGAACGCATCGATATCGAAGGATCTTCAGAGCATCACCATCATAGAGAAAAATCAAAATAAAAACCTCTCCGTGTCCGAAACGAAAAACGTGTCGACAATAGTTATTAAGTACTGCGATAAAGATAACTTGACGGAGACCATTAACACGGGGGACGATGAAAACAAAGGCGACGTTAAAACATTGATTGGACATGATGATAGTGACGATGAAGAGGATTATGGAGAAATAGCCGATGATAAGAGCTATAACAGAAGTCTCACAGAGAGCAGGAAAGATATATTGGAAGCTGCGAAATATGGAATGGAGAAAATGCATGAACTGTATGGTGTGTTAGAGCCAAAACTCTATTCTATGGGTCTCGTATTGGACGAAAAGGACCCAGCACGTTACGTGGCTGCCTTTAATGCACCATCAGAGAACGCTGACACATACGCCAAGTATGGATACGCGTCTCTCCAGGCATCAGCGAGGTTCAGGGAACTAGCCAGCGTTGACAGTGACAATCTTGAGAGTCGATCGGAAGAAAACCAGTTTCCTGATGCGAGCTCCCTGCGCCAGTCCCCACTAGTTCAACAATGTCCCCTTCGAGGCGCTCCCAAATGTCCGCCAGCATCCAAAAGATATCGCACTCACGATGGTACTTGCAACAACCTGAGTCGGCCTCGCTGGGGTGCCACCATGACCCCGGTACAGCGGTTCCTTCCACCAGTTTACTCCGATGGTATTCAAGCACCAAGGAAATCAATTTTCGGCTCTACCTTACCTTCAGCCAGAGAAATCAGCGCGCTAGTACACGAGGACAAGGATTCTGAAAATTCTGGAATAACGCATTTGCTCATGCAGTGGGGTCAATTTCTGGACCACGACATAACGTCTTCCTCCCAGTCCCGAGGGTTCAACGGTTCAGTGCCGCGGTGCTGCAAGGACGGAGGAAGAGACTTCATACCTCAAGAGTTCATGCACCCGGAATGTCTTCCGATCGCCGTCCCACCCTCGGATCCCTTCTACGGTCCCCGCGGTGTTCGTTGCCTGGACTTCGTTCGATCGTCTCCGGCGCCGCGGGAGGACTGCGCCCTCGGCTGGAGGGAACAGTTCAACCACGTGTCCTCGTACATCGACGGATCACCACTTTACGCCAGCTCCGCGAGACAGTCGGACAGGTTGAGACTGTTCAGGAATGGTATGCTGCAGTATGGGCGGGTGCAGCAGCGCCGTCCTCTGCTGCCGGCTGAACGTGATGAGTTGTGCCGCGGGGGCGCTCTATCCACGGACTGCTTCAAGTCGGGGGACGCGCGGGTCAATGAACACCCCGGTCTCGTCGCCAAACACATCGTCTGGCTCAGACAACATAACCGAATGGCCCAGGAACTGGCGCACCTTAACCCTCACTGGAGCGATGAAAAAATTTATCAGGAAACTCGAAAAATAGTGGGAGCTATGATACAACACATTACTTACAGGGAATTCCTACCGATCGTTTTGGGTCCTGAAGTGATGCGTCTTTTTGAGCTGGAGCTTCTTCCGAAGGGCTATTTCAAGGGCTACAGCGCCAAGACCAATCCGAACCCAGCCAGTTCTTTCGGTACAGCCGCTTTCCGCTTCGGACACAGTCTGGTTCAGTCGTCGATGATGCGCTTTGATAAGTTCCACAGACCGATCAATAACAACGTTTCCCTCCACGCGGAGCTTACAAACCCGTCCAACATCTGGAGCGTGGGTGCCGTGGACCGACTGCTGCTGGGGATGCTGAACCAGCACATACAGAAGAGGGACGAATTCATTACGGAAGAACTCACCAACCACTTGTTTCAAACCAATCACTTCAACTTTGGGATGGATTTGGCTGCTATTAACATCCAGCGAGGAAGAGACCACGGGGTGCCGCCGTATACCGCGTGGAGGGAGCCCTGCGGACTGACGCCCATCACGGACTTCGATGACTTAGTGAGGGTGATGCCGGCACGGGTCGTGAGGAAGTTGAAGGTGTTATACAGACACGTGGATGACCTGGACCTGTTCACGGGCGGCGTGTCCGAGCGCCCCGTGGCAGGCGCCCTCGTCGGGCCGGTGTTTGCATGCATCATAGCTCAACAGTTCGCAAACTTACGGAAAGGGGATCGCTTCTGGTACGAAAATGGTGGTTTCGATTCATCTTTCACTCCGGCTCAATTGCAACAGATAAGACGAATATCTCTGTCACAGGTCCTTTGTAGTACTCTGGACTCAATAGATAACATACAACCTTTCGCTTTCCTCTCACATGAAAATCCAAAAAATGACAGGATATCATGCCGTAATGGCTTACTTAACAATTTTGACCTATCCGCTTGGATCGAATTACATTCAGACTCAAATAATATTAAGAAATCTGACGAAAATCAACAGAGTTCTAAGACGAAAACCAAGCGAACAACCGTGAGACCGACCACGACAACTAAACATCCCCAAAAACTATCTCAAACTTTGACTCAACAAAAATCTGAAAAATTTAGACTTAATAATATGACAGAGACCGATGACACTGATAAAGACAAAGACAAAAACGCAGACGATGAACCGACTGGGATCAAACCTAACGCTACAGTAGTTATAGACGATAAACTAGACTTTAGAAACAAATCACGACGCTTTACCGACTTCGACGACGAGAGAAACCCCCCGACTAGACAATACAATGACTATTATGATGACGTACAAAGCGTACAATCAGTTGTTATCAACAACATACCAAATAACAGACCAAACAGACGACCTTACATATCCGTTACTGAGAATATTGCTGACAAATACACGTATCTTATTAACTATGTTCCCCGACCGACTCACTCCTGGCGGCAGACCACTAGACGTTCTCACGATCGTGACGTTGTTAAAGTCACATATCAGACTTACGAAGACACTTACGGCCGTCCAAACAGACCTTACTTCAACAGAGACGAACTTGACAATGACTTTGAATCGCGGCAACAGAAGCCTGTGACGGAAAGCTTCCAGTCATCAGCCAGATCGATTGACAACGAAGCACCGACTCCACAGTTGAAGTTATCAACTGAGAGCTCAGTGCAGACTGAAAAAAACAGACCAATAGATACACAGACAGACAAAACTGACTCAACAACAGAAAATTTGTACAAACTTTTAACTTTTGGTTATGTAGGAACTTATAAACGAGACAAGATTGTAAATGATGACACTAAAGACTCGAAAGACAATACGAACAAACATGACTCTGGCGACCATAACGTCAGTTTAGACTTCTCGACCGTAGTAAACAATGAGACGGACGATGATGACAAACAAAACGTAAAACTTTCAACTTTCATAGTTTACGATACAGCCACTAAACCTTACCTGACCAGCTCACAGAGACCGACGAGACGTAACGATGACGAGACCACGGAAAAGAAAGACAAATATTATTTCATTCAAAACGTCTTACATAAATACTCTGAAACAAAGAGCGACGACCTCAAGAAAACGAGCAGCGGAAAAGATAAAAACAACACTGACCAATACATAGGAATCGAGGAGAGGTTAGGCAACGACAGCTTGGACGATGACGAGAGACCAGTGAATGTGAGAGCGAAAATAAAATCAAGAAAACCATCGAGTTCAGCGAAAACTCCATCGGTCGCTTTTCAAATTATTCCTAGCGAAAACAATCCATCACAATGGGCGGTTTATGAGGAGAAAGAAGATCTTTCGGGGCAAATACCACAGATGCCAAGCATTAAGATCGACCCACACGCTCTACGGGAAGTGCCAAGACCTATGAATTTCGGTTTTAGAAAACGACACGGATAA

Protein sequence:

>DPOGS215049-PA
MTRRSLMVSTRHKNIIIVLFILITADCKTTNSKSDVENTTHNCPFTRIIEDIFNTKRTPSNAPYWKERDAELFVNATSFFETNQNISKSIQEELVNDIIASIKRIKANLSTSDDNNKLNEAIIIAPEHNEDNHHTDKKVTQIKSGIEDKTDIQMSEPKQNKEQRITKQTEKPDRNTFVEVLTIEPNNTKSNTYSILDVLKNLMPKLNASISKDLQSITIIEKNQNKNLSVSETKNVSTIVIKYCDKDNLTETINTGDDENKGDVKTLIGHDDSDDEEDYGEIADDKSYNRSLTESRKDILEAAKYGMEKMHELYGVLEPKLYSMGLVLDEKDPARYVAAFNAPSENADTYAKYGYASLQASARFRELASVDSDNLESRSEENQFPDASSLRQSPLVQQCPLRGAPKCPPASKRYRTHDGTCNNLSRPRWGATMTPVQRFLPPVYSDGIQAPRKSIFGSTLPSAREISALVHEDKDSENSGITHLLMQWGQFLDHDITSSSQSRGFNGSVPRCCKDGGRDFIPQEFMHPECLPIAVPPSDPFYGPRGVRCLDFVRSSPAPREDCALGWREQFNHVSSYIDGSPLYASSARQSDRLRLFRNGMLQYGRVQQRRPLLPAERDELCRGGALSTDCFKSGDARVNEHPGLVAKHIVWLRQHNRMAQELAHLNPHWSDEKIYQETRKIVGAMIQHITYREFLPIVLGPEVMRLFELELLPKGYFKGYSAKTNPNPASSFGTAAFRFGHSLVQSSMMRFDKFHRPINNNVSLHAELTNPSNIWSVGAVDRLLLGMLNQHIQKRDEFITEELTNHLFQTNHFNFGMDLAAINIQRGRDHGVPPYTAWREPCGLTPITDFDDLVRVMPARVVRKLKVLYRHVDDLDLFTGGVSERPVAGALVGPVFACIIAQQFANLRKGDRFWYENGGFDSSFTPAQLQQIRRISLSQVLCSTLDSIDNIQPFAFLSHENPKNDRISCRNGLLNNFDLSAWIELHSDSNNIKKSDENQQSSKTKTKRTTVRPTTTTKHPQKLSQTLTQQKSEKFRLNNMTETDDTDKDKDKNADDEPTGIKPNATVVIDDKLDFRNKSRRFTDFDDERNPPTRQYNDYYDDVQSVQSVVINNIPNNRPNRRPYISVTENIADKYTYLINYVPRPTHSWRQTTRRSHDRDVVKVTYQTYEDTYGRPNRPYFNRDELDNDFESRQQKPVTESFQSSARSIDNEAPTPQLKLSTESSVQTEKNRPIDTQTDKTDSTTENLYKLLTFGYVGTYKRDKIVNDDTKDSKDNTNKHDSGDHNVSLDFSTVVNNETDDDDKQNVKLSTFIVYDTATKPYLTSSQRPTRRNDDETTEKKDKYYFIQNVLHKYSETKSDDLKKTSSGKDKNNTDQYIGIEERLGNDSLDDDERPVNVRAKIKSRKPSSSAKTPSVAFQIIPSENNPSQWAVYEEKEDLSGQIPQMPSIKIDPHALREVPRPMNFGFRKRHG-