Monarch geneset OGS2.0

DPOGS202319
TranscriptDPOGS202319-TA1422 bp
ProteinDPOGS202319-PA473 aa
Genomic positionDPSCF300032 + 459935-461356
RNAseq coverage169x (Rank: top 51%)
Annotation
HeliconiusHMEL0076470.085.65% 
BombyxBGIBMGA004926-TA0.083.70% 
DrosophilaCG6194-PA3e-12750.11% 
EBI UniRef50UniRef50_G6DEX60.0100.00%Autophagy related protein Atg4-like protein n=5 Tax=Endopterygota RepID=G6DEX6_DANPL
NCBI RefSeqXP_971091.27e-14356.00%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|2133900420.083.62%autophagy related protein Atg4-like protein [Bombyx mori]
NCBI nr blastxgi|2133900420.083.62%autophagy related protein Atg4-like protein [Bombyx mori]
Group
KEGG pathwaytca:6597202e-142 
 K08342 (ATG4)maps-> Regulation of autophagy
InterPro domain[53-474] IPR0050785.8e-218Peptidase C54
Orthology groupMCL13503 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202319-TA
ATGTTTAACGGAACAAGCAGCGCTATTACGGCAACTTCCTTGCTTAAAACTTCCACAGGCAGTGTTAGTCAAGGTACGGAGACTATGAAGGTCGAAAACGTTTCTAAACCTGCCACATCGAAAGATAATACGCGTGACAGTTCAGAGGATCTCCTTGATCTCAAGGGAAAAGTAGAATCACGTTTACTATCAATGTGGAACAATGTTAAATTCGGATGGACTGTTAAGATTAAAACAAATTTTTCCAAAGAGTCTCCTGTCTGGTTACTAGGTCGGTGTTACCATCGCAAATTGAGTCCTACGGGATCTTTGGAATCTTCAACCGAAATTGGCACAGAAGCCACAGCTCATGAACAGATGGAACAAATATATGGCGAAGGTATTGAAGGGTTTAAGTCGGACTTTATTAGTAAAATTTGGATGACATATCGAAGAGAGTTTCCCACTATGTCAGGATCCTCTTTCACAACAGATTGTGGTTGGGGTTGCATGCTTCGTAGTGGACAAATGATGTTAGCTCAAGCTCTTGTATGCCATTTCCTTGGTCGCTCGTGGAGGTGGTCGGAAAAACCAATACAAAATGGTAGAGAATTCCAAGAAGACTGCCTCCATCGCATGATTATTAAATGGTTTGGTGATAAATCATCTGTTAATAGCCCTCTTTCAATTCATCAGATGGTAACTTTAGGTGAAGCATTGGGGAAAAAGCCAGGTGACTGGTATGGTCCTGCGTCGGTAGCTCACTGTCTCAAATCAGTCATGGTTGAGGCTTCGAAAGAAAACTATGAATTTGATAAATTAGAAGTTTATGTTGCTCAAGATTCAACCATTTATATTCAGGATGTGTACACACACTGTAGATTGCCTAATGGTTGTTGGAAATCACTCATACTTCTGGTACCTGTAAAATTGGGTACTGAAAGGTTGAACCCTATTTATGGTCCCTGCTTAACATCGCTGTTGACGCTAGATTTCTGCATCGGAATTATTGGTGGCCGTCCCAAACATTCCCTTTATTTTGTGGGATATCAAGACGACAGACTTATACATTTGGATCCTCATTACTGCCAGGAAATGGTCGATGTGTGGCAGCCGAATTTTTCTTTACAAACATTTCATTGTCGTTCTCCAAGGAAGATGCCTATCAGTAAAATGGATCCATCTTGCTGCATAGGTTTCTATCTTCAAACGCATCACGACTTTGAAACCTTTGTGAATGTTATAAATACGTTCCTAACTCCACAAGGAGTTTCTTCCAGTAATGAGTACCCAATGTTCACGCTTCATAGTGGATCTCGCAGCACAGTTATGAACCCACCTAACATTCGATACTCGATATATGAATCGGAACACAATTGGGCGGCTCCAAATTTACAAGACAGTGACACCGACATGGAATCAGAAGAATTTGTTTTACTTTAA

Protein sequence:

>DPOGS202319-PA
MFNGTSSAITATSLLKTSTGSVSQGTETMKVENVSKPATSKDNTRDSSEDLLDLKGKVESRLLSMWNNVKFGWTVKIKTNFSKESPVWLLGRCYHRKLSPTGSLESSTEIGTEATAHEQMEQIYGEGIEGFKSDFISKIWMTYRREFPTMSGSSFTTDCGWGCMLRSGQMMLAQALVCHFLGRSWRWSEKPIQNGREFQEDCLHRMIIKWFGDKSSVNSPLSIHQMVTLGEALGKKPGDWYGPASVAHCLKSVMVEASKENYEFDKLEVYVAQDSTIYIQDVYTHCRLPNGCWKSLILLVPVKLGTERLNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVGYQDDRLIHLDPHYCQEMVDVWQPNFSLQTFHCRSPRKMPISKMDPSCCIGFYLQTHHDFETFVNVINTFLTPQGVSSSNEYPMFTLHSGSRSTVMNPPNIRYSIYESEHNWAAPNLQDSDTDMESEEFVLL-