Monarch geneset OGS2.0

DPOGS214126
TranscriptDPOGS214126-TA1152 bp
ProteinDPOGS214126-PA383 aa
Genomic positionDPSCF300014 - 1518379-1520275
RNAseq coverage401x (Rank: top 30%)
Annotation
HeliconiusHMEL0113771e-10880.44% 
BombyxBGIBMGA006175-TA0.088.05% 
DrosophilaAtg4-PA2e-10152.04% 
EBI UniRef50UniRef50_E0VF888e-11255.22%Cysteine protease ATG4A, putative n=9 Tax=Coelomata RepID=E0VF88_PEDHC
NCBI RefSeqXP_972923.18e-12962.18%PREDICTED: similar to Autophagy-specific protein, putative [Tribolium castaneum]
NCBI nr blastpgi|910831932e-12762.18%PREDICTED: similar to Autophagy-specific protein, putative [Tribolium castaneum]
NCBI nr blastxgi|3838611446e-13360.00%PREDICTED: cysteine protease ATG4B-like [Megachile rotundata]
Group
KEGG pathwaytca:6616822e-128 
 K08342 (ATG4)maps-> Regulation of autophagy
InterPro domain[1-384] IPR0050783.2e-190Peptidase C54
Orthology groupMCL13621 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214126-TA
ATGGATGCCATGTTTGATATTTGTTACCTTTCGCCAGACGGTATTAATGTAGAGCCTGATGATATTCCGGAAACAAAGGATAACGTATGGGTATTAGGAAAAAAATACAGTGCTATACAAGATCTGGAGCGTATAAGACGCGATATCACTTCGGTAATATGGTGTACCTACAGAAAGGGCTTTGTCCCTATAGGAGATGAAGGTTTGACATCGGATAAGGGATGGGGCTGCATGCTCCGTTGCGGACAAATGGTGCTTGGAGTAGCATTGATTAAAGTTCACTTATCTGCTGATTGGGTGTGGACTCCCGAAACAAGAGATCCAACATATTTAAAAATAGTCCAAAGATTGGAGGAGAGAAAACAAGCTCCATACTCAATTCATCAAGTGGCTTTAATGGGGGCATGCGAAGGAAAGGAAGTAGGCCAGTGGTTTGGCCCAAATACTGTGGCGCAAGTACTCAAAAAATTGGTGGTTTATGACAAATGGAGTTCTTTGGTTATTCATGTTGCTTTGGACAATACAGTTGTTAAGGAGGATATTTTGCAACAGTGTATTGTCAATAATGACAGAGGGGATTGCTCTGAGAATGTAGATGGATTTGTTGTAAGTGATTGGATGCCTCTTCTGTTAATAGTACCTCTTAGACTTGGGCTTAGTGAAATTAATCCTATTTATATGGAAGGGCTTAAGATATGTTTTCAATCACCTCAATCTATAGGTGTAATAGGAGGCAAGCCGAATCAGGCTCTTTATTTAATAGGTTGCGTTGGTGATGAAGTTATATACTTAGATCCACATACAACACAGAAATCTGGATTAGTGGAGAACAAACTTACAGATGAACAAAAAGAAATGGACTGTACATATCACTGTAAGTATGCTTCAAGAATTCCAATATTGTCTATGGATCCCTCTGTGGCAGTGTGTTTTCTTTGTCGCACAAGAAGTGATTTTGATGAACTATGTGAATTAATTGAGAAAAGATTAATGCAAGAGAGTCAACCATTGTTTGAAATATGTGAGAAAAGACCATCACATTGGGGCCCCAATACAAACGACATTGATTTACAGAACACTAATCTATTCACAGAATTTGAAGAAGTTGACAGACAATTTGATGATTCTGACATTGAATTTGAAATTCTATGA

Protein sequence:

>DPOGS214126-PA
MDAMFDICYLSPDGINVEPDDIPETKDNVWVLGKKYSAIQDLERIRRDITSVIWCTYRKGFVPIGDEGLTSDKGWGCMLRCGQMVLGVALIKVHLSADWVWTPETRDPTYLKIVQRLEERKQAPYSIHQVALMGACEGKEVGQWFGPNTVAQVLKKLVVYDKWSSLVIHVALDNTVVKEDILQQCIVNNDRGDCSENVDGFVVSDWMPLLLIVPLRLGLSEINPIYMEGLKICFQSPQSIGVIGGKPNQALYLIGCVGDEVIYLDPHTTQKSGLVENKLTDEQKEMDCTYHCKYASRIPILSMDPSVAVCFLCRTRSDFDELCELIEKRLMQESQPLFEICEKRPSHWGPNTNDIDLQNTNLFTEFEEVDRQFDDSDIEFEIL-