Monarch geneset OGS2.0

DPOGS202240
TranscriptDPOGS202240-TA3918 bp
ProteinDPOGS202240-PA1305 aa
Genomic positionDPSCF300032 - 897182-908403
RNAseq coverage70x (Rank: top 66%)
Annotation
HeliconiusHMEL0025920.079.06% 
BombyxBGIBMGA004832-TA1e-16274.55% 
DrosophilaAtg2-PA3e-4141.47% 
EBI UniRef50UniRef50_UPI000224793F4e-12743.38%UPI000224793F related cluster n=3 Tax=unknown RepID=UPI000224793F
NCBI RefSeqXP_001122229.17e-12143.93%PREDICTED: similar to Autophagy-specific gene 2 CG1241-PA [Apis mellifera]
NCBI nr blastpgi|3454958231e-12643.38%PREDICTED: LOW QUALITY PROTEIN: autophagy-related protein 2 homolog A [Nasonia vitripennis]
NCBI nr blastxgi|3454958238e-13143.48%PREDICTED: LOW QUALITY PROTEIN: autophagy-related protein 2 homolog A [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL30007 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202240-TA
ATGTTGTGGTATCTACCATGGTCCGAAAGCATAAAAAAAAGAGCCTGCCGATATTTACTCCAACGATACCTTGGAAATTATCTTGAAGAAAAACTTACTTTGGATCAGCTGAGTGTCGATCTTTACAATGGGACTGGTACCGTTTCAGATGTTAGCCTTGACTGCGAGGCCCTCAATGAGTTGGGGGACAGTCAAAATTGGCCATTGGAAATAGTTGATGGTCAAATGAAAGAAATAACAGTCACAGTACCCTGGTCTACGCTTCTCAAAGATGATTCTGTTGTGGAAATTAATGGACTCTCACTTACAGTCCAACCTAAAGTTAGGCCTGAACCTGCATCATCAATGCTGGAGTCTATGTGGTCTTCCATGTCATCATCAATGCAACTGGCGGCAGAATGTCTTAGGGAAGAGGCTGGCCCACAGGAATCAAACCCCGTTGAAGGCATTGAGATGTTTGCGCATGCTATAGATTCAATTTTGAGCAGAGTGAAGGTAAAATTCGTGAATACAAAAATAAGAATAGAACATGTTCCAAAAAATGGTGACAAGGGCATCGCACTGGAGGTCCATATTGAAAATATCGACTATTTCGACGAAGCCGGAACGGAACCATCACCGGAAACGACGGATCCAGATAAAACTAAAACCTACATCGTGTCAACATACACAAATAAAAAAATCAAATTCAATGGCGTCGTGTTCAACATAGACGAGTTCCCATCGAAGCTGCGTACAATGGCTCCGAGTCTGATGGAAAAGTCCGCGTCATCTGTCGACAGGTCCGACGGAGCGTCAGTCGACACCCCTAACTCAAATTACCAGAGCACAATGTCCGATGTGTTCTATGAAACTAGGAGTGTAATGTCTACAATAGATTCGGATCCTGTTAAGGAGATAATTGAAGAGAGACATATCGGAAGCGCAAGAGAGTCGACTCCGCCGGACAGAACCACGCAAGCAGACCCAATACTATTTGCAAAATTAACTGGTCAACAGGAATTAGCCCTCAAACTTAAGCATTCGGAGGAAGTTGAGGGGCCAAAGGTCGAGGTCAAAATTCTATTAGGGTCATTCTTGATATTCATAACGCCGAGGCAGATGCGTACAATCATTGAACTCATAGATGCTTTGAATCAACCGCATCTTGAAGATACGAGTAATATTCCAGTCCGACCCAGCAACATTAACATGCAATGCAAGCCGATGAAGCAGGCTGACTTTCAGCTGATAGAAGCACAACTGCTCGGTAACTTGGACAAGCAGCAAGCCAAACCCAGCAACATGTACGGCTGGTCCGGTCCTAGTTTTGAAGACGGTGAGACTGATGAGAAATTCCTGCCGATGACGTCACAGGGCCTTATGTCTGAGAGCTTCACCAGCTCCATCAGCTCAATGAGCAACAGCATGACGTCCAGCATGAACATATCCAGTCAGCCGAGGATCAAGAGAAACAAGAAAGTTCCTCACATTGAAGGCGATCCCACCGCTGAAGTGTCACATATAAGCTTGAGGGTGGCCTCAATATCGTGTGTACTACTACATAAGGATATCCTCGCCCCAACACCACTGTCCTTGGACAGCGTCTCCTACTCCAGCGCTTATAAAATGCAACAAGTAGCGACGGAATTCTTCAACCGTATAGAAACATTCAGCAAGTTCGACGAGAGAAGAAACATACATCATGTCAACGACTCATTGGATGCAGCCACCGACAGAGATCATTTAAGATTGTTGATGTCAGAGATCAGTGTGGATGGCAGCGAGAAGGTGACGTCACACGGCAGCCACACGGTCTGCGAGGCGGCCGTGCATGAAGCGTTACTCAGAGAGTGTCTGTACCACGAGGGACAGAGACAAACATATGACTTGATTCGATTCGATAGAGTTTTGAGCAGAGTGAAGGTAAAATTCGTGAATACAAAAATAAGAATAGAACATGTTCCAAAAAATGGTGACAAGGGCATCGCACTGGAGGTCCATATTGAAAATATCGACTATTTCGACGAAGCCGGAACGGAACCATCACCGGAAACGACGGATCCAGATAAAACTAAAACCTACATCGTGTCAACATACACAAATAAAAAAATCAAATTCAATGGCGTCGTGTTCAACATAGACGAGTTCCCATCGAAGCTGCGTACAATGGCTCCGAGTCTGATGGAAAAGTCCGCGTCATCTGTCGACAGGTCCGACGGAGCGTCAGTCGACACCCCTAACTCAAATTACCAGAGCACAATGTCCGATGTGTTCTATGAAACTAGGAGTGTAATGTCTACAATAGATTCGGATCCTGTTAAGGAGATAATTGAAGAGAGACATATCGGAAGCGCAAGAGAGTCGACTCCGCCGGACAGAACCACGCAAGCAGACCCTATACTATTTGCAAAATTAACTGGTCAACAGGAATTAGCCCTCAAACTTAAACATTCGGAGGAAGTTGAAGGGCCAAAGGTCGAGGTCAAAATTCTATTAGGGTCGTTCTTGATATTCATAACGCCGAGGCAGATGCGTACAATCATTGAACTCGTAGATGCTTTGAATCAACCGCATCTTGAAGATACGAGTAATATTCCAGTCCGACCCAGCAACATTAACATGCAATGCAAGCCGATGAAGCAGGCTGACTTTCAGCTGATAGAAGCACAACTGCTCGGTAACTTGGACAAGCAGCAAGCCAAACCCAGCAACATGTACGGCTGGTCCGGTCCTAGTTTTGAAGACGGTGAGACTGATGAGAAATTCCTGCCGATGACGTCACAGGGCCTTATGTCTGAGAGCTTCACCAGCTCCATCAGCTCAATGAGCAACAGCATGACGTCCAGCATGAACATATCCAGTCAGCCAAGGATCAAAAGAAACAAGAAAGTTCCTCACATTGAAGGCGATCCCACCGCTGAAGTGTCACATATAAGCTTGAGGGTGGCCTCAATATCGTGTGTACTACTACATAAGGATATCCTCGCCCCAACACCACTGTCCTTGGACAGCGTCTCCTACTCCAGCGCTTATAAAATGCAACAAGTAGCGACGGAATTCTTCAACCGTATAGAAACATTCAGCAAGTTCGACGAGAGAAGAAACATACATCATGTCAACGACTCATTGGATGCAGCCACCGACAGAGATCATTTAAGATTGTTGATGTCAGAGATCAGTGTGGATGGCAGCGAGAAGGTGACGTCACACGGCAGCCACACGGTCTGCGAGGCGGCCGTGCATGAAGCGTTACTCAGAGAGTGTCTGTACCACGAGGGACAGAGACAAACATATGACTTGATTCGATTCGATAGAGGTGATGAAGATACAACTGTATCAACGAAATCAAATATACGAATGAATTTCAAACAGACATCCAAATATATATCAACATCGGGAGAGAGGAAACTTGTCTACCCTACAACGGATATTGTGTTGAAGTGTACTCCGTTCTACATAGATGTGGACCTGACCTTATTGGAGCGCATGTCTTCAACATTCTTCGGTGGGCCCCCTCCCCCGCCCTCCCCGCACGTCGCTTCGCCATCAAACAAGTCACAGAACCAAGTCAACTTCTCACTACAATGTCCTAACTTGGATATTATACTAAGATTCCCCATAGCGGATCTTCGTCCAGGAGGTCGTTCTGAGGCTCGCTCTGTCCGTCCCGACTACCTCCTCTTCAAGTTACACAACACCAACGTCGGCCTCCAACAGCTCGCCAGCGCTCGGCCACTGCCGACCACTATATCAATACGAATGACCACCCTGGATCTATACTACTATGTATGCACTTATAATCCCCTCTCTAACTTAAACTTTTCCTCTATAATTGACCAAGTATATGATAAGATATATACATATATATCTATAATATACGATTGCTTACTTTCCCTTTTATTTCATAGCAAAATAATGTCATATATTCTATTGTAA

Protein sequence:

>DPOGS202240-PA
MLWYLPWSESIKKRACRYLLQRYLGNYLEEKLTLDQLSVDLYNGTGTVSDVSLDCEALNELGDSQNWPLEIVDGQMKEITVTVPWSTLLKDDSVVEINGLSLTVQPKVRPEPASSMLESMWSSMSSSMQLAAECLREEAGPQESNPVEGIEMFAHAIDSILSRVKVKFVNTKIRIEHVPKNGDKGIALEVHIENIDYFDEAGTEPSPETTDPDKTKTYIVSTYTNKKIKFNGVVFNIDEFPSKLRTMAPSLMEKSASSVDRSDGASVDTPNSNYQSTMSDVFYETRSVMSTIDSDPVKEIIEERHIGSARESTPPDRTTQADPILFAKLTGQQELALKLKHSEEVEGPKVEVKILLGSFLIFITPRQMRTIIELIDALNQPHLEDTSNIPVRPSNINMQCKPMKQADFQLIEAQLLGNLDKQQAKPSNMYGWSGPSFEDGETDEKFLPMTSQGLMSESFTSSISSMSNSMTSSMNISSQPRIKRNKKVPHIEGDPTAEVSHISLRVASISCVLLHKDILAPTPLSLDSVSYSSAYKMQQVATEFFNRIETFSKFDERRNIHHVNDSLDAATDRDHLRLLMSEISVDGSEKVTSHGSHTVCEAAVHEALLRECLYHEGQRQTYDLIRFDRVLSRVKVKFVNTKIRIEHVPKNGDKGIALEVHIENIDYFDEAGTEPSPETTDPDKTKTYIVSTYTNKKIKFNGVVFNIDEFPSKLRTMAPSLMEKSASSVDRSDGASVDTPNSNYQSTMSDVFYETRSVMSTIDSDPVKEIIEERHIGSARESTPPDRTTQADPILFAKLTGQQELALKLKHSEEVEGPKVEVKILLGSFLIFITPRQMRTIIELVDALNQPHLEDTSNIPVRPSNINMQCKPMKQADFQLIEAQLLGNLDKQQAKPSNMYGWSGPSFEDGETDEKFLPMTSQGLMSESFTSSISSMSNSMTSSMNISSQPRIKRNKKVPHIEGDPTAEVSHISLRVASISCVLLHKDILAPTPLSLDSVSYSSAYKMQQVATEFFNRIETFSKFDERRNIHHVNDSLDAATDRDHLRLLMSEISVDGSEKVTSHGSHTVCEAAVHEALLRECLYHEGQRQTYDLIRFDRGDEDTTVSTKSNIRMNFKQTSKYISTSGERKLVYPTTDIVLKCTPFYIDVDLTLLERMSSTFFGGPPPPPSPHVASPSNKSQNQVNFSLQCPNLDIILRFPIADLRPGGRSEARSVRPDYLLFKLHNTNVGLQQLASARPLPTTISIRMTTLDLYYYVCTYNPLSNLNFSSIIDQVYDKIYTYISIIYDCLLSLLFHSKIMSYILL-