Monarch geneset OGS2.0

DPOGS202241
TranscriptDPOGS202241-TA3666 bp
ProteinDPOGS202241-PA1221 aa
Genomic positionDPSCF300032 - 884670-895968
RNAseq coverage451x (Rank: top 27%)
Annotation
HeliconiusHMEL0025920.078.96% 
BombyxBGIBMGA004832-TA0.071.65% 
DrosophilaAtg2-PA2e-16131.68% 
EBI UniRef50UniRef50_UPI000224793F0.040.77%UPI000224793F related cluster n=3 Tax=unknown RepID=UPI000224793F
NCBI RefSeqXP_969083.10.038.78%PREDICTED: similar to autophagy-specific gene 2 [Tribolium castaneum]
NCBI nr blastpgi|3454958230.040.77%PREDICTED: LOW QUALITY PROTEIN: autophagy-related protein 2 homolog A [Nasonia vitripennis]
NCBI nr blastxgi|3454958230.040.85%PREDICTED: LOW QUALITY PROTEIN: autophagy-related protein 2 homolog A [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[1125-1213] IPR0154123.4e-16Autophagy-related, C-terminal
Orthology groupMCL11314 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202241-TA
ATGAGGGAATTCACAAACAGCTCCGTAGAACATTCCGCTATACATCTAGACTTCAGTCTGCCCATCCTCAGCTTACAACTGGAATCAAAGCAGCTGTACGAGATCCTGTACAACCGCATTAGTTCCGAGCTGCTGCTGTGGTCTCCTCGTGAAGAGTTTGATATCGCCCCCCCTCCGCCACCCTCCTTCGAACCATGCCGTGGAGGATACGATTCAGACTCGGAGAGCTCGTCGTCTCAAGAGGACAATTTATACTATTCAACATATGACAACAAATTGAAGAAAGGCATCGGCAACACAAGACCGTTCTCAGACATCAGACAATGTGAGACGCACAACTTCTGTCTCACATTCAATGTTGACAAGGGGCTTCTCTCTATATTAGCGCCTGTGAGGGATAGCAACAAAAGAGTTGTCCCGGGACAGATGGGTGAATTAGTTTTGGAGGCGCATAAGCTGTCTATGTGTCAAGTCAGCGGGCTCTATGGAAAAGCTAAGACGGCCCAAATGTGTCTGAGGGCTGCCAAGGCGACTCTATACCATGAACCGCTCCTGACTATACCGTCAGACAGGCCACCGTTACGTTTGTACGGCTCAGTGTTGCCATCACACCTCAAGAAGACAATATATCCGTCGAATAAAGGCGTCATAATAAAAGATAGATTGAAGCCTAAGGACATGTTCACAATGGCGCTGAAAACTGAACCTGACACTGAGACGCCCAATTTGAAGACAATATGCATAGCCCTGGGCATTGAACAGGCCACCCTCCGACACAGAGGCGACAAGGGTATAGCGTGGCTCAGTCAGCTGTTGGATGTACTAGATGTTATAGACTACCCTGTGCCGGGATACACGCCGTCGCCAGTACTATCAGAATTGCATGTGCATGTGTGGGACTGCGCTGTGGACTACAGGCCGCTGTATCTTCCAATACGTAGCGTGGTGACGCTTGGCAACTTCAGCGTTTCCAGCAACCTTATACCGGAAACCAACACGTCCTACCTCCGCTTCCTCGCTCAAGAATGCTCACTCCACCTCAGCTATCTCCACAGCAAGACTGTAGCGCCAGACGACAGAGCACCAGATCTCCACAAGGAATACGTCTGCGTCATTGATGTCGGACTGTTTGAACTGTCCCTTAGAATGGAAGATAAAAGCAATGGCAGCCAAGACCATCCTCAGGTGGACCTGACGGCGTCCAACAACATGGTGACTATGTTCACGTGTTGGGATTCCGCGTCCGCGCTGTGCCGTCTGTTGACTTATGTGGCGTCTGACGGGGACTCGCAGACTTACGACTCCCGACACACCAGCCTGTGCTCTGACCAGCCCTTGGAACAGTTGGTTGGGTTAGAAGATCGACCGATAGAAGAAATAAGAGAACTGTCGCCGAGTGAAATCCAACAAGTGAACGATTTGATGGCGGAAGCTATGAAAGAGAGTCCCAATAATACAATTGATGATGAGGATTTCGTGAGCTCGACGGAAAAGGAAGGTGTGGAACTGTTCTACTTTCCTGATGAGTCAAATGTGAAGCAAAAGCAACTCGAGACAGCGGACGCCGAGAGCGAAACTAAGTCAGTTGAATACGAAGACATGTCACACGTTGAAGAGGCCCAGGAGGCGACGCCGACCAACATGCAGGTCGCCAGGGATCTAGGGGACCCGACTGTCACGCCGAAGTCGACGCCAAAAAAATCAAAGCGGAAAAAGATGAGCTCGTGCGGCAGCGGCAGTAACACGGACGACGAGTACTGTGTGGTGGAACAGCTGGCTGGTGACATGGAGATGGAGGAGCCGGTGGTGACCTGGCTGGCTGGACCCGTCACTATGTTGAACGACCACTTCAGTGTACCACCAGCGAAGTCAGACGTACTCGCAGCGCCCAAGAGCTTCCCGCCACCAGTGCTCAGGTACACTCTGTGTGAACTGAGCTTAACCTGGAATATGTTTGGAGGCAGTGATTTCAAACCGAAAGAAACGTCCAAGAAATCAGTCTCCATTGATGATCCTAGGGGAGGGGGCTCGCCTGTTAGTTCTGCGCGCAGCAAGGACTACGAGCCATACGAGAGCCGTCGCTCGTTGGCGTCCTCATACCGGCACGGGGTCAGTTGGAGCGCGGGAACTGACCGGGTGCGGGCGACTCACACAAGAAAAAACGACTCCCGGGATCATCACACTTGTGTCAAGCTCTGTCTTACTAAGGTGAAGTTCCAACACGAGGTGTACCCGCCCGGATGCACGCAGGCTTCCAGACAGACCCTGGCTATCGCAAAAATAGAAGTCTTAGACAGATTAGTGTGCAGCGACATCAACAAACTGCTGAGTCAATATAAACTTAAAGACGAACCCGAGAGAAAAAACGCTCATATGTTAATAGTGAAAGCGGTCCACCTGCGAGCCGACGCCTCGCTCCCGGTGCAGGAGTGCTGTCTAAAGGTGTCTCTACTACCGCTACAATTCAACCTGGACCAGGACACTCTCGCCTTTTTAGTTGATTTCTTCTCTAAATTGGGCAGTGATGAGACCAATGAGGAAGACACAAAGAGCCTAGGGGCTGTCTCAACGGAGTCAGGATCCCGTCAAAGTACGCCCACACATAGGCCGCCCGTGATGAGCGTGGGTGCCCATTTAAAAGACCCACCGCCCACGCCCACATCCTTAGGAGATGCCGACTGTCTCTCGCTTAACGAAACTGTTATTCGTGACGACGAACCGCTCATGGAGACGTATGAAGCTGAACGGCTGGTGTCCGAGAATCTCATACAACTGGAGGAGGACTTTCAGCGGCTCGGCATCAGCCACGAGAAGCCGACCACCAAAGTGCAAGACTGTGAACCCGTCGATGACTCGCCTATATACTTCCGTCGTGTAGTATTTTCTCCTGAGGTGCCAATACGTCTGGACTATGTGGGTAAGCGTGTAGACCTGTCAGCTGGTCCTGTGGCCGGACTGCTCATGGGACTCGGACAGCTAAACTGCTCAGAGCTAACATTGAAAAGGCTCGATTATAAGTTGGGCCTGTTGGGCCTTGAGAAGCTGGTGCAATGGGCGCTACACGAATGGCTATCAGACATCAAAAGACATCAACTGCCGGGGCTACTCAGTGGCATTGGGCCCATGCATTCCTTACTACAGATAATCACCGGCATCCGCGACCTGGTCTGGTTGCCGGTGGAGCAGTGGCGTCGCGACGGGCGTCTGGTCCACGGTCTAAGACGCGGCGCCGCCTCCTTCACAGCTAGAACTGCTGTCGCTGCTCTGGACATCACCGCACGCATCCTACATCTCATACAGGCGACAGCTGAAACGGCGGTGGACATGTTGACACCGGCTCCGGCTCTGCCCCTGTCGACCCAGGGGAGGAGACGTCGCAGAGACCGCACTAGACAACCCGCTGATATACGGGAGGGAGTTACCAGCGCATATAACACTGTTAAAGAGGGTTTCGCGGAGACGGCCGCATCATTATCAGCGGCGGCTCGTCGGGGGAAGGGCGCGGGGGTGCTCCGTCAGTTGCCGGGGGCTGCGGTCGCGCCCCTCGCCCTGGCCGCGGCCGGCGCCGCCGACGTCCTGGGAGGTGTCCGAGCACACCTCGCACCGCACACCACGCGTGATCACGCAGACAAATGGCGCAGACCATTCACAGATACGACTGATTAA

Protein sequence:

>DPOGS202241-PA
MREFTNSSVEHSAIHLDFSLPILSLQLESKQLYEILYNRISSELLLWSPREEFDIAPPPPPSFEPCRGGYDSDSESSSSQEDNLYYSTYDNKLKKGIGNTRPFSDIRQCETHNFCLTFNVDKGLLSILAPVRDSNKRVVPGQMGELVLEAHKLSMCQVSGLYGKAKTAQMCLRAAKATLYHEPLLTIPSDRPPLRLYGSVLPSHLKKTIYPSNKGVIIKDRLKPKDMFTMALKTEPDTETPNLKTICIALGIEQATLRHRGDKGIAWLSQLLDVLDVIDYPVPGYTPSPVLSELHVHVWDCAVDYRPLYLPIRSVVTLGNFSVSSNLIPETNTSYLRFLAQECSLHLSYLHSKTVAPDDRAPDLHKEYVCVIDVGLFELSLRMEDKSNGSQDHPQVDLTASNNMVTMFTCWDSASALCRLLTYVASDGDSQTYDSRHTSLCSDQPLEQLVGLEDRPIEEIRELSPSEIQQVNDLMAEAMKESPNNTIDDEDFVSSTEKEGVELFYFPDESNVKQKQLETADAESETKSVEYEDMSHVEEAQEATPTNMQVARDLGDPTVTPKSTPKKSKRKKMSSCGSGSNTDDEYCVVEQLAGDMEMEEPVVTWLAGPVTMLNDHFSVPPAKSDVLAAPKSFPPPVLRYTLCELSLTWNMFGGSDFKPKETSKKSVSIDDPRGGGSPVSSARSKDYEPYESRRSLASSYRHGVSWSAGTDRVRATHTRKNDSRDHHTCVKLCLTKVKFQHEVYPPGCTQASRQTLAIAKIEVLDRLVCSDINKLLSQYKLKDEPERKNAHMLIVKAVHLRADASLPVQECCLKVSLLPLQFNLDQDTLAFLVDFFSKLGSDETNEEDTKSLGAVSTESGSRQSTPTHRPPVMSVGAHLKDPPPTPTSLGDADCLSLNETVIRDDEPLMETYEAERLVSENLIQLEEDFQRLGISHEKPTTKVQDCEPVDDSPIYFRRVVFSPEVPIRLDYVGKRVDLSAGPVAGLLMGLGQLNCSELTLKRLDYKLGLLGLEKLVQWALHEWLSDIKRHQLPGLLSGIGPMHSLLQIITGIRDLVWLPVEQWRRDGRLVHGLRRGAASFTARTAVAALDITARILHLIQATAETAVDMLTPAPALPLSTQGRRRRRDRTRQPADIREGVTSAYNTVKEGFAETAASLSAAARRGKGAGVLRQLPGAAVAPLALAAAGAADVLGGVRAHLAPHTTRDHADKWRRPFTDTTD-