Monarch geneset OGS2.0

DPOGS210890
TranscriptDPOGS210890-TA966 bp
ProteinDPOGS210890-PA321 aa
Genomic positionDPSCF300045 - 846597-850609
RNAseq coverage1610x (Rank: top 8%)
Annotation
HeliconiusHMEL0039552e-15284.16% 
BombyxBGIBMGA003767-TA5e-14982.15% 
DrosophilaAut1-PA1e-12366.87% 
EBI UniRef50UniRef50_Q9VVS62e-12166.87%Autophagy-related protein 3 n=29 Tax=Opisthokonta RepID=Q9VVS6_DROME
NCBI RefSeqNP_001135961.16e-14681.54%autophagy related protein Atg3-like protein [Bombyx mori]
NCBI nr blastpgi|2158206041e-14481.54%autophagy related protein Atg3-like protein [Bombyx mori]
NCBI nr blastxgi|2158206041e-15482.77%autophagy related protein Atg3-like protein [Bombyx mori]
Group
KEGG pathwayaga:AgaP_AGAP0115825e-124 
 K08343 (ATG3)maps-> Regulation of autophagy
InterPro domain[8-154] IPR0071344.6e-54Autophagy-related protein 3, N-terminal
[209-271] IPR0071354e-27Autophagy-related protein 3
[292-316] IPR0194611.8e-14Autophagy-related protein 3, C-terminal
Orthology groupMCL13048 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210890-TA
ATGCAGAGCGTAATAAACACAGTAAAGGGAACCGCCCTCGGGGTAGCAGGGTACTTGACGCCTGTATTAAAGGAATCTAAATTTCAAGAGACTGGAGTGTTGACCCCTGAAGAGTTTGTTGCGGCCGGAGATCACTTAGTCCACCACTGCCCAACTTGGCAGTGGGCCAAGGGTGAAGAGTCCAAAATCAGGCCCTACTTGCCCTCTGACAAGCAGTTCCTCATAACCAGAAATGTTCCATGCTATAGGCGCTGTAAACAGATAGAATATTGTGATGAACATGAAAAGACAATAGAAGATGAGAATGACGAGGACGGAGGCTGGGTGGACACACATCACTACGCCTCACCAGGGTTCCCTGCAGTTGAAGAGAAGGTGTGTGAGATGACGTTAGAGGCGGCCGAGGCTGGGGACAGTGATGGCGAGGGAGGCGGAGACGGCGATGAAGCTGATGGTGACGATGACGCTGATGATGATGAGGCCGAGGACATGGAGAACTTCCAGGAGTCTGGACTACTCGATGAAGTGGACCCATCTACAGCGCTGACGACCCGCAAGGAGCCGAGGAAGACGGTGAAACACACGGACGGCGACGAGATAGTGAAGACGAGGACCTACGACCTCCACATCACATACGATAAGTACTACCAGACGCCGCGCCTCTGGCTCATCGGATACGACGAGGAGCGGCGGCTGCTGAGTGTGGAGGCCATGTACGAGGACGTGTCCCAGGACCACGCCAAGAAGACCGTCACCATGGAGACGCACCCACACCTCTCCGGACCCAGCATGGCGTCCGTACATCCCTGCAGACACGCGGAGGTGATGAAGAAGATCATCGAGACGGTGATGGAGAGCGGCGGCGCGCTGGCCGTGCACTCCTACCTCATAGTGTTCCTGAAGTTCGTGCAGACCGTCATCCCCACCATCGAGTACGACTTCACGCAGAACTTCTCCATGAACTGA

Protein sequence:

>DPOGS210890-PA
MQSVINTVKGTALGVAGYLTPVLKESKFQETGVLTPEEFVAAGDHLVHHCPTWQWAKGEESKIRPYLPSDKQFLITRNVPCYRRCKQIEYCDEHEKTIEDENDEDGGWVDTHHYASPGFPAVEEKVCEMTLEAAEAGDSDGEGGGDGDEADGDDDADDDEAEDMENFQESGLLDEVDPSTALTTRKEPRKTVKHTDGDEIVKTRTYDLHITYDKYYQTPRLWLIGYDEERRLLSVEAMYEDVSQDHAKKTVTMETHPHLSGPSMASVHPCRHAEVMKKIIETVMESGGALAVHSYLIVFLKFVQTVIPTIEYDFTQNFSMN-