Monarch geneset OGS2.0

DPOGS205833
TranscriptDPOGS205833-TA5004 bp
ProteinDPOGS205833-PA1667 aa
Genomic positionDPSCF300081 - 112017-124024
RNAseq coverage1146x (Rank: top 11%)
Annotation
HeliconiusHMEL0099290.061.22% 
BombyxBGIBMGA010888-TA0.067.13% 
DrosophilaCG9485-PB0.054.22% 
EBI UniRef50UniRef50_E2BPT90.056.74%Glycogen debranching enzyme n=18 Tax=Coelomata RepID=E2BPT9_HARSA
NCBI RefSeqXP_394961.30.055.54%PREDICTED: similar to CG9485-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838647550.057.03%PREDICTED: glycogen debranching enzyme [Megachile rotundata]
NCBI nr blastxgi|3838647550.057.03%PREDICTED: glycogen debranching enzyme [Megachile rotundata]
Group
Gene OntologyGO:00041350amylo-alpha-1,6-glucosidase activity
GO:00059780glycogen biosynthetic process
GO:00038242e-47catalytic activity
GO:00431691.2e-30cation binding
GO:00059751.2e-30carbohydrate metabolic process
KEGG pathwayame:4114880.0 
 K01196 (AGL)maps-> Starch and sucrose metabolism
InterPro domain[44-1665] IPR0104010Amylo-alpha-1,6-glucosidase
[1204-1656] IPR0089282e-47Six-hairpin glycosidase-like
[483-572] IPR0137811.2e-30Glycoside hydrolase, subgroup, catalytic core
[198-580] IPR0178534.2e-24Glycoside hydrolase, superfamily
Orthology groupMCL13992 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205833-TA
ATGCTCGCAAGAGAGATGACCGAGTACGCGAGGAAGGCGGAAAAAATCGCCATCGAAAAGAGGCAGATGGCAGTGGAGAGTCGCGAGGCGGAGGGGGGGTCGGCGCCCGAGGCCAGCGGCGAGGGGGCGGGGGAGGGGCAGGTCGTCCGTGCGATCACGCTCAATCACGGAGAACATCAAGACGCTACTCTCTATAGGTTTGAAAAAGGATGCCGTTTACAGTTCAGCCCCGGTCCGAGCATGCTCGGTCGGAAGGTGTTCCTCTACACAAATTATGTCGTCTCAGAAAATTCGGAAGACAAGAGCGAGCCAGCGTTCGTACGCAACCAGTACTATGCTCTCGAGTGGAGGAAGGACGAGGATTCGGAGTCTCTCGGTACCGGCCTCCTGGTCACTGACACGGAGTTCTACTGCGAGCTGAAACTGGCTAAGGCTGGCTCATTCCACTACTACTTCGTTTACGACAGCCCCGAGTCCCGGGTGGGTCCTCAAGGCTCGGGCTGGTTCCACGTGGCTCCGAGCCTCTCCGCGGGGGGAGTTCAGGTTCCGCTGGACGGCGTCATGTGTCAGACGGTGCTGGCCAAAAGTTTGGGACCCCTGTCACGGTGGATGAAAACCCTGAGAGTGGCTCACGAAGCTGGTTTCAACATGATACACTTCACTCCCGTTCAGGAGCTGGGCGCGTCTAATTCAAGCTACAGCCTGGCGAACCAGCTCAAGCTGAACCCTCGCTTCAACGACATCAATTCTGGCAGGGATGCCACTTTCGCTGACGTGGAAAACATCATCGCCAAAATGCGCAACGATTGGAAGATGCTGTCGATATGTGACGTCGTGTTGAACCACACGGCCAATGAGAGTGAATGGCTGACGTCACATCCGGAAGCCACTTATAATTGCATCACATGTCCCCACCTGAGGCCCGCCGCCCTCCTCGATGCTGTACTGGCCAAGTTGGGGGAAGACATCGCATCGGGACGGGAAACTCGCCTGCCCACTAAGATTAACACACATCAACAAATTGAGATGATCCGCGACATCCTGCTGAACGAGCGTCTCCCGGAAGCGAAGCTGCACGAGATGTACATCTGTAACGTGGACGAGACGGTCGAGAGGTTCTACCACATGGCGAGGAATAAGGCCGACTGCTACGACGAGGATGCTCGTGTGAAGCGTTGCTGTGAGGAGTTCCGCGTGAAACTGGAACAGTTGAACGAGGCCGCCATACACACCGTCAACGATCACCTGAGAGCCGCTGTGCAGAACTGTGTAGCGGGAATGAGTTACTTCCGTCTTCAGTCAGACGGCCCTAAGATAGAGGAAGTTAGCGAGAAAAATCCTCTCGTACCAAGATATTTCACGTTCCCTGGTCCGCTGGGTGGTGTCGCGGACATGGAAGGCGTGATATACGGCGAGGCGGGCCGCCTGGTGATGGCGCACAACGGCTGGGTCATGAACTCCGACCCGCTGCAGGACTTCGCTGACAAGGAACACGACGGACGGGTTTACTTCAGGAGGGAACTCATCGCTTGGGGAGACAGTGTCAAACTCAGGTACGGCGAGAAGCCGGAAGACAGTCCATTCCTTTGGCGCCACATGCGGCAATACGTAGAACTAACAGCCGAGGTCTTCGATGGAGTCAGACTGGACAACTGTCACTCAACGCCATTACATGTTCACGGCCCTAAGATAGAGGAAGTCAGCGAGAAAAATCCTCTCGTGCCAAGATATTTCACGTTCCCTGGTCGGCTGGGTGGTGTCGCGGACATGGAAGGCGTGATATACGGCGAGGCGGGCCGCCTGGTGATGGCGCACAACGGCTGGGTCATGAACTCCGACCCGCTGCAGGACTTCGCTGACAAGGAACACGACGGACGGGTTTACTTCAGGAGGGAACTCATCGCTTGGGGAGACAGTGTCAAACTGAGGTACGGTGAGAAGCCGGAAGACAGTCCATTCCTGTGGCGCCACATGCGGCAGTACGTAGAACTAACAGCCGAGGTCTTCGATGGAGTGAGACTGGACAACTGTCACTCAACGCCATTACATGTCGCGGAGTACATGCTGGACTGTGCTCGGAACGTCAAACCCGACTTGTATGTAGCGGCGGAGCTGTTCACTAACTCCGACCACGTCGACAACATATTTGTCAATAGACTTGGTATAACGTCGCTGATACGAGAAGCCCTATCAGCATGGGACTCCCACGAGCAGGGTCGACTGGTCCATCGTTTCGGAGGTCGCGCCGTGGGGTCGTTTTTCACCCCTCAAGTGACCCGCGCCCAACCTCAAGTGGCCCACGCTTTGTTCCTTGACCTCACACACGACAACCCATCGCCGATCGATAAGAGGAGCGTGTTCGATCTACTCCCGTCCGCGGCGCTGGTGTCGATGGCCAGCTGCGCGATCGGTTCCACTAGAGGATACGACGAGCTCGTGCCCCACCATATCCACGTGGTGGACGAGGCTCGTTTGTACGCGGAGTGGGAGGAGGGGGAGGGGGAGGGGGTGAACGCGTCCACCGGCCTTATAGCAGCTAGGCGGGCGCTCAACGATCTACATTTACATCTAGCAGCCTCCGGATACTCGGAGGTGTACGTCGATCAAATGGATGCTGACGTGGTGGCTGTGACGAGACACGAACCACATTCGAGAAAGTCTGTTATTTTGGTCGCCTTCACCGCCTTCAAGGCCCCCGACGAGTCTTCCACCGGCCGCTACGTGAAGCCGTTACGGTTCGAGGGACAACTGGAGGAAATTATTTTGGAGGCGGTGCTGAGACACAAGGACCACAGAAGCACGGGTCGCCCGTTCCAGACGTGCGGAGGGTTCTCTCGTCACCCGCAGCACATCAACGGTCTCAGCGACTACGAGGCCAGCGTCCGCTGCGGCGTTCCGTTGGCGCAGTCCAACGTGTTTGTTAGTGAACGCCGTGACGGTCCCTATACTGTGCTGGAGTTCGGCGTGCTACCCCCCGGGACGGTGGTCGCGGTGCGTGTGGCTCCACATTCCTCGCAAGCGGGAGCGTTGGCGGCTCTCAGACGAGTGACCTGCCAGCATCCAGCCACAGATCCCCTGGGTCTATCGCCGGCACTCACTGACCTCGATCTAGGAGATTTCAACGCGTTGCTATATTGCTGTGACGCGGAGGAACGTGAGCGCTGTGGGGGCGGAGTGTACGACGTGCCCGGTCACGGTCCGCTCGTATACGCTGGGCTGCAGGGCGTGGCTTCACTGCTGGAGGAAGTCGCCCCACGTGACGACCTCGGACACGCGCTATGTGACAACCTCCGCGCTGGCGACTGGCTGCTCGACTACCAATGGCGGCGCTTAGAGTCTGACCCGCGACTGTCCGCCCTGGCGACGATCTATAGAGAAGCGTTGAGGCCGGTCGGTGAATTACCGCGTTTTCTGGTGCCGGCCTACTTCGAGGTCACCGTCCGTTGTATGATAGCTGCCGTGAAGCGCGCGGCCCTGGCTCGTCTGGGAGGCGTCGCGTTGAGTTCGGGAGTCGCTCGTGAGTTGTCACTGACCGGGGTCCAGCTGGCCGGTGCGGTGTCGTCCGCTCGTTTGCCGGCCATGTCTCCCTCTCTGCCGATGCTCCGCCCGGCTCGTCCTTTGTCCCTGTCGGCCGGTCTGCCTCACTTTGCCGTGGGCTACATGAGATGCTGGGGTCGAGACACCTTCATCGCTCTGAGAGGGATGTTCCTACTCACGGGCCGTTATCAAGACGCGCGCTTCCATATACTAGGATTCGCTGCTTGTTTAAGACACGGTCTGATACCTAATTTATTGGACGGAGGCCGCAACGCTAGATTCAACTGTCGCGATGCGGTGTGGTGGTGGCTGCAGAGCATCAAACAGTACTGCACGGAGGCCCCTCAAGGCTACTCTATCCTGACGGACCCGGTGTCTCGTATATTCCCGAAAGACGACAGTGAGCCTGCACCCCCAGGGGCTGCGGACCAACCGCTGCATGACGTTATGCAGGAAGCCTTGGACGTTCACTTTCAGGGTCTCGTCTTTCGTGAGCGCAACGCCGGCCGACAGATAGACGCACACATGTCCGACAAGGGTTTCAACATCCAGATCGGCGTGGACCCCGAGACCGGGTTCCCGTTCGGAGGAAACGAAGCCAACTGCGGGACGTGGATGGATAAGATGGGTTCGTCGGAGACGGCCGGCACGCGGGGGAAGCCCGCCACGCCGCGCGACGGCAGCGCGGTGGAGCTCGTCGCGATGGCGTACTGCGTCGCGTCCTGGCTGGCGGCACAGCATCGCTCCGCTAAGTATCCGTACCCGGGCGTGGCGAGGCGGCACCGAGACGGATCCCTCACCGCCTGGACGTACTCCCAGTGGGCGGATCGAATACGACGCTCCTTCGAACGACACTTCTGGGTCCCGGCCGCGCCCTCGGCCGCCGACCAGCGCCCAGACCTCGTGCACCGCCGCGCCATCTACAAGGACACGCACGGCGCCTCGCAGCCCTGGGCCGACTATCAACTGCGCTGCAACTACGTCGTCGCCATGGCTCTGGCGCCGGAATTGTTCGACCCGCGACACGCCTGGCTCGCCCTGGACAATGTCGAGAAACTGCTGGTCGGGCCTCTCGGTCTCAAAACCCTCGACCCCGAGGACTGGGCCTACCGTCCCAACTACGACAACTCGGACAACAGCTCGGACCCGAGCGTTGCACACGGCTTCAACTACCACCAGGGCCCGGAGTGGACGTGGCCGCTCGGCTTCTACCTCAGAGCGCGGCTCGCCTTCGCTCACGACAACGGTCAGTTCTCCAAGACCGTGGCGGCGGCATACGCGGCCCTCGCCCCGCTTGTGGCAGAGATGCGTTCGTCCCCGTGGCGCGGCCTGCCCGAGCTGTCCAATGCCGGAGGCGCATTCTGTAAGGACTCGTGTCGCACCCAGGCCTGGAGTTCGTCCTGCATGCTAGAGGTGCTCCATGATCTCGAGCTGTCGCGCCGAGCTCGACCGCTCCCTCTCGACTGA

Protein sequence:

>DPOGS205833-PA
MLAREMTEYARKAEKIAIEKRQMAVESREAEGGSAPEASGEGAGEGQVVRAITLNHGEHQDATLYRFEKGCRLQFSPGPSMLGRKVFLYTNYVVSENSEDKSEPAFVRNQYYALEWRKDEDSESLGTGLLVTDTEFYCELKLAKAGSFHYYFVYDSPESRVGPQGSGWFHVAPSLSAGGVQVPLDGVMCQTVLAKSLGPLSRWMKTLRVAHEAGFNMIHFTPVQELGASNSSYSLANQLKLNPRFNDINSGRDATFADVENIIAKMRNDWKMLSICDVVLNHTANESEWLTSHPEATYNCITCPHLRPAALLDAVLAKLGEDIASGRETRLPTKINTHQQIEMIRDILLNERLPEAKLHEMYICNVDETVERFYHMARNKADCYDEDARVKRCCEEFRVKLEQLNEAAIHTVNDHLRAAVQNCVAGMSYFRLQSDGPKIEEVSEKNPLVPRYFTFPGPLGGVADMEGVIYGEAGRLVMAHNGWVMNSDPLQDFADKEHDGRVYFRRELIAWGDSVKLRYGEKPEDSPFLWRHMRQYVELTAEVFDGVRLDNCHSTPLHVHGPKIEEVSEKNPLVPRYFTFPGRLGGVADMEGVIYGEAGRLVMAHNGWVMNSDPLQDFADKEHDGRVYFRRELIAWGDSVKLRYGEKPEDSPFLWRHMRQYVELTAEVFDGVRLDNCHSTPLHVAEYMLDCARNVKPDLYVAAELFTNSDHVDNIFVNRLGITSLIREALSAWDSHEQGRLVHRFGGRAVGSFFTPQVTRAQPQVAHALFLDLTHDNPSPIDKRSVFDLLPSAALVSMASCAIGSTRGYDELVPHHIHVVDEARLYAEWEEGEGEGVNASTGLIAARRALNDLHLHLAASGYSEVYVDQMDADVVAVTRHEPHSRKSVILVAFTAFKAPDESSTGRYVKPLRFEGQLEEIILEAVLRHKDHRSTGRPFQTCGGFSRHPQHINGLSDYEASVRCGVPLAQSNVFVSERRDGPYTVLEFGVLPPGTVVAVRVAPHSSQAGALAALRRVTCQHPATDPLGLSPALTDLDLGDFNALLYCCDAEERERCGGGVYDVPGHGPLVYAGLQGVASLLEEVAPRDDLGHALCDNLRAGDWLLDYQWRRLESDPRLSALATIYREALRPVGELPRFLVPAYFEVTVRCMIAAVKRAALARLGGVALSSGVARELSLTGVQLAGAVSSARLPAMSPSLPMLRPARPLSLSAGLPHFAVGYMRCWGRDTFIALRGMFLLTGRYQDARFHILGFAACLRHGLIPNLLDGGRNARFNCRDAVWWWLQSIKQYCTEAPQGYSILTDPVSRIFPKDDSEPAPPGAADQPLHDVMQEALDVHFQGLVFRERNAGRQIDAHMSDKGFNIQIGVDPETGFPFGGNEANCGTWMDKMGSSETAGTRGKPATPRDGSAVELVAMAYCVASWLAAQHRSAKYPYPGVARRHRDGSLTAWTYSQWADRIRRSFERHFWVPAAPSAADQRPDLVHRRAIYKDTHGASQPWADYQLRCNYVVAMALAPELFDPRHAWLALDNVEKLLVGPLGLKTLDPEDWAYRPNYDNSDNSSDPSVAHGFNYHQGPEWTWPLGFYLRARLAFAHDNGQFSKTVAAAYAALAPLVAEMRSSPWRGLPELSNAGGAFCKDSCRTQAWSSSCMLEVLHDLELSRRARPLPLD-