Monarch geneset OGS2.0

DPOGS215494
TranscriptDPOGS215494-TA2037 bp
ProteinDPOGS215494-PA678 aa
Genomic positionDPSCF300098 + 560103-566075
RNAseq coverage1901x (Rank: top 7%)
Annotation
HeliconiusHMEL0083560.080.71% 
BombyxBGIBMGA007451-TA0.081.48% 
DrosophilaCG33138-PA0.067.45% 
EBI UniRef50UniRef50_UPI00021A7B130.063.89%UPI00021A7B13 related cluster n=2 Tax=unknown RepID=UPI00021A7B13
NCBI RefSeqXP_002423057.10.066.81%1,4-alpha-glucan branching enzyme, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420043470.066.81%1,4-alpha-glucan branching enzyme, putative [Pediculus humanus corporis]
NCBI nr blastxgi|1947547110.067.35%GF11944 [Drosophila ananassae]
Group
Gene OntologyGO:00431695e-65cation binding
GO:00059755e-65carbohydrate metabolic process
GO:00038245e-65catalytic activity
GO:00045533.2e-09hydrolase activity, hydrolyzing O-glycosyl compounds
KEGG pathwayphu:Phum_PHUM0339400.0 
 K00700 (E2.4.1.18, glgB)maps-> Starch and sucrose metabolism
InterPro domain[14-676] IPR0159020Alpha amylase
[159-580] IPR0178535.5e-88Glycoside hydrolase, superfamily
[163-575] IPR0137815e-65Glycoside hydrolase, subgroup, catalytic core
[580-676] IPR0137802.2e-24Glycosyl hydrolase, family 13, all-beta
[581-675] IPR0060482.5e-24Alpha-amylase, C-terminal all beta
[201-275] IPR0060479.9e-16Glycosyl hydrolase, family 13, catalytic domain
[52-144] IPR0137832.5e-14Immunoglobulin-like fold
[51-161] IPR0147564.3e-14Immunoglobulin E-set
[61-119] IPR0041933.2e-09Glycoside hydrolase, family 13, N-terminal
[192-546] IPR0065896.4e-07Glycosyl hydrolase, family 13, subfamily, catalytic domain
Orthology groupMCL11929 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215494-TA
ATGGACCCGATGGACGTACCAGTCCCCGATTTAAAACTTTTATTCCAAAGAGACGGCTATTTAAGACCATATGAACGTGAAATTCGACGACGTTTCGCCTGCTTCCAAGATCTGTGGGATAAGATAGAGTCATGGGAGGGTGGCGTGGAAGGTTTCACTACCGGTTACCGTTATTATGGACCACAGTTCTGCGTCGACGGATCAGTGGTGTGGAGAGAATGGGCTCCTGGAGCACACTCGCTTCATCTTCAGGGCGACTTCAATGGTTGGAACCCAAAGAGTCATCCGTTCAGGAAGCTGGAATATGGAAAGTGGGAGCTTTATATACCTGGAAATGAGGATGAATCGTGTCCTATCAAGCATTTGAGTCGAGTCCAGCTTATTGTTAACGAACACCTGTACCGAGTGTCTCCCTGGGCGAGTTACGTTAAGCCATACGAAGGATTCACTTACCAACAATTCATTTACAAGCCGGAGCAGCCGTACCAGTTCAAGCACAGAAAAGTTAAGAGGCCAGCGTCGTTACGCATCTATGAGTGCCACGTGGGGATCGCTACCAACGAGGGAAGAGTTGGCACTTACCTGGAGTTCAAGGACAATGTGCTACCAAGGATTAAAGATTTAGGCTACAACGCTATACAGTTGATGGCTATAATGGAACATGCCTACTACGCCTCTTTCGGTTACCAGGTCACAAGCTTTTTTGCAGCCAGCAGTCGATATGGAACCCCCTGTGAGTTAAAGCAGTTGATCGACCGTGCCCATGAGCTCGGTATCTACGTGCTATTAGACGTCGTCCACTCCCACGCCTCCAAGAACACGTTGGATGGTCTCAACGAGTTCGACGGCACCAACTCCTGCTACTTCCACGACGGCGCCAGAGGAACCCACTCGCTCTGGGACAGCAGATTGTTCAACTATTCCGAGACGGAGGTGCTACGTTTCCTACTTTCTAACCTGAGATGGTATCAAGAGGAATATCAGTTTGACGGATTCAGGTTTGATGGCGTGACGTCGATGTTGTACCACAGTCGTGGCATTGGGGAAGGCTTCTCTGGAAATTATGACGAGTACTATGGATTGAACGTGGACACGGAGGCGCTCGTCTACCTGATGGTGGCCAACGAGCTCGTGCACTCCATAGACAGCCAGGCCATAACTATAGCCGAGGATGTATCAGGAATGCCGGCCTCCGGGCGACCTGTTCGTGAAGGCGGTACGGGCTTTGACTACCGCCTGGGTATGGCCATCCCGGACATGTGGATCAAATTGCTGAAGGAGGAACGCGACGAGGACTGGAAGATGGGGCACATCGTCCACACCCTCACCAACAGACGGTGGATGGAGGGGACTGTCGCTTACGCCGAAAGCCATGACCAGGCTCTGGTCGGGGATAAGACGATCGCGTTCTGGTTGATGGACGCCGCCATGTACACCCACATGAGTACCCTAAGCGAGCCCAACCCGGTCATCGAGCGAGGACTCGCTCTACACTGCATGATACGACTCATCACCAACGCGCTCGGAGGAGAGGCCTACCTCAATTTCATTGGTAACGAATTCGGACACCCCGAATGGTTGGATTTCCCTCGAGCTGGCAACAATTCCTCGTACCACTACGCCAGGAGACAGTGGCATCTCGTTGACGACCAGCTTCTCAAGTACAAATATCTGAACGAATTCGATAAAGACATGCACGCCTTGGAGAACAAATACGGATGGCTCGCATCAAATCCGGCGTACGTGTCGTGTAAGCATGAAGGCGACAAAGTGATAGCGTTCGAGCGCGCGGGCCTGCTGTTCGTCTTCAACTTCCATCCCAACCAAAGCTTCACGGACTACCGCGTTGGCGTCGACGTCGCTGGAAAATATCAGGCTGTATTGTGTTCGGATAGTAAGAAATACGGCGGTTTCGGTCGTGTGGAGCCGGATGGGGAATACCATCTCACTCAGAACATGCCCTGGGGTGACAGAAAGGATTCCGTTCAGCTCTACATCCCGTGCCGTACAGCTCTAGTTTACGCTCGATGTGAATGA

Protein sequence:

>DPOGS215494-PA
MDPMDVPVPDLKLLFQRDGYLRPYEREIRRRFACFQDLWDKIESWEGGVEGFTTGYRYYGPQFCVDGSVVWREWAPGAHSLHLQGDFNGWNPKSHPFRKLEYGKWELYIPGNEDESCPIKHLSRVQLIVNEHLYRVSPWASYVKPYEGFTYQQFIYKPEQPYQFKHRKVKRPASLRIYECHVGIATNEGRVGTYLEFKDNVLPRIKDLGYNAIQLMAIMEHAYYASFGYQVTSFFAASSRYGTPCELKQLIDRAHELGIYVLLDVVHSHASKNTLDGLNEFDGTNSCYFHDGARGTHSLWDSRLFNYSETEVLRFLLSNLRWYQEEYQFDGFRFDGVTSMLYHSRGIGEGFSGNYDEYYGLNVDTEALVYLMVANELVHSIDSQAITIAEDVSGMPASGRPVREGGTGFDYRLGMAIPDMWIKLLKEERDEDWKMGHIVHTLTNRRWMEGTVAYAESHDQALVGDKTIAFWLMDAAMYTHMSTLSEPNPVIERGLALHCMIRLITNALGGEAYLNFIGNEFGHPEWLDFPRAGNNSSYHYARRQWHLVDDQLLKYKYLNEFDKDMHALENKYGWLASNPAYVSCKHEGDKVIAFERAGLLFVFNFHPNQSFTDYRVGVDVAGKYQAVLCSDSKKYGGFGRVEPDGEYHLTQNMPWGDRKDSVQLYIPCRTALVYARCE-