Monarch geneset OGS2.0

DPOGS214663
TranscriptDPOGS214663-TA2481 bp
ProteinDPOGS214663-PA826 aa
Genomic positionDPSCF300321 - 41089-50981
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0118940.073.44% 
BombyxBGIBMGA001876-TA0.079.63% 
DrosophilaAmy-p-PA2e-17159.47% 
EBI UniRef50UniRef50_P816419e-17059.47%Alpha-amylase B n=313 Tax=Mandibulata RepID=AMYB_DROME
NCBI RefSeqNP_001166624.10.071.03%alpha-amylase [Bombyx mori]
NCBI nr blastpgi|331510280.077.15%alpha-amylase 2 [Diatraea saccharalis]
NCBI nr blastxgi|2195230220.080.00%alpha-amylase [Ephestia kuehniella]
Group
Gene OntologyGO:00431693.3e-114cation binding
GO:00059753.3e-114carbohydrate metabolic process
GO:00038243.3e-114catalytic activity
KEGG pathwaydan:Dana_GF188432e-173 
 K01176 (E3.2.1.1, amyA, malS)maps-> Starch and sucrose metabolism
InterPro domain[19-528] IPR0159021.8e-191Alpha amylase
[33-417] IPR0137813.3e-114Glycoside hydrolase, subgroup, catalytic core
[32-420] IPR0178537.8e-105Glycoside hydrolase, superfamily
[42-415] IPR0065891.9e-96Glycosyl hydrolase, family 13, subfamily, catalytic domain
[424-512] IPR0060482.1e-49Alpha-amylase, C-terminal all beta
[727-814] IPR0137809.2e-40Glycosyl hydrolase, family 13, all-beta
[89-106] IPR0060462.1e-33Glycoside hydrolase, family 13
[67-361] IPR0060471.4e-22Glycosyl hydrolase, family 13, catalytic domain
Orthology groupMCL10115 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214663-TA
ATGCACTTGGACACAACACTGGCACAGATAGCCTCATTGCAGGCTGGGTTAACAATTCTAGTTGTATTAGCTTTGGCAACCGGACTAAATGCTTATAAGAATCCACACTATGCGCCGAATAGATCTGTGAACGTTCATTTGTTTGAATGGAAATGGGATGACATAGCGGCTGAATGTGAACGGTTCTTAGGACCTAAAGGGTTCGGAGGTATTCAGGTATCACCGCCCAATGAGAACGTCGTTCTTCGTAACAACAATCGTCCCTGGTGGGAGCGGTATCAGTCCATGTCATATAAGCTGGTGACCAGATCTGGCAATGAACAGCAGTTTACCAACATGGTCAGACGGTGTAACGCGGCTGGAGTTAGGATTTACGTGGATGCTGTCATCAACCACATGACAGGAGAGCCAGTGGAGAATGTAGGGACAGGGGGAAGCACTGCTGTGTTCCGGGATTTTTACTACCCAGCTGTGCCTTACACCAGGGATCACTTCAACTGGCCAACGTGCGGTATTAATGGGGAAGACTATATGAACAATGCGTGGAGGGTCCGCAACTGTGAGCTGGTCGGTCTGAAGGATTTGGATCAGAGCAACGAACATGTCAGACAAATGATAGTCAATTATATGAACAAACTCATAAGCCTCGGTGTCGCTGGATTCAGAATCGACGCAGCGAAGCACATGTGGCCGGAAGACTTGAGAGTCATTTTCAGCAGACTCCGAAATCTGAACACTGAGCACGGTTTCGCCCCGAACTCCCGACCATACATATACCAGGAGGTTATCGACTATGGCGGTGAGGCCGTCAGCAGGGACGAATACACACCCATAGGGGCTGTCACAGAATTCAAAGCTGGCTTGGAACTTAGTAACGCCTTCAGAGGAAGCAACCAACTGAGATGGCTTTCCTCGTGGGGCCCACAATGGGGTTTGCTAGCTAGCGGTGACGCTTTGACATTTATAGACAACCATGATAACGAGAGAGGTCACGGAGGCGGTGGGGGGATATTGACGTACAAGGAGCCCAGAGCATACAAGGCGGCTATAGCGTTCCTCCTGGCGCATCCTTACGGGGAACCACAGATAATGAGCAGTTTTCAATTCCTGGACTCCGAAATTGGACCACCGATGGACTTTAACCAGAATATTATATCACCCTCTATCAATTCCGATGGATCTTGCGGCAACGGCTGGGTCTGTCAGCACCGTTGGAGACAGATCTACGCGATGGTAGGATTCAGGAACGCGGCTGGCAACAGTGGCATCAACGATTGGTGGGACAACGGCTCTAATCAGATCGCGTTCTGCCGCGGAAATAAGGCTTTCATCGCATTTAACAATGACAACTGGACTTTAAATCAAAATTTGCAGACCTGTCTGCCCGCGGGTACTTACTGCGACGTCATCTCGGGAGACAAAGTGAACAACTCCTGTCGCGGTAAGACTGTGAACGTGGACGGTAACGGTCGCGCTAACATTATACTTGGCAACAATGAGTACGACATCATGATGGCTATACATGTTGGTCCGGAGGCTACAGTGAAACGTTTTGAAATAATCATTTTCGTCACCAGAATCGACGCAGCGAAGCACATGTGGCCGGAAGACTTGAGAGTCATTTTCAGCAGACTCCGAAATCTGAACACTGAGCACGGTTTCGCCCCGAACTCCCGACCATACATATACCAGGAGGTTATCGACTATGGCGGTGAGGCCGTCAGCAGGGACGAATACACACCCATAGGGGCTGTCACAGAATTCAAAGCTGGCTTGGAACTTAGTAACGCCTTCAGGGGAAACAACCAACTGAGATGGCTTTCCTCGTGGGGCCCACAATGGGGTTTGCTAGCTAGCGGTGACGCTTTGACATTTATAGACAACCATGATAACGAGAGAGGTCACGGAGGCGGTGGGGGGATATTGACGTACAAGGAGCCCAGAGCATACAAGGCGGCTATAGCGTTCCTCCTGGCGCATCCTTACGGGGAACCACAGATAATGAGCAGTTTTCAATTCCTGGACTCCGAAATTGGACCACCGATGGACTTTAACCAGAATATTATATCACCCTCTATCAATTCCGATGGATCTTGCGGCAACGGCTGGGTCTGTCAGCACCGTTGGAGACAGATCTACGCGATGGTAGGATTCAGGAACGCGGCTGGCAACAGTGGCATCAACGATTGGTGGGACAACGGCTCTAATCAGATCGCGTTCTGCCGCGGAAATAAGGCTTTCATCGCATTTAACAATGACAACTGGACTTTAAATCAAAATTTGCAGACCTGTCTGCCCGCGGGTACTTACTGCGACGTCATCTCGGGAGACAAAGTGAACAACTCCTGTCGCGGTAAGACTGTGAACGTGGACGGTAACGGTCGCGCTAACATTATACTTGGCAACAATGAGTACGACATCATGATGGCTATACATGTTGGTCCGGAGGTTAAATATCAGGCAGACCTGAATCACAACTGA

Protein sequence:

>DPOGS214663-PA
MHLDTTLAQIASLQAGLTILVVLALATGLNAYKNPHYAPNRSVNVHLFEWKWDDIAAECERFLGPKGFGGIQVSPPNENVVLRNNNRPWWERYQSMSYKLVTRSGNEQQFTNMVRRCNAAGVRIYVDAVINHMTGEPVENVGTGGSTAVFRDFYYPAVPYTRDHFNWPTCGINGEDYMNNAWRVRNCELVGLKDLDQSNEHVRQMIVNYMNKLISLGVAGFRIDAAKHMWPEDLRVIFSRLRNLNTEHGFAPNSRPYIYQEVIDYGGEAVSRDEYTPIGAVTEFKAGLELSNAFRGSNQLRWLSSWGPQWGLLASGDALTFIDNHDNERGHGGGGGILTYKEPRAYKAAIAFLLAHPYGEPQIMSSFQFLDSEIGPPMDFNQNIISPSINSDGSCGNGWVCQHRWRQIYAMVGFRNAAGNSGINDWWDNGSNQIAFCRGNKAFIAFNNDNWTLNQNLQTCLPAGTYCDVISGDKVNNSCRGKTVNVDGNGRANIILGNNEYDIMMAIHVGPEATVKRFEIIIFVTRIDAAKHMWPEDLRVIFSRLRNLNTEHGFAPNSRPYIYQEVIDYGGEAVSRDEYTPIGAVTEFKAGLELSNAFRGNNQLRWLSSWGPQWGLLASGDALTFIDNHDNERGHGGGGGILTYKEPRAYKAAIAFLLAHPYGEPQIMSSFQFLDSEIGPPMDFNQNIISPSINSDGSCGNGWVCQHRWRQIYAMVGFRNAAGNSGINDWWDNGSNQIAFCRGNKAFIAFNNDNWTLNQNLQTCLPAGTYCDVISGDKVNNSCRGKTVNVDGNGRANIILGNNEYDIMMAIHVGPEVKYQADLNHN-