Monarch geneset OGS2.0

DPOGS210044
TranscriptDPOGS210044-TA1737 bp
ProteinDPOGS210044-PA578 aa
Genomic positionDPSCF300017 - 1220821-1224267
RNAseq coverage4570x (Rank: top 3%)
Annotation
HeliconiusHMEL0138615e-16749.22% 
BombyxBGIBMGA003057-TA3e-17349.30% 
DrosophilaMal-B2-PD6e-15651.09% 
EBI UniRef50UniRef50_D8KY557e-17149.30%Alpha amylase n=3 Tax=Obtectomera RepID=D8KY55_BOMMO
NCBI RefSeqNP_001182391.11e-17149.30%alpha amylase [Bombyx mori]
NCBI nr blastpgi|3065186602e-17049.30%alpha amylase precursor [Bombyx mori]
NCBI nr blastxgi|3065186601e-17050.36%alpha amylase precursor [Bombyx mori]
Group
Gene OntologyGO:00431696.6e-131cation binding
GO:00059756.6e-131carbohydrate metabolic process
GO:00038246.6e-131catalytic activity
KEGG pathwaydme:Dmel_CG149355e-154 
 K01187 (E3.2.1.20, malZ)maps-> Starch and sucrose metabolism
    Galactose metabolism
InterPro domain[3-541] IPR0159021.8e-165Alpha amylase
[30-430] IPR0065896.6e-131Glycosyl hydrolase, family 13, subfamily, catalytic domain
[18-483] IPR0178531e-121Glycoside hydrolase, superfamily
[196-483] IPR0137812.1e-105Glycoside hydrolase, subgroup, catalytic core
[45-401] IPR0060472.5e-89Glycosyl hydrolase, family 13, catalytic domain
Orthology groupMCL10053 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210044-TA
ATGCGTTTCCTGATCCTCTCTCTCGCTGTCTTCACGTCAGCGGTTGCCGCTTCTGATACGGAGTGGTGGAAGACCGCCTTGATCTACCAAATCTATCCGCGATCCTTCAAGGACAGTAATGGCGACGGCATCGGCGATCTTAATGGTATCACGGAGAAGCTGGTTTATCTGAATCAGACGGGAGTTGACGCGATCTGGCTCTCACCGATCTACCTCTCGCCGATGTATGACTTTGGGTACGACATTACGGACTACAGGAAAATAGCCCCCGAATACGGTACTATGGACGATTTCAAGACGCTCATGACAGAAGCACGGAGACTTGGTATCCGTGTAATAATGGACTTGGTCCCCAACCACACGGGCAATGAGAGCGAATGGTTTCAGAAGTCCATCCGACGCGAGCCAGGATACGAGGATTACTATATATGGGCGGACGGCATCAAGACCGAAGGATCCAACGACACTAAGCCACCGAGCAATTGGGTAAGCACTTTCCGGAAGAGTGCGTGGGAATACAATTCTGTGCGCGGTCAATACTACCTCCACAAATTTGTAATCGGACAACCAGATCTTAATTATCGCAGTACAAGAGTTCAACAGGAAATGAAGGATGTCCAGAAATTTTGGCTCGATTTGGGAGTATCCGGTTTCCGTGTGGACGCAATCAATCATCTGTACGAATCTAATCCCGCTAATTTCGGTGGTCGCTACCCAGACGAGCCTTTATCAGGAAACCCCAACACCAATCCCGACGACTACGAGTACCTGAACCACATTCATACCGAAAACCTGAACGAAACCTATGAAGTGGTTTACGACTGGAGAGATCTTCTCGACGAGTACATAGAACTGCAGGGGGAATACAAGATCATGATGACGGAGGCTTACGCGGACTTGGACAGCATGATGCGGTACTACGGCACCAGCACCAGGAACGGATCTATTCCCTTCAACTTCAGCTTTTTGGGAGACATCACCAAGGATTCCGACGCGAGACATATTAAGACTGTCATCGATAAATGGATGACGTACATGCCGAGTGGAAGAACTGCCAACTGGGTGAACGGTAACCACGATCAAAGCAGGATGGCTAATCGTCAGGGGGTCGACAGAGTTGATGCTATGAACATGATAGCACTGTTGTTACCTGGTGTTGCCATCACATACCAGGGTGAGGAAATAGGAATGACAGATGGAGAGGTCAGCTGGGAAGAGACGAAGGACCCGCAGGCTTGTAACACTGACGACCCCGTGAACTACTGGAAGAAGTCGAGAGACCCCAACCGTACGCCCTTCCACTGGGATAACAGCACTAATGCTGGATTCTCTACCGGAAAGACTTGGCTACCGGTTGCTAGTAACTACCACAAAGTAAACTTGGCTGAACAAATCAACAACACCAAAAGTCACTACCAGTTCTACAAGGATCTCGCAGCAATAAGAAAGATGGCAGCTGTGAAATATGGAGATGTAGACACAAGAGCTCTGTCAGAAACGGTATTAGTCGTCACAAGGTTACTACCGGGCGAGCAGGGAGTATTGGGCATTGTGAACTTATCAGATGAGGACCAATATGTTGATCTGACCTCGCTGCGTTTAATACCGAGAGTGATTAAAGTTAGGGCTGTTGGAGCCAATTGTGATAATGTGAAGGGGACTCTTCTTATCAAGAACAAAATACCAGTAAATGCTCACTGCGCCTTAGTTCTACAAACTATCCGACACTGCTGTTGA

Protein sequence:

>DPOGS210044-PA
MRFLILSLAVFTSAVAASDTEWWKTALIYQIYPRSFKDSNGDGIGDLNGITEKLVYLNQTGVDAIWLSPIYLSPMYDFGYDITDYRKIAPEYGTMDDFKTLMTEARRLGIRVIMDLVPNHTGNESEWFQKSIRREPGYEDYYIWADGIKTEGSNDTKPPSNWVSTFRKSAWEYNSVRGQYYLHKFVIGQPDLNYRSTRVQQEMKDVQKFWLDLGVSGFRVDAINHLYESNPANFGGRYPDEPLSGNPNTNPDDYEYLNHIHTENLNETYEVVYDWRDLLDEYIELQGEYKIMMTEAYADLDSMMRYYGTSTRNGSIPFNFSFLGDITKDSDARHIKTVIDKWMTYMPSGRTANWVNGNHDQSRMANRQGVDRVDAMNMIALLLPGVAITYQGEEIGMTDGEVSWEETKDPQACNTDDPVNYWKKSRDPNRTPFHWDNSTNAGFSTGKTWLPVASNYHKVNLAEQINNTKSHYQFYKDLAAIRKMAAVKYGDVDTRALSETVLVVTRLLPGEQGVLGIVNLSDEDQYVDLTSLRLIPRVIKVRAVGANCDNVKGTLLIKNKIPVNAHCALVLQTIRHCC-