Monarch geneset OGS2.0

DPOGS200704
TranscriptDPOGS200704-TA1461 bp
ProteinDPOGS200704-PA486 aa
Genomic positionDPSCF300274 + 195725-205719
RNAseq coverage335x (Rank: top 34%)
Annotation
HeliconiusHMEL0118943e-16659.70% 
BombyxBGIBMGA001876-TA4e-16558.44% 
DrosophilaAmy-d-PA8e-15151.90% 
EBI UniRef50UniRef50_P091071e-15156.44%Alpha-amylase (Fragment) n=48 Tax=Coelomata RepID=AMY_TRICA
NCBI RefSeqNP_001166624.13e-16257.77%alpha-amylase [Bombyx mori]
NCBI nr blastpgi|331510304e-16659.45%alpha-amylase 3 [Diatraea saccharalis]
NCBI nr blastxgi|331510302e-16559.45%alpha-amylase 3 [Diatraea saccharalis]
Group
Gene OntologyGO:00431698.5e-111cation binding
GO:00059758.5e-111carbohydrate metabolic process
GO:00038248.5e-111catalytic activity
KEGG pathwayame:4061145e-157 
 K01176 (E3.2.1.1, amyA, malS)maps-> Starch and sucrose metabolism
InterPro domain[11-474] IPR0159021.4e-176Alpha amylase
[10-382] IPR0137818.5e-111Glycoside hydrolase, subgroup, catalytic core
[2-382] IPR0178533.2e-106Glycoside hydrolase, superfamily
[12-379] IPR0065892.3e-95Glycosyl hydrolase, family 13, subfamily, catalytic domain
[388-476] IPR0060488.1e-36Alpha-amylase, C-terminal all beta
[386-475] IPR0137803.3e-32Glycosyl hydrolase, family 13, all-beta
[37-325] IPR0060475.2e-28Glycosyl hydrolase, family 13, catalytic domain
[61-78] IPR0060462.8e-27Glycoside hydrolase, family 13
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200704-TA
ATGTGTATTAAGACAAATCGTCTATTAAATAGAACAACAATAGTGCATCTATTTGAATGGAAGTGGCTCGACGTTGCCGAGGAGTGTGAAAGGTTTCTGGCCCCCAAGGGTTTTGGTGCTGTGCAGATTTCGCCGCCGTCAGAGAATTTGATCATAAAAACACAAAATGGTCTAAGACCCTGGTACGAAAGGTATCAAGTGATGTCATATAATTTAGAAACACGCTCCGGCAACCAAGATGACTTCTTGGACATGACAAAGAGGTGTAATAAAGTTGGTGTTAGGATATACGCAGACGTTGTTATAAACCATATGACTGGTTCACATAAGAGCAATAAAGGAACTGGTGGAAGCACTGCAGACTTTGACAAGTACAGTTACCCATCCGTCCCATACACTGCTGAGAATTTTCATTCATCCTGCATAATCAATAACTATTACAATGCTACTGAGGTTCGTAACTGTCAATTATTAGGACTGAAAGATCTCAATCAGACCGAAGAATATGTAAGACAAAATATTGCAAATTACATAAATAATCTCATAAAACTCGGCGTGGCTGGCTTAAGAATAGATGCCGCCAAACACATGTGGCCAAGTGATCTCAGAGAAATTTATACAAGGTTAGACGATTTGAACCCTGAATTCGGTTTTCCACCAAACACAAGACCGTATATCTATCAAGAGGTAATTTATTATGGAAGCGAGCCGATACGACCTGAAGAGTACACTCCTCTTGGAGACGTCACTGAATTTAGGGTTGGCAATGAACTGAAGAACGTCTTTCGTGGAATTAACTCTATGAAGTGGTTAGTGAATTGGGGTGAGAAATGGGGACTGTCACCTGCTAAAACTGCCTTAGTTTTCATTGACAACCACGACACACAACGTAGCAGCAACATGTTGACATACAAAGAGGCGAGAGCTTATAAGGCCGCCATAGCATTCATGCTAGCACATCCTTATGGCCGACCTCGGATTATGAGCAGTTACTTCTTCACAAATAATGAAATGGGACCTCCTAGTGATGATAGTGAGAATATACTGTCACCGATAATACACGAAGATGACACATGTGGCAATGGCTGGGTGTGTGAACATCGCTGGCGTCAGATATACCAAATGATCGGATTCAGGAATGCTGTCCTTAATACAAGCATCAAAAACTGGTGGAATAATGACAACAAACAAATTGCTTTTGGGAGAGATGGAAAGGGATTTATAGTATTTAATGGAGACGAAGTTGAACTTAACGTTACTCTGCAGACTGGACTTCCCCCCGGGGAGTATTGCGATGTAATATCGGGTTCGAGAGTTGAATCTCATTGCACAGGGAATAAAATATACGTAAATAATGATGGTAGAGCGCATTTTTATAAGTCACAATATGCTGAAGATATGCATATAGCGATCCATGTTGGCAAAGAGTCAAGAGAACATCATAGAAAACGAGATAAATGA

Protein sequence:

>DPOGS200704-PA
MCIKTNRLLNRTTIVHLFEWKWLDVAEECERFLAPKGFGAVQISPPSENLIIKTQNGLRPWYERYQVMSYNLETRSGNQDDFLDMTKRCNKVGVRIYADVVINHMTGSHKSNKGTGGSTADFDKYSYPSVPYTAENFHSSCIINNYYNATEVRNCQLLGLKDLNQTEEYVRQNIANYINNLIKLGVAGLRIDAAKHMWPSDLREIYTRLDDLNPEFGFPPNTRPYIYQEVIYYGSEPIRPEEYTPLGDVTEFRVGNELKNVFRGINSMKWLVNWGEKWGLSPAKTALVFIDNHDTQRSSNMLTYKEARAYKAAIAFMLAHPYGRPRIMSSYFFTNNEMGPPSDDSENILSPIIHEDDTCGNGWVCEHRWRQIYQMIGFRNAVLNTSIKNWWNNDNKQIAFGRDGKGFIVFNGDEVELNVTLQTGLPPGEYCDVISGSRVESHCTGNKIYVNNDGRAHFYKSQYAEDMHIAIHVGKESREHHRKRDK-