Monarch geneset OGS2.0

DPOGS200705
TranscriptDPOGS200705-TA1506 bp
ProteinDPOGS200705-PA501 aa
Genomic positionDPSCF300274 + 208371-213301
RNAseq coverage1025x (Rank: top 12%)
Annotation
HeliconiusHMEL0118940.085.63% 
BombyxBGIBMGA005240-TA0.082.04% 
DrosophilaAmy-p-PA3e-17660.00% 
EBI UniRef50UniRef50_P816411e-17460.00%Alpha-amylase B n=313 Tax=Mandibulata RepID=AMYB_DROME
NCBI RefSeqNP_001166624.10.081.64%alpha-amylase [Bombyx mori]
NCBI nr blastpgi|2905608750.081.64%alpha-amylase precursor [Bombyx mori]
NCBI nr blastxgi|1569682850.084.63%alpha-amylase [Helicoverpa armigera]
Group
Gene OntologyGO:00431694.3e-115cation binding
GO:00059754.3e-115carbohydrate metabolic process
GO:00038244.3e-115catalytic activity
KEGG pathwayaga:AgaP_AGAP0023172e-179 
 K01176 (E3.2.1.1, amyA, malS)maps-> Starch and sucrose metabolism
InterPro domain[1-498] IPR0159021.9e-198Alpha amylase
[18-403] IPR0137814.3e-115Glycoside hydrolase, subgroup, catalytic core
[17-402] IPR0178538.8e-104Glycoside hydrolase, superfamily
[27-401] IPR0065891.2e-97Glycosyl hydrolase, family 13, subfamily, catalytic domain
[410-498] IPR0060483.4e-51Alpha-amylase, C-terminal all beta
[408-497] IPR0137801.3e-40Glycosyl hydrolase, family 13, all-beta
[74-91] IPR0060463.8e-35Glycoside hydrolase, family 13
[52-321] IPR0060475.1e-23Glycosyl hydrolase, family 13, catalytic domain
Orthology groupMCL10115 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200705-TA
ATGATGCGTTTCATCCTTTTGCTGTGTGCCGTGTCTTTGGCTGTTGCGTACAAAAACCCGCACTATGCCTCTGGACGTACCACAATGGTCCACTTGTTCGAGTGGAAGTGGGACGACATAGCAGCTGAATGTGAAAGATTCCTAGGTCCAAGGGGATTTGGTGGTGTTCAGATTTCTCCACCCAATGAAAACCTTGCTATCTGGTCTCACAACCGGCCATGGTGGGAGAGATACCAGCCCATATCCTATCGTCTCGTAACCAGGTCTGGCAATGAACAACAGTTTTCGAATATGGTGCGAAGATGCAACAATGCTGGTGTTCGGATTTATGTAGACGCGATCATTAATCACATGACTGGAACATGGAATGAGAACGTTGGAACCGGAGGCAGTACAGCCAACTTTGGTAACTGGCATTATCCTGGTGTTCCCTATGGCAGAAACGACTTCAACTGGCCTCAGTGCGTCATTAATGGCAATGATTACAGAAATAATGCAGCCAGAGTCCGCAACTGTGAGCTCTCAGGTTTGAAAGATTTGAACCAGGGATCGGAATACGTCCGCACTCAAATTGTTAATTACATGAATCGCCTCATCGACTTGGGAGTTGCTGGATTCAGAATTGACGCCGCAAAACATATGTGGCCGAATGATCTGAGGATCATCTACGGCAGGCTAAAGAATCTTAACACCAGACATGGTTTCCCATCTGGTGCTCGTCCCTACATCTATCAAGAAGTAATTGATCTCGGAGGAGAAGCTGTCAGACGTGATGAATATACCCCTCTCGCTGCGGTAACTGAGTTCAAATATGGTATGGAGCTTAGCCGAGCTTTCTCTGGTCGCAATCAACTTAGATGGTTGGTGAACTTTGGACCTCAATGGGGTATGTTGGGTTCTTCAGACGATGCTCTTACTTTCATCGATAACCATGACAATCAAAGAGGTCATGGCGCTGGTGGAAATATTCTGACCCACAAAACAGCTCGCAATTATAAGGGAGCCATTGGCTTTATGTTAGCTCACCCATATGGACGACCACAGCTTATGAGCAGTTTTGGTTTCCATAATACTGAGGCTGGACCACCAATGGACAACCGAGGCAATATTATTTCTCCATCCATCAATTCGGACAACAGTTGTGGCAATGGCTGGATCTGCGAGCACAGATGGCGTCAGATCTATGCCATGGTTGCTTTCCGTAACGTCGCTGGAAACACTCGTGTTTCTAATTGGTGGGACAACGGTAGCAACCAGATCGCTTTCTGTCGAGGAGGACAAGCCTTTATAGCTTTCAACCGAGATGGCTGGGATTTGAATCAGAACTTACAGACTTGCCTCCCGGCGGGTAACTACTGTGACGTTATCTCCGGAGAGAAGAGAAACAATCGTTGCACTGGCAAAACTATAACAGTCGGAGGTGACGGCCGCGCCCGTATTCACGTTGGAGCCAACGACTACGACATGTTCCTTGCCATTCACAGGGGCAGCGAGTCAAGACTGTAA

Protein sequence:

>DPOGS200705-PA
MMRFILLLCAVSLAVAYKNPHYASGRTTMVHLFEWKWDDIAAECERFLGPRGFGGVQISPPNENLAIWSHNRPWWERYQPISYRLVTRSGNEQQFSNMVRRCNNAGVRIYVDAIINHMTGTWNENVGTGGSTANFGNWHYPGVPYGRNDFNWPQCVINGNDYRNNAARVRNCELSGLKDLNQGSEYVRTQIVNYMNRLIDLGVAGFRIDAAKHMWPNDLRIIYGRLKNLNTRHGFPSGARPYIYQEVIDLGGEAVRRDEYTPLAAVTEFKYGMELSRAFSGRNQLRWLVNFGPQWGMLGSSDDALTFIDNHDNQRGHGAGGNILTHKTARNYKGAIGFMLAHPYGRPQLMSSFGFHNTEAGPPMDNRGNIISPSINSDNSCGNGWICEHRWRQIYAMVAFRNVAGNTRVSNWWDNGSNQIAFCRGGQAFIAFNRDGWDLNQNLQTCLPAGNYCDVISGEKRNNRCTGKTITVGGDGRARIHVGANDYDMFLAIHRGSESRL-