Monarch geneset OGS2.0

DPOGS201891
TranscriptDPOGS201891-TA1821 bp
ProteinDPOGS201891-PA606 aa
Genomic positionDPSCF300191 + 419545-423760
RNAseq coverage65x (Rank: top 67%)
Annotation
HeliconiusHMEL0147420.068.80% 
BombyxBGIBMGA006066-TA0.070.15% 
DrosophilaMal-A1-PA2e-15849.72% 
EBI UniRef50UniRef50_O160985e-15949.91%Maltase 1 n=30 Tax=cellular organisms RepID=MAL1_DROVI
NCBI RefSeqXP_002052293.11e-15949.91%maltase 1 [Drosophila virilis]
NCBI nr blastpgi|25764049e-15947.91%maltase 1 [Drosophila virilis]
NCBI nr blastxgi|25764043e-16048.05%maltase 1 [Drosophila virilis]
Group
Gene OntologyGO:00431692.1e-116cation binding
GO:00059752.1e-116carbohydrate metabolic process
GO:00038242.1e-116catalytic activity
KEGG pathwayaag:AaeL_AAEL0006421e-158 
 K01187 (E3.2.1.20, malZ)maps-> Starch and sucrose metabolism
    Galactose metabolism
InterPro domain[24-606] IPR0159025.8e-164Alpha amylase
[54-458] IPR0065892.1e-116Glycosyl hydrolase, family 13, subfamily, catalytic domain
[42-517] IPR0178536.4e-115Glycoside hydrolase, superfamily
[222-517] IPR0137811.5e-99Glycoside hydrolase, subgroup, catalytic core
[69-428] IPR0060471.2e-81Glycosyl hydrolase, family 13, catalytic domain
Orthology groupMCL10053 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201891-TA
ATGGGGTCGTGTAAAGCGTTGGGCATAACTGCTGCCCTTTTCTTGGGCTTAATACTGGTTGGAGGTATTATCACAGTAGCAGTGTTGTGGTCTCAAGATAATGCAGTTGACCCTCCGATTATTATTCCCACTGACTGGTGGGAGCACTGCGTTCTCTATCAGATTTATCCTCGCTCTTTCAAAGACACCGACGGAGATGGCATCGGAGATTTAAAAGGTATCACCCAAGAGCTCGAGCATTTCGTGGATGCCGGAGTGGACGCTATATGGATGTCTCCGATATTTGCCTCCCCGATGGTGGACTTCGGATATGACATCAGCAACTTTTACGAGATACATTACGAGTACGGAACCATGGAGGACTTCGAGGCTCTCCTTGAGAAGGCGCATCGTTTAGGAATAAAGGTGTTATTGGATTTCGTGCCTAACCATGCAAGCAATGAATCGGATTACTTTAAGAAATCAGAAGCCAGAGATCCCGAATATGAAGATTTTTTCGTTTGGGCGGACGGGATCCCGGATCCAAACAATGCCAGTAACATTTTACCTCCATCTAACTGGGTAAGCCAATTTGATGGATCAGCATGGCAATGGAGCCCAATTCGTCAGCAGTTTTACCTTCACCAATTTGCAGTTCAGCAAGCTGACTTTAACTTTAGGAACGAGTCGGTCAGACAAGAGATGAAGAACATCATGAAATTCTGGCTCGACAAAGGGGCAGACGGATTCAGAGTCGACGCTCTGCCTTTTCTCATGGAAGCTAATCCTGATGACTACGGCGGTAGATATCCTGATGATCCCCTTAGCGGAAAAATTGGACTTGAACCTCATCAACTAGGATACACCATTCCTCTGTACACTAAAGATCTCATTGAATTATACGATGTAGTTTACGAATGGCGAGAATACGTGGATCAATATTGGAAGGAAAATGGCGGAGACACTCGAGTGCTGTTGTCCGAAGGTTACGCAAATATCTCCATGACGATGCTTTACTATGGTAACAAACAGGGAAAATTCGGTGCCCACTTCCCCTTCAACTTTGATTTCATTACCGATGTCTCTAATAATTCAAATGCAAGGGACTTCGTTTACACCATTCAGAAATGGCTCACGTACAAGCCCTTCGCAGCAACAGCTAACTGGGTGTTTGGCAATCATGACAATAATAGGATGGCAACTCGATTCCGAGAAGACATGGTGGATGGTCTTAACGCCCTGGCAATGATACTACCAGGTGTAGCTGTCACCTACCAGGGAGAAGAGATCGGTATGCAAGATGGGTACGTGAGCTGGGAGGATACTGTTGATGTAGAAGCCCTCAACAGAGGCGACAACGAAACCTACATGCTTTACTCGCGAGACCCAGCAAGAACCCCATACCAATGGAACGGTTCGCTCAATGCCGGTTTCTCAACCGCCAACAAAACATGGCTACCGGTGGCTGATAACTATAAGGAACTAAACCTACAAGCTCAAAAGGCAGCTAATGTTAGCCATTTTAAAGTTTATCAAAAATTAACAGCTCTTCGCAAGGAGATGTCTATGATCCATGGAGATTACGAAGTGAGAGCGTTTTCCGATCGCTCCTTCTACGTAGTACGAAACTTCAGGACCTACGACACATTTGTCTTATTGTTCAACGTCGCCGATACAGCAGATATTATCAATCTAACTAGAATCCAAGACATAAAAGTGCCCTCCACTGTTGAAGTAGCCAGTATTCATTCCAGTAGGAGAGCAGGTGACGTCATCGAAGAAAACCTGATACAACTAGAAGCAGGAGAGGCGCTAGTGCTTCGAGATGCGCCATTGGAATAA

Protein sequence:

>DPOGS201891-PA
MGSCKALGITAALFLGLILVGGIITVAVLWSQDNAVDPPIIIPTDWWEHCVLYQIYPRSFKDTDGDGIGDLKGITQELEHFVDAGVDAIWMSPIFASPMVDFGYDISNFYEIHYEYGTMEDFEALLEKAHRLGIKVLLDFVPNHASNESDYFKKSEARDPEYEDFFVWADGIPDPNNASNILPPSNWVSQFDGSAWQWSPIRQQFYLHQFAVQQADFNFRNESVRQEMKNIMKFWLDKGADGFRVDALPFLMEANPDDYGGRYPDDPLSGKIGLEPHQLGYTIPLYTKDLIELYDVVYEWREYVDQYWKENGGDTRVLLSEGYANISMTMLYYGNKQGKFGAHFPFNFDFITDVSNNSNARDFVYTIQKWLTYKPFAATANWVFGNHDNNRMATRFREDMVDGLNALAMILPGVAVTYQGEEIGMQDGYVSWEDTVDVEALNRGDNETYMLYSRDPARTPYQWNGSLNAGFSTANKTWLPVADNYKELNLQAQKAANVSHFKVYQKLTALRKEMSMIHGDYEVRAFSDRSFYVVRNFRTYDTFVLLFNVADTADIINLTRIQDIKVPSTVEVASIHSSRRAGDVIEENLIQLEAGEALVLRDAPLE-