Monarch geneset OGS2.0

DPOGS201066
TranscriptDPOGS201066-TA1866 bp
ProteinDPOGS201066-PA621 aa
Genomic positionDPSCF300185 - 359947-363709
RNAseq coverage415x (Rank: top 29%)
Annotation
HeliconiusHMEL0080000.049.11% 
BombyxBGIBMGA007199-TA0.068.34% 
Drosophilatobi-PA4e-16953.77% 
EBI UniRef50UniRef50_Q7PJP43e-17454.10%AGAP003995-PA n=5 Tax=Culicidae RepID=Q7PJP4_ANOGA
NCBI RefSeqXP_001867055.18e-17754.53%alpha-glucosidase [Culex quinquefasciatus]
NCBI nr blastpgi|1700633252e-17554.53%alpha-glucosidase [Culex quinquefasciatus]
NCBI nr blastxgi|1700633251e-17654.84%alpha-glucosidase [Culex quinquefasciatus]
Group
Gene OntologyGO:00045531.1e-194hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059751.1e-194carbohydrate metabolic process
GO:00081529.8e-19metabolic process
GO:00038249.8e-19catalytic activity
KEGG pathwaydme:Dmel_CG119093e-167 
 K01187 (E3.2.1.20, malZ)maps-> Starch and sucrose metabolism
    Galactose metabolism
InterPro domain[86-615] IPR0003221.1e-194Glycoside hydrolase, family 31
[215-552] IPR0178537.8e-77Glycoside hydrolase, superfamily
[338-476] IPR0137859.8e-19Aldolase-type TIM barrel
Orthology groupMCL10426 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201066-TA
ATGAATCAGCCGGGCTTACGATTACATTTGGAGACGAATATTAACGGAGGCATTAGTATTATATCAGAGAGAAGAGGTGTGGCATCCGTGCTCTCAGTAATAGGCTATTTTACGGGCCCTGAAATCGAAGTTGATCCGCGCTTAAATAATGTTGCTATAAAATTCACTAATAACACCTCAGTCAAAATAACAGCCACGAACGTTATGCGACCCAGAAAAGGGGTTGTAGTATTTGTTGACTGGGAAGCGCCTAGTGATATACGCTTAGGAGACTGCATTAATTTAGATTCAAACCATTGGTACGGTGGACCTCAGCAAAAGCGTCAATTTTGGCCGATTGAGAAGCTCGTTCTTCCAGACTATTCTTATATAACGAAGGAAGCAGACAATTGCGGTGTCGCAGAGCCATATTGGCTCAGCTCTAGCGGAACATTCTTCTTCTTTGACAGAAATGTACCATTATTCGTTGATCAAAATACGATTGTAAGAAATGCAGCATGTTTCATAGCGGAAGTTAAGCCTCCTTACACAAAACGGAGGAATCGCAATGATTTGGATTACGTGATCGGTATATTCGACGATGTAAGACAGGCCCATGAGTACGCCGTGGATGTAATTCTGAAGAAACCAAAGGGTCATCCCGACGAAAGAATGTTAACGCACCCGATCTGGTCGACGTGGGCGAGACACAAGCGGAACATCAGCCACGATGTCGTTCTGAAATTTGCTGATGAAATAACTGAACATGGATTTCCTAACAGTCATATTGAAATTGATGATTTGTGGGAAAAATGTTACGGATCTCAAACGGTGGACGCGCGGCGTTTCCCCGACATGAAGAGCACCGTCGACGCTCTCAAAGACAAAGGTTTCCGGGTGACACTGTGGACGCATCCATTTATTAACAAAGACTGTGAGCCCTGGTACACCGAAGCTAAAGAAAAGGGGTATTTAGTGGTGTCCGAGTCGGGTTCAGTAGAAACTAGCTGGTGGAACGATAATGGTACAACCACTGCTTACATTGACTTCTCTAACGAAGAGGCTCGCAAGTGGTATGTCGACAGATTGAAGATACTTCAGCAGGCTTATGGCATTGACAGCTTTAAATTTGATGCTGGGGAATCGAGCTGGTCCCCTCAGATTCCAGTCCTAAACGGCAATGTTCTAGAGCAACCCGTCAGCATCACAGAAGATTATGTCAGAACAGTAGCCGAGTTCGGAGATCTAGTTGAAGTACGGTCGGGTTACAGAACCCAGGACCTTCCCGTGTTCGTTAGAATGATCGACAAGGACTCATACTGGACCTTCGAGAACGGTCTCCCTACGGTGGTCACGACTCTGCTTCAGATGAACTTGAACGGCTACCCGCTGGTGTTACCCGACATGATTGGAGGAAACGGATATAACGCCCCTCCCACCAAGGAACTGTTCATACGCTGGCTGCAGGCCAACACATTCATGCCCAGCATGCAGCTTTCTTACGTACCGTGGGACTTTGACAACGAGACAATCGCCATTAGTAAGAAGTTCATCGACCTTCACGCAAAATACGCTCCGGCGATAATAGCAGCGTGTCGCCGTGCGGTGACGATGGGCAGTCCGGTCAACACGCCAGTGTGGTGGGTGGCACCGCGAGACCCCACCGCGCAGGAGATATGGGACGAGTACATGTTGGGGGAAGACATCCTCGTGGCGCCGGTGCTGGCGAGGGAGGCTCGCGTCCGCGACGTGTACCTGCCCCCGGGCTCGTGGTGGGCTCAAGGGGACCCGAGCCGGGTGTACCCGGGCGGAGAATGGATCAGAGACTACCCCGCGCCGCTTGACACTCTACCTTACTTCGTGAGATCGACCGTCCCCGCGATATAA

Protein sequence:

>DPOGS201066-PA
MNQPGLRLHLETNINGGISIISERRGVASVLSVIGYFTGPEIEVDPRLNNVAIKFTNNTSVKITATNVMRPRKGVVVFVDWEAPSDIRLGDCINLDSNHWYGGPQQKRQFWPIEKLVLPDYSYITKEADNCGVAEPYWLSSSGTFFFFDRNVPLFVDQNTIVRNAACFIAEVKPPYTKRRNRNDLDYVIGIFDDVRQAHEYAVDVILKKPKGHPDERMLTHPIWSTWARHKRNISHDVVLKFADEITEHGFPNSHIEIDDLWEKCYGSQTVDARRFPDMKSTVDALKDKGFRVTLWTHPFINKDCEPWYTEAKEKGYLVVSESGSVETSWWNDNGTTTAYIDFSNEEARKWYVDRLKILQQAYGIDSFKFDAGESSWSPQIPVLNGNVLEQPVSITEDYVRTVAEFGDLVEVRSGYRTQDLPVFVRMIDKDSYWTFENGLPTVVTTLLQMNLNGYPLVLPDMIGGNGYNAPPTKELFIRWLQANTFMPSMQLSYVPWDFDNETIAISKKFIDLHAKYAPAIIAACRRAVTMGSPVNTPVWWVAPRDPTAQEIWDEYMLGEDILVAPVLAREARVRDVYLPPGSWWAQGDPSRVYPGGEWIRDYPAPLDTLPYFVRSTVPAI-