Monarch geneset OGS2.0

DPOGS203031
TranscriptDPOGS203031-TA3417 bp
ProteinDPOGS203031-PA1138 aa
Genomic positionDPSCF300068 + 616002-623064
RNAseq coverage469x (Rank: top 26%)
Annotation
HeliconiusHMEL0095340.069.70% 
BombyxBGIBMGA012267-TA0.067.71% 
Drosophilaalpha-Man-IIb-PA0.042.48% 
EBI UniRef50UniRef50_O184970.067.53%Alpha-mannosidase II n=1 Tax=Spodoptera frugiperda RepID=O18497_SPOFR
NCBI RefSeqXP_970968.20.047.83%PREDICTED: similar to mannosidase alpha class 2a [Tribolium castaneum]
NCBI nr blastpgi|22455680.067.53%alpha-mannosidase II [Spodoptera frugiperda]
NCBI nr blastxgi|22455680.067.53%alpha-mannosidase II [Spodoptera frugiperda]
Group
Gene OntologyGO:00038249.5e-103catalytic activity
GO:00302469.5e-103carbohydrate binding
GO:00059759.5e-103carbohydrate metabolic process
GO:00159234.7e-82mannosidase activity
GO:00060134.7e-82mannose metabolic process
GO:00045591.4e-77alpha-mannosidase activity
GO:00045531.7e-30hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00082701.7e-30zinc ion binding
GO:00431693.5e-15cation binding
KEGG pathwaytca:6595830.0 
 K01191 (E3.2.1.24)maps-> Other glycan degradation
InterPro domain[544-1138] IPR0110139.5e-103Glycoside hydrolase-type carbohydrate-binding
[33-427] IPR0113308.4e-98Glycoside hydrolase/deacetylase, beta/alpha-barrel
[582-1135] IPR0116824.7e-82Glycosyl hydrolases 38, C-terminal
[172-427] IPR0006021.4e-77Glycoside hydrolase, family 38, core
[435-526] IPR0153411.7e-30Glycoside hydrolase, family 38, central domain
[572-704] IPR0137803.5e-15Glycosyl hydrolase, family 13, all-beta
Orthology groupMCL15988 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203031-TA
ATGTTGGACCTCTCGCCACGTTCTATATCTTTGGAAACGATGCATGAAGCCATTTCGTCGGCATCAGCGCCCCTTTTCCCGAGTTGGTTACGAAGTAAGGAGTTTTGGGACAAATCCTTTGAAGATCGCTACGAGAAAATGAAGAACGACTCCCGCCGTCCGAGACTCAAGGTGATCGTGGTGCCGCACTCCCACAACGACCCCGGCTGGCTGAAAACCTTCGAGCAGTACTTCGAGTGGAAGACCAAGAACATCATCAACAACATGGTCACCAAGTTGCACCAGCTCCCCAACATGACTTTCATCTGGAGCGAGATATCGTTCCTTAACGAGTGGTGGGAACGGTCGCATCCCGTTAAGCAGAAGATAGCGCTGGTGGCTGGCGGCTGGTGGCTGGCGGCTGGCGGCTGGCGGCTGGTGGCTGGTGGCTGGCGGCTGGCAGGCTGGCGGCTGTCGGCTGGCGACCGGTCGCCGGTGGAGGGTTTAGTGACGGTAGCCGTGGTAGCGTGCACTGCGCTGAAGAAGTTAGTTAAAGAAGGTCGGCTGGAGATAACGAGCGGCGGCTGGGTCATGCCGGACGAGGCCTGCACACATATATACGCACTCGTCGATCAGTTCATTGAAGGGCACCAGTGGGTGAAGACGAACCTGGGGACTGTTCCGCGGATAGGATGGTCCATCGACCCGTTCGGTCACGGCCCCACGGTCCCTCACCTCCTGGAGCTGAGCGGGCTGGAGGGAGCTATCATACAACGGATCCACTACGCCTGGAAGCAGTGGCTGGCGAGGAGGCAGATAGAGGAGTTCCACTGGGCACCGGGCTGGAGCTCGCGCAGGCCCACGCTGGTGGTCCACAACCAGCCGTTCGACATCTACTCCATCAAGAGCACGTGCGGACCTCACCCCGCCGTCTGTCTCGGCTACGACTTCCGCAAAATCCCCGGCGAGTACTCCGAGTACACGGCCAAGTACGAAGAGATCACCGACCAGAACGTGCAGAGCAGAGCGCGCACGCTCCTCGAGGAGTACGAACGCGTCGGCTCCCTGACGCCTCACAACGTGGCGCTGGTGCCGCTGGGGGACGACTTCCGCTACGAGCACGCCTCGGAGTTTGACGCCCAGTACAACAACTACATGAAGATGTTCAACTACATCAACGACCGTAAAGACATCTTCAACGCCGACGTCTCCTTCGGAACTCCCCTCGACTACTTCAACGCTATGAAGGAGAGACACGACAACATCCCCGTCCTCAAAGGAGACTTCTTCGTCTACTCCGATATATTTAGTGAAGGCAAACCGGCCTACTGGTCGGGCTACTTCACGACGCGTCCTTATCTCAAAATTTTGACGCGACAGTTCGAGCATCATTTACGAACGGCGGAGATTCTGTTCACTCTCGTCTCGAACTACGTGTCGCAATCTAAAAACAAGAAACTCATTGCCTCCGAGAAACGACTGGAGAAGCATTACGAGCAGTTGGTGACCGCGCGCAGGAACCTCGGCCTCTTCCAACACCACGACGCCATCACCGGCACCTCCAAGTCCACGGTGATGACCGACTACGGCACCAAGCTGCTCACCAGCCTCTACCACTGCATCCGCCTTCAGGAGACGGCTCTCACCACTCTCATGCTGCCCGACGAGTCCCTGCACTCCCAGAGCGTGCTACAGAGTCAGATGGAGTGGGAGTCGTACGGCAAGCAGCCGCGTCAACTGCAAGTGTCGCACGTGGACAAGAATCAGGTGATTTTGTTCAACCCTCTGACCGAAGAGAGGACGGACGTCATATCGCTCAGGTCCAACACCACCAACATACGGGTGTACGACACTCGCAGGAAGGAATACGTCCAGTATCAAATTATGCCCAATATAGAGATCCGTGAAAACAAAAAGTTCGTCATCAGCGACATGAACTTCGACATCCTGTTCGTGGCGACTCTCCCGGCGCTGACGGCGGTGACCTTCCGCCTGGAGGATCACAGCAACATCTCCCAGCACGCGGTGGTGTTCTGCAACAGTTGTGACCACCGCGCCACCTCGCAGCGACCCTCAAACTTCGCCTACAAGAAGATGATGCCCGGCGACATCCAGCTCGAGAACTCAGTGCTGAAGCTGCTCATCGACCGCAACTCCGGCCTCCTGAGGCAGCTCTACAGAAAAGACATACGCAAGAGAAACGTAGTCGAGATACAGTTCGGGGCCTATCAAAGCGCGCAACGACATTCTGGAGCTTACCTGTTCATGCCTGACTATGATTCTCCCGAGAAAAACGTTTTGAACTCTTACACGAACGGTGAGAGCTTGCAGGACGACAACATAGTCATCATCTCGGGGCCGGTCTCCACTGAAATAACAACTTTTTACTTGCCCTTCTTAGTTCACACTTTGAGGATTTACAACGTAGACGACCCCGCTCTGCTGCGTGCTGTACAGATTGAGAACATCGTGGATTTCGAAAGCCCGCCCAAGAACAGGGAGACCGAGCTGTTCATGAGGTTCCAGACCAACATACAGAACGGGGAAGTACCAGAGTTCTATACGGATCAGAACGGGTTCCAGTACCAAAAGAGAGTCAAGGTGGACAAGTTGGGCATCGAGGCCAACTATTACCCCATCACAACAATGGCTTGGCTGCAGGACGAGGAGAGCCGTCTGACGGTGGTGACCGACCACGCGCAGGGAGCTTCCGGCTTCGAGCCCGGGCGGCTCGAGCTGATGATGGACCGCCGCACCTTGTACGACGACCATCGCGGCATAGGAGAAGGCGTCGTCGACAACAAGCCCACCGTGTTCAGGAACTGGCTGCTGGTGGAGCCGACAACCTCAACCCCCCCCGACTCGAGCGACCGCGCCAACACTGCACGCGACAAGCGAGACGCTCACGTGTCCGGCGTCCTCAGCGAGCGTCACTTCCGGCCGGGGCAGGTGGGCAACGAGTACGAGCTGCCGTCGCTCGCCGCCGGGAAACTCAGTCGTCACCTCAACTACCCGGTGAACGTGTACCTGGTGGACTCCAGCGAGGTGGAGGGCGGCGATGTGGTCACCCGACACGAGCACGTGTTCGTCCAGGATTTCCCTCGAGACATCCACCTCCTCACCTTGCGCACCTTGTCGGACCCTGCCCTCGACCAGCTGCCGACCGACACCGCGCTCCTGGTGCTGCACCGCCCGGCCCACAGCTGCTCGGTGGGGGAGCGCCCCCCCTACTCCTCCCCCTCCACCGGCGACACGCCCACTCCCCGCTCCCCCCCAGCGAGCGCACGCTTCACCCGCGCCACGCGCTTCCCTCCCCTGCGCGTGTCTAACGTGACGTCTGTGAGCCTGACGGGTGTGGTGGAGCGCCGTGTGCTGACAGGCCTGCAAGACCTGCACGTGGAGCCGCTCGAGATCAAAACATATAAGATACGCTTCTAA

Protein sequence:

>DPOGS203031-PA
MLDLSPRSISLETMHEAISSASAPLFPSWLRSKEFWDKSFEDRYEKMKNDSRRPRLKVIVVPHSHNDPGWLKTFEQYFEWKTKNIINNMVTKLHQLPNMTFIWSEISFLNEWWERSHPVKQKIALVAGGWWLAAGGWRLVAGGWRLAGWRLSAGDRSPVEGLVTVAVVACTALKKLVKEGRLEITSGGWVMPDEACTHIYALVDQFIEGHQWVKTNLGTVPRIGWSIDPFGHGPTVPHLLELSGLEGAIIQRIHYAWKQWLARRQIEEFHWAPGWSSRRPTLVVHNQPFDIYSIKSTCGPHPAVCLGYDFRKIPGEYSEYTAKYEEITDQNVQSRARTLLEEYERVGSLTPHNVALVPLGDDFRYEHASEFDAQYNNYMKMFNYINDRKDIFNADVSFGTPLDYFNAMKERHDNIPVLKGDFFVYSDIFSEGKPAYWSGYFTTRPYLKILTRQFEHHLRTAEILFTLVSNYVSQSKNKKLIASEKRLEKHYEQLVTARRNLGLFQHHDAITGTSKSTVMTDYGTKLLTSLYHCIRLQETALTTLMLPDESLHSQSVLQSQMEWESYGKQPRQLQVSHVDKNQVILFNPLTEERTDVISLRSNTTNIRVYDTRRKEYVQYQIMPNIEIRENKKFVISDMNFDILFVATLPALTAVTFRLEDHSNISQHAVVFCNSCDHRATSQRPSNFAYKKMMPGDIQLENSVLKLLIDRNSGLLRQLYRKDIRKRNVVEIQFGAYQSAQRHSGAYLFMPDYDSPEKNVLNSYTNGESLQDDNIVIISGPVSTEITTFYLPFLVHTLRIYNVDDPALLRAVQIENIVDFESPPKNRETELFMRFQTNIQNGEVPEFYTDQNGFQYQKRVKVDKLGIEANYYPITTMAWLQDEESRLTVVTDHAQGASGFEPGRLELMMDRRTLYDDHRGIGEGVVDNKPTVFRNWLLVEPTTSTPPDSSDRANTARDKRDAHVSGVLSERHFRPGQVGNEYELPSLAAGKLSRHLNYPVNVYLVDSSEVEGGDVVTRHEHVFVQDFPRDIHLLTLRTLSDPALDQLPTDTALLVLHRPAHSCSVGERPPYSSPSTGDTPTPRSPPASARFTRATRFPPLRVSNVTSVSLTGVVERRVLTGLQDLHVEPLEIKTYKIRF-