Monarch geneset OGS2.0

DPOGS202980
TranscriptDPOGS202980-TA3735 bp
ProteinDPOGS202980-PA1244 aa
Genomic positionDPSCF300068 - 627859-631995
RNAseq coverage7x (Rank: top 86%)
Annotation
HeliconiusHMEL0095360.058.33% 
BombyxBGIBMGA012267-TA1e-16433.02% 
Drosophilaalpha-Man-IIb-PA1e-15432.21% 
EBI UniRef50UniRef50_O184972e-16632.76%Alpha-mannosidase II n=1 Tax=Spodoptera frugiperda RepID=O18497_SPOFR
NCBI RefSeqXP_001602695.10.035.00%PREDICTED: similar to ENSANGP00000010944 [Nasonia vitripennis]
NCBI nr blastpgi|1565516050.035.00%PREDICTED: alpha-mannosidase 2-like [Nasonia vitripennis]
NCBI nr blastxgi|1565516058e-17934.91%PREDICTED: alpha-mannosidase 2-like [Nasonia vitripennis]
Group
Gene OntologyGO:00038245.2e-72catalytic activity
GO:00302465.2e-72carbohydrate binding
GO:00059755.2e-72carbohydrate metabolic process
GO:00045597.2e-58alpha-mannosidase activity
GO:00159231.4e-44mannosidase activity
GO:00060131.4e-44mannose metabolic process
GO:00045535.5e-08hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00082705.5e-08zinc ion binding
GO:00431697.7e-08cation binding
KEGG pathwaynvi:1001188090.0 
 K01191 (E3.2.1.24)maps-> Other glycan degradation
InterPro domain[693-1244] IPR0110135.2e-72Glycoside hydrolase-type carbohydrate-binding
[238-589] IPR0113305.9e-62Glycoside hydrolase/deacetylase, beta/alpha-barrel
[239-583] IPR0006027.2e-58Glycoside hydrolase, family 38, core
[728-1066] IPR0116821.4e-44Glycosyl hydrolases 38, C-terminal
[595-667] IPR0153415.5e-08Glycoside hydrolase, family 38, central domain
[728-794] IPR0137807.7e-08Glycosyl hydrolase, family 13, all-beta
Orthology groupMCL34474 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202980-TA
ATGGTGCAGAGACCTTACCACTACCACCTCAGCTTCACCAACGACTCTCGACCCATGCAACAACATTTCGTGTACCCAGGTTTTACACAAATGAAATCTATTATACATAAAATGCAGCGTCCAAAAATGTCAACAATCCTAGCGTCACTGGACACGGGAAGAAGTAAATCAACGTACTCAGGAATGAATATTGCCAAAAAATCCGGAATCCATCCGTTCGAGCTGAGACTCACAAGAAATACCTTTAGTAACACCTCCAACCTTCCTCGCAATAAAGACGGTCATTTAATTTTTCCTGATTTAAACACTCTTGAACATTTCAATTATAACAATAGTAATCCGGTAGATACTTATGAGAACCACATTGGCTCGGTTCCAACCTCTAACTTTTTTCCCCTCGTTCAAAACAACCCTTCACGGACTACTGCACCAGAAGCTAGCGTAAGCGTCTTTCCTCTAGTAACTTCGTCGACCACGGATGTAACCGAGGATAGCGGTCTACTTATCAAAAGTATCTACAACTCTAGCACTAATAAAACCGCTAATGATTTCGAAGTCCCGAAACAAAAATACACGATAGTAGATTTCGATCCGCAAGAAGAAAAAGCGGACTTTTTAATAGAGGAACGTCCCATCGAGACGCTGGTGTCATCCCACAATTACAATACGAGTATGTCGACAGCACCCTTCATTTGTACACATAACTATGAGGCTAAAGCTGACATAGATGCTCAAGAAAAATTTTCTGAATTTAACATAGAGGTGATACTGGTTCCGCGGTCGGACGTTCATTCGATTTGGAAAAAACCATTTGAAAAGTTACATAAAAATTCCGTCAGATTTATTATATCAAATATAGTAAAAAAATTACAGTTTTATCCAAATCTAACATTTACTTGGAACGAGGTATCACACCTTAGTCAATGGTGGAAAAGTGCTCGTCAAAAAAGTCGCACCGCACTTCGTAAACTAGTAAAAGAAGGAAGATTAGAAATTACAACAGGCGCCTGGGTAGAAACAGACGAGGCTACCTCACACTTGTTTGGGATTGTTCACCAATTGATGGAAGGGCACCAGTGGTTGCAGTATAACTTAAATTATTCGCCCGACGTGGCGTGGCTTACGAATAGCGTAACCCACAGCCCCACTCTGCCCTATCTTCTATCAGCATCCGGGATAACCAGTTTGGTTGTAACAAATTTACATTTCGCTTGGCAGCAATATTTAACAGAGTATCAAGAAACCAACTTCATGTGGATTCAAAACTGGGATACCGACAAAACGACTCAGACAACTCTTAACGAAGCCCTTAAAAAAATAGGCAACGACCGGTTCCAAAAACATTCTGTTTTAACACACTATCTACCATTTAATTCTGCCGGAGTCAGAGCTTCTTGTCCTCAAGGTGATATTTGCAGCGAGGAATTTAATTTTGTGAATTCCGACAACCATCTGGATATCAATTCTTTCAACGTTAAAGAAAGGTCTGAGAAAATACTTGAACAGTATTCTAAAACTGGAACAACGTCATCTCACAACGTGGTGCTGGCGCCTATAGGCAGCTCCTTCAGTTATGAATTGCAATCCGAATTTGACTTACAGTATAATAATTACCAGAAAATTTCGGAATTTGTGAATGCGAATCAAGATATTTATAAAGCAACGATTGATTTTGGAACACCGAAAAATTATTTTGAAAGTTTGTTTTCTAGTCCAACATCTTATCCCACTTTAAAAGGAGACTTCTTGAATTTTGCTGATATCAGTGACGGCAGCCCAGCTTACTGGACGGGGTTCTTTACCACTAGACCTCAATTTAAGATTTTGCTGAGACGTCTTCAGGCAACATTACGCAGCTCAGAAATTTTATTTACCTTCGCAATGAGCTATAACGTGTTGAAAAAAAATGACGTGTCTACATTGTTCGGCCGATTGGTGAATGCTCGCGAAACTGTAGCCCGTCTCCAGGACAGGAACGTCGTCGGCGGCACTTTAAAGGCAGTGGCGCTGCGATACGCTCACAGAGAGATAGTTAAGACAGCACAAGACTGCTGGTACATACAAGAGGTAGCGGCCAGCTTGCTCAGCTCTAAACCTGACCAAAACACAACGTATCTGAAAAAATACGTCTATAGAGAGGGAGAATTTATTTCTTCTTTTAAGTCCGTCACGTCAGGAGATCAAATATATATATTTAATTCTCTCAGTCACGAAAGAACTGAAATAGTAGAATTGGTAAGTAGGTACTCTGGCATAAGAATTTTGGATCACAATAAGAAAGACGTTAGCCTTCAAATAAAACCAACCTTTAAGTATGGCTTTCAAAATGTCGTTAAGATATCCAAGCATTTTTTTCGTATCATATTTGTTGCTGTCATTCCTCCGTTTTCATTTCAACTTTTTAAGATAAAAGATACCTTTGATACGACACAGAGTCTCTCTACTTTATATTGTACGGCTTGCGTTGCCGAGGAGGACGATGTCACTCCACTGTCTCCATTCACCTTGCATCCGGTCGAGACGGGAGACGTACAGCTGGAGAACTATAAGTATCGTCTTATTTTTGATGAATACACAGGTTTTTTTAAGACGGTCACTGATAAGTCTACTAATATTGAAAAACAAATTTCGATTGAGTTCGGTGCTTTCAGAAGTTCACATATAAACTCTGGTATGTTTTTATTTAATACAAATGTTTCAAAACCACTGGAGGATATCTTATCTTCTTATAAACGAAATAATGGTTCAAAAGTTGTGATGATCATATCCGGATTCATTACCACCGAATTCATATTATTTTATGGCAAATTTTTATACCATAGTGTTACAATTTATAATTTAGTGCACAGTCCTTTGTCCAGCGCTATAAGAGTAGAAACAAAAATCGATTACGACCTGTCACCGAAACATCGGGAACTGGAGGTGTTTATGTCGATACAGACAGATATAAACAACGGCAACCCTCCGGAGATCGTTATTGATAATAATGGTTTTCAATACACTGCACGAACTATCAACATGAGCAGAAGGGTGGAATCCAACATGTACCCTATGACGAGTATGGCCTTTATACAGGATCACAAAAATCGTTTAACTATTATAACTGATCACGCACAAGGTGTGACAGCGTTTCAAGAAGGTCAACTGATAATTATGATGGATCGAAGAATACTCTTCGACGACGGTCGAGGGTCGAATGAAGGCCTCGCAGACAACACCGCCGCCTGGCAGACACATTACATACTGCTGGAGACCTTTACCGCACCTTACACCAGCTATCAAAAAGAAGAATTGAAGATGTCCTTGATGTTACCCAGCTTTTCAGCAATATATTTAGCCAACATTTTAAATTTTTTGATAGATATATACTTTATAGATAATAACAGAACTCATTCTTGTCAATTTGCATTCTTGCCACTGGTTAAGATATCATTTCCTTGTGACGTCACTGTCCTTAATTATAGAGCGATTCTAAATAGAGGAACTCCCGATTATTATATACCCAATATTGCATTATTGACACTACACAAACAAAGCTTCTCATGTCTAATAGAACACAATAGTTTTATTGATTGCAACGGGGATAGTTCGTTTATTTTGCAACAGATTTTACGTAATGCCAAAGCTGTTTACCAGACCAACTTAGTGGGTACATCGGAGGGTGTACCTATAAGTATTTTAAATAAAGCCAACTTTCCACCCATGGAAATTTCAACTTTTAGAATACACTTTTAA

Protein sequence:

>DPOGS202980-PA
MVQRPYHYHLSFTNDSRPMQQHFVYPGFTQMKSIIHKMQRPKMSTILASLDTGRSKSTYSGMNIAKKSGIHPFELRLTRNTFSNTSNLPRNKDGHLIFPDLNTLEHFNYNNSNPVDTYENHIGSVPTSNFFPLVQNNPSRTTAPEASVSVFPLVTSSTTDVTEDSGLLIKSIYNSSTNKTANDFEVPKQKYTIVDFDPQEEKADFLIEERPIETLVSSHNYNTSMSTAPFICTHNYEAKADIDAQEKFSEFNIEVILVPRSDVHSIWKKPFEKLHKNSVRFIISNIVKKLQFYPNLTFTWNEVSHLSQWWKSARQKSRTALRKLVKEGRLEITTGAWVETDEATSHLFGIVHQLMEGHQWLQYNLNYSPDVAWLTNSVTHSPTLPYLLSASGITSLVVTNLHFAWQQYLTEYQETNFMWIQNWDTDKTTQTTLNEALKKIGNDRFQKHSVLTHYLPFNSAGVRASCPQGDICSEEFNFVNSDNHLDINSFNVKERSEKILEQYSKTGTTSSHNVVLAPIGSSFSYELQSEFDLQYNNYQKISEFVNANQDIYKATIDFGTPKNYFESLFSSPTSYPTLKGDFLNFADISDGSPAYWTGFFTTRPQFKILLRRLQATLRSSEILFTFAMSYNVLKKNDVSTLFGRLVNARETVARLQDRNVVGGTLKAVALRYAHREIVKTAQDCWYIQEVAASLLSSKPDQNTTYLKKYVYREGEFISSFKSVTSGDQIYIFNSLSHERTEIVELVSRYSGIRILDHNKKDVSLQIKPTFKYGFQNVVKISKHFFRIIFVAVIPPFSFQLFKIKDTFDTTQSLSTLYCTACVAEEDDVTPLSPFTLHPVETGDVQLENYKYRLIFDEYTGFFKTVTDKSTNIEKQISIEFGAFRSSHINSGMFLFNTNVSKPLEDILSSYKRNNGSKVVMIISGFITTEFILFYGKFLYHSVTIYNLVHSPLSSAIRVETKIDYDLSPKHRELEVFMSIQTDINNGNPPEIVIDNNGFQYTARTINMSRRVESNMYPMTSMAFIQDHKNRLTIITDHAQGVTAFQEGQLIIMMDRRILFDDGRGSNEGLADNTAAWQTHYILLETFTAPYTSYQKEELKMSLMLPSFSAIYLANILNFLIDIYFIDNNRTHSCQFAFLPLVKISFPCDVTVLNYRAILNRGTPDYYIPNIALLTLHKQSFSCLIEHNSFIDCNGDSSFILQQILRNAKAVYQTNLVGTSEGVPISILNKANFPPMEISTFRIHF-