Monarch geneset OGS2.0

DPOGS214574
TranscriptDPOGS214574-TA3465 bp
ProteinDPOGS214574-PA1154 aa
Genomic positionDPSCF300050 - 609001-621928
RNAseq coverage368x (Rank: top 32%)
Annotation
HeliconiusHMEL0056922e-16360.48% 
BombyxBGIBMGA005142-TA0.070.98% 
Drosophilaalpha-Man-II-PA0.041.02% 
EBI UniRef50UniRef50_D6WTA90.044.54%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WTA9_TRICA
NCBI RefSeqXP_972030.10.044.54%PREDICTED: similar to mannosidase alpha class 2a [Tribolium castaneum]
NCBI nr blastpgi|910862130.044.54%PREDICTED: similar to mannosidase alpha class 2a [Tribolium castaneum]
NCBI nr blastxgi|910862130.044.41%PREDICTED: similar to mannosidase alpha class 2a [Tribolium castaneum]
Group
Gene OntologyGO:00045593.3e-132alpha-mannosidase activity
GO:00059753.3e-132carbohydrate metabolic process
GO:00038241.3e-130catalytic activity
GO:00302464.2e-86carbohydrate binding
GO:00159238.2e-45mannosidase activity
GO:00060138.2e-45mannose metabolic process
GO:00045532.6e-24hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00082702.6e-24zinc ion binding
GO:00431697.4e-20cation binding
KEGG pathwaytca:6607280.0 
 K01231 (MAN2)maps-> N-Glycan biosynthesis
InterPro domain[126-498] IPR0006023.3e-132Glycoside hydrolase, family 38, core
[120-499] IPR0113301.3e-130Glycoside hydrolase/deacetylase, beta/alpha-barrel
[617-1150] IPR0110134.2e-86Glycoside hydrolase-type carbohydrate-binding
[646-1145] IPR0116828.2e-45Glycosyl hydrolases 38, C-terminal
[501-584] IPR0153412.6e-24Glycoside hydrolase, family 38, central domain
[627-771] IPR0137807.4e-20Glycosyl hydrolase, family 13, all-beta
Orthology groupMCL11433 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214574-TA
ATGAGAATTAGAAGATTATCATATTTTTTTTGGGCAACTTGCATAGTAGCATTTATATTTATTTTATACGTGGTAACGGATCTTAGCTTCAAGCTGCCAAGCATAAAACCGGCTATGGTGGAGCTGGATGATAATAAATGGACAGCATTTGAATCAAAACTTCAGAAAATAGAAAAGGAGCTTGACCAACATCATGCCGTTGTTGGTGAGATTAAGAACGCCATGGATCAGATTGTGGAACAATCCCAAGAGTTTTACCCACCACAGAAGAGACCAAAGGAGCGCAAGGAAAGTAAAATACGAGAGTTCAAGGTTGTGGGAGATGAACCACCGAGAATGCATAAAAAGATTAACATGTCCTGTCCAGCCATACAGACCTCAGCTAAGTCTGATATACAGATGCTCTCGATGTATGACCGAATCATGTTCGATAATGTTGATGGCGGAGTGTGGAAACAGGGCTGGAACATTGAATACAAGGATAACCAGTGGAGTAGCAAAAATAAGCTGAAGGTGTTCATTGTACCTCATTCACATAATGATCCTGGTTGGTTGAAGACGTTTGAGAACTACTACAAGACTCAAACAAGAGCTATCTTCACTAACATGGTTGAAAAGCTGAATGAAGGGGTCGGAAGGAAGTTCATATGGGCGGAAGTGAGCTACCTCGCGCTCTGGTGGGCGAGTGACGCTACTGACAAAGAGAAACTTGCTTTCCAGAAATTACTTAAATCTGGACAGATTGAGATCGTGACAGGAGGATGGGTTATGAACGACGAGGCCAACTCACACTGGTTGTCGATAGTACAGCAACTGACTACGGGTCACCAGTGGCTGATGGATAACGTGGGATATATACCGAAAAACCATTGGTCGATTGACCCCTTCGGTTACTCTAGTACCCAGCCGTACCTGTTAAAACTGTCCGGTCTAGAAAACTCTGTGATACAAAGAGTACATTACAGGGTCAAGAAGGAACTCGCCATGAATAGACAATTGGAGTTTAAGTGGAGGCAGTTATGGGACGGTGTTGGTAAGACTGATATGTTCACTCACATGATGCCGTTCTACTCATACGACATCCCACACACATGCGGCCCTGATCCTAAGATATGTTGCCAGTTCGACTTTAAAAGGTTGCCAGGTAATGGCGTAACCTGTCCCTGGGGAATACCGCCGAGAAAGATTATACAGAAGAATGTTAACGAACGGTCGTCCATAATCCTGGACCAATGGCGGAAGAAGGCTCAGCTGTATAGAAGCAACGTCCTGCTGGTGCCGCTGGGAGATGACTTCAGATATGACCGCGCCAATGAATGGGACAACCAGTACTCCAACTACGACATGATCATCAGCCATATCAATGAGAACGACTCCTGGAATGCTGAGGTACAATTCGGCACTCTATCAGACTACTTCAAGGCGTTGCACGAGGAGGTGAAGCTGTCAGACTTCCCGGTGCTGTCCGGAGACTTCTTCACGTACGCCGACCGCAACCAGCACTATTGGAGCGGATACTACACCTCGCGACCCTTCTACAAGAGAATGGATAGAGTGCTACTGGCCTACGTCAGGGCGGCGGAAACAATCAGCATGCAGGTGTTCCTGTCGTCATCGACCAGGCAGCTGGTGTCTCTGCAGCTGGAGGAGCGCGTGGACGCCGCACGGAGAGCCCTGGCGCTGTTCCAACATCACGACGGAGTCACCGGCACAGAGAGGGACGAGGTGCGAGAGGACTACGCCAAGAAATTGTTACAAGCTATAAAGTACTGTCAGTCGGCGATCCAGCAGTCCGCCTACCACCTGCTGCGGGAGCCGGTCCTCAAGGACCAGAAACAGGAGGATGTTTATTTCGACGTGGACGACATCTGGCGGAGACATGACGAGATACCTTCTAGAATAACCATCACCTTGGACGCGATGTTCCCTTCTAGAAGAATAGTGCTCTACAACGCGTTACCGTTCAGAAGATACGAAGTCCTAACGCTGATCGTCTCGTCACCACATGTTGAGGTGTTCGACCAGGAAGGTTCGCCCCTGATGTCGCAGGTGTCCCCGGTGGTGGCGGGAGAGCGTCGCCTGGGCTTCGCTGCCAACAAGTTCCAGCTGTCGTTCCCCGTCAGCGTGGGCTCCCTGGGCCTGGCCGTCTACAGCGTGGCCTTGAGAGACGCCGCCTCCATCAACAAATACACGTCGTACTCCCACGTGCGTATCTACAACGCGGACTACTGGTCGGTGGACCTGCCGAGGATGTTCGCGGTGGAGCAGCCGGCCGGTCGCCTGGCTGATGATGTCACACTCCGTGCCAACAACACGCGCCTCGTGGTCACCAAGGACGGACTACTGAAGGCTCTGGTGGGGCCGAACGGACGGACCACGCCCATACACATGGACTTCGTACAATACGACACCCAGAAAACACCTGATAACAACAGCGGCGCGTACCTGTTCATGCCGGCGGGACCCGCGACGGACCTCAACACGGACCCCTACCCTGAGATAGTGATCATAGAGGGACCCTACAAGGCCACCGTGTACACGGGACTCGTGGGTCCAAAGGAGGCCGAGATAGTCCTGGCTATGTCAGTGTACACCAACCCGTCGTTGGGTCACAGCGAGGTGGAGCTGGACAACACCTTCCAGCTGGACCAGGCGGTCGACGACCTGGAACTGGCGATAAGACTCTCCACTAACATTAAGAACGGAGACACCTTCTACACGGACCTCAACGGCATGCAGATGATACGGCGACGGTACTTCGATAAACTTCCCCTGCAAGCGAACTTCTACCCTCTGCCGGCGGCGGCCTACATCGAGGACGCTGCGACCAGGCTGACAGTAGTGACGTCCACACCGCTCGGCACCGCCGCCCTGCAACCCGGACAGATAGAGATAATGCAGGACCGTCGGCTGAGTCGCGACGACAACCGCGGCGTGAACCAGGGCGTCCTCGACAACGTGCGCACAAGACACGTGTTCAGGGTCATCGTAGAACACTCGCAACCCAACTGCCAGTCTACAGCCGCGGACCGTACTAGCGGGCACCTGTCTCTGGGCGCGGCGGTGTCTCAGCGGACGCTACAGCAGCCCCTGGTGGTGATGCAGTTCACATCCGAGGAGGTTCCCCCCACCGCAGCGCCCCCGCACGGCGCCGCCGACGTCGAACTGGCCTCTATACGACCAGCTCGAGGGATGAAGGACGGCTCTAAACTGCAAGGTGTTGGGGCGACGTTCAGACGACTACATTTTGACAGCTGCTATGGAAATGACATCGTCAGCAAATGGTACCGCGTAGGAGACGGACAGATCTCACTCAACGATATGTTTGAAGTCCAACCTGACAAAGTGTTCGAAAGTTCACTGACCTTCAACACCATAGGGAATCCCATCCCCGACGGCATCCTGACGTTATGTCCCATGGAAGTGAGATCGGTATTCATCAATCAGACCATAAAACACGGGTGA

Protein sequence:

>DPOGS214574-PA
MRIRRLSYFFWATCIVAFIFILYVVTDLSFKLPSIKPAMVELDDNKWTAFESKLQKIEKELDQHHAVVGEIKNAMDQIVEQSQEFYPPQKRPKERKESKIREFKVVGDEPPRMHKKINMSCPAIQTSAKSDIQMLSMYDRIMFDNVDGGVWKQGWNIEYKDNQWSSKNKLKVFIVPHSHNDPGWLKTFENYYKTQTRAIFTNMVEKLNEGVGRKFIWAEVSYLALWWASDATDKEKLAFQKLLKSGQIEIVTGGWVMNDEANSHWLSIVQQLTTGHQWLMDNVGYIPKNHWSIDPFGYSSTQPYLLKLSGLENSVIQRVHYRVKKELAMNRQLEFKWRQLWDGVGKTDMFTHMMPFYSYDIPHTCGPDPKICCQFDFKRLPGNGVTCPWGIPPRKIIQKNVNERSSIILDQWRKKAQLYRSNVLLVPLGDDFRYDRANEWDNQYSNYDMIISHINENDSWNAEVQFGTLSDYFKALHEEVKLSDFPVLSGDFFTYADRNQHYWSGYYTSRPFYKRMDRVLLAYVRAAETISMQVFLSSSTRQLVSLQLEERVDAARRALALFQHHDGVTGTERDEVREDYAKKLLQAIKYCQSAIQQSAYHLLREPVLKDQKQEDVYFDVDDIWRRHDEIPSRITITLDAMFPSRRIVLYNALPFRRYEVLTLIVSSPHVEVFDQEGSPLMSQVSPVVAGERRLGFAANKFQLSFPVSVGSLGLAVYSVALRDAASINKYTSYSHVRIYNADYWSVDLPRMFAVEQPAGRLADDVTLRANNTRLVVTKDGLLKALVGPNGRTTPIHMDFVQYDTQKTPDNNSGAYLFMPAGPATDLNTDPYPEIVIIEGPYKATVYTGLVGPKEAEIVLAMSVYTNPSLGHSEVELDNTFQLDQAVDDLELAIRLSTNIKNGDTFYTDLNGMQMIRRRYFDKLPLQANFYPLPAAAYIEDAATRLTVVTSTPLGTAALQPGQIEIMQDRRLSRDDNRGVNQGVLDNVRTRHVFRVIVEHSQPNCQSTAADRTSGHLSLGAAVSQRTLQQPLVVMQFTSEEVPPTAAPPHGAADVELASIRPARGMKDGSKLQGVGATFRRLHFDSCYGNDIVSKWYRVGDGQISLNDMFEVQPDKVFESSLTFNTIGNPIPDGILTLCPMEVRSVFINQTIKHG-