Monarch geneset OGS2.0

DPOGS212946
TranscriptDPOGS212946-TA1029 bp
ProteinDPOGS212946-PA342 aa
Genomic positionDPSCF300057 - 100631-108373
RNAseq coverage839x (Rank: top 15%)
Annotation
HeliconiusHMEL0128806e-2646.36% 
BombyxBGIBMGA014566-TA7e-14079.65% 
DrosophilaCG6206-PA4e-11558.20% 
EBI UniRef50UniRef50_B3N5F12e-12060.49%GG23643 n=3 Tax=Schizophora RepID=B3N5F1_DROER
NCBI RefSeqXP_002036526.12e-12157.83%GM18459 [Drosophila sechellia]
NCBI nr blastpgi|1953398434e-12057.83%GM18459 [Drosophila sechellia]
NCBI nr blastxgi|1948623044e-11860.49%GG23643 [Drosophila erecta]
Group
Gene OntologyGO:00059751.4e-105carbohydrate metabolic process
GO:00045591.4e-105alpha-mannosidase activity
GO:00038244.6e-100catalytic activity
KEGG pathwaydse:Dsec_GM184596e-121 
 K12311 (MAN2B1, LAMAN)maps-> Lysosome
    Other glycan degradation
InterPro domain[18-308] IPR0006021.4e-105Glycoside hydrolase, family 38, core
[21-311] IPR0113304.6e-100Glycoside hydrolase/deacetylase, beta/alpha-barrel
Orthology groupMCL10107 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212946-TA
ATGCTGCTCCTTTTCTTGCTGATAGGTCTCTCCTCTGGAGTGCCGCTTGACGAGAAATGCGGTTACGATAGTTGCACGGCCGCGGAGGCCGGGGTGCTGAACGTTCACATAGTACCGCACACACACGACGACGTCGGCTGGCTCAAGACCGTGGACCAATACTACTATGGAAGCAAGAACGACATTCAGAAAGCTGGCGTCCAGTATATCTTGGACTCTGTCATTAAAGAACTGTGGCAGGATCCTAAACGAAGGTTTATATATGTGGAGACAGCATTCTTTTGGAAGTGGTGGACTCTTCAGAGTGACGACACTCGCCACAAGGTCCACACCCTGGTCCGCCAGGGCCGGCTGCAGTTTGTAGGGGGAGCCTGGAGTATGAACGACGAAGCCGCCTCGCACTATCAGAGCACCATCGACCAGTTTACTTGGGGCCTCAAGAAGCTGAACGATACGTTCGGCGCGTGTGGGATCCCACGCGTGGGATGGCAGATAGATCCGTTCGGCCACTCCAGGGAGTTCGCGTCTCTTTTATCCCAGATGGGATACGACGGCTTGTTCCTGGGACGGATCGACTACCAGGATAAAGGATCTCGTCTGAGAGACAAGCGGATGGAGATGGTGTGGAGGGGAGACGACCAGCTCGGTAACTCGTCGGACATATTCACGGGTGTTCTGTTCAATACCTACTCGCCGCCGGCCGGCTTCTGTTTCGACGTGCTCTGCAACGACGAGCCGATCATAGATGACGTGAACTCGCCCTTATATAACGTCGAAGATAGAATGGCAGCGTTCATAAGGAAAGTGTACGGTATGTCGGAATCGTATAAAAGCAACAACATCTTGGTGACTATGGGAGACGATTTCCAGTACCAAGACGCTAACATGTGGTTCAGTAACCTCGATAAACTGATAGCAAATATGTCGGAAGCGTACGGAACCGATAACGTCTTGGTGACGATGGGAGAGGACTTCCAGTACCAGGACGCCAGCATGTGGTTCATTAATTTAGACAAACTCATACAGTGA

Protein sequence:

>DPOGS212946-PA
MLLLFLLIGLSSGVPLDEKCGYDSCTAAEAGVLNVHIVPHTHDDVGWLKTVDQYYYGSKNDIQKAGVQYILDSVIKELWQDPKRRFIYVETAFFWKWWTLQSDDTRHKVHTLVRQGRLQFVGGAWSMNDEAASHYQSTIDQFTWGLKKLNDTFGACGIPRVGWQIDPFGHSREFASLLSQMGYDGLFLGRIDYQDKGSRLRDKRMEMVWRGDDQLGNSSDIFTGVLFNTYSPPAGFCFDVLCNDEPIIDDVNSPLYNVEDRMAAFIRKVYGMSESYKSNNILVTMGDDFQYQDANMWFSNLDKLIANMSEAYGTDNVLVTMGEDFQYQDASMWFINLDKLIQ-