Monarch geneset OGS2.0

DPOGS212963
TranscriptDPOGS212963-TA1437 bp
ProteinDPOGS212963-PA478 aa
Genomic positionDPSCF300057 + 286479-293872
RNAseq coverage2913x (Rank: top 4%)
Annotation
HeliconiusHMEL0032700.071.93% 
BombyxBGIBMGA011607-TA2e-17757.31% 
DrosophilaGNBP3-PA3e-8839.38% 
EBI UniRef50UniRef50_Q9NL899e-15355.00%Beta-1,3-glucan-binding protein n=13 Tax=Obtectomera RepID=BGBP_BOMMO
NCBI RefSeqNP_001128672.10.061.32%beta-1,3-glucan recognition protein 3 [Bombyx mori]
NCBI nr blastpgi|527827400.061.33%beta-1,3-glucan recognition protein [Plodia interpunctella]
NCBI nr blastxgi|527827400.062.90%beta-1,3-glucan recognition protein [Plodia interpunctella]
Group
Gene OntologyGO:00045538.9e-11hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059758.9e-11carbohydrate metabolic process
KEGG pathwaymmr:Mmar10_02475e-13 
 K01199 (E3.2.1.39)maps-> Starch and sucrose metabolism
InterPro domain[150-478] IPR0089851.8e-43Concanavalin A-like lectin/glucanase
[165-477] IPR0133201.8e-31Concanavalin A-like lectin/glucanase, subgroup
[265-412] IPR0007578.9e-11Glycoside hydrolase, family 16
Orthology groupMCL25872 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212963-TA
ATGGAGAAATTATATTATGTTGCTGTTTTAGCTGCGTTCTGCCTCGTCCAGGCTAAGGGATACAAGGTTCCTGACGCCAAGCTAGAAGCCATTTATCCCAAAGGACTGAGAGTATCAGTACCGGACGATGGCTTCACCTTAATGGCGTTCCATGGTAATCTAAATCAGGAGATGGAAGGTTTGGAAGCTGGCCAGTGGGCTCGTGATATCACCAAGCCCAAAAACGGCCAGTGGATCTTCAGAGACAGAAATGCACAGTTGAAAATTGGGGACAAGATTTACTTTTGGACTTACGTCATCAAAAATGGCTTAGGCTACAGACAGGATAATGGAGAATGGACTGTTACTCATTTCGTAACTGAAGAGGGAGTACCGATCAACGAAACATATCCAGACCCTATGCCAATCCCAATTCCTGACAGCAGCACTTCATCACCACCACCGATCGCTTGTGAGGTTTCTCCAACAGTCGTTCAGGGTTATCGTTCCCTGTGCAAGGGAGTGCTAATCTTTACTGAAGATTTCAAAAAACCGAATATCAAGAGCCTCACCTCATGGGATCCAGAGATTATGTTCCCTCAAGAACCGGATTATCCCTTCAACGCATATATCACGGATGCTGTTAGCATAGACAACGGTGCCTTAAAGATTAAACCTGAATTGACTGAATCTCGCTTCCACGAAGGCTACCTCAACGAACAATGGGACCTATCCAACATTTGCACGGGACAAGTAGGAACGAGAGAGTGCAGTCTCGTAGCAAATGGAGCCCAAATCCTTCCCCCTGTCATAAGTGGAAAAATTACAACACGCAACAAATTCAACTTCAAGTACGGTAGGATAGAAGTACGCGCAAAATTACCAGCAGGAAGTTGGCTTTTACCAGAAATATTGTTGGAGCCACGTGATAACGTATATGGGAAGCAGCGGTATGAATCTGGTATTATGAAAATTGCGTTTGTGAAAGGGAACGCAGCATTCGCAAAGAAGTTATACGGAGGTCCAGTTCTATCAGACACAGAACCTTACAGGACATTCCTCCTCAAAGAGAAACTTGGTTTCGATAATTTCAATAAAGATTTCCATAACTATACTCTGGTTTGGAAGCCTGATGGCCTCCAGATGTATGTAGATGGAGAACAGTATGGTGACGTAACTCCTGGAGAAGGTTTCTACTTCAGTGCCAGAGACCACGCGGTTCCTCACGCATCGATGTGGCTTAAGGGATCAATCATGGCACCTCTGGATCAATTATTCTACATATCATTGGGGGTACGCGTTGGTGGTATCCATGATTTCGATGATGCGAGAGACAAACCCTGGATAAACAGACACACTAAGGCTCTCTACAAATTCTGGGAACACAAAGACTTATGGTACCCTACGTGGTACTCACCAGAACTGATTGTAGAATCAGTTAAAGTTTATGCTCTTTAA

Protein sequence:

>DPOGS212963-PA
MEKLYYVAVLAAFCLVQAKGYKVPDAKLEAIYPKGLRVSVPDDGFTLMAFHGNLNQEMEGLEAGQWARDITKPKNGQWIFRDRNAQLKIGDKIYFWTYVIKNGLGYRQDNGEWTVTHFVTEEGVPINETYPDPMPIPIPDSSTSSPPPIACEVSPTVVQGYRSLCKGVLIFTEDFKKPNIKSLTSWDPEIMFPQEPDYPFNAYITDAVSIDNGALKIKPELTESRFHEGYLNEQWDLSNICTGQVGTRECSLVANGAQILPPVISGKITTRNKFNFKYGRIEVRAKLPAGSWLLPEILLEPRDNVYGKQRYESGIMKIAFVKGNAAFAKKLYGGPVLSDTEPYRTFLLKEKLGFDNFNKDFHNYTLVWKPDGLQMYVDGEQYGDVTPGEGFYFSARDHAVPHASMWLKGSIMAPLDQLFYISLGVRVGGIHDFDDARDKPWINRHTKALYKFWEHKDLWYPTWYSPELIVESVKVYAL-