Monarch geneset OGS2.0

DPOGS215599
TranscriptDPOGS215599-TA1014 bp
ProteinDPOGS215599-PA337 aa
Genomic positionDPSCF300097 + 296136-299046
RNAseq coverage622x (Rank: top 21%)
Annotation
HeliconiusHMEL0169182e-17484.87% 
BombyxBGIBMGA000353-TA3e-16881.90% 
DrosophilaGNBP1-PA6e-3332.08% 
EBI UniRef50UniRef50_Q17E764e-12160.77%Gram-negative bacteria binding protein n=174 Tax=Pancrustacea RepID=Q17E76_AEDAE
NCBI RefSeqNP_001159614.12e-16681.90%beta-1,3-glucan recognition protein 4 [Bombyx mori]
NCBI nr blastpgi|3598014662e-17285.16%beta-1,3-glucanase [Heliconius melpomene cythera]
NCBI nr blastxgi|3598014660.085.16%beta-1,3-glucanase [Heliconius melpomene cythera]
Group
Gene OntologyGO:00045532.9e-21hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059752.9e-21carbohydrate metabolic process
KEGG pathwaysde:Sde_31211e-27 
 K01199 (E3.2.1.39)maps-> Starch and sucrose metabolism
InterPro domain[324-336] IPR0133201e-71Concanavalin A-like lectin/glucanase, subgroup
[1-337] IPR0089852.7e-64Concanavalin A-like lectin/glucanase
[100-236] IPR0007572.9e-21Glycoside hydrolase, family 16
Orthology groupMCL18477 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215599-TA
ATGATATTCGCGGACGACTTCGAAGAATTTGACCTTGAGAAATGGCAACACGAAAACACTTTGGCTGGTGGTGGTAACTGGGAGTTCCAATACTACAATAACAATAGAACTAATTCGTTCACTCAAGACGGCAAGTTGTTTATTCGTCCGTCCCTGACTTCTGACCAATTTGGAGAAAACTTCCTACACACTGCCACTCTCAATATCGAAGGTGGAGCACCAGCTGATAGATGCACCAACCCACAATGGTATGGTTGTGAACGTACTGGAACTCCAAACAATATTATAAACCCCATAAAAAGCGCTCGCATCCGAACAGTTAACTCCTTCAGCTTCCGTTATGGAAGAGTGGAGGTGCGCGCTAAAATGCCTGCAGGAGACTGGTTATGGCCTGCTATTTGGTTAATGCCAGCGTACAATTCATATGGCACGTGGCCAGCGTCTGGAGAAATTGATCTTGTGGAATCCCGCGGTAACCGTAACATGATTCAGAACGGACTTCACATTGGAACACAAGAAGCTGGGTCTACTTTACATTTCGGACCGTACCCTGATTGGAACGGTTGGGAAACAGCTCACTGGGTTAGACGTAACCAAGGAGGATATGATTACGACTTTCATCGCTACCAGCTCGAGTGGACTCCAGATTACATAAAATTCAGCCTTGACGATGTTGAGCTAGGGCGAGTAACGCCTGGAAACAAGGGCTTTTGGGAATACGGTGGATTTAGCAAGAATCCCAACGTACTTAATCCCTGGCGATACGGTTCGAAAATGGCGCCATTCGACCAAAAGTTTTACATAATCATCAACTTGGCTGTCGGTGGTACCAATGGATTCTTTCCCGACAACGTATTTAATCCTACACCAAAACCCTGGTCCAACAATTCTCCTCGTGCCGCAACTGACTTCTGGAATGGCCGCAATTCCTGGCTGCCAACCTGGATGCTCGATCAAAACGAGGGACATTCGGCTTCTTTACAAGTGGATTATGTGAGAGTCTGGGCATTGTAG

Protein sequence:

>DPOGS215599-PA
MIFADDFEEFDLEKWQHENTLAGGGNWEFQYYNNNRTNSFTQDGKLFIRPSLTSDQFGENFLHTATLNIEGGAPADRCTNPQWYGCERTGTPNNIINPIKSARIRTVNSFSFRYGRVEVRAKMPAGDWLWPAIWLMPAYNSYGTWPASGEIDLVESRGNRNMIQNGLHIGTQEAGSTLHFGPYPDWNGWETAHWVRRNQGGYDYDFHRYQLEWTPDYIKFSLDDVELGRVTPGNKGFWEYGGFSKNPNVLNPWRYGSKMAPFDQKFYIIINLAVGGTNGFFPDNVFNPTPKPWSNNSPRAATDFWNGRNSWLPTWMLDQNEGHSASLQVDYVRVWAL-