Monarch geneset OGS2.0

DPOGS206219
TranscriptDPOGS206219-TA1320 bp
ProteinDPOGS206219-PA439 aa
Genomic positionDPSCF300334 - 202383-207829
RNAseq coverage1523x (Rank: top 8%)
Annotation
HeliconiusHMEL0112247e-13454.03% 
BombyxBGIBMGA009695-TA9e-12953.81% 
DrosophilaCht2-PA4e-8940.89% 
EBI UniRef50UniRef50_D6WQ485e-9647.24%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WQ48_TRICA
NCBI RefSeqXP_970191.21e-9647.24%PREDICTED: similar to AGAP005634-PA [Tribolium castaneum]
NCBI nr blastpgi|1892393652e-9547.24%PREDICTED: similar to AGAP005634-PA [Tribolium castaneum]
NCBI nr blastxgi|1892393651e-9447.24%PREDICTED: similar to AGAP005634-PA [Tribolium castaneum]
Group
Gene OntologyGO:00060327.5e-109chitin catabolic process
GO:00045687.5e-109chitinase activity
GO:00431695.7e-89cation binding
GO:00059755.7e-89carbohydrate metabolic process
GO:00038245.7e-89catalytic activity
GO:00045531.1e-87hydrolase activity, hydrolyzing O-glycosyl compounds
KEGG pathwayaga:AgaP_AGAP0056343e-86 
 K01183 (E3.2.1.14)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[31-378] IPR0115837.5e-109Chitinase II
[357-385] IPR0137815.7e-89Glycoside hydrolase, subgroup, catalytic core
[31-385] IPR0178532.2e-88Glycoside hydrolase, superfamily
[31-378] IPR0012231.1e-87Glycoside hydrolase, family 18, catalytic domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206219-TA
ATGTCACTGTCATGGAAGGGAGTGCTTTTCTTCATAGCATCATGTTCAGCTATCGCCGCGGGACAGGGTGAGAGCGGCCCAAAGCACGGAAAGAATATCGTATGTTTTCTGGCCTCCTGGTCTCATTATAGGCTGGACCCGATCAAGTTCCACTTATCCGACTTGGACCCATCGTTATGTACTCACCTAGTTTACTCGTTCGCTGTTCTCGATGAGAAAACAAATGAAATTAAAAGTTCAGACGTAGGTCTGGATGTAGAGAATAAAGATACGACAGTTGGTTACAAAGGGTTTGTTGATCTCAAGAAAAAGAATCCTCATCTCAAAGTGACCTTGTGTATCGGAGGCTGGAATGAGGGGTCCCAAAAATTTTCCCTAATGGCTAAAAGTCCACATTCAAGAAAACAGTTTATACAGAGTGTTATCAAGTTTTTACAAACGTATAATTTCGATGGTCTCGACATTATGTGGAAGTATCCGACGACGAGGGGCGGCGACAAACAGGACAAAGATAACTTCGTTATTTTAGTTAAGGAACTAAAGGAGGCTTTCAGTCCACACGACTTTATTTTAACAGCATCTTTGTCTGGGATTAAACATGTGATGGAACCAGCATACGACTTGGTCCAGTTGAACAAATACTTAGACATGATCCATGTTTTGGGCTACGACTACCATGGTCCGTGGAATGGCATACTTGGGGCAAATTCACCGCTGTCTTCCACCTCTCAAGATAACTTCCGTAGTGTGGAATACACTATAAGATACATGATAGCTTTTGGCGTGAGTCCTGAGAAAATAAACCTAGAGTTGTCCCTGTTTGGAAGAACTCTCCTCTTAAGCAACCCAAAAGAGGAACGAGTGAAATTCGGTCAAACGAAAGTTCAGGGCGTCGGATTCCCTGGACCAATAATAAAAGGAATACATTACTACGCATACAATGAGATCTGTATGGAACTGACCAACAAATCCATACCCTGGGATTATCATTGGGATGAAGAGTCTTCCACACCATATCTCCGGGATAAAGATCGTATCATATCGTACGACAACCCTCGCTCCATAGCAAATAAAGTTAAACTAGCTATAGACTATAACCTGGGAGGCTTCATGGTGTGGAGCGTGGACACGGACGATTTCAAAGGTCTTTGCGATCTTCGCAATGATACTTACGATGACTACGAGGCGAGGATTAACAGGATATCAGACGATCCCATGCTACAGGACGCCATAGACAAACTAGATCTGTCAGACTTGGTTTTAAACGACGGTTATTACATTAAGATGGATAAGAAATTGAGCTTCATCAAACTAGACTAG

Protein sequence:

>DPOGS206219-PA
MSLSWKGVLFFIASCSAIAAGQGESGPKHGKNIVCFLASWSHYRLDPIKFHLSDLDPSLCTHLVYSFAVLDEKTNEIKSSDVGLDVENKDTTVGYKGFVDLKKKNPHLKVTLCIGGWNEGSQKFSLMAKSPHSRKQFIQSVIKFLQTYNFDGLDIMWKYPTTRGGDKQDKDNFVILVKELKEAFSPHDFILTASLSGIKHVMEPAYDLVQLNKYLDMIHVLGYDYHGPWNGILGANSPLSSTSQDNFRSVEYTIRYMIAFGVSPEKINLELSLFGRTLLLSNPKEERVKFGQTKVQGVGFPGPIIKGIHYYAYNEICMELTNKSIPWDYHWDEESSTPYLRDKDRIISYDNPRSIANKVKLAIDYNLGGFMVWSVDTDDFKGLCDLRNDTYDDYEARINRISDDPMLQDAIDKLDLSDLVLNDGYYIKMDKKLSFIKLD-