Monarch geneset OGS2.0

DPOGS205012
TranscriptDPOGS205012-TA1869 bp
ProteinDPOGS205012-PA622 aa
Genomic positionDPSCF300123 + 401742-414069
RNAseq coverage246x (Rank: top 42%)
Annotation
HeliconiusHMEL0146322e-10078.32% 
BombyxBGIBMGA010240-TA0.079.56% 
DrosophilaCht5-PA0.059.48% 
EBI UniRef50UniRef50_P363620.078.84%Endochitinase n=46 Tax=Endopterygota RepID=CHIT_MANSE
NCBI RefSeqNP_001037480.10.079.42%chitinase isoform 2 [Bombyx mori]
NCBI nr blastpgi|99716090.079.64%endchitinase [Spodoptera litura]
NCBI nr blastxgi|101197840.080.26%chitinase precursor [Bombyx mori]
Group
Gene OntologyGO:00060322.4e-149chitin catabolic process
GO:00045682.4e-149chitinase activity
GO:00038241.3e-109catalytic activity
GO:00431691.3e-109cation binding
GO:00059751.3e-109carbohydrate metabolic process
GO:00045534.8e-108hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00060301.3e-15chitin metabolic process
GO:00080611.3e-15chitin binding
GO:00055761.3e-15extracellular region
KEGG pathwayxtr:5489452e-87 
 K01183 (E3.2.1.14)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[100-452] IPR0115832.4e-149Chitinase II
[431-470] IPR0137811.3e-109Glycoside hydrolase, subgroup, catalytic core
[101-452] IPR0012234.8e-108Glycoside hydrolase, family 18, catalytic domain
[100-478] IPR0178535.1e-98Glycoside hydrolase, superfamily
[566-623] IPR0025571.3e-15Chitin binding domain
Orthology groupMCL15937 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205012-TA
ATGTTCAAACAACAAAATAACCCACAATCCAAGCACCCTTTGCATATCAAGTGTGCGCACGCGCATATAACCGCGGTGCAGTGCGGTTCGGACGCAATCGCACGCACGCCGCGGCAAAGACACACATGTCTTCTTCGTCATCCGAAGGACTCTCTCAAATCCAATAATCTTTACTCGACAAAGGTTTTACTTTCGGGCGCTAACCTCCGCCACAAAGGATCGCAAGTCAGAATGCGAGTGCTACTATTCACGTTGGCCGTCCTGGCTGTTTTCACAGCAGTTAAATCGGACAGCAAAGCGCGTGTAGTATGCTATTTCAGCAATTGGGCGGTGTACCGACCAGGTATTGGACGTTATGGAATCGAAGATATCCCAGTCCACATGTGCACCCACCTTATATACTCATTTATAGGAGTCACTAAGAAATCTAACGAAGTACTCGTCATTGATCCTGAGTTGGATATCGATAAAAATGGTTTCCGTAATTTCACGTCATTAAAGAAATCTAATCCAGACGTTAAGTTCATGGTGGCCGTGGGTGGCTGGGCGGAAGGTGGTTCCAAGTACTCCCACATGGTTGCTCAGAAGACCTCTAGAATGACATTTGTTAGGAGCGTCGTCGATTTCTTGAAGAAATATGACTTTGATGGCCTTGATCTCGACTGGGAGTACCCCGGGGCAGCCGACCGTGGTGGTTCATTCTCAGACAAAGACCGCTTCTTGTTTTTGGTACAGGAATTGAGGAGAGCCTTCATCAGAGCTGGCAAGGGTTGGGAACTGACTGCTGCTGTTCCACTCGCAAACTTCAGACTTATGGAAGGTTATCATGTACCGGACTTGTGCCAGGAATTGGACGCTATACATGTAATGTCCTACGATTTGCGTGGGAATTGGGCCGGCTTCGCAGATGTGCACTCGCCTTTATACAAGCGTCCACATGACCAGTGGGCGTATGAAAAACTTAATGTTAACGATGGTCTTAATCTATGGGAAGAGAAGGGCTGTCCATCTAACAAATTGGTAGTAGGAATCCCCTTCTACGGTCGCTCTTTCACGTTGTCCGCTGGGAACAATAACTACGGTCTGGGAACATACATCAACAAGGAAGCTGGAGGTGGCAACCCTGCACCATATACGAACGCCACTGGATTCTGGTCGTATTATGAGATTTGTATGGATGTGGACAAGCCAGGTTCAGGGTGGACCAAGAAATGGGATGACCATGGAAAATGTCCCTACGCCTATAAAGGAACCCAATGGGTTGGTTATGAGGACCCTGAGAGTGTGGAGATTAAGATGAAGTGGATTAAAGAAAAAGGCTATTTGGGAGCCATGACTTGGGCCATCGATATGGATGACTTCAGAGGAATCTGTGGCGAAAAGAATCCGTTGATGAACTTGCTGTACAAATATATGAAATCGTACAGAGTACCGCCTCCGCGTACTGGAAACACCACACCTACTCCTGAATGGGCAAGACCGCCTTCCACCCCATCCGATGCTTCTGAAGGCGCTCCCATCCCCACAACGACTCCGGCCGCCACGCAGGCCCCCACAAAGAAACCATCTGTCACAAGTACTACAAAGAAGCCGTTGACTACAACCACACAGGCTTCAAACTCCGGCGATGAGTCAGAGTTACCAGATAGACCCAGTGACGTAGAGCAACCAGTGGAGAACGAGATAGACAACCCTGAAATATGCAGCTCCCAGGATGACTACGTCCCTGACAGGAAGCATTGCGATAAGTACTGGAGATGCGTTAACGGCCAAGGAGTACAGTTCACGTGCCAGCCAGGTACCGTGTTCAATTTCAATCTGAATGTCTGCGACTGGCCCGGGAACGCTGACCGCGACGAATGCCTTTAG

Protein sequence:

>DPOGS205012-PA
MFKQQNNPQSKHPLHIKCAHAHITAVQCGSDAIARTPRQRHTCLLRHPKDSLKSNNLYSTKVLLSGANLRHKGSQVRMRVLLFTLAVLAVFTAVKSDSKARVVCYFSNWAVYRPGIGRYGIEDIPVHMCTHLIYSFIGVTKKSNEVLVIDPELDIDKNGFRNFTSLKKSNPDVKFMVAVGGWAEGGSKYSHMVAQKTSRMTFVRSVVDFLKKYDFDGLDLDWEYPGAADRGGSFSDKDRFLFLVQELRRAFIRAGKGWELTAAVPLANFRLMEGYHVPDLCQELDAIHVMSYDLRGNWAGFADVHSPLYKRPHDQWAYEKLNVNDGLNLWEEKGCPSNKLVVGIPFYGRSFTLSAGNNNYGLGTYINKEAGGGNPAPYTNATGFWSYYEICMDVDKPGSGWTKKWDDHGKCPYAYKGTQWVGYEDPESVEIKMKWIKEKGYLGAMTWAIDMDDFRGICGEKNPLMNLLYKYMKSYRVPPPRTGNTTPTPEWARPPSTPSDASEGAPIPTTTPAATQAPTKKPSVTSTTKKPLTTTTQASNSGDESELPDRPSDVEQPVENEIDNPEICSSQDDYVPDRKHCDKYWRCVNGQGVQFTCQPGTVFNFNLNVCDWPGNADRDECL-