Monarch geneset OGS2.0

DPOGS200835
TranscriptDPOGS200835-TA5313 bp
ProteinDPOGS200835-PA1770 aa
Genomic positionDPSCF300071 - 300732-351663
RNAseq coverage628x (Rank: top 20%)
Annotation
HeliconiusHMEL0126420.071.78% 
BombyxBGIBMGA009891-TA8e-12055.18% 
DrosophilaCht6-PC0.053.71% 
EBI UniRef50UniRef50_B0WYC90.056.68%Brain chitinase and chia n=3 Tax=Coelomata RepID=B0WYC9_CULQU
NCBI RefSeqXP_001862401.10.056.68%brain chitinase and chia [Culex quinquefasciatus]
NCBI nr blastpgi|1700528330.056.68%brain chitinase and chia [Culex quinquefasciatus]
NCBI nr blastxgi|1571326390.038.84%brain chitinase and chia [Aedes aegypti]
Group
Gene OntologyGO:00060325.3e-143chitin catabolic process
GO:00045685.3e-143chitinase activity
GO:00038241.2e-110catalytic activity
GO:00431691.2e-110cation binding
GO:00059751.2e-110carbohydrate metabolic process
GO:00045533.1e-107hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00060302.2e-16chitin metabolic process
GO:00080612.2e-16chitin binding
GO:00055762.2e-16extracellular region
KEGG pathwaydme:Dmel_CG29890.0 
 K01183 (E3.2.1.14)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[37-415] IPR0115835.3e-143Chitinase II
[394-435] IPR0137811.2e-110Glycoside hydrolase, subgroup, catalytic core
[38-415] IPR0012233.1e-107Glycoside hydrolase, family 18, catalytic domain
[37-442] IPR0178532.3e-102Glycoside hydrolase, superfamily
[510-589] IPR0025572.2e-16Chitin binding domain
Orthology groupMCL19578 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200835-TA
ATGGGAAATATTGACAGAAGAAAGAACTTTATAAAAATGGAACTCGCGACATGGTTGTTTTTGTTAATCGCTGTCGTCGCACTGGCCACACCGGCTCAATCAGCGTCTCCGCGGGTGGTTTGCTACTACACAAACTGGTCTGTGTATCGTCCGGGGACCGCTAAGTTCAACCCCCAGAATATCAACCCCTACCTTTGTACTCACCTGGTCTACGCGTTCGGAGGCTTCACCAAGGACAACACCCTGAAACCTTTTGATAAATATCAGGATATAGAAAAAGGTGGATACGCCAAGTTTAACGGCCTGAAGACGTACAATAAAAACCTCAAGACGTTGCTGGCTATCGGAGGCTGGAACGAGGGATCCTCGCGTTTCTCCCCCATGGTCGCCGCCAAAGATCGGCGAAGGGAGTTCGCGAGGAATGCCATAAAGTTTTTAAGACAAAATCAGTTCGATGGATTGGATCTTGACTGGGAGTACCCCGCCTCCAGAGAGGGAGGGAAACCCAAGGATAGAGAAAACTACGTCAAGTTCGTGAAGGAACTGCGCCAGGAGTTTGAGAAGGAATCGGAAAAGACCAGCAAACCCAGGCTGCTGCTCACGATGGCTGTCCCCGCGGGCATCGAATACATCGAGAAGGGATTTGACATTAAGACTTTGACACGTTACTTGGACTGGATGAACCTTCTCACATACGACTACCACTCCGCGTTCGAGCCGGCGGTGAACCACCACGCTCCACTCTACCCTCTAGAAGAACCCAATGAATACAGCGTTGACAACGAGCTGAATATAGACTACACGATCAAATTCTATCTTGAAAATGGTGCGGACCCGGAGAAGTTGGTGCTGGGTATACCAACGTATGGTCGTTCCTATACTCTGTTCAACGCTGATGCTGTGGAAATAGGCTCACCCGCTGATGGACCCGGGGAACAAGGAGTGGCGACGAGAGAGAAGGGATACTTGGCCTATTATGAGATCTGTGAAGCGCAAATATCCAAGACAAAGAAGCGCGCCATCGCGTCGGACGAGGACTCTGAAGAAGAATCAGAAGAAGAAGATGAAGAAGAAGAAGAAGAGAAATGGACGATCATGTATCCCAACCCTAACGCTATGGGCCCTGTAGCATTCAAAGGGAACCAGTGGGTTGGTTACGATGACGTGGAGATCGTCAAAAAGAAAGCACATTATGTTGTCGAAAACGGGCTCGGAGGTATCATGTTCTGGTCTATAGACAACGATGATTTCCGCGGCGTGTGCAACGGCAAACCCTACCCGCTTATTGAAGCGGCCAAGGAGGCTTACCTCACTAAATTAGAATCTTCCAAAAACTCGGTTAGCAGTCCGAAAGAAAGTTCGAAACCATCGAGGGGTGGGAACCGCAGACGAAACCGTCCGAAGACGACACCCACCACCACCACCACCACCACTACTACCACTCCCAAACCCCCGAAAAGCAACAAGCGGAAGTCCACTAGCTCGGTGAGCACCACACCAGCCTGGAACATTATCACACCTGAACCCCCCACCACTCCCGATCCCGGATCTGACTTCAAATGTACCGACGAGGGTTTCTTCCCCCACCCGCGTGATTGTAAGAAGTATTTCTGGTGCCTGGACTCCGGACCCTCGGACCTTGGGATCGTTGCGCACGCCTTCACCTGCCCCTCCGGTCTATATTTCAACAAGGCCGCCGATTCCTGTGATTTTGCAAGAAATGTGCTCTGCAAGAAATCATCGTCTACCACAAAAGCTGTCACCAAAACCACAACAACAAAAACAACTCCAACAACCACAACAACAACGACCACGACCACCAGACGACCCATCAGACTGACTTCAAGGAGCTCTTTGCTGTTCAGGACTTCAACTACTACAACTACAACCACACCAGAACCTGAGCTCAGTGAAGAAGACGAGGAAGAAGCCGATGACGCCAGTGACGTGGAAGCTGAAGACCCCAAGGTCATCAAGGAACTTATTGACCTCATAAAGAAAGTTGGCGGCGTCGAACAGCTGGAAAAGCAGCTTAAGCTGTCGGAGTCGTCAGGATCTACGGATGGAGTCGCCACAACCACGCCGACATCGTTCAACACTAAGCTGTATCAAAAAGTACTGGAGAGAGCTCGGGGAAAAAACAAAGTTTCTAACCCACCAAATCTTAGGTTCGTCGGAAATAGCATTACTGAAAGCAGTGTGCAAAACAGCCGTCGGGGACCACAGAACGAGGGACTCGAACCAGCTGTTGATAAAGATCGCTTGTTGAGGAGAGATAGGCCACAGTATGTCACCATCAACCGGGCAAGGTCATCCACTACACCAGAATCTCTAGAAAGTGAAGAGGCCGAGGACGAATCGGAGGAAATTCAGGAGACAGTTCAGGAACAGCGATCAGAGGTACCCGCAGCAAGAGTGGCGACGACTCCTAAACCTCTCCAATACGTTAACATCAGACGGACGAGACCGACTACTGCTGCCACAGAGACTCCTGATGACTCCAGGAATGCTCTGTTTGAACGCGAATCGTCCGAGTCAGAGGAGCGTCTAACCGCTGTTGAGGATGCACAGCGCGTGGACCGCGGGGACTCCCGGCGCGACACTCCAGAATACGTCACTATAAGACGCGGCAGACCGACCACTGAGGCCACCACACTACCATATCACAGCGCTGAAGAAGAAGATAAATCGCAGGAAGTTGCTTTAGTGAAAGAGATCACGTCTCAGTCTTCGTCACCACAATATAACTCTATAGTCAGATTTCGATCTACTACACAGTCACCAGCCGAGGAGTTGACTAACCCAGCTCCGACTACAGTCCTTTCTGTTCAGATATCTTCATTATTGAATTCCCCAAGTTCCGATGAAACCGCAAGCCCTCGAACAGATAGTACGACCGCTCATGTAACCGAAGCTTCTGAACCCGAAGTAACGACGGCATCCACTACGGTTGTGACAACCACAACCACACCAGTAACGACCACGACGACTACGACGCCCTTACCTCCAAGCACGACCACACGCCGCAACTTATTAAGACGACGGGGCTCCACAACACCGACCACGCCAACCACGGCCGCGGCAGTTTCGACTACGCAGGCTGCAAAGGAACGCCGAACGTTCCCGCGTCGCACGAAAGCCACCGCGCCGCCGGAAACAACAGGAGAGGTAATAAATTCACAGACTACAACCAGCAAATATCCGAGACGAGGCGAGAACAAATTCAAGATACAAAAAACGGAAAAAGTCGAGAAATCGCGGGAAAGTAACTCGACAGAGAGTCAGCCATCTTTGAACGGCACCGCGGCGAGCAATGACAGGCCCAACCGCAACTTTGTTCGCAGACGCTTTGGAGGGGCTAACACTTCTACGACCCAATCGTCTACTATACAATTATCATCATCTGTAACAAGACGTCCGTTCCGTGTGGCTAACCGTCGCAGACTATTTTCTACCACAACCACCACAACAACAACCTCCCCCAGCACCACAGAGCTGGAGAGTGACGAATCTTTACAGGATATAGGAGACACAGATGCGATCGAAGACCCGTCTCTCCAACCTCAACCCCGAGCGAGGAAAGTCTCTAACGGCCCGCGGAGACGACCATTAGTCCAACTGAAGAACGAAAACGAAGATCAAAACTCTTCCCCCACTAACGAAGACGAGAAGACGAGACAGAGCAAGAAATACAGCGCCAGCTTCAAACAAAACCAGCTCGAGGAAATACTGAAGATACGAGCTAGCGCTGAAGAAATCGACGTAACTACGGAAGGGAGATCCACTCTTGATGATACAAGTGCTGAAACAGCAGTAGCTCTAGCAGCCCACCAGCTCCTATCAGCACCGATACCGATCATCCCCGACTACGATGACGAATCGAAACCAGCGCGGTCTTCTCAAACTATCGTAGACTATAAATTTACAAGCCCAGAGTACAACGACCTCACTAAAACACAAACGTACACAGAAGACTATCAGAGAACTCCATCTTACACTTCGACTGGACAAAGATTTGAGTCAACGACCCCTTACACTCTTCGAACCGAAGGAAATGTTCGATCGACGACAGGATCAGTTAACTCTGAGACTAATATTCCATCTGGGTTCACTACACCTGGCGCATTCACAGGCAGGTTCACGAGCTCAACTACCGGAAATACAATCAACCCGACCTTCTCTGGTATAACTCTCAGCCTCGGTTCCGAAGGCTCTGGCGAGTCGACAGCTCGTTACACCTCAAAGTTCCCCAAAGAATCGAGCCCTACAGCTTACACGATCAGCAACTATGAGACCAGAACCCTGAGGCCCGGTTTATCGACCAACATCGTCAATCCAACTTCGTTAAATTGGAGGGAATCTACAGCTAGAATATACGCGAGCCTGGATCGGAGTGTGCAACCGACATATTCGACTGAATTCACAACAAAAATTTCGAGGCCAGCAGGATTTTCACCCAATTCAGTTAAGATAGATGAAATAGAAAACGATAAGACGACAGAGAAGTATACGGGTTTCATCGAGAGGGGTCCATCTACTGCTAGATACGAGGGTTCTAGCGAAAAGATTTCGGTCCCGGTCGCCGTGGGGTACTCGTCTGGTAGCCAGGGCTTGCAAGAGCCTTCGTATTTCACCAGAGAATACTTATTGGAATCGCCGGTCACTAGAACATACGACGATGAATACCAATATTTGTCCCCGGCGACGACGCCACAACCAACAACCAAGAAACCGCTCAGAAGGAAAACTATCTATCGTAGAATATCATCTACAGCTGCACCAAGCTCACAGATCACTCAAAGCCTGTCATCAATCCGCACGTCACCCACCACATTGACGACACCCACACAACCGATAACACAGACAACGGTCAAACCACGTCGAACCAGCAGGAAACCGTTTCAGAGGATAGCGGTGAAAAAAGGTCCCCTTCAGAAACAACCAGTACAGCCAGAGATCAAGGACTCGGTTCCGAAGGAAGTCCAGAAAACTCTAGTTCTAAAGATCAATAACAACGCCGTCAAGTCGTCGAGGCCACTTTCAGACTACGATTACTACGATGACAGTCACGAAGGTGTGAAATATGAAGATGGATCCAAAGTACTTCTGCACGGAAAAGGCGACATCGAATGTTTGGACATCGGAAACTTCGCGCATCCATCGTCATGTAAGAAGTTCATATCGTGCGCGCGGATGGAGAGCGGCGCGTTAGTGGGCTGGGAGTATATTTGTCCAAAGGGACTGTCCTTCGACCCCGTAGGAGGCATTTGTAATTGGTCCGCCGGGTTAGGTTGTACTGAAAAGGACGCGTGA

Protein sequence:

>DPOGS200835-PA
MGNIDRRKNFIKMELATWLFLLIAVVALATPAQSASPRVVCYYTNWSVYRPGTAKFNPQNINPYLCTHLVYAFGGFTKDNTLKPFDKYQDIEKGGYAKFNGLKTYNKNLKTLLAIGGWNEGSSRFSPMVAAKDRRREFARNAIKFLRQNQFDGLDLDWEYPASREGGKPKDRENYVKFVKELRQEFEKESEKTSKPRLLLTMAVPAGIEYIEKGFDIKTLTRYLDWMNLLTYDYHSAFEPAVNHHAPLYPLEEPNEYSVDNELNIDYTIKFYLENGADPEKLVLGIPTYGRSYTLFNADAVEIGSPADGPGEQGVATREKGYLAYYEICEAQISKTKKRAIASDEDSEEESEEEDEEEEEEKWTIMYPNPNAMGPVAFKGNQWVGYDDVEIVKKKAHYVVENGLGGIMFWSIDNDDFRGVCNGKPYPLIEAAKEAYLTKLESSKNSVSSPKESSKPSRGGNRRRNRPKTTPTTTTTTTTTTPKPPKSNKRKSTSSVSTTPAWNIITPEPPTTPDPGSDFKCTDEGFFPHPRDCKKYFWCLDSGPSDLGIVAHAFTCPSGLYFNKAADSCDFARNVLCKKSSSTTKAVTKTTTTKTTPTTTTTTTTTTRRPIRLTSRSSLLFRTSTTTTTTTPEPELSEEDEEEADDASDVEAEDPKVIKELIDLIKKVGGVEQLEKQLKLSESSGSTDGVATTTPTSFNTKLYQKVLERARGKNKVSNPPNLRFVGNSITESSVQNSRRGPQNEGLEPAVDKDRLLRRDRPQYVTINRARSSTTPESLESEEAEDESEEIQETVQEQRSEVPAARVATTPKPLQYVNIRRTRPTTAATETPDDSRNALFERESSESEERLTAVEDAQRVDRGDSRRDTPEYVTIRRGRPTTEATTLPYHSAEEEDKSQEVALVKEITSQSSSPQYNSIVRFRSTTQSPAEELTNPAPTTVLSVQISSLLNSPSSDETASPRTDSTTAHVTEASEPEVTTASTTVVTTTTTPVTTTTTTTPLPPSTTTRRNLLRRRGSTTPTTPTTAAAVSTTQAAKERRTFPRRTKATAPPETTGEVINSQTTTSKYPRRGENKFKIQKTEKVEKSRESNSTESQPSLNGTAASNDRPNRNFVRRRFGGANTSTTQSSTIQLSSSVTRRPFRVANRRRLFSTTTTTTTTSPSTTELESDESLQDIGDTDAIEDPSLQPQPRARKVSNGPRRRPLVQLKNENEDQNSSPTNEDEKTRQSKKYSASFKQNQLEEILKIRASAEEIDVTTEGRSTLDDTSAETAVALAAHQLLSAPIPIIPDYDDESKPARSSQTIVDYKFTSPEYNDLTKTQTYTEDYQRTPSYTSTGQRFESTTPYTLRTEGNVRSTTGSVNSETNIPSGFTTPGAFTGRFTSSTTGNTINPTFSGITLSLGSEGSGESTARYTSKFPKESSPTAYTISNYETRTLRPGLSTNIVNPTSLNWRESTARIYASLDRSVQPTYSTEFTTKISRPAGFSPNSVKIDEIENDKTTEKYTGFIERGPSTARYEGSSEKISVPVAVGYSSGSQGLQEPSYFTREYLLESPVTRTYDDEYQYLSPATTPQPTTKKPLRRKTIYRRISSTAAPSSQITQSLSSIRTSPTTLTTPTQPITQTTVKPRRTSRKPFQRIAVKKGPLQKQPVQPEIKDSVPKEVQKTLVLKINNNAVKSSRPLSDYDYYDDSHEGVKYEDGSKVLLHGKGDIECLDIGNFAHPSSCKKFISCARMESGALVGWEYICPKGLSFDPVGGICNWSAGLGCTEKDA-