Monarch geneset OGS2.0

DPOGS210864
TranscriptDPOGS210864-TA6546 bp
ProteinDPOGS210864-PA2181 aa
Genomic positionDPSCF300027 + 829624-849878
RNAseq coverage1021x (Rank: top 12%)
Annotation
HeliconiusHMEL0050280.057.42% 
BombyxBGIBMGA006989-TA0.052.36% 
DrosophilaCht6-PC7e-1722.18% 
EBI UniRef50UniRef50_D6WX173e-18032.69%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WX17_TRICA
NCBI RefSeqXP_391993.28e-12643.50%PREDICTED: similar to K06A9.1b [Apis mellifera]
NCBI nr blastpgi|2700116081e-17932.69%hypothetical protein TcasGA2_TC005652 [Tribolium castaneum]
NCBI nr blastxgi|2700116080.032.51%hypothetical protein TcasGA2_TC005652 [Tribolium castaneum]
Group
Gene OntologyGO:00045531.8e-19hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059751.8e-19carbohydrate metabolic process
GO:00060304.1e-14chitin metabolic process
GO:00080614.1e-14chitin binding
GO:00055764.1e-14extracellular region
GO:00038247.7e-13catalytic activity
GO:00431697.7e-13cation binding
GO:00060322.8e-07chitin catabolic process
GO:00045682.8e-07chitinase activity
KEGG pathwayisc:IscW_ISCW0129863e-19 
 K01183 (E3.2.1.14)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[85-297] IPR0012231.8e-19Glycoside hydrolase, family 18, catalytic domain
[23-297] IPR0178533e-18Glycoside hydrolase, superfamily
[442-517] IPR0025574.1e-14Chitin binding domain
[276-317] IPR0137817.7e-13Glycoside hydrolase, subgroup, catalytic core
[23-297] IPR0115832.8e-07Chitinase II
Orthology groupMCL18060 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210864-TA
ATGTGGCTTTGGTTGCTCTGTGCGGCTGCGGCCGTGTACTCCGCGGAGGGCGCAGGCCCGGGCGCCGCCCGCCTCGTTTGTTATGTCGAGGGCGCGCGCGCATCTGATGTATCTGAGTGCACGCACCTCGTGTACGCCGGCGACTCTAGAGGTGAGGCACTGGACGATCTGCTGAAGGACTACAGGAAAAATAACCCAAGGATCAAGATCATCCTGCGCGTCAACGAAGGAGATAAGGATCTAACAGAAGTCCTCAAATCTAAGAACGTGCAAGGTTTGGAAATATACGATGCCCACAAAGCCTTCAATAAGACCAAAGTCCTGGAAATGGTGGAATCAGCTAGAGCTGCCCTCACCGCAGCGGGAGGTGGGCCATTGTTCCTGGCGCTGCCTTCGCACCCAGAACTCCTGGCCAAGTATTACGACCTCAGGGTCTTAGTGAAGAAGACTGACCTCATGACCGTCCAGACACATGCTCTGGGTTTGGTGAAAAAGATGACCTACCATCCCAGTCGGCTGAGTGGATTGTGGGACATGATGAATACTGATTCCGTAGTGGATCTAGTCATCGGCTTAGGAGTCCCCGCTTCGAAGGTGGTGATCAGTTTACCAGCTACGGCTCGACAGTTCCAACTCCTTAACGAGACCCTCAGTACGCCCGGTAGTCCGACTACCGACGACGATCCGAAAGAAATCGACCAACTAGAACTCTGCAGACAGTTACGCAAAGGCAGATGGACCTTAGAAAGAGACCAGGACCTGTCAGCGCCTTACGCATTCAAGGACAAGACGTGGATATCATTCGAAGACGCATCCTCGGTAGATGTGAAGGGTAAGTACGCGAGGGTTCGAGGTCTCGCTGGCCTCGCTCTGTACAACACTGAGAAAGACCTGGACTCGCCCTGTGCTCCAACTCTCAGGACGGCTTTGAATAAAGTGCTGAACCAGCAGAGTAGAGCGCCGAGGGCAGCCGTTTTAAGATCTTTGGAAAACGAGATCCTTTCCGCTCCACACAACGCGCGTGTCCTGGACGCTCTCCAAGTGTCACCTTACCGGATCACTCAGGTTGTTGATGCTGACGGTGTCATACATTCCATCAGAGAGGATACGAGGACAGAATTCTCCTGTTCCCGCCAAGGCTACTTCGTACATCCCCGTTCTTGCGCTCGTTTCTATCGTTGCGTTAAATTCGATCAACTGTCTCCCGAATATACAGTTTTCGAATTTGACTGTCCCGCTGGGCTGGCGTTTGACGCGCGGTATGAGGTGTGTGTATGGCCAGGGTCTTTGCCTCACGCCGCCGCCTGCCCAGGGTCATCAGAGATCGCGCCAGTTCCTACCACCCGGTTCATTTGTCCGGACCATGAGGGTTACTACGCGGATCCAGAGAACTGTCGTTGGTTCTTCGCGTGTCTGGACCATGGTAAAGCGCCGCTAACTGCCTACGAGTTCCGCTGCCCATTCGGTCTAGGGTTCGACGCTGCCCGCCTTAAGTGTGATTGGCCGTGGTTGGTACCGGCTTGCGGGAACATTGCCCGATACGAAGCAGAAGCCTTTGGATACTCAGGCGCCGCTCTTAGTGGTGCTACTGGTTTCGAAGGAAAAACAGCAGATTCAGTTAACATAGCCGCTCATCAGAGTTTGGTTTCCGGAGCATCTCTTAACAACCTAGTAGGAATTCAAAACGGACTTCTCACTAATGACGATATTCTAGACGCTAACTACATATCATCCCAAGAAGCTGCCAATGGGGGTTTGGCCGGAGCAGGAACTTATGACAGCTTTTCGAACATAGGACTCAGTTATCAAGGCGAACCAAGCGGAGTCTTCTACAATTACGTGTCAGCGGAAGACATTAACAAGGGTCTTGTTAGGGGCGTTAATAGTGAGGGAGAGGAGAAATACACGAGTGGATCCATAATTTTAGATGATTACAGACTACCAAGTAATAAAGTAAACACTTACTCGGTATATAATAAAGGTCAAAGTGGTGTATCAGGCTCTGCCAGAAAGCCACTTCGTTACAATGATGGTAAATACCGCCCTGATAATTCTGGGAAGTACGTCCACAACCCTGCCGGTGACCGAGCCAAGCCCTATGAGCACATTGGTGTCCCTCCCGTTCCGTACGCGCACAAGGATTTTAAATATAAACAATCAAACGAGAATGACGCCAGCAAATATTCTGGTAGCGTCTCTGGAGGTTACGAGTACCCAAGACCAAAAATTGAATTCCACGAAGGATTTGGGGTGAATGATAATATTCATAGTGTTAGTGGATACAACAGTGTAGGAGTCGAAGGTAGTGTGACAAATACATATCAAGGCAGTTACACAAACGCAGATGTAGGAAAATACGTAACCGACGACAGAGTATACACAGGCGCTTCACATACTCTTACTGCTGGAGCAGTGAATGTTGGATTGGTCGGGAAGACAACACTATTAGACGCTGATTACGCGAATCAAGGAAACTATCAGGGTCATTCATACAGCAGTGGGATTGTTCATCATGTTTCTCAGCCTGCAGTCTCTGTAAGTCACGTCAGCACGGCCGGTCAAAACTTAGACGGTTACAGCTATACAACTTCAACCTCATCGGGAGTCCCCACAACAGTGAGTCCATTGTCGTACACAACATACAAAACTACATACGTTCCAGAAGTGCCTAAAACTAGTATTAAACAAGTTTTCGGTTTCACTCAACCAGCTGTAACTTACGTCCAAACTGCTGTAGTACCTGTAAAACAAGTAACGCCACAAATTCAAGTTACTAATTACAATCAAGGCGTCAACTATCAGTACGAAAGTAAAGATTATTCGTCCTCCAATGCCAAGTCAAATGAAAACTCTGAGTTTACAGGTTACGAATACAAACAACCTAGTATTAAATTTGAAAATGTCCCCACTGCAACTCCAGCCGTGACTGTAGTATCCCAAAAACCACAAAACCAACAAACCTTTCACACAATCGAATCAGTTGGTTTTGAATATTCAACACCAGTACCTGTCAGTGAACCACCATTTAAAAAAATAGTGGCTTATACAACGGGTTCACCAGTTAGCAGTTACGTTCAATCAACATCACCGGCTGTCAGTCATCAAACAATACATAAAGTGGAAACATCAAAAGTACAAATTCCATTCGTCGAAGTCAGTGGCCAAGATGTTGCAAGCACAGGTGTCAACTATGAAGTGCCCCAGGGTTTGATACAGTACACTACCCCACAACCGTCAATTTCAGTTCAACCGGAAATATCATATCAACCACAAGCATTTAGTCAACAAACTGTACATACAGTCGGTGCTAACAAAGTTAATTCGTTCACACAAGGCCATTCGCAAGTAGGATACGAGTACCAAGGACCGTCAACAATCAGTTTTAATACCAACGATGAAAATGGGTATAACTACAAACAACCTAACGTTCGATTAGAAGACGCAGCAAGAGTAGTTCATTATTCTACCCCAGCACCGACAGTGTTATACACTCCTCAAAGTGACAGTTCGCAAAGCATACAGAGATCCAGGACGCAATACGTTTCGTCAACTGCTCGCCCATTCGTGTCAACAACGCCTGTAACTTTATACGAGAATCCCCTTCTACAATACACAGCAAAATCTACGGAAATAACATATGACGCTCCTAAGTACACTTCACACTCATACAGTCAACAAAACCATAACAGAGGTGTTTCTTCAAACTCGTTCGCATCCAATGGATACTCAGCAGCACTGGATCAACAAACTACAGTCTATTCAACTCCTCAGCCGGCACTTTCAGTTCAATCCACAACAACAACATACCCTCAATATCAAGGTCAGCAGAATCTTCATCAAGTCGATACAAGTCGACTTAATACATATGAAGGTGCTTCCATCAATGTTGATTACTCTCAGCCGTCGTTTAGTGTACAACAAGCTAAAATAGAAACATCCACTCCTGCATCAATTACTTCTTCAACGTACAAACCTCAATCATATAGCCAACAAACAATCCATAAATATGAGAAACAAGTAACTCCTTCAACCACGCCAATTTCTACCATACATTTATCTTATCAACAGCCCGAGGTGTCTGTTTACGAAAACCCAATCTTGAAATACACTCAAAGGGTTACTCCTGTTACCTATGTAAAGCCTACAGCATCTATATTTACTCAAACATATAGGCCTGAAGTCAAGGTAAGTCACGTTACCAGACCAGCACAATACGAAACATCAGACCAACAATCTTATATAAATCAAGAAGCTTATACTTATAATCAACCGGAAATACAAAGAAATATTGAGTCATCAAAAAGCCAAACTTACTCTGACGCTAGTGTTGTGTCAAGCCAATACTTCCATCAAGGGAAAGGAATAACACAAACAAATGGAAATGAACAGTACGAATCTGGTAAGAACGTTGTGTTTGTATCAAGTACCCCTGCAACACTCGCTTACGAGGATCACTATACAGAATATGAAAATCAAGGGAAGGTTAATGCATACAAAGCTCCTGAGTATATACCTCCGAAGGAAGAAATACCACAAACATACATAGTACCAACTGTCTCCACACCAGCCGTGAAGCAGCAGGGTTCGATACTTTATTCTCAAGATCAATACGATTATCAACCTGTGGGCTATTCTCAACAATATCAAGGTGAATATACAAATAGTAAGGCCAGTACATCAATTAATAATTATGAATATACTCAAAATAATTACCAATCTCCTGAAGTACAAGTGGAGTATAAGGCTGAAGAATATTTGCCACCAGTACCATCTACTGCAAGACCCGCTGTCTCATCTACATACCGGTCACGCATCTATTCTACAACAACCAATGCTCCAGAATACCTGCCTCCTGAGTCTGAAACTAAATTCCGCGCCGATGAATATTTACCACCAGTGAAAAATAGTTACCAATCAATTGTTAAATCTACAGACAATGTTGAATACTTACCTCCTGCAGAAAGTACAGCAGCACGCGTGGCTACTTTCCAACGATTTGGTTACAACGATGAGGATGAATCCAGTAAAGGATATAGCACATATCAAGATTATTCACTCTCAACTGAAGCGCCCATTATAGCGGCTGGTGTGACAGGTAGAAAACAGAACATAGTAGTTGAGAAGGCGAAGTCAAATTTACTTGGTTTTGGAACTGTTGGACCAGAGGCTGGCTTGGTGTCAACTACGGCCCGGCCGTTTATATCTACTACGGATAACTCTTACTTACCTCCAAAGACGGCAACGTTCACGTATACAACTGAAACTCCGGAAGTAACTACTACAATAAGAAGGACGAAACCTAAATATTTTAGAGTACCATCTTCTTCTACTACACCGGTATACGTAGAGTCAACGACAGCGTATCAGGCGCCGGAATACTTGCCACCGTCAGAAAAAATCTGGCAAGGCTTAGTGGAGACGTCAGCATATAGTAGTACAGTAGCTCCAAAAATAAGACTTAATCCTTATCAAGGAAACACATATGATAATTCCCAAATAATTGTTTCCACAACAACGGCACCGATAAGAAAACAAAATGTTGTCGTTGAAAATGCTAAAGCACAATTATTAGGATTCGGTGCCGTTAGTTCTGAAGCTGGTTTAGTTTCACCAGCATCGTATAGTGAAATAGAGCCTATTGAAGTATCTCATGATGGCACCTACTCTACTGTGCTACCAGTTCAACAACAAGTGGAAGTCACAAGCGCAAGAGTCCCGATACGAAGAGTTAAGCCAAAGGTAGCGATCGTTACAAAAATAAATGACTTTAATCCTCTTCTTGTGAAAAAATTAGGCGCTGTTTGTAGTTGTCAATCTCCTGTATTAATTCTTAAGGGCAAAAGACCGAATATTCAAGAAGAGGACGTTGATTACGACAATGGTTACGAAAGTGGACGAGGGGATCTCGGCAGTTATAAATTAAAACCGCAGTCAGCACGAGTAACTACCGTTCCATTACTGAATACTGTGACGACCGCATCACCGGTAGTATCAACTTTCAATCCAATAATTGTCCCCGATGATTCATACTATCAAGACTACCAAGAGGCTAGTAACGATAATGTAGTGGTAAATGCCGTTGGAAAGGATAATTCTCAAAGTTACGTTTCAAGCGTACCCGTTGTTTCAACAACTGAAAGAGTAGTTAGAATAAGACCAAGAGTAAAATTAGTGACTGAGGCGCCTACTTATAAAACTGTTGTATTAAATAAACAAGTAGGCCCAAGCATTCCCCAGACAGCGGAATTGATCGAATCTGTTGGAGTTAATTCCCCATCTTTTGATCGTTATGGACCAGGCGGTTGGAGAGACAGGGATGAAACTCTACAGGGCTCTATAGATTGTCAGCGAGCTGGGCTATTCCGTCATCCAAAACAATGTAACAAATTCTATGCATGCAGATGGGACTGTACAAAACAAAGATTTACGCTTCACGTCTTTAACTGCCCTGTTCAACTGAGCTTCGACCCTAACATTGGAGCCTGTAACTGGCCAAGTCAAGGACCCGCTTGTCAAGGGGATACCCTCCTCACAAACGCTCTTTGA

Protein sequence:

>DPOGS210864-PA
MWLWLLCAAAAVYSAEGAGPGAARLVCYVEGARASDVSECTHLVYAGDSRGEALDDLLKDYRKNNPRIKIILRVNEGDKDLTEVLKSKNVQGLEIYDAHKAFNKTKVLEMVESARAALTAAGGGPLFLALPSHPELLAKYYDLRVLVKKTDLMTVQTHALGLVKKMTYHPSRLSGLWDMMNTDSVVDLVIGLGVPASKVVISLPATARQFQLLNETLSTPGSPTTDDDPKEIDQLELCRQLRKGRWTLERDQDLSAPYAFKDKTWISFEDASSVDVKGKYARVRGLAGLALYNTEKDLDSPCAPTLRTALNKVLNQQSRAPRAAVLRSLENEILSAPHNARVLDALQVSPYRITQVVDADGVIHSIREDTRTEFSCSRQGYFVHPRSCARFYRCVKFDQLSPEYTVFEFDCPAGLAFDARYEVCVWPGSLPHAAACPGSSEIAPVPTTRFICPDHEGYYADPENCRWFFACLDHGKAPLTAYEFRCPFGLGFDAARLKCDWPWLVPACGNIARYEAEAFGYSGAALSGATGFEGKTADSVNIAAHQSLVSGASLNNLVGIQNGLLTNDDILDANYISSQEAANGGLAGAGTYDSFSNIGLSYQGEPSGVFYNYVSAEDINKGLVRGVNSEGEEKYTSGSIILDDYRLPSNKVNTYSVYNKGQSGVSGSARKPLRYNDGKYRPDNSGKYVHNPAGDRAKPYEHIGVPPVPYAHKDFKYKQSNENDASKYSGSVSGGYEYPRPKIEFHEGFGVNDNIHSVSGYNSVGVEGSVTNTYQGSYTNADVGKYVTDDRVYTGASHTLTAGAVNVGLVGKTTLLDADYANQGNYQGHSYSSGIVHHVSQPAVSVSHVSTAGQNLDGYSYTTSTSSGVPTTVSPLSYTTYKTTYVPEVPKTSIKQVFGFTQPAVTYVQTAVVPVKQVTPQIQVTNYNQGVNYQYESKDYSSSNAKSNENSEFTGYEYKQPSIKFENVPTATPAVTVVSQKPQNQQTFHTIESVGFEYSTPVPVSEPPFKKIVAYTTGSPVSSYVQSTSPAVSHQTIHKVETSKVQIPFVEVSGQDVASTGVNYEVPQGLIQYTTPQPSISVQPEISYQPQAFSQQTVHTVGANKVNSFTQGHSQVGYEYQGPSTISFNTNDENGYNYKQPNVRLEDAARVVHYSTPAPTVLYTPQSDSSQSIQRSRTQYVSSTARPFVSTTPVTLYENPLLQYTAKSTEITYDAPKYTSHSYSQQNHNRGVSSNSFASNGYSAALDQQTTVYSTPQPALSVQSTTTTYPQYQGQQNLHQVDTSRLNTYEGASINVDYSQPSFSVQQAKIETSTPASITSSTYKPQSYSQQTIHKYEKQVTPSTTPISTIHLSYQQPEVSVYENPILKYTQRVTPVTYVKPTASIFTQTYRPEVKVSHVTRPAQYETSDQQSYINQEAYTYNQPEIQRNIESSKSQTYSDASVVSSQYFHQGKGITQTNGNEQYESGKNVVFVSSTPATLAYEDHYTEYENQGKVNAYKAPEYIPPKEEIPQTYIVPTVSTPAVKQQGSILYSQDQYDYQPVGYSQQYQGEYTNSKASTSINNYEYTQNNYQSPEVQVEYKAEEYLPPVPSTARPAVSSTYRSRIYSTTTNAPEYLPPESETKFRADEYLPPVKNSYQSIVKSTDNVEYLPPAESTAARVATFQRFGYNDEDESSKGYSTYQDYSLSTEAPIIAAGVTGRKQNIVVEKAKSNLLGFGTVGPEAGLVSTTARPFISTTDNSYLPPKTATFTYTTETPEVTTTIRRTKPKYFRVPSSSTTPVYVESTTAYQAPEYLPPSEKIWQGLVETSAYSSTVAPKIRLNPYQGNTYDNSQIIVSTTTAPIRKQNVVVENAKAQLLGFGAVSSEAGLVSPASYSEIEPIEVSHDGTYSTVLPVQQQVEVTSARVPIRRVKPKVAIVTKINDFNPLLVKKLGAVCSCQSPVLILKGKRPNIQEEDVDYDNGYESGRGDLGSYKLKPQSARVTTVPLLNTVTTASPVVSTFNPIIVPDDSYYQDYQEASNDNVVVNAVGKDNSQSYVSSVPVVSTTERVVRIRPRVKLVTEAPTYKTVVLNKQVGPSIPQTAELIESVGVNSPSFDRYGPGGWRDRDETLQGSIDCQRAGLFRHPKQCNKFYACRWDCTKQRFTLHVFNCPVQLSFDPNIGACNWPSQGPACQGDTLLTNAL-