Monarch geneset OGS2.0

DPOGS206237
TranscriptDPOGS206237-TA1497 bp
ProteinDPOGS206237-PA498 aa
Genomic positionDPSCF300334 + 209499-214638
RNAseq coverage3130x (Rank: top 4%)
Annotation
HeliconiusHMEL0112248e-14651.56% 
BombyxBGIBMGA009695-TA3e-14450.97% 
DrosophilaCht2-PA1e-10242.15% 
EBI UniRef50UniRef50_B4MMQ42e-10142.86%GK17559 n=8 Tax=Endopterygota RepID=B4MMQ4_DROWI
NCBI RefSeqXP_002060284.12e-10644.22%GJ16075 [Drosophila virilis]
NCBI nr blastpgi|3442271625e-10843.43%chitinase [Bactrocera dorsalis]
NCBI nr blastxgi|3442271622e-10744.09%chitinase [Bactrocera dorsalis]
Group
Gene OntologyGO:00060322.3e-116chitin catabolic process
GO:00045682.3e-116chitinase activity
GO:00431691.1e-90cation binding
GO:00059751.1e-90carbohydrate metabolic process
GO:00038241.1e-90catalytic activity
GO:00045531.3e-89hydrolase activity, hydrolyzing O-glycosyl compounds
KEGG pathwaycqu:CpipJ_CPIJ0045642e-102 
 K01183 (E3.2.1.14)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[33-380] IPR0115832.3e-116Chitinase II
[359-394] IPR0137811.1e-90Glycoside hydrolase, subgroup, catalytic core
[35-380] IPR0012231.3e-89Glycoside hydrolase, family 18, catalytic domain
[33-387] IPR0178532.1e-86Glycoside hydrolase, superfamily
Orthology groupMCL16529 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206237-TA
ATGTTTGTTAAATCGTTATATTTAGTCGTATTTCTGTTAAACCTCACACAACTTAATTGCCAGAACGTACCAAAAGGAAAACCCCCTCCACACGACAAGCTGATCGTGTGCCTCGTATCAAGCTGGGCCGTGTACAGACCTGGAGCCGGTGCCTTCAACATCGAAGACATTGAACCCTCGCTGTGTACACACCTCGTGTACTGTTTCGCTGGATTTGATGAAGAGACCAACAAGATCAAGAGTCTTGACCCTTGGCAAGACTTAGAAGATAATTATGGCAGGGCAGCCTACAAGAGAGTTGTGGCCTTCAAAGATAAGCATCCCCACCTTAAAGTGACGATCTCTGTGGGCGGGTGGAACGAGGGATCCACGAAATACTCCAAGTTGGCCGCAGACCCAGCAGCCAGAAAAACTTTTATAGATAGTGTCATGGAATTCTTGGCCAAATACAAGTTTGATGGTCTCGACTTGCACTGGGATTATCCAACAGGGAGGGGGGGGCAGAAAATAGATAGGACAACATACGTGACACTTCTCAAGGAGCTATCGGAAGCTTTCGAACCAAAGAACTACTTACTGACCGCGGCCATCAGAACTACTAAGGAGGATATGAATGCTGTCTATGATCTTGACCAACTCAATTACTACCTGGACTTCATCTATCTCATGACGTATGACTACCACGGCCCGTGGGATGGAATTATAGCTCCCCATGCTCCAATCAAAGGAAGAACTGTCGGTGACATTCTCAGTGTGGATTACACTATACGATACCTGAGAGATCGCGGCATGACAATGGGTAAATTGATCCTTGGCCTGCCGATGTACGGAAGGACATACAACCTCGTCCACGGAGACATTAAAAATGTCGAATATTATTCCACTGCCACACAGACTATTGGTTTTTCTGGACCCTTGAGTAAAGAACCAGCTTTTATGGGATACAACGAGATATGTTCAGCATTCAGTAACAGAACCTCTGGTTGGACCAAGGGTTGGCATGAGAAGTCCAGTACAGCCTACCTCAGGAATGGCGACAAGTTTATAAGTTATGACAGTCCACGGACCATTGCGGATAAAGTGAAGCTGGGTCTAGACTATGAGCTGGGAGGCTTCTCATCTTGGAGCATAGTGACTGACGACTTCCGTGGAGCCTGTGACGAGGAACATGACACGTATGCTGATTACATCGCAAGGTATAAGAAGTTTTCGGACGAAGCCACCTTGAAGCAAGCTTTGGAAAACTTAGCTGAAGCTGAAAATAAGATCAGTTTCTACACCATCATAGACAACAAGCCAACAGTGACATTACCGAAGGCCAACTTCTCGAACTACCCTCTACTGAGGACCATAAACAATGCGATAAGATTGATCTCGGAAGAAAACAAAGTCGTTGAAGAAATCGATCGCATTAAATTAAAAAGAACCGACACTATAGAGGAGGGCAGCTCCCCGTGTATACGGACGTGCTCAGAAGTTTGTTTCTATGGGACATAA

Protein sequence:

>DPOGS206237-PA
MFVKSLYLVVFLLNLTQLNCQNVPKGKPPPHDKLIVCLVSSWAVYRPGAGAFNIEDIEPSLCTHLVYCFAGFDEETNKIKSLDPWQDLEDNYGRAAYKRVVAFKDKHPHLKVTISVGGWNEGSTKYSKLAADPAARKTFIDSVMEFLAKYKFDGLDLHWDYPTGRGGQKIDRTTYVTLLKELSEAFEPKNYLLTAAIRTTKEDMNAVYDLDQLNYYLDFIYLMTYDYHGPWDGIIAPHAPIKGRTVGDILSVDYTIRYLRDRGMTMGKLILGLPMYGRTYNLVHGDIKNVEYYSTATQTIGFSGPLSKEPAFMGYNEICSAFSNRTSGWTKGWHEKSSTAYLRNGDKFISYDSPRTIADKVKLGLDYELGGFSSWSIVTDDFRGACDEEHDTYADYIARYKKFSDEATLKQALENLAEAENKISFYTIIDNKPTVTLPKANFSNYPLLRTINNAIRLISEENKVVEEIDRIKLKRTDTIEEGSSPCIRTCSEVCFYGT-