Monarch geneset OGS2.0

DPOGS214075
TranscriptDPOGS214075-TA1464 bp
ProteinDPOGS214075-PA487 aa
Genomic positionDPSCF300171 + 419364-434288
RNAseq coverage1781x (Rank: top 7%)
Annotation
HeliconiusHMEL0202714e-15276.97% 
BombyxBGIBMGA010573-TA0.085.68% 
DrosophilaCda4-PA0.071.74% 
EBI UniRef50UniRef50_G6CJP60.099.78%Chitin deacetylase 4 n=9 Tax=Endopterygota RepID=G6CJP6_DANPL
NCBI RefSeqNP_001103903.10.074.06%chitin deacetylase 4 [Tribolium castaneum]
NCBI nr blastpgi|1603337850.074.06%chitin deacetylase 4 precursor [Tribolium castaneum]
NCBI nr blastxgi|1603337850.074.06%chitin deacetylase 4 precursor [Tribolium castaneum]
Group
Gene OntologyGO:00059751.9e-31carbohydrate metabolic process
GO:00038241.9e-31catalytic activity
GO:00080615.9e-16chitin binding
GO:00060305.9e-16chitin metabolic process
GO:00055765.9e-16extracellular region
GO:00168106.3e-08hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
KEGG pathway 
InterPro domain[113-410] IPR0113301.9e-31Glycoside hydrolase/deacetylase, beta/alpha-barrel
[32-106] IPR0025575.9e-16Chitin binding domain
[331-404] IPR0025096.3e-08Polysaccharide deacetylase
Orthology groupMCL14748 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214075-TA
ATGTTTGTGTACCGAGTCAATATGAAACGTTGGACAAATATGACGGCCGCTTTTCGAATATTTATCTTCTGTGCCATATTTATTAACGTAAATTGTCAGAAAGAAGAAGAGGAATTCATTTGTCCCGGAGGTACTCAAGGCAACGGCAATTTTGCTGATCCGGCCACTTGCCGCCGCTTCTACCAATGCGTCGACGGTTTCCCTTATTTGAACAGATGTCCTTCAGGTCTTTACTTTGACGATATAAGCAAATTTTGCACTTTCAAAGCGGAAGCCAGATGTGGACCAATACCAACAACCGTAGCTCCTGTGACTGAAATCCCTCAGGACATAGTTACGAATTGCGACCCATCAGAATGTCAGCTGCCATATTGTTTCTGCTCCAAAGACGGCACACTCATCCCGGGTGGGCTAACGCCTCAGATGATAATGCTCACGTTTGACGGCGCCGTCAATTTGAATAACTTCGACTTATACAAGAAAGTTTTTAATGGGAAATTACGTAATCCAAACGGCTGTCCGATTCGGGGCACTTTCTTCTTATCTCACGAGTACAGCAACTACGTCATGGTTCAGAGCCTCGCACACGATGGTCACGAGATCGCTACAGGCACAATATCACAGCAGCAAGGATTACAAGACAAAGGCTACGAAGAATGGGCTGGTGAGATAATTGGGATGCGGGAAATTCTTAACAAGTTCGCTAACATTTCACGCAGCGAAGTTGTAGGGACGCGTGCACCATTCCTCAAACCTGGAAGAAATACACAGTTCAAGGTGCTAGAAGATTTCGGCTACATATACGACAGTTCTATCGGAGTGCCGCCCCTTCCTCAACCCGTGTGGCCCTACACCCTCGACTACAAGATTCCACACGAGTGCAAATCCGGAACCTGCCCTACTAAGGCCTTCCCCGGTCTTTGGGAAGTTCCTTTTAATGCTCACTACGTCGAATCTTACGAAGGTGGTCATTGCCCTTACTTGGATCAATGTGTTCTTCACAATCATGATGCTGACGATGTACTAGAATGGCTTCAGGAAGACTTCACCAGACATTACGAACAAAACCGGGCACCATACATGATGCCGTTCCATACCAATTGGTTCCAGATCAAGCCACTGGAGCGAGGTCTCCATAAATTCTTAAACTGGGCTGCTAACTTAGACGACGTTTGGTTCGTAACAATGACACAGTCCCTGACGTGGATGACGGACCCTCGCTCCGTGAAATCCTTGAACAACTACGAGCCCTGGAAGTGCGATAAGAAGGAGGGGCCCAAACCCTGTAACCTTTCAAACAAGTGCGCGCTGCCGTTCAAACTGCCAGAGACTAATTTCACAGATACCAGATATATGGAAACATGTGTCGACTGTCCTAAGCAATACCCATGGCTCGGTGATTCTGGCGGTACGGGTATCGCTGGAACTGATAATTACATCCCTGAAAGTCTTAGCAGAAAGTAA

Protein sequence:

>DPOGS214075-PA
MFVYRVNMKRWTNMTAAFRIFIFCAIFINVNCQKEEEEFICPGGTQGNGNFADPATCRRFYQCVDGFPYLNRCPSGLYFDDISKFCTFKAEARCGPIPTTVAPVTEIPQDIVTNCDPSECQLPYCFCSKDGTLIPGGLTPQMIMLTFDGAVNLNNFDLYKKVFNGKLRNPNGCPIRGTFFLSHEYSNYVMVQSLAHDGHEIATGTISQQQGLQDKGYEEWAGEIIGMREILNKFANISRSEVVGTRAPFLKPGRNTQFKVLEDFGYIYDSSIGVPPLPQPVWPYTLDYKIPHECKSGTCPTKAFPGLWEVPFNAHYVESYEGGHCPYLDQCVLHNHDADDVLEWLQEDFTRHYEQNRAPYMMPFHTNWFQIKPLERGLHKFLNWAANLDDVWFVTMTQSLTWMTDPRSVKSLNNYEPWKCDKKEGPKPCNLSNKCALPFKLPETNFTDTRYMETCVDCPKQYPWLGDSGGTGIAGTDNYIPESLSRK-