Monarch geneset OGS2.0

DPOGS200442
TranscriptDPOGS200442-TA1185 bp
ProteinDPOGS200442-PA394 aa
Genomic positionDPSCF300236 + 456030-458555
RNAseq coverage550x (Rank: top 23%)
Annotation
HeliconiusHMEL0115990.081.47% 
BombyxBGIBMGA008900-TA0.076.14% 
DrosophilaCG8460-PA1e-8240.41% 
EBI UniRef50UniRef50_E0VQE27e-9246.21%Spore germination protein yaaH, putative n=1 Tax=Pediculus humanus corporis RepID=E0VQE2_PEDHC
NCBI RefSeqXP_001869617.13e-9545.25%chitinase domain-containing protein 1 [Culex quinquefasciatus]
NCBI nr blastpgi|1700705445e-9445.25%chitinase domain-containing protein 1 [Culex quinquefasciatus]
NCBI nr blastxgi|1700705444e-9345.25%chitinase domain-containing protein 1 [Culex quinquefasciatus]
Group
Gene OntologyGO:00431692.5e-20cation binding
GO:00059752.5e-20carbohydrate metabolic process
GO:00038242.5e-20catalytic activity
GO:00045538.4e-17hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00060322.2e-06chitin catabolic process
GO:00045682.2e-06chitinase activity
KEGG pathway 
InterPro domain[77-393] IPR0178536.5e-32Glycoside hydrolase, superfamily
[366-385] IPR0137812.5e-20Glycoside hydrolase, subgroup, catalytic core
[73-385] IPR0012238.4e-17Glycoside hydrolase, family 18, catalytic domain
[77-386] IPR0115832.2e-06Chitinase II
Orthology groupMCL12115 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200442-TA
ATGAAATATTTGACTATAGTTTTACAAGTCCTGTGTATTTCAAGTTTATCCCTTGCAACATTGTCACCTCCCTCCGATAAAAAATCACAAAAGGAAGTAAAACCGCAAGAAGGTTCTAGGAAAAATAATGTTTTGGATCGAAAATTGGTAGCTGAGACTCCGTACGTGAAAGATATACTTAAATATCATGCAACTTATCATCAGGACGTACATGTAAAAAATTTTAATAATCTGGTGTTAGGCTTTGTTACACCGTGGAACAATAAAGGTTATGATGTAGCCAAGAGATGGGCGTCAAAATTTAATTACATTTCCCCTGTATGGCTGCAAGTTAAAAGGCAAAGCTCCAACATATACATCATTTCTGGTCTTCACGACGTGGACAATGCATGGATGAAGGCGGTCAAACAGAAAGGAACTGACACTGGCTTAAGAATTGTACCGAGATTATTGTTTGAAAACTGGCAGCCATCAGATTTGAAGGCGTTTTTCATTGAACCATCATCATACAGTGAACAGAAAGCGTTGATTGAAGAAATCAAGAAAGTCTGTAAGCAATGGGGTTTTGACGGGATCGTGTTAGAAATGCTTTCTCAAATCGGAAAGTACATCGACAAATCAGTGAAGTTTATACAACACTTCGGTCTTGAGATGAGCGAGAACGGCTACCACCTTATTCTCGTGTATCCACCATTTAGAGGTTATCCAAGCGATGACTTCTTTGTTCAAGCCTTCAATGAAATCCATCCTTATGTAGATGCTGTCTCCGTCATGACATATGATTTTTCAAATCCTCAAAAACCGGGTCCGAACGCACCATTCTATTGGCTGAGATTGTGCATTGAAAAACTAATAGGCGATGATGAAAATCCGACAAAGAGATCAAAAATATTACTCGGTTTAAATTTCTATGGTAACTCGTATACCGCGAACGGTGGCGGTCCCATTGTTGGCACGGAGTATATTGAATTGTTGAAAAATGCGAAACCGAACCAGCTTATATCTTACAACAATAATACTGCTGAGAATTATCTTGAAGTCAGGACATTACAAGGTACAAAGAAGATTTTCTTCCCCACATTGTACTCAATCCATAAAAGACTGGAGCTCGCTAGGGAATACCGAACTGGTGTTGCCATTTGGGAACTCGGTCAGGGATTGGACTATTTCTATGACCTATTTTAA

Protein sequence:

>DPOGS200442-PA
MKYLTIVLQVLCISSLSLATLSPPSDKKSQKEVKPQEGSRKNNVLDRKLVAETPYVKDILKYHATYHQDVHVKNFNNLVLGFVTPWNNKGYDVAKRWASKFNYISPVWLQVKRQSSNIYIISGLHDVDNAWMKAVKQKGTDTGLRIVPRLLFENWQPSDLKAFFIEPSSYSEQKALIEEIKKVCKQWGFDGIVLEMLSQIGKYIDKSVKFIQHFGLEMSENGYHLILVYPPFRGYPSDDFFVQAFNEIHPYVDAVSVMTYDFSNPQKPGPNAPFYWLRLCIEKLIGDDENPTKRSKILLGLNFYGNSYTANGGGPIVGTEYIELLKNAKPNQLISYNNNTAENYLEVRTLQGTKKIFFPTLYSIHKRLELAREYRTGVAIWELGQGLDYFYDLF-