Monarch geneset OGS2.0

DPOGS207191
TranscriptDPOGS207191-TA1305 bp
ProteinDPOGS207191-PA434 aa
Genomic positionDPSCF300001 + 5563123-5570685
RNAseq coverage13821x (Rank: top 1%)
Annotation
HeliconiusHMEL0122020.084.56% 
BombyxBGIBMGA000648-TA0.082.95% 
DrosophilaCG5210-PA7e-13152.25% 
EBI UniRef50UniRef50_G6CSZ40.0100.00%Hemocyte aggregation inhibitor protein n=2 Tax=Endopterygota RepID=G6CSZ4_DANPL
NCBI RefSeqNP_001036847.10.080.50%chitinase-like protein EN03 precursor [Bombyx mori]
NCBI nr blastpgi|2594938190.086.18%hemocyte aggregation inhibitor protein precursor [Manduca sexta]
NCBI nr blastxgi|2594938190.086.18%hemocyte aggregation inhibitor protein precursor [Manduca sexta]
Group
Gene OntologyGO:00060322e-99chitin catabolic process
GO:00045682e-99chitinase activity
GO:00431692.5e-82cation binding
GO:00059752.5e-82carbohydrate metabolic process
GO:00038242.5e-82catalytic activity
GO:00045535.2e-81hydrolase activity, hydrolyzing O-glycosyl compounds
KEGG pathwaydya:Dyak_GE121625e-48 
 K01183 (E3.2.1.14)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[7-433] IPR0155205.2e-169Imaginal disc growth factor
[23-413] IPR0115832e-99Chitinase II
[392-432] IPR0137812.5e-82Glycoside hydrolase, subgroup, catalytic core
[24-413] IPR0012235.2e-81Glycoside hydrolase, family 18, catalytic domain
[23-434] IPR0178533.8e-74Glycoside hydrolase, superfamily
Orthology groupMCL11543 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207191-TA
ATGAAAGTCCTGGTTGCTCTGGTTGCTCTGCTGGCAGTGGCGGTTGCCACGCCGGCCGTCCCGCCCAGCAAGGTTGTCTGCTATTACGACAGCAAGAGCTACGTACGAGAATCTCAAGCACGTATGCTGCCATTGGATCTGGATCCAGCTTTGTCGTTCTGCACTCACCTTGTATATGGATATGCAGGAATTCAACCTGACACCTATAAAATGGTCTCACTCAATGAGAACTTGGACATTGACCGCAACCACGCCAATTATCGTGCTCTCACCAACTTTAAGACCAAATACCCAGGACTTAAGGTCTTGCTCTCAGTGGGTGGTGATGCCGACACCGAGGAAGAACAGAAGTACAATTTACTGCTGGAGTCTCCACAAGCCCGTACAGCTTTCGTAAACTCTGGTGTACTCTTGGCAGAGCAGCACGGCTTTGATGGTATTGATCTTGCTTGGCAGTTTGCTAGAATCAAACCCAAAAAGATCCGCTCAACTTGGGGATCAATCTGGCACGGAATAAAGAAAACGTTTGGCACTACCCCCGTCGATGAGAAGGAAGCGGAGCACAGAGAAGGCTTTACTGCTCTTGTTCGTGAAATGAAGGCTGCTCTCAACCTAAAACCAAACATGCAATTGTCTGTCACTGTGCTTCCCAATGTCAACGCTACCATCTACTACGACGTACCTGCTATTATAAACTTAGTGGACATTGTGAATATCAATGCGTTTAACTACTACACTCCAGAAAGAAATCCAAAGGAGGCTGACTATACTGCACCTATTTACAAACCTCAAGGCCGCAATGAACTTTTAAATGTTGATGCCGCTGTAAATTACTGGCTTCAAGCAGGTGCCCCTAGTTATAAAATTGTTGTGGGTGTCGCCACCTACGGCCGCACATGGAAACTAAACTCCGACAGCGAAATTTCTGGAGTCCCACCTATCCATGCTGAAGGACCTGGGGAAGCCGGTCCCTACACGAAAATTGAAGGATTGTTGAGCTATCCCGAAGTTTGTGCTAAACTCATTAATCCCAATCATCAAAAGGGTATGCGTCCTCATTTAAGGAAGGTCTCTGATCCAAGCAAGCGTTTTGGAACTTATGCTTTCCGTGTTCCCGACGACAATGGTGAAGGTGGTTTTTGGGTAAGCTATGAAGATCCTGACACTGCAGGACAGAAGGCTGTGTATGCTAAATCAAAGAATCTTGGAGGTGTTTCAATTTCTGATCTCTCTATGGATGACTTCCGGGGTCTATGTACAGGTGACAAGTATCCCATTCTTCGCGCTGTAAAATACCGTGTATAA

Protein sequence:

>DPOGS207191-PA
MKVLVALVALLAVAVATPAVPPSKVVCYYDSKSYVRESQARMLPLDLDPALSFCTHLVYGYAGIQPDTYKMVSLNENLDIDRNHANYRALTNFKTKYPGLKVLLSVGGDADTEEEQKYNLLLESPQARTAFVNSGVLLAEQHGFDGIDLAWQFARIKPKKIRSTWGSIWHGIKKTFGTTPVDEKEAEHREGFTALVREMKAALNLKPNMQLSVTVLPNVNATIYYDVPAIINLVDIVNINAFNYYTPERNPKEADYTAPIYKPQGRNELLNVDAAVNYWLQAGAPSYKIVVGVATYGRTWKLNSDSEISGVPPIHAEGPGEAGPYTKIEGLLSYPEVCAKLINPNHQKGMRPHLRKVSDPSKRFGTYAFRVPDDNGEGGFWVSYEDPDTAGQKAVYAKSKNLGGVSISDLSMDDFRGLCTGDKYPILRAVKYRV-