Monarch geneset OGS2.0

DPOGS206218
TranscriptDPOGS206218-TA771 bp
ProteinDPOGS206218-PA256 aa
Genomic positionDPSCF300334 - 215570-222517
RNAseq coverage798x (Rank: top 16%)
Annotation
HeliconiusHMEL0112241e-8279.78% 
BombyxBGIBMGA009695-TA1e-6671.24% 
DrosophilaCht2-PA8e-5859.35% 
EBI UniRef50UniRef50_D6WQ488e-6262.86%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WQ48_TRICA
NCBI RefSeqXP_970191.21e-6262.86%PREDICTED: similar to AGAP005634-PA [Tribolium castaneum]
NCBI nr blastpgi|1892393653e-6162.86%PREDICTED: similar to AGAP005634-PA [Tribolium castaneum]
NCBI nr blastxgi|1892393651e-6062.86%PREDICTED: similar to AGAP005634-PA [Tribolium castaneum]
Group
Gene OntologyGO:00431695.7e-58cation binding
GO:00059755.7e-58carbohydrate metabolic process
GO:00038245.7e-58catalytic activity
GO:00045539.2e-49hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00060321.3e-21chitin catabolic process
GO:00045681.3e-21chitinase activity
KEGG pathwayaga:AgaP_AGAP0056347e-55 
 K01183 (E3.2.1.14)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[16-192] IPR0137815.7e-58Glycoside hydrolase, subgroup, catalytic core
[31-183] IPR0178532e-51Glycoside hydrolase, superfamily
[31-191] IPR0012239.2e-49Glycoside hydrolase, family 18, catalytic domain
[31-248] IPR0115831.3e-21Chitinase II
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206218-TA
ATGGTGGGTAGAAAGTGCTTATTGTTAACGATCTTCGCCGTCCTGGCGTCGGTTTTGGACGGACAGACCTTGGGAGGTCCTATGCATGGTAAGGTTGTGGTGTGCTACGTCGCGACCTGGGCAGTGTACAGACCTGGTGCTGGAAAGTTTGATTTATCTGACATCGATCCAACTTTGTGCACCCACCTCATATACTCCTTCGCTGGCTTGGATCAGAGTACTGGCGGAATAAAGAGTCTCGATCCCTGGCAAGATTTGGAAAAAGATTATGGTAAAGGTGGATACAAAAGGCTCGTGTCCCTCAAAGCGAAATATCCTCACCTGAAGGTCACTGTGGCTATCGGTGGATGGAACGAAGGTTCCAGCAAATACTCCGAGATGGCGTCCAAAAATGAAACTAGGGCGAAGTTTGTGCAAAGTGTGGTTCAGTTTTTGGACACCTACAACTTTGACGGACTGGATTTGGACTGGGAGTATCCGACTAAGAGAGGAGGAGCTCCTGAAGACAAAGCAAATTATGTAGCCATGGTCAAGAACCTCAGTAGACGCACGTCTTACGTGGCGGCCAACAACAAGCTCCACCTGCGTCTGCCGGAGCCAGAGTACTCCAACTACATCATAATGAGGACCATCAACGACGCGACCACCCTCGCGTTGGAGGAGAAGAGAATATTTGATGAGATGAAGAGAGTCACTAAAGAGAATGAGGTCACACCTGGAGACGATCCAGGTCTTGGTATGATCAGGGTGCTGTTGGGTGGAGCGCGTTGA

Protein sequence:

>DPOGS206218-PA
MVGRKCLLLTIFAVLASVLDGQTLGGPMHGKVVVCYVATWAVYRPGAGKFDLSDIDPTLCTHLIYSFAGLDQSTGGIKSLDPWQDLEKDYGKGGYKRLVSLKAKYPHLKVTVAIGGWNEGSSKYSEMASKNETRAKFVQSVVQFLDTYNFDGLDLDWEYPTKRGGAPEDKANYVAMVKNLSRRTSYVAANNKLHLRLPEPEYSNYIIMRTINDATTLALEEKRIFDEMKRVTKENEVTPGDDPGLGMIRVLLGGAR-