Monarch geneset OGS2.0

DPOGS206423
TranscriptDPOGS206423-TA1914 bp
ProteinDPOGS206423-PA637 aa
Genomic positionDPSCF300181 + 128759-136540
RNAseq coverage184x (Rank: top 49%)
Annotation
HeliconiusHMEL0180814e-5437.82% 
BombyxBGIBMGA013822-TA5e-13556.20% 
Drosophila% 
EBI UniRef50UniRef50_A0JCZ23e-7645.00%Hyaluronidase, putative n=4 Tax=Glyptapanteles RepID=A0JCZ2_9HYME
NCBI RefSeqXP_972926.11e-7845.56%PREDICTED: similar to hyaluronidase [Tribolium castaneum]
NCBI nr blastpgi|910845372e-7745.56%PREDICTED: similar to hyaluronidase [Tribolium castaneum]
NCBI nr blastxgi|910845371e-8045.85%PREDICTED: similar to hyaluronidase [Tribolium castaneum]
Group
Gene OntologyGO:00081521.7e-88metabolic process
GO:00038241.7e-88catalytic activity
GO:00069525.1e-80defense response
GO:00059755.1e-80carbohydrate metabolic process
GO:00044155.1e-80hyalurononglucosaminidase activity
KEGG pathwaytca:6616853e-78 
 K01197 (hya)maps-> Glycosaminoglycan degradation
InterPro domain[42-340] IPR0178531.7e-92Glycoside hydrolase, superfamily
[40-339] IPR0137851.7e-88Aldolase-type TIM barrel
[1-465] IPR0181555.1e-80Hyaluronidase
[1-465] IPR0013295.1e-80Glycoside hydrolase, family 56, allergen Api/Dol m 2
Orthology groupMCL10654 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206423-TA
ATGTATTATATATTACTTATGTTTGTATGTTGTGCTGTGAAATCTGAAGTTGTAAGTAACTATTATGTAGTTGAAATGCCGGAGTCGGATACGCCACAGCAGAATGTTAAAGATATCAAAAAAGACTTTAGGGTCTACTGGAACGTTCCGACGATGCAGTGCACTTCCAAAAAAATACTATTCGAGAACTTGTACGAAAAATTCGGGATAATACAGAATGATGGAGACAGATTTAATGGAGAAAAAATCACTATTCTATATGAACCAGGAGATTTTCCGGCGATTTTTAAAAATGAATCCAGTGGAAAATACAGATTGAGAAACGGAGGTGTACCTCAGGAGGGCAGCTTGGAAGAACATATAGATGCTTTTAGAATTGATTTAAATCAAACCATACCCGATCCGAAATTTGACGGAATCGGTATAATTGACTTCGAGTCGTGGAGACCGGTTTTTCGACAAAATTTCGGAGTACTCGTTCCTTACAAGGATGTTTCAATCGAGATTGAAAAGCAATTGCACTGGTGGTGGCCAAAGACATGGATACAGGCACAGGCTACCCAAAGATTCGAGGCAGCAGCCAGAAGGTTTATGCAGACGACTCTATCGATAGCAAAGCAAATGCGACCCAAAGCCTTATGGGGCTACTACGGATTTCCACACTGTTTCAACATGGCCAGCAATAATATGAAGGAAACATGCGCGAAGAATGTTCCAGAAGAAAACGATAGCTCCGTCTCACTCTCCTCTACTCAGCTCTCTTCGCTTATTAATGGGAGAGTGAAAGAGAGCGTCAGAGTGAGATTCAAAAACACTCCAGTGTTGCCGTATTTCTGGTTTAGATACCGCGATGCTGGTTTTATGAAACAGGAAGACCTTTCCGTAGCTCTCAGCACACTGTACCAGTCGAAAGCATCTGGTTTAATAATATGGGGCAGCTCAAATGACGTGAATACTGTTGACAAATGTAAGAAACTTTACAACTACGTGGAGACCATCCTTGGACCGAAAATAGCGAAATATACAAAACAGAATGTGTTTAAAGATGAAATTAATAACGAACTTAATAATACATTAACAACCGTGGAACTTTCTACTACAGAAGTTCCTGAAAATACAACTATTTCTATGAAAATAGGACAAATAGATCCTGAATATGATTGGATTCCACCCAAAAACTACACTGAGGACATATCGCAGCAAGTCGATGAAGAACTAACCAAAAAAGGCTTCAACAGAACTGAAAATAACGAAGTGGACGTTTTAAGTTCAGGGGCTGGTATAGATTTTTTATATGATGCTCTGCTAAATGTTGAAAGCAATGGAGAAAACGAAGATATTGAGCAAACGACTAGAAGTGCTGATAGTGATGAAAGTTCGCAAAGCACCGCTGTTACTAACAATGGCTTAAAAGACGACATGTTTGATGTATCCGAAGATTACACTCAAGAGACATCAACGATATTAGTAGAAATCACAACTGACGATCAAAAAAATATTCCCTACAACCATTCTACAACAAATGTTATTGAATATACAGAGAATTATACAAACGGAGAAATAACCAATGAATATGAAACAACACAGCTTAACACGGATAAAGAATCTGAAAATTTTCAATCAACAGAGGATAGTTTCTACGATCTTAGTAACTTTTTCAGTTCCAGTGAAGAGACATCCGATTACTTGATCAAAGTTGAAAAAATTAACGTTACCAGCGAGGAAACGTCGAGTGATTATTCTTATAAAGAGAACTCGAGTGACTACAGTGATTATTTCGTAGTTCTAAGATATTATAATGTGAATAAAACTAAATCACAAAAAAGAATGGTTTTTCAACAAATAGACAGAGAAGACATCAGTGAAGTGACCGAAAATTCAGATCAGGTTGTGACATACGTTTACGGCAAATGA

Protein sequence:

>DPOGS206423-PA
MYYILLMFVCCAVKSEVVSNYYVVEMPESDTPQQNVKDIKKDFRVYWNVPTMQCTSKKILFENLYEKFGIIQNDGDRFNGEKITILYEPGDFPAIFKNESSGKYRLRNGGVPQEGSLEEHIDAFRIDLNQTIPDPKFDGIGIIDFESWRPVFRQNFGVLVPYKDVSIEIEKQLHWWWPKTWIQAQATQRFEAAARRFMQTTLSIAKQMRPKALWGYYGFPHCFNMASNNMKETCAKNVPEENDSSVSLSSTQLSSLINGRVKESVRVRFKNTPVLPYFWFRYRDAGFMKQEDLSVALSTLYQSKASGLIIWGSSNDVNTVDKCKKLYNYVETILGPKIAKYTKQNVFKDEINNELNNTLTTVELSTTEVPENTTISMKIGQIDPEYDWIPPKNYTEDISQQVDEELTKKGFNRTENNEVDVLSSGAGIDFLYDALLNVESNGENEDIEQTTRSADSDESSQSTAVTNNGLKDDMFDVSEDYTQETSTILVEITTDDQKNIPYNHSTTNVIEYTENYTNGEITNEYETTQLNTDKESENFQSTEDSFYDLSNFFSSSEETSDYLIKVEKINVTSEETSSDYSYKENSSDYSDYFVVLRYYNVNKTKSQKRMVFQQIDREDISEVTENSDQVVTYVYGK-