Monarch geneset OGS2.0

DPOGS212990
TranscriptDPOGS212990-TA843 bp
ProteinDPOGS212990-PA280 aa
Genomic positionDPSCF300024 - 867292-868274
RNAseq coverage187x (Rank: top 49%)
Annotation
HeliconiusHMEL0076978e-12474.72% 
BombyxBGIBMGA006905-TA2e-11877.20% 
DrosophilaFuca-PB2e-9156.74% 
EBI UniRef50UniRef50_D6WGK42e-10160.00%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WGK4_TRICA
NCBI RefSeqXP_971511.14e-10260.00%PREDICTED: similar to AGAP007285-PA [Tribolium castaneum]
NCBI nr blastpgi|2700037197e-10160.00%hypothetical protein TcasGA2_TC002989 [Tribolium castaneum]
NCBI nr blastxgi|2700037192e-10662.45%hypothetical protein TcasGA2_TC002989 [Tribolium castaneum]
Group
Gene OntologyGO:00059755e-157carbohydrate metabolic process
GO:00045605e-157alpha-L-fucosidase activity
GO:00431691.1e-90cation binding
GO:00038241.1e-90catalytic activity
GO:00060041.6e-47fucose metabolic process
KEGG pathwaytca:6601631e-101 
 K01206 (E3.2.1.51, FUCA)maps-> Other glycan degradation
InterPro domain[1-280] IPR0009335e-157Glycoside hydrolase, family 29
[34-279] IPR0137811.1e-90Glycoside hydrolase, subgroup, catalytic core
[35-279] IPR0178539.3e-80Glycoside hydrolase, superfamily
[90-105] IPR0162861.6e-47Glycoside hydrolase, family 29, bacteria/metazoa/fungi
Orthology groupMCL10808 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212990-TA
ATGAGATGGTTATTGCTGTTACCTGTACTAATCGCTACAACTTCCCCAATAAGTGCTAGTATAAGAAGCTGGGAAGAAGCGGTTCAAGATGTAAAAGGAAAAAAATATTCTCCAAAGTGGCCTAGCCTTGATAGTCGACCTTTGCCAGAATGGTTTGATCGTGCTAAAATCGGCATCTTCCTTCACTGGGGATTGTACTCAGTGCCGGGCTTTGGGTCAGAATGGTTTTGGAGTAATTGGAAAGGTGGAGACAATAAAACAGTCAAGTTCATGGAGGCAAACTATCCACCTGGTTTTTCTTATCAAGAATTTGCGCCAATGTTCAAGGCAGAATTTTTTGATCCTGAAAAATGGGCATCTTTATTCCAGAAAGCGGGAGCTAAATATGTAATACTGACTAGTAAGCATCACGAAGGCTATACTTTATTCCCATCAAAGAGATCATTCAGCTGGAATGCAAAAGAAGTAGGTCCAAAAAGAGATCTAGTCAAAGATGTAGCAAATGCTGTGAGGAATAAGAACATGAAGTTTGGCGTGTACCACTCCCTGTATGAATGGTTCAATCCCATATACATTGAAGACAAAAAGAATTTATTTACAACACGGAATTATGTCAATGACAAGCTGTGGCCGGATTTGAAACAACTTGTACATGATTACCACCCGTCTGTTATATGGTCTGACGGGGACTGGGAGGCGTTTGACGTTTATTGGAACTCCACTGCCTTCCTCGCATGGCTTTACAACGACAGTCCAGTTAAGGATACAGTTGTGGTTAACGATAGATGGGGTATCGGTATTCCATGCCACCATGGAGATTTCTACAACTGCGCTGACAGATAA

Protein sequence:

>DPOGS212990-PA
MRWLLLLPVLIATTSPISASIRSWEEAVQDVKGKKYSPKWPSLDSRPLPEWFDRAKIGIFLHWGLYSVPGFGSEWFWSNWKGGDNKTVKFMEANYPPGFSYQEFAPMFKAEFFDPEKWASLFQKAGAKYVILTSKHHEGYTLFPSKRSFSWNAKEVGPKRDLVKDVANAVRNKNMKFGVYHSLYEWFNPIYIEDKKNLFTTRNYVNDKLWPDLKQLVHDYHPSVIWSDGDWEAFDVYWNSTAFLAWLYNDSPVKDTVVVNDRWGIGIPCHHGDFYNCADR-