Monarch geneset OGS2.0

DPOGS207163
TranscriptDPOGS207163-TA1197 bp
ProteinDPOGS207163-PA398 aa
Genomic positionDPSCF300001 + 4530535-4531803
RNAseq coverage146x (Rank: top 54%)
Annotation
HeliconiusHMEL0092510.070.85% 
BombyxBGIBMGA000620-TA1e-17366.51% 
Drosophila% 
EBI UniRef50UniRef50_Q8T1082e-17166.51%Heparanase-like protein n=2 Tax=Obtectomera RepID=Q8T108_BOMMO
NCBI RefSeqNP_001108471.13e-17266.51%heparanase-like protein [Bombyx mori]
NCBI nr blastpgi|1692347006e-17166.51%heparanase-like protein [Bombyx mori]
NCBI nr blastxgi|1692347003e-17266.51%heparanase-like protein [Bombyx mori]
Group
Gene OntologyGO:00160201.8e-80membrane
GO:00167981.8e-80hydrolase activity, acting on glycosyl bonds
GO:00431699.1e-08cation binding
GO:00059759.1e-08carbohydrate metabolic process
GO:00038249.1e-08catalytic activity
KEGG pathwaytca:6596392e-70 
 K07964 (HPSE)maps-> Glycosaminoglycan degradation
InterPro domain[55-398] IPR0051991.8e-80Glycoside hydrolase, family 79
[51-291] IPR0178536.1e-31Glycoside hydrolase, superfamily
[101-287] IPR0137819.1e-08Glycoside hydrolase, subgroup, catalytic core
Orthology groupMCL16821 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207163-TA
ATGTCTGATAGATTAATTTTCAGTGCTGAAGACTTACCTTCAGTCTCGTGTGATCACTGTCTGGCATCAAGTCACAATGAAACAGCCTGCGTTGCTCTTAAAAAATTATGTAAGAATAAGTTTTTGCCATTCTTTCTAATGACCGGCCGTAAATGGACAGAAATAAATGAATTTTGTCAAGCAACAAACACAAAGTTATTATTTACATTAAATTTACTGCTTCGCGATAGTCATGAATGGAATTCTCAAAATGCTGTTGAATTGATAAAATATTCCAAACAGAAGAAGTTTGATATTGACTGGCAACTTGGAAATGAGCCTAACTCTTTCAGACATGTATTCAATTTGACTGTTACCCCTCAAGAATTAGCTCATGACTTCAAAAAGCTTCGGAATCTTCTAAATCATCATGGATATAAAAAATCATTATTAGTAGGGCCTGACACCACTAGGCCCCAAGAACATCAACCAAACTGTCTGAAATATATGGTGGAATTCCTAGGCAATGGTTCACATTTTGTAAATGCTAGATCATGGCATCAGTACTACCTGAATAGTAGAACTGCTAAGTTACAAGATTTTTGGAATCCTGAAACACTTGACTTGCTTAAAGAACAAATCGAAACTATGCAAAATCACACCAAGAAATATCACAATATACCCATGTGGCTCAGTGAAACTAGTACTTCTTATGGCGGTGGGGCCCCTGGTTTGTCCAACACATATGCTGGTACTCCTCTATGGGTAGATAAGCTGGGCCTGTCTGCTAAATATAACATTTCCACTGTCATAAGGCAAAGCTTTTATGGAGGAAACTACAGCCTTGTAAATGAAGAACTCGAACCTCTTCCTGATTGGTGGAGAGTTTATGTTCATTGTGCTAATAAGAATTATACAAATGATTCAAGTGCAATAACAGTTTACGCCATTAATTTAGAAATGGAAAAAGTCCAATTTCTTCTCAATGGCACTGCCTTACATGGTGATAATATAATAATTGATGAATTCATAATAAGTGCTCCTTCAAATAACAGGCGAACAAAAACCATACTTTTAAATGGCTGGCCACTGCATTATGAGTCAGCTAGTCTTGACCTGCAACCCAATCACAAGAAATATAATAACCGTATATCTATGCCGCCATATTCCATAGGATTTTGGGTTATTAAAAATACGTCAATTAAAATATGTAAATGA

Protein sequence:

>DPOGS207163-PA
MSDRLIFSAEDLPSVSCDHCLASSHNETACVALKKLCKNKFLPFFLMTGRKWTEINEFCQATNTKLLFTLNLLLRDSHEWNSQNAVELIKYSKQKKFDIDWQLGNEPNSFRHVFNLTVTPQELAHDFKKLRNLLNHHGYKKSLLVGPDTTRPQEHQPNCLKYMVEFLGNGSHFVNARSWHQYYLNSRTAKLQDFWNPETLDLLKEQIETMQNHTKKYHNIPMWLSETSTSYGGGAPGLSNTYAGTPLWVDKLGLSAKYNISTVIRQSFYGGNYSLVNEELEPLPDWWRVYVHCANKNYTNDSSAITVYAINLEMEKVQFLLNGTALHGDNIIIDEFIISAPSNNRRTKTILLNGWPLHYESASLDLQPNHKKYNNRISMPPYSIGFWVIKNTSIKICK-