Monarch geneset OGS2.0

DPOGS202036
TranscriptDPOGS202036-TA1233 bp
ProteinDPOGS202036-PA344 aa
Genomic positionDPSCF300053 - 102654-104639
RNAseq coverage227x (Rank: top 44%)
Annotation
HeliconiusHMEL0099403e-15973.35% 
BombyxBGIBMGA000866-TA6e-10869.69% 
DrosophilaCG1827-PC4e-10559.62% 
EBI UniRef50UniRef50_O024671e-12565.94%N(4)-(Beta-N-acetylglucosaminyl)-L-asparaginase (Fragment) n=37 Tax=Eumetazoa RepID=ASPG_SPOFR
NCBI RefSeqNP_001037686.12e-14066.57%aspartylglucosaminidase [Bombyx mori]
NCBI nr blastpgi|1129827153e-13966.57%aspartylglucosaminidase [Bombyx mori]
NCBI nr blastxgi|1129827151e-13567.86%aspartylglucosaminidase [Bombyx mori]
Group
Gene OntologyGO:00167875.5e-149hydrolase activity
KEGG pathwaycqu:CpipJ_CPIJ0126181e-109 
 K01444 (E3.5.1.26)maps-> Lysosome
    Other glycan degradation
InterPro domain[1-320] IPR0002465.5e-149Peptidase T2, asparaginase 2
Orthology groupMCL13364 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202036-TA
ATGTTGAAAGTATTTATTTTTATTGCATTCACTTCTTTGGTCGATAGTGATGCAAAACTGCCGGTCGTAGTCACAACATGGAATTTCGTAAATGCGACGGTAACAGCTTGGGATATCCTTAAAAAAACTGGCTTTGCACTGGATGCTATTGAAAAAGGAACTTCAGTATGTGAAAAACAACAATGTGATGGAACTGTGGGGTATGGAGGAAGCCCAGATGAAGAATCTGAAACTACCCTGGATGCTCTTATTATGGACGGGTGCACTATGAATGTGGGTGCTGTTGGTGCACTCAGAAGAGTGAAACATGCCGCTTCAGTCGCGAGACATGTATTGGAACACACAAAACACTCTATTCTTGTTGGAGAGATGGCAACAAATTTTGCTAAACAAATGGGTTTTAAAGAGGAATCTTTAACAACTACAACATCAAAGAGGATGTGGTTAAAGTGGCACTACAGAGACCAGTGCCAACCAAATTTTTGGATGAATGTGGAGCCTGATCCGACCAAGTTTTGTGGACCTTATAAAAAAATAGACAACATTGTCAAGAGGAGTAAACACACAATTCCTATGAAAGTGAGCAGATTCAACCATGACACAATTGGAATGATAGCTGTGGATAAGCAGGGAAATGTAGCTGCTGGTACCTCAACAAATGGGGCAAAATTTAAAATACCTGGGAGAATTGGCGACTCCCCAATCCCTGGTTCAGGAGCTTATGCAGATAATGCCGTTGGTGGGGCAACAGCAACTGGTGATGGGGATATTATGCTCAGGTTTTTGCCGAGTTTTCTAGCAGTGGAGGAAATGCGACGCGGTGCTTCACCAACGGATGCCGCCAGAACTGCTGTTAATAGAATAGCAGAACATTACCCGGACTTCATGGGTGCTGTTATAGCTTTGAGAAACGACGGCGAATATGGTGCAGCTTGTCACGGCTTAGGCGACGAACCTTTTCCATTTGTTGTCAAGGACATAACTATGACGAAATTTAAAATTGAAAAAATTAATTGTTCTTGGCCCATTCGTTAAATATGTACCTACCTATATAATGAGATCCAAGACTTTCGCGAGTATTCATATAAATGAAACTAATATTTTTATTAACTATTTTATTATTTTTAAACCTATCTGCAAAGTAACGGAAACGTCGGTAGTATGTAGTTCTAAAAATAATAAAATAGCGCGTAGTAATCCGAAAATATTAGTTTCATTTACCTACCTATATAA

Protein sequence:

>DPOGS202036-PA
MLKVFIFIAFTSLVDSDAKLPVVVTTWNFVNATVTAWDILKKTGFALDAIEKGTSVCEKQQCDGTVGYGGSPDEESETTLDALIMDGCTMNVGAVGALRRVKHAASVARHVLEHTKHSILVGEMATNFAKQMGFKEESLTTTTSKRMWLKWHYRDQCQPNFWMNVEPDPTKFCGPYKKIDNIVKRSKHTIPMKVSRFNHDTIGMIAVDKQGNVAAGTSTNGAKFKIPGRIGDSPIPGSGAYADNAVGGATATGDGDIMLRFLPSFLAVEEMRRGASPTDAARTAVNRIAEHYPDFMGAVIALRNDGEYGAACHGLGDEPFPFVVKDITMTKFKIEKINCSWPIR-