Monarch geneset OGS2.0

DPOGS213232
TranscriptDPOGS213232-TA2976 bp
ProteinDPOGS213232-PA991 aa
Genomic positionDPSCF300394 + 50853-63203
RNAseq coverage314x (Rank: top 36%)
Annotation
HeliconiusHMEL0169640.070.86% 
BombyxBGIBMGA002235-TA0.061.16% 
DrosophilaOga-PA0.046.26% 
EBI UniRef50UniRef50_D6X0A60.050.84%Putative uncharacterized protein n=5 Tax=Tribolium castaneum RepID=D6X0A6_TRICA
NCBI RefSeqXP_966927.20.051.85%PREDICTED: similar to CG5871 CG5871-PA [Tribolium castaneum]
NCBI nr blastpgi|3811450110.069.53%O-GlcNAc hydrolase [Ostrinia furnacalis]
NCBI nr blastxgi|3811450110.070.75%O-GlcNAc hydrolase [Ostrinia furnacalis]
Group
KEGG pathwaydme:Dmel_CG58710.0 
 K01197 (hya)maps-> Glycosaminoglycan degradation
InterPro domain[15-333] IPR0178534.9e-108Glycoside hydrolase, superfamily
[18-301] IPR0114961.4e-100Beta-N-acetylglucosaminidase
[765-986] IPR0161812.2e-06Acyl-CoA N-acyltransferase
Orthology groupMCL13394 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213232-TA
ATGTCTGAAAGTTCACCAGATGAGTTGAATTCCATGCGCAAGGATTTCATTTGCGGCGTCGTCGAAGGCTTTTACGGCCGGCCTTGGACGACGGAGCAGAGGAAAGATTTATTTCAAAAATTGAAAAAATGGGGACTAGACATGTACGTTTACGCTCCGAAAGATGACTACAAGCACAGAGCCTACTGGAGGGAGCTGTACACAGTGGAAGAAGCAGAACATCTGACGTCACTAATATCGGAAGCTAAGTCTCACGGTATTACTTTCTGTTATGCCCTGTCTCCCGGACTGGACATCACATACAGCAGCCAAAAGGAAATAACAACATTGAAACGGAAACTAGAGCAGGTATCTCAATTTGGGTGTACATGTTTTGCTCTGCTGTTTGACGACATCGAGCCGGAAATGAGTGAAGCTGACAAGCAAATATTCCAGAGTTTTGCACATGCTCAGGTATCTGTTACTAATGAGATCCACCAGCATCTCGGCAGCCCCAAGTTCCTCCTATGCCCGACTCAGTACTGCTCCACGAGGGCTGTACCAACTGTGCACACATCGGAGTATCTCAATACATTGGGCACTAAACTCTCCCAGGAAATTGACATCATGTGGACGGGACCCAAAGTTATAAGTAAGACTCTAACCACAGAGTGCATTGGAGAGATAACGCAAGTGCTACGGAGACCTCCTGTTATTTGGGACAATTTACACGCCAATGACTACGACCAGAAGAGGATATTCTTAGGTCCATACTGCGGCCGCTCCCCCGAGCTGATCCCTCTCCTCCGGGGAGTGTTATCTAACCCCAACTGCGAGTACAACGCCAACATGATACCGATATGGACCCTCGCACACTGGGCCAGGTGCAGCCTGGATGCACCAGCACATATGGAAGCTGTGTCGTGGGACATTAAGCTGGAACGTGAGAGCGAGCAAGGTATATGTGAAGATGAAGTGCCTCTCACTCTCGGTAAACACGTATACCACCACAGACAAGCGCTGAGACAAGCCATAAACGAGTGGCTCCCAGAGTTTTCTATACCCAAAACAGCCCAAGGTCCAGTCATTAAACCTCAACCGCAGGTCGCCGCTCCTCCGGTACCTATCCTGCCGATCCTGCCGTCGGTGAACACGTGTATGTCGCTGACGGCCACCACCACGACCAGCTCGCGCGCCCCGGACCTGCCCATACCCACCGTCACCACCAGTCAGCTGCAGGCGCTGGCTGACCGGCCAGCACTCGCCACCGCTGTGACGTCAATCGAGCCATTCAATCCTGTCCCCAACCCTGTGATGAATTCCTTAGTATCACCAACTAAGGTGATCCTTAACGAGTCGATCCCAAACCCCATCATACCCATGGCCAGTTCCATCGCTCTGCCGCCCGAGCTGCCGGTGTCCACGCTGCCGGTACCCATAATGGGCATCAAGGCGATCGATGGTGACAAGATCGATTCGGAAATGGATAAAATCGATATCAACGAATCGAATGATAGTCTACTGACGCAGAGCTTCATAGACGATATGAAGAAGGACAAAGAGGACGACGACACTATCATAGTTGATGATCTGGAGCAGTCTGAACAGCAGCGGAACGGTGACATGAGTGTTGGAGATACGCCCCAGACGTTGAGTCCCAGCCGCGTCCCCGAGGGTGTGGAACCCCTGGACGTGGACCCTCCATCAACCGCAGCTGATTCTGATGTTGTCATGAACGATCAGCTCAGCGAGAACGGTTCTATGCAAGTTGAGCCTAGCAGCAGTCCGTTGAGCGGGGACATGATAGTTGAACAAGCCGAGGCCATCGATGACAGTGACTCTCGGCTGTCTCAAGATGACCTGCTGTTGCTCTGCGAGCTGTTCTACCTACCGTTCAGCCACGGTGGTAGAGGTCTCAGACTGTTGCACGACTACCACTGGCTCACCACCCACGCCACCAGCTGCCTAGCAAGAGGGAATAAACCGGAACCCAGCGAGTGGCGCCGTCGTCTGCGTCGCTTCTCGTGGTGGTCGTGTCGCGCGCGCCGCTTGTCCCGCCGCCTGTCCTTGTGCGCCAACCGAGAGCTACACGCGGAGCTGCACCCCTACCTATGGGACCTGTGCGCTGTGCTCGCTCTACTACAGGCCTTCCTGCGGTGGCTGGGGTTCTCCAAAGGCTGGAGGGAAGCCTTCGAAAGCGGCACCCAAGAGCCGTGGGTGTTCCGAGGAGGTCTGACCGCGGACCTGCGGAGACTACTGCCCGTGGAGTGCAGCGGGGACGCTCTCAGACCCCAGTGTATACCCAACAGCCTGCCGCTCACCGTGAGACCCTACACACTCGCAGACGAAGACGCGGTGTGTAATTTGTGCCAGAAAACCTGCCGGGACGGTTTGGACTGCAGTCACTTGTTCCCTGGGGAATTAATGTCGCTCCCCGTGGACAGACTGATCGCTCCATACTTGACGCTGTCCCCCGAGCTGTGTATGGTGATAGAGGATGATGGTGACATCATCAATGATGATGATGATGACAAGCCGGGCATCAATAACAACGACGCTAAACCGGAAATAGTCGGTTACGTGTGCGCGGCCGTCAACAGCGTGGACTTCTATAGGAAACAGGAAATAGCCTGGATACCGGAAATGTGTCTCAAATATCCCAAGGAGTTACTCGACAAAGACGACTTGAGTGATGCGGCCAAGGACTGCATCCGCTACTTCCACTCGTACTCCGCGGAGTCCATCATCACTTCCTCTTCCGGCGTCTACTCCTCTCACCCGTCCTTGATATCGATGGCTGCTGTGCCACGGTCCGACCCGCTGGCAACCTCGCGCCTGCTCACGTGCCTACTGGCTGCTCTCAGGGCTTACGGTGTTAACGGCGTCCACACCTGTGTGGCCATCAACGACCAGCACCTGCTGCAGTTCTACAGCAAGTTCGGCTTCACGGAGCACTCGCGTAACGAGGTCCACGTGTTCATGGCTAAACTGTTCTAG

Protein sequence:

>DPOGS213232-PA
MSESSPDELNSMRKDFICGVVEGFYGRPWTTEQRKDLFQKLKKWGLDMYVYAPKDDYKHRAYWRELYTVEEAEHLTSLISEAKSHGITFCYALSPGLDITYSSQKEITTLKRKLEQVSQFGCTCFALLFDDIEPEMSEADKQIFQSFAHAQVSVTNEIHQHLGSPKFLLCPTQYCSTRAVPTVHTSEYLNTLGTKLSQEIDIMWTGPKVISKTLTTECIGEITQVLRRPPVIWDNLHANDYDQKRIFLGPYCGRSPELIPLLRGVLSNPNCEYNANMIPIWTLAHWARCSLDAPAHMEAVSWDIKLERESEQGICEDEVPLTLGKHVYHHRQALRQAINEWLPEFSIPKTAQGPVIKPQPQVAAPPVPILPILPSVNTCMSLTATTTTSSRAPDLPIPTVTTSQLQALADRPALATAVTSIEPFNPVPNPVMNSLVSPTKVILNESIPNPIIPMASSIALPPELPVSTLPVPIMGIKAIDGDKIDSEMDKIDINESNDSLLTQSFIDDMKKDKEDDDTIIVDDLEQSEQQRNGDMSVGDTPQTLSPSRVPEGVEPLDVDPPSTAADSDVVMNDQLSENGSMQVEPSSSPLSGDMIVEQAEAIDDSDSRLSQDDLLLLCELFYLPFSHGGRGLRLLHDYHWLTTHATSCLARGNKPEPSEWRRRLRRFSWWSCRARRLSRRLSLCANRELHAELHPYLWDLCAVLALLQAFLRWLGFSKGWREAFESGTQEPWVFRGGLTADLRRLLPVECSGDALRPQCIPNSLPLTVRPYTLADEDAVCNLCQKTCRDGLDCSHLFPGELMSLPVDRLIAPYLTLSPELCMVIEDDGDIINDDDDDKPGINNNDAKPEIVGYVCAAVNSVDFYRKQEIAWIPEMCLKYPKELLDKDDLSDAAKDCIRYFHSYSAESIITSSSGVYSSHPSLISMAAVPRSDPLATSRLLTCLLAALRAYGVNGVHTCVAINDQHLLQFYSKFGFTEHSRNEVHVFMAKLF-