Monarch geneset OGS2.0

DPOGS204292
TranscriptDPOGS204292-TA3690 bp
ProteinDPOGS204292-PA1229 aa
Genomic positionDPSCF300046 + 296207-323840
RNAseq coverage732x (Rank: top 18%)
Annotation
HeliconiusHMEL0151910.075.95% 
BombyxBGIBMGA007570-TA0.058.81% 
DrosophilaSulf1-PA0.051.52% 
EBI UniRef50UniRef50_UPI00022479070.051.60%UPI0002247907 related cluster n=1 Tax=unknown RepID=UPI0002247907
NCBI RefSeqXP_001606010.10.051.60%PREDICTED: similar to CG6725-PA [Nasonia vitripennis]
NCBI nr blastpgi|3454956720.051.60%PREDICTED: extracellular sulfatase SULF-1 homolog [Nasonia vitripennis]
NCBI nr blastxgi|3454956720.044.55%PREDICTED: extracellular sulfatase SULF-1 homolog [Nasonia vitripennis]
Group
Gene OntologyGO:00081521.4e-100metabolic process
GO:00038241.4e-100catalytic activity
GO:00084847.2e-57sulfuric ester hydrolase activity
KEGG pathway 
InterPro domain[216-333] IPR0178491.4e-100Alkaline phosphatase-like, alpha/beta/alpha
[4-366] IPR0178504.7e-79Alkaline-phosphatase-like, core domain
[5-350] IPR0009177.2e-57Sulfatase
Orthology groupMCL10736 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204292-TA
ATGCCTCGTACTATGAAGGCAATCAGAAGTGCTGGCGCTGAGTTTCGGCACGCGTATACCACCACCCCAATGTGCTGCCCATCGAGGAGTTCCTTGCTGACTGGAGTGTATGTACACAATCACAATGTGTACACAAATAACGACAACTGTTCGTCACCGATGTGGCAGGCGAAGCATGAGACCAATACCTTCGCTACTTACCTTTCAAACGCTGGATATCGTACGGGTTATTTTGGCAAGTACTTGAACAAATACAACGGTTCATACATACCTCCGGGCTGGCGTGAATGGGGAGGCCTCATAATGAATTCAAAATACTACAATTACAGCGTTAACATGAATGGAAAGAGAATAAAACACGGAGATGATTATAATAAAGATTATTATCCGGATCTAATAGCGAATGATTCGATAGCGTTCTTGCGTGCTTCAAAGCGAAGATTTTCAAGAAAACCGGTCCTCCTCGTGATGTCTTTCCCCGCACCTCATGGACCCGAGGATTCAGCTCCGCAGTACTCTCATCTCTTCTTTAATGTTACAACCCATCACACACCAACTTACGATATGGCGCCAAATCCAGATAAACAATGGATCCTGCGAGTGACAGAGAAAATGAAACCTATTCATAGACAGTTCACGGACCTGTTAATGACAAAGCGTTTGCAGACTTTGCAAAGTGTTGATGTGGCTGTGGAACGAGTGTACCAGGAGCTTAAGGCTCTCGGGGAGTTAGATAACACCTATCTGGTGTACACATCAGATCACGGATACCACCTTGGACAGTTCGGACTGGTTAAGGGCAAGAGCTTTCCCTTCGAATTCGATATAAGAGTGCCGTTTTTAGTACGCGGCCCGGGAGTCGAACCTGGAACTGTCGTGGACGATATAATTCTCAACATCGATCTGGCGCCCACATTTCTGGATATGGGAGGAGTTCAGCCCCCGCCTCATATGGACGGCAGGTCGCTGCTGCCGCTGCTGCAGCCACGGAGGCGACGAGCGACAGCACATTGGCCAGATACATTCCTAGTCGAGAGCTCTGGACGTCGCGAGACCCAAGCTCATTTAATGGAAGAACGTTTGCGAGCACAAAAATACAGTAAAGAAATGAATGCAAGAACAACGACTATTATGCCGCTACAGTCGTCGTCCGAGAGCGGAGACTTCGAGGACGAGTCTGACGATGACTTCCTGGAACTTGATGATATTATGCCCCTACAGTCGTCGTCCGAGAGCGGAGACTTCGAGGATGAGTCTGATGATGACTTCCTGGAACTTGATGATGACGAAGATGATGAGGACAATGAGAGCACTGAGGATACATCGAACAAATCAAATCAACCTCTCATATCAAATGAAAGTCACAATCCCATACTGGAGGCGAGTCTCGATAAGATTCTTGGAGGTGACGGTGCTGTCAATAATCAATATAATTACCTCAGCCAATCAGAAATGGATGTCATTAATGGGAAGGCAGCACGTATAGCGGCTGAATGTTCCAAAGCTGAACTCCGGGCTCCCTGCTCCGTCGGACGGAAGTGGAAATGTGTGCTTGTTAATGGACGATGGAGGAAACACAAATGTAAATATGAGGATATAACTATTCCACAACCGAAAATGAGCACAAAGAAATGTGCTTGTTTCACTCCAAGTGGCCTTGTTTATACAAGACTGGAAACAGATGGTACAATCGCTAGACGACCCGCAGATTTACAGAAAGATAATAACACAAGATCACGGAGGTCTACAGATAATGATGTATTTGAACCGAACACTGTGGACACAATTCTTGAGGAAAATCCTAGTATTGGACATCTAAGTTTTAACAATGAGCCTATTGATGAAATAGAGAAGAGGAACATTGAAAACAAAGTCGATAAACTCATTAGGGAAACTGAAGCTTTCCTCGAGGCGTACGAACGAACCAAAGATAATATAGATCATAAGAGAAGTAAGAGGCGTGCTCAGCATTGGGGTCACAAACACAAACCACACAAAAACGACCCATTGTTGAACATGAATGAATCGTCTCTAGAATGTAAGATAGATAAAGACGGCACTGTTAATTGTTCGCAAGTTATATACAATGATTTGAAAGCTTGGCACACCAACAGACTGAGTCTAGAAGACCAAATAAGAGAATTGAAAACAAAGTTGGAAGACTTAAAAGAAATTAAGAGGCATTTAAAAATAAGCAAACCTGTTGTCGAAGTACAAACGGTAACGCCATCGTACGTCAACACGCATTTACACAATAAAACACAAACACCTGATAGCACGAAGGACAGCTTTAGGAGATCACGTTTCCATAGAATTAAAACCAAGCACAGGAACAGCACAGTGATTGATAAAAAATTCAGACAACTCAACGAATACATCCTACCGACTGTGAACGGTCACACCAGAGACGACATATTTAACACTCAACTCAGGAACGAAACGTCCACAGAAGCAGTCGTAAAACAATTGAGTACAATTGATCTGGTCGAAATTGATTCCAATCAGACATACGTTTTAGGAAAATTACCAAAACCACAAGTAACTACAATCATAACCGAAAGTAACTTCTATGATCAGAATTTTGCAGCGGAGAAGAGCACGCAGCAAGCAATTACCACGTCGACTGATGATACAGCGACTATATACAGCGATATTTCCGGGATAAATAACACATCCAGTAAAACACCACAAACGGAAAAACCGACGAGCCCGAGTCAAGAAACTTCCACGGACATTCTGTCCACATTGCAGTATTACAGTTCCGAAGCTAATAAAGTAATATTGACAATGACAACGACACCAACTCCTGTGACAAGACGGACAACAGCATCTCATCAAACATATAACCGGACATACCACACAAAACCATCGAACAGACCAAAGTCATCGTCTCTAGGACCAACAAGATTCGATGCGTCGGAATATGAACAAAGAAATCCTAATAAAGGAAATTCTAACAATCACGGAGTATTCAGCAAGCCGATGGACGTGTTCCAAAGAAGATTACATCCTTTGTTTATAGAGAATGAGGATAAACATGTCTGTTACTGTGAAGAGAGTCGCAAAATGAAACCAGTAGGTAACTCGTATTTGGAAGCCACTCAAAGAGCCAGAGAGGAACGAAGGAAATTGAAAGAACAGAGATTGAGAAAGAAGCTTAGGAAAGCGAAGAAGAAGGCGGAATTGGAAAGGTTATGTGAATCAGAGCGTATGAATTGCTTCCGACACGACAATGACCATTGGCGCACAGCCCCGCTATGGACCGCCGGACCTTTCTGTTTCTGTATGAGCGCCTCAAACAATACATACAATTGTGTGAGAACTATTAACTCGACCCACAACCTGCTCTACTGTGAGTTCGTCACTGGTTTGATAACGTACTACAATCTGCGTATAGATCCGTTTGAAACACAAAACAGAGTTAAATATTTATCGTCAGCTGAAAAGGAATATTTCCACAATCAGTTGCAACAGCTTTTGACATGTCGGGGACCGTCGTGTAGAAGATTCTCGCATTCAAATGTTGGAGGTATTAAAGATGATGTCAGCAGACGGACTGAAGATGACCAACTCATGTATAGAGGGGAGCCAATTGGTTACAGTGAAAGGGCATGGCGATGGAGTGGCTATGGTCGTAGATATGCAAGAGCCAGAGAGTTGCACCGGCGTCGACATACCGCGGCCTTCTAG

Protein sequence:

>DPOGS204292-PA
MPRTMKAIRSAGAEFRHAYTTTPMCCPSRSSLLTGVYVHNHNVYTNNDNCSSPMWQAKHETNTFATYLSNAGYRTGYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSVNMNGKRIKHGDDYNKDYYPDLIANDSIAFLRASKRRFSRKPVLLVMSFPAPHGPEDSAPQYSHLFFNVTTHHTPTYDMAPNPDKQWILRVTEKMKPIHRQFTDLLMTKRLQTLQSVDVAVERVYQELKALGELDNTYLVYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLVRGPGVEPGTVVDDIILNIDLAPTFLDMGGVQPPPHMDGRSLLPLLQPRRRRATAHWPDTFLVESSGRRETQAHLMEERLRAQKYSKEMNARTTTIMPLQSSSESGDFEDESDDDFLELDDIMPLQSSSESGDFEDESDDDFLELDDDEDDEDNESTEDTSNKSNQPLISNESHNPILEASLDKILGGDGAVNNQYNYLSQSEMDVINGKAARIAAECSKAELRAPCSVGRKWKCVLVNGRWRKHKCKYEDITIPQPKMSTKKCACFTPSGLVYTRLETDGTIARRPADLQKDNNTRSRRSTDNDVFEPNTVDTILEENPSIGHLSFNNEPIDEIEKRNIENKVDKLIRETEAFLEAYERTKDNIDHKRSKRRAQHWGHKHKPHKNDPLLNMNESSLECKIDKDGTVNCSQVIYNDLKAWHTNRLSLEDQIRELKTKLEDLKEIKRHLKISKPVVEVQTVTPSYVNTHLHNKTQTPDSTKDSFRRSRFHRIKTKHRNSTVIDKKFRQLNEYILPTVNGHTRDDIFNTQLRNETSTEAVVKQLSTIDLVEIDSNQTYVLGKLPKPQVTTIITESNFYDQNFAAEKSTQQAITTSTDDTATIYSDISGINNTSSKTPQTEKPTSPSQETSTDILSTLQYYSSEANKVILTMTTTPTPVTRRTTASHQTYNRTYHTKPSNRPKSSSLGPTRFDASEYEQRNPNKGNSNNHGVFSKPMDVFQRRLHPLFIENEDKHVCYCEESRKMKPVGNSYLEATQRAREERRKLKEQRLRKKLRKAKKKAELERLCESERMNCFRHDNDHWRTAPLWTAGPFCFCMSASNNTYNCVRTINSTHNLLYCEFVTGLITYYNLRIDPFETQNRVKYLSSAEKEYFHNQLQQLLTCRGPSCRRFSHSNVGGIKDDVSRRTEDDQLMYRGEPIGYSERAWRWSGYGRRYARARELHRRRHTAAF-