Monarch geneset OGS2.0

DPOGS210478
TranscriptDPOGS210478-TA1224 bp
ProteinDPOGS210478-PA407 aa
Genomic positionDPSCF300062 + 508899-510122
RNAseq coverage263x (Rank: top 41%)
Annotation
HeliconiusHMEL0077530.075.62% 
BombyxBGIBMGA002769-TA7e-17669.63% 
DrosophilaCG2061-PC3e-9242.96% 
EBI UniRef50UniRef50_D6WR422e-11449.26%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WR42_TRICA
NCBI RefSeqXP_975458.24e-11549.26%PREDICTED: similar to Lancl1 protein [Tribolium castaneum]
NCBI nr blastpgi|1892394848e-11449.26%PREDICTED: similar to Lancl1 protein [Tribolium castaneum]
NCBI nr blastxgi|1892394848e-11449.26%PREDICTED: similar to Lancl1 protein [Tribolium castaneum]
Group
Gene OntologyGO:00038245.3e-09catalytic activity
KEGG pathway 
InterPro domain[48-399] IPR0078221.8e-73Lanthionine synthetase C-like
[1-13] IPR0204647.5e-32LanC-like protein, eukaryotic
[235-391] IPR0089285.3e-09Six-hairpin glycosidase-like
Orthology groupMCL17084 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210478-TA
ATGGTGCGATATTTTCCTAATCCTTACGACGACTATAAGTCGGGAATTAATTTGAATATTGATAAGGACGAGGTGTTGTCCCAGATAAATGAACATATTAAGAATATTACCAAGCGCATTCAGCCAAACCGAAAAAATGTTGAAGGTGGCTTATATGTCGGCATCACGGGCATATCGTATATGTTTTATTACTTATCTCGAAATCCCTTACTTTCGGAGATGAAGTCTGAGTATATTGCAAAAGGACTCGACTATCTTGCGCCGGCATTAGAGACGTCAGCCGGCGATAAAACCTCATACCTTCTTGGCGACGCCGGCACCTTCGCACTGGCGACTGTTTTAAAAAAAGAAATGGGGGATGAGGAGTTCACCCATAGCTTAAAAACATATAAATCTTTGTACAATTATTATTTAAACCCAAAGTTTTTAAAGTGCGGAGGGGACGAGTTTTTTGTTGGTCGTGCCGGTTATCTTGCCGGAGCTTTGTGGATAAGCCGTGAATTGAAAACTGAAATATTCACTCCGGAGGAATTGTATAAAATTTGCGACATAATTGTTGCATCTGGCCGACAATTTGCATCGGCACACAATAGTCCTTCTCCATTAATGTATCATTATTACAATACTAAATATTTGGGAGCAGCTCATGGTAGCAGTTTCATTCTGCAAATGCTCTTATCTGTACCTGGTTATTTGGAATACAATAAATCTGCAGCCCAGGACATAAAGAGCACAGTTGAGTTCATAGCATCTTTACAAACTGAGGAAGGCAATTGGCCATGTTGCATGGAGGAGGTTGGTTTATCTGATCATAAGTTAGTCCACTGGTGTCATGGAGCTCCTGGGACTGTTTATTTGATGGCCAAAGCATACCTGGTGTTCAAAGACCAGAAGTATTACAATGCCTGTGTAAGGGCTGCAGAACTTGTATGGTCTAAAGGTCTCCTCCGTAAAGGTCCCGGTTTATGCCACGGTGTGGCCGGTAATGGTTATGTTTTCTTGCTTCTTCACAGATTATCAGGTGATGAGAAATATCTTTATAGAGCTAAGCTGTTTGCTGATTTTATGAATACTGAGGACTTCTTGCGGGATGCCCGACTGCCTGATAATCCAGAGAGTTTATATGAGGGTACGGCAGGAACAGTCTGCTTTTTATCGGACCTCCTTGTTCCAGACAAGGCAGAGTTCCCTTTCCAGGATGTATTCTCTACTTATGTTTATTAA

Protein sequence:

>DPOGS210478-PA
MVRYFPNPYDDYKSGINLNIDKDEVLSQINEHIKNITKRIQPNRKNVEGGLYVGITGISYMFYYLSRNPLLSEMKSEYIAKGLDYLAPALETSAGDKTSYLLGDAGTFALATVLKKEMGDEEFTHSLKTYKSLYNYYLNPKFLKCGGDEFFVGRAGYLAGALWISRELKTEIFTPEELYKICDIIVASGRQFASAHNSPSPLMYHYYNTKYLGAAHGSSFILQMLLSVPGYLEYNKSAAQDIKSTVEFIASLQTEEGNWPCCMEEVGLSDHKLVHWCHGAPGTVYLMAKAYLVFKDQKYYNACVRAAELVWSKGLLRKGPGLCHGVAGNGYVFLLLHRLSGDEKYLYRAKLFADFMNTEDFLRDARLPDNPESLYEGTAGTVCFLSDLLVPDKAEFPFQDVFSTYVY-