Monarch geneset OGS2.0

DPOGS202378
TranscriptDPOGS202378-TA1074 bp
ProteinDPOGS202378-PA357 aa
Genomic positionDPSCF300104 + 324373-326902
RNAseq coverage79x (Rank: top 64%)
Annotation
HeliconiusHMEL0029045e-13564.57% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_F1KZ989e-1825.10%Mucosa-associated lymphoid tissue lymphoma translocation protein 1 n=3 Tax=Chromadorea RepID=F1KZ98_ASCSU
NCBI RefSeqXP_002630458.14e-1825.55%Hypothetical protein CBG11191 [Caenorhabditis briggsae]
NCBI nr blastpgi|3245075553e-1725.10%Mucosa-associated lymphoid tissue lymphoma translocation protein 1 [Ascaris suum]
NCBI nr blastxgi|3071854674e-1825.40%Mucosa-associated lymphoid tissue lymphoma translocation protein 1 [Camponotus floridanus]
Group
Gene OntologyGO:00065085.9e-10proteolysis
GO:00041975.9e-10cysteine-type endopeptidase activity
KEGG pathwaycbr:CBG111911e-17 
 K07369 (MALT1)maps-> T cell receptor signaling pathway
    B cell receptor signaling pathway
InterPro domain[82-247] IPR0116005.9e-10Peptidase C14, caspase catalytic
Orthology groupMCL34441 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202378-TA
ATGTCTCTTTTACAAGAATTAAGTTACAAAGAATACAAGGAATTATGCTCCTTAAGCAGTGAACTCTGTCAAGTGATAGCTAATTTGGCAAATTTAAATATTGTTTTCAATGATAAAAAGAATCCGGGACAGTTACTATCAAAATTTTTAGATCGCAAGGGCTGTTCCTTAACTCAGTACAAAAATTATCTAAACAAGGCACTTAACACTAAAACAATAATCACATATTACAAACCTCAAACTAAAGTGGCAATATTGTTAGCTAATAATAAATATGAACATTTGAGTAAATTAGTGACGCCATCTATCGATTGTGATTCTTTGGCGTCGAATTTAAAGAGGCTAGGTTTTATATCTATCGTTGTTATAAATACGAGAAGTAAGGACTGCAAGGATATTTTATCGAAAATATTTAATGTTATACCAGAGGATTCATATTGTTTTATTTTCTATGCCGGTCATGGTTGTGAGCTTTGTAATACTAAATGCATCTTAAGCGTGGATTGTCCGACGGAAGACATAGATTTGAATCATTGCGTCACAGAGAACTGGTTGTTAAGTGAAGTGGAGAAATGTAAACCGGAAATGTGTGTCTTAATCATGGACATGTGCAGAAAGAATTTGGAGAGAGAAACAAATCCCAAAATTTACTCAAACATATCAAATTTAGAGAATTATTCGATTCATAGAAACTTAATAATATGTTATTCCACTCAATCATCACAATCAGCATATGAATTGCTGCAAATCGAGCATTCGGAGAGCATCGACAATGATTTGACCTACGAGTTGAGGACGGGAGATACTGACAAGATACTGCCGTTTGGCAGTCAATATGTTAATGTGTTGTGTTCTAGAATCGGTGATGATTTTGACATAAGTACATTGTTGGACAAGGTTCATGAAGATGTTGAAAATTCATCAAAAAAGCAGATACCTATTAAAGTCCAATGTGGTGTATCTAAGAGATCTCTGTATGATCCGGTGAAAGGTGACATGAAGGCGCTTTTGGATAACCTAACAAAACTTTTACAAGAATATGTGGATAATGATTTATTATACTTTGTACATTAA

Protein sequence:

>DPOGS202378-PA
MSLLQELSYKEYKELCSLSSELCQVIANLANLNIVFNDKKNPGQLLSKFLDRKGCSLTQYKNYLNKALNTKTIITYYKPQTKVAILLANNKYEHLSKLVTPSIDCDSLASNLKRLGFISIVVINTRSKDCKDILSKIFNVIPEDSYCFIFYAGHGCELCNTKCILSVDCPTEDIDLNHCVTENWLLSEVEKCKPEMCVLIMDMCRKNLERETNPKIYSNISNLENYSIHRNLIICYSTQSSQSAYELLQIEHSESIDNDLTYELRTGDTDKILPFGSQYVNVLCSRIGDDFDISTLLDKVHEDVENSSKKQIPIKVQCGVSKRSLYDPVKGDMKALLDNLTKLLQEYVDNDLLYFVH-