Monarch geneset OGS2.0

DPOGS212093
TranscriptDPOGS212093-TA1509 bp
ProteinDPOGS212093-PA502 aa
Genomic positionDPSCF300038 - 792338-793846
RNAseq coverage146x (Rank: top 54%)
Annotation
HeliconiusHMEL0056051e-15754.89% 
BombyxBGIBMGA006726-TA5e-12445.88% 
DrosophilaDredd-PE1e-3025.59% 
EBI UniRef50UniRef50_F6K5S26e-12848.73%Caspase-6 n=1 Tax=Manduca sexta RepID=F6K5S2_MANSE
NCBI RefSeqNP_001108337.12e-12145.69%death related ced-3/Nedd2-like protein [Bombyx mori]
NCBI nr blastpgi|3333627512e-12748.73%caspase-6 [Manduca sexta]
NCBI nr blastxgi|3333627517e-13248.73%caspase-6 [Manduca sexta]
Group
Gene OntologyGO:00065082e-41proteolysis
GO:00041972e-41cysteine-type endopeptidase activity
GO:00082343.6e-39cysteine-type peptidase activity
GO:00069153.6e-39apoptosis
KEGG pathwayoaa:1000871053e-22 
 K02187 (CASP3)maps-> Colorectal cancer
    Amoebiasis
    Amyotrophic lateral sclerosis (ALS)
    MAPK signaling pathway
    Viral myocarditis
    Alzheimer's disease
    Apoptosis
    Huntington's disease
    Pathways in cancer
    Natural killer cell mediated cytotoxicity
    p53 signaling pathway
    Parkinson's disease
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[243-502] IPR0023982e-41Peptidase C14, caspase precursor p45
[255-500] IPR0159173.6e-39Peptidase C14, caspase precursor p45, core
[287-497] IPR0116002.1e-17Peptidase C14, caspase catalytic
Orthology groupMCL14751 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212093-TA
ATGGTTTCTTTGGTGTTTTTATTGTACGAAACCCCAGACACGGCTCTTCAGCGATTAATTGTTTATCAGAGGCTATCTAACGATGAAGCACGCCATAGTGTAAACTTGCTCTATGACTGGGCACTTCATGCACAATCACGCCAAACATGGAAGTATGAATTTCTGGAAGCTTTAACTATTTGTAGATTATACAATATTATAAGAAAACTTGGATTTGATGTCTCACAAATTAGGAAACACTACTTACCGGACAACCTCAATGTTAGCATTTATACAGATCCCATGAAGAAGGTTCTGTATAAACTTTGTGAAAGTTTAAATTCTGAAATATTTTCTAGATTGCAAAAGACACTTATATCATACAATTATAATGTATCTGAATATAGAACATGTGAATTAGTTTTACTGGAACTAATATCAAAAAGGTTTATTAAATTGGGATATTATGATAAAGACAAAAAATCTTACACCAATACCTATGATATTGAAGAGCTTGCGAAAATTATTGAAAAATTTTCAACGGTCAGTATATTAGCATCAGTCTTAAGGGAAATACAAGAGCAAATAAATTCACCAAACTTAAAAAATAAACCTCAGAACTTTACTAGTATAGAAACATACAACATTCCTGGACCATCAAAAATCAAAGAAGACAAATATCAAGAGGGCTTTGATGAGATATTTGAGCTGATGAACCAATGTCACAATGACGAAAATGAACCTGGGGCTAATTTCAAATCTGACACAATGCTATTAAAAGATGCATATGTTATAAAAAATAGAAAAAGAATCGGTGTGTGTTGCATAATAAACCAAGACAAGTTTTATCCTTCGAAACAAAGCCTACAAAACAATGAACACATTGATTTAGAAGACCGTTTGGGTTCTAGGTTGGATCTTATGGCTTTAGAGAGAACTATGACGTCATTGAATTTCAAAGTAAAAAGTAAATCTAACTTAAATCATGAAGAAGTTTTTCAATTTATAAAAGATGTCTTGAAATATCATGTCACAGCTGAAGATAGCGTTTTTATGCTGTGTATACTTTCTCATGGTGTAAGAGACCATGTTTATGCAGCAGACTCGGTAAGGGTGAAAATAGATGATATACAAAATCTATTAGACTCTGAACATGCCCATCATCTCCGTGGTATTCCAAAAGTACTCATCCTTCAAGCTTGTCAAGTAGAAACCGAGTCGGAGGTGAAAAATAAAATTGTTGCAGACAGTCCACCTTTAAACGATTTCTATTTAAAAAAATCACATTTCTTGGTCTATTGGGCTACTGCTCCTGAATATGAAGCGTTCAGAATAGAAGATAAAGGATCAATATTTATCCAATGTATCTGTGCACTTATAAAGAAAAGAGCCAAATACGAACATCTGATTGACATTTTTACTAAGGTGACCTATAATGTTACTGCTTTATGCACTAAACTACACAAACCTCAAGTACCACTATCTAAATCTACTTTGACAAAAAAATTGTACTTGAGTATTCCGGAATAA

Protein sequence:

>DPOGS212093-PA
MVSLVFLLYETPDTALQRLIVYQRLSNDEARHSVNLLYDWALHAQSRQTWKYEFLEALTICRLYNIIRKLGFDVSQIRKHYLPDNLNVSIYTDPMKKVLYKLCESLNSEIFSRLQKTLISYNYNVSEYRTCELVLLELISKRFIKLGYYDKDKKSYTNTYDIEELAKIIEKFSTVSILASVLREIQEQINSPNLKNKPQNFTSIETYNIPGPSKIKEDKYQEGFDEIFELMNQCHNDENEPGANFKSDTMLLKDAYVIKNRKRIGVCCIINQDKFYPSKQSLQNNEHIDLEDRLGSRLDLMALERTMTSLNFKVKSKSNLNHEEVFQFIKDVLKYHVTAEDSVFMLCILSHGVRDHVYAADSVRVKIDDIQNLLDSEHAHHLRGIPKVLILQACQVETESEVKNKIVADSPPLNDFYLKKSHFLVYWATAPEYEAFRIEDKGSIFIQCICALIKKRAKYEHLIDIFTKVTYNVTALCTKLHKPQVPLSKSTLTKKLYLSIPE-