Monarch geneset OGS2.0

DPOGS210848
TranscriptDPOGS210848-TA1947 bp
ProteinDPOGS210848-PA648 aa
Genomic positionDPSCF300027 + 480141-486641
RNAseq coverage2549x (Rank: top 5%)
Annotation
HeliconiusHMEL0127623e-15264.72% 
BombyxBGIBMGA006976-TA5e-17455.04% 
Drosophilastv-PE5e-2856.52% 
EBI UniRef50UniRef50_Q9BLJ64e-16354.20%BAG domain-containing protein Samui n=2 Tax=Obtectomera RepID=BAGS_BOMMO
NCBI RefSeqNP_001036843.17e-16454.20%BAG domain-containing protein Samui [Bombyx mori]
NCBI nr blastpgi|1129839601e-16254.20%BAG domain-containing protein Samui [Bombyx mori]
NCBI nr blastxgi|1129839600.056.88%BAG domain-containing protein Samui [Bombyx mori]
Group
Gene OntologyGO:00069151e-22apoptosis
KEGG pathway 
InterPro domain[390-463] IPR0031031e-22Apoptosis regulator, Bcl-2 protein, BAG
Orthology groupMCL25292 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210848-TA
ATGGAGTCACCCGTAGTTTTGGATAAGCCGCCCGAATATCGTTATGAGCGCGGTTTTCCGTTTGATGAGGAGGAAGGCAGTGGAGCGTGGAGCGAGCTCGCGTCCCGGCATCCGGATATAGCGGCGCGCCTGAGACAGCGGCCGGCCACGTGGGCCAGGAAACGAAGACCGTCTAGTCAGGACGGCATTGATGGTGTTTTAAAAAAACGTTTCGTTCTTGCAGATACAGCCGACGGCTTCGGTGGATTTGACAGATTCCCGTTTGACGATATACCGCCAGAGTTTAGAGAACACTTCCCTTCACATTGGAACAGAAGATTTGGTTCTAGAGACGAGCAGTTTCAGCCAGAACCACAAACTCATCAGCCACAATCACCGTCGCAACAGACAGCGGCCACACAGACTGAACAAGACGTAGGATCTCCAAAAGAGGAACAGGTTCCCTTGCCACAGTACGGTTTAAGAAATACTGTCGATCTAGGACAGAAGAGTGCGACTGATCCCAGTGTAGTTGATGCAGACGAGAGGAATCAGAGGTCAATGTCAGCACCTCCCGATCATCGCACTGTAAATCAGAATAGCCACAAGATGAGCGGTCAAAATCAACAGGATCAACATCATCCACAGTCTGAGCAGTCCTCTAATGTTAGACACATACCAATATTTGTAGAGGGTAGGGATGAACCAGTAATAAATAAAACTGTAGATCATGGAACTCATTTCGCTGATACAAAGCCTGCATACGTGCCACCCCCACAGCCGCATATCGATAGAGAAAACTATTTTGCAGATGATGGACCCGTAGGGTTCCCATTCGCAAAGAGCTTTGACAGACCTTTCTTTAGACAATCTGCTCAACCGTTCTCGAAACAGAAGATGTACCCTCAGTCTGCATTCGCCCGCGGCGCTTCGCCCCAGAGATCCCAGTCACCGAAACCACAGAATTATCAGGAGCAGACTTCTCCCAAAACTGAGACTCCGTCCAGGCAACAACAAAAACCATCTCCGCAGCAGAGGCAGGCTCCTCAACAGAAGGCGACACCATCTCAGCAACGCGCCACTTCACCACAGGCTCCCCCTCCCTGTAAAGAAAATCAACCACCTCCCCAGTGCAACCAAACACAACCGCCTCCCCAACAAGCTCATCCAGCCAATGACCCCATTTCCCAGATTCTCAGTATTCAAACTGACGTCTTGAACCTAATGACAGAAGTGGAAAACTTTAAAGGGGCCAAAAACGACAAACAATACCTATTCCTAGACGAGATGTTAACTAGGAATCTAATCAAATTAGATAATATTGAAACAGAAGGAAAAGAAAACATAAGGCTGGCTAGGAAAGAGGCGATCAAATGTATACAGAAATGCATAGCTGTCCTCGAAGCGAAGGCCGAGAGCAATGCTGCCGCTGCTAAAGCTGCCGCGCAACCTCAAGATGTTGAAATGAATAATCCTGAAAAACCAAATGTTCAAAATGGTGATGTCGAAATGAAAGAAACAACCAAAGAAGAAGCGATAGCAGAGCCTCAAACGGAACCCCCAGCACCTGCTCCAGAAGCACAAGTACATAACGAGGAAGAGATTAAAATAGAACAAAAACCTTTAGAGGACAACAAAGAAGAGGTGCAACCTGAGAAGGTAAAGGAAGCTGAGGAACAACCACAAGCAGCCGATACGCGACCCGAAGAACCTAAAGAGCAAGCGAATACAAAGCAACCAGAACCAGTTCACACTGAAAACGCGAAGAAAACTTCCCCTAAGAAAACTGTAAAGAAACGCGACAAAAGTAAAGAGAATAAAGAAAAGAAAAACGAAGAAGTGTTAGAACAGAAAAAAGTTGCAGAGAAAAACAAAGAGAACGTAGAAACAATGCACATTGACTCCAAGGGTGATTCAGACAAGGCTAACTCTCAAGAAATGGAAGTGGACGCTGTCGCTAGTCAATAA

Protein sequence:

>DPOGS210848-PA
MESPVVLDKPPEYRYERGFPFDEEEGSGAWSELASRHPDIAARLRQRPATWARKRRPSSQDGIDGVLKKRFVLADTADGFGGFDRFPFDDIPPEFREHFPSHWNRRFGSRDEQFQPEPQTHQPQSPSQQTAATQTEQDVGSPKEEQVPLPQYGLRNTVDLGQKSATDPSVVDADERNQRSMSAPPDHRTVNQNSHKMSGQNQQDQHHPQSEQSSNVRHIPIFVEGRDEPVINKTVDHGTHFADTKPAYVPPPQPHIDRENYFADDGPVGFPFAKSFDRPFFRQSAQPFSKQKMYPQSAFARGASPQRSQSPKPQNYQEQTSPKTETPSRQQQKPSPQQRQAPQQKATPSQQRATSPQAPPPCKENQPPPQCNQTQPPPQQAHPANDPISQILSIQTDVLNLMTEVENFKGAKNDKQYLFLDEMLTRNLIKLDNIETEGKENIRLARKEAIKCIQKCIAVLEAKAESNAAAAKAAAQPQDVEMNNPEKPNVQNGDVEMKETTKEEAIAEPQTEPPAPAPEAQVHNEEEIKIEQKPLEDNKEEVQPEKVKEAEEQPQAADTRPEEPKEQANTKQPEPVHTENAKKTSPKKTVKKRDKSKENKEKKNEEVLEQKKVAEKNKENVETMHIDSKGDSDKANSQEMEVDAVASQ-