Monarch geneset OGS2.0

DPOGS202116
TranscriptDPOGS202116-TA2376 bp
ProteinDPOGS202116-PA791 aa
Genomic positionDPSCF300150 + 140143-149385
RNAseq coverage1576x (Rank: top 8%)
Annotation
HeliconiusHMEL0080240.082.78% 
BombyxBGIBMGA006959-TA0.094.29% 
Drosophilaarm-PA0.080.69% 
EBI UniRef50UniRef50_P188240.080.69%Armadillo segment polarity protein n=65 Tax=Metazoa RepID=ARM_DROME
NCBI RefSeqXP_001603109.10.083.58%PREDICTED: similar to armadillo protein [Nasonia vitripennis]
NCBI nr blastpgi|3072125530.084.17%Armadillo segment polarity protein [Harpegnathos saltator]
NCBI nr blastxgi|3072125530.084.17%Armadillo segment polarity protein [Harpegnathos saltator]
Group
Gene OntologyGO:00054886.1e-121binding
GO:00055152.7e-08protein binding
KEGG pathwaynvi:1001192140.0 
 K02105 (CTNNB1)maps-> Basal cell carcinoma
    Colorectal cancer
    Prostate cancer
    Pathogenic Escherichia coli infection
    Thyroid cancer
    Bacterial invasion of epithelial cells
    Tight junction
    Adherens junction
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Pathways in cancer
    Wnt signaling pathway
    Endometrial cancer
    Leukocyte transendothelial migration
    Focal adhesion
    Melanogenesis
InterPro domain[147-675] IPR0119896.1e-121Armadillo-like helical
[96-116] IPR0132841.2e-116Beta-catenin
[143-677] IPR0160241.2e-81Armadillo-type fold
[359-399] IPR0002252.7e-08Armadillo
Orthology groupMCL10475 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202116-TA
ATGAGTTATCAGATACCATCATCTCAGAGCCGCACAATGTCTCATAGCAACTATGGTGGGTCTGATGTGCCGATGGCACCTAGTAAGGAGCAGCAGACCCTTATGTGGCAACAGAACTCGTACTTGGTGGATTCTGGTATCAATTCTGGTGCAGCTACTCAGGTACCGTCTCTTACCGGCAAAGAAGATGATGAAATGGAAGGAGATCAGCTCATGTTTGATCTGGACCAGGGCTTTGCCCAAGGCTTTACTCAAGAACAGGTTGATGATATGAACCAGCAGTTGTCTCAGACGCGGTCCCAGCGTGTTCGCGCTGCTATGTTTCCTGAGACTTTGGAGGAAGGTATAGAGATTCCTTCCACCCAATTGGATCCCGCTCAACCGACTGCTGTCCAACGCCTATCTGAGCCATCTCAGATGCTTAAACATGCTGTTGTTAACCTTATTAACTATCAGGACGACGCTGATTTGGCTACAAGGGCCATACCTGAATTGATCAAACTCTTGAATGATGAAGACCAAGTGGTTGTGTCTCAAGCTGCTATGATGGTACATCAGCTATCCAAGAAGGAGGCGTCCCGTCACGCCATCATGAATTCACCTCAGATGGTTGCCGCTTTAGTTAGAGCGATCTCTAACAGCAATGATTTGGAGACTACTAAAGGTGCAGTGGGAACCCTGCATAACTTGTCTCACCACAGACAAGGTTTACTTGCAATCTTCAAGAGCGGTGGTATCCCCGCTCTGGTGAAGTTGCTAAGTTCCCCTGTAGAGTCGGTGCTCTTCTATGCTATCACCACACTCCATAATCTCTTACTGCACCAAGATGGTTCAAAAATGGCGGTTCGCTTGGCTGGGGGACTACAAAAGATGGTTGCTTTATTGCAGAGGAATAACGTCAAGTTCTTGGCTATAGTGACTGATTGTCTCCAGATCTTGGCATACGGGAACCAAGAGTCCAAGCTAATCATCCTTGCGTCTCAAGGACCGATCGAACTTGTGCGTATCATGCGCTCTTACGACTATGAGAAGTTGCTGTGGACTACATCCAGAGTTCTGAAGGTGCTGTCAGTATGCTCTAGTAATAAGCCGGCCATAGTGGAAGCTGGTGGCATGCAAGCCCTCGCCATGCACCTCGGAAACCCAAGCGGCCGTTTGGTCCAAAACTGTCTTTGGACTCTAAGAAATTTGTCTGATGCTGCCACCAAGGTGGAAGGTTTGGAAGCTCTGCTGTCAAGCTTAGTGCAGGTGCTCGCTTCGACTGATGTCAACATTGTGACTTGTGCTGCTGGAATACTCTCCAATTTGACCTGCAACAATCAGCGCAATAAGGTGACGGTGTGTCAAGCTGGTGGTGTGGATGCTCTGGTCCGTACGGTGGTGTCAGCCGGGGACCGCGAGGAGATCACAGAGCCAGCGGTGTGTGCTCTCCGCCACCTCACCTCCAGGCACGTTGAGAGTGAGATGGCTCAGAACGCTGTCAGACTCCACTACGGACTACCGGTGATAGTGAAGCTGCTGCAGCCCCCGTCTCGCTGGCCGCTGGTGAAGGCTGTAGTGGGTCTAGTACGTAACCTGGCTCTGTGTCCAGCCAACCATGCCCCGCTACGAGAACATGGTGCTGTACATCATCTCGTTAGACTCCTGCTGCGCGCATTCAATGATACTCAGAGGCAACGTACATCCGTAACTGGAGGCGGGGGTGCTGGCGGAGCTTACGCAGACGGCGTCCGTATGGAGGAGATTGTGGAAGGCGCGGTCGGTGCCCTGCATATTCTCGCTAGGGAGGGACTAAATCGCACTCTCATAAGACAACAGAACGTTATTCCGATATTTGTGCAGCTTCTGTTCAATGAGATAGAAAACATACAGCGTGTAGCGGCGGGCGTGCTCTGTGAGCTCGCGGCTGATAAAGAGGGAGCGGAAATGATAGAAGCAGAGGGAGCCACCGCACCCCTCACTGAACTACTGCATTCAAGGAATGAAGGTGTGGCTACATACGCCGCAGCTGTTCTGTTCCGCATGTCAGAAGACAAACCTCACGACTACAAGAAGAGACTCTCTATGGAGTTGACGAATTCGCTGTTCAGAGACGATCATCAGATGTGGCCCAGCGACCTCGCCATGCAACCCGACCTACAGGACATGCTCGGACCGGAACAGGGCTACGAAGGTCTATACGGCACGAGACCATCTTTCCACCAACAAGGCTACGATCAAATCCCGATAGACTCAATGCAGGGGTTGGACATCGGAAGCGGTTTCAATATGGACATGGACATCGGCGAGGAGGGCGCCACTACCAACGAGCTAGCCTTTCCTGAACCGCCGCACGACAACAACAACGTAGCCGCCTGGTACGACACCGACCTCTAG

Protein sequence:

>DPOGS202116-PA
MSYQIPSSQSRTMSHSNYGGSDVPMAPSKEQQTLMWQQNSYLVDSGINSGAATQVPSLTGKEDDEMEGDQLMFDLDQGFAQGFTQEQVDDMNQQLSQTRSQRVRAAMFPETLEEGIEIPSTQLDPAQPTAVQRLSEPSQMLKHAVVNLINYQDDADLATRAIPELIKLLNDEDQVVVSQAAMMVHQLSKKEASRHAIMNSPQMVAALVRAISNSNDLETTKGAVGTLHNLSHHRQGLLAIFKSGGIPALVKLLSSPVESVLFYAITTLHNLLLHQDGSKMAVRLAGGLQKMVALLQRNNVKFLAIVTDCLQILAYGNQESKLIILASQGPIELVRIMRSYDYEKLLWTTSRVLKVLSVCSSNKPAIVEAGGMQALAMHLGNPSGRLVQNCLWTLRNLSDAATKVEGLEALLSSLVQVLASTDVNIVTCAAGILSNLTCNNQRNKVTVCQAGGVDALVRTVVSAGDREEITEPAVCALRHLTSRHVESEMAQNAVRLHYGLPVIVKLLQPPSRWPLVKAVVGLVRNLALCPANHAPLREHGAVHHLVRLLLRAFNDTQRQRTSVTGGGGAGGAYADGVRMEEIVEGAVGALHILAREGLNRTLIRQQNVIPIFVQLLFNEIENIQRVAAGVLCELAADKEGAEMIEAEGATAPLTELLHSRNEGVATYAAAVLFRMSEDKPHDYKKRLSMELTNSLFRDDHQMWPSDLAMQPDLQDMLGPEQGYEGLYGTRPSFHQQGYDQIPIDSMQGLDIGSGFNMDMDIGEEGATTNELAFPEPPHDNNNVAAWYDTDL-