Monarch geneset OGS2.0

DPOGS212904
TranscriptDPOGS212904-TA1152 bp
ProteinDPOGS212904-PA383 aa
Genomic positionDPSCF300799 + 204-5172
RNAseq coverage239x (Rank: top 43%)
Annotation
HeliconiusHMEL0225138e-15678.37% 
BombyxBGIBMGA011888-TA3e-9669.73% 
Drosophilaalpha-Cat-PA8e-11563.88% 
EBI UniRef50UniRef50_E3X5I85e-10559.00%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3X5I8_ANODA
NCBI RefSeqXP_625229.12e-12466.86%PREDICTED: similar to Catenin CG17947-PA [Apis mellifera]
NCBI nr blastpgi|3214791804e-12769.76%hypothetical protein DAPPUDRAFT_309973 [Daphnia pulex]
NCBI nr blastxgi|3214791802e-13069.58%hypothetical protein DAPPUDRAFT_309973 [Daphnia pulex]
Group
Gene OntologyGO:00071553e-52cell adhesion
GO:00156293e-52actin cytoskeleton
GO:00051983e-52structural molecule activity
KEGG pathwayame:5528506e-124 
 K05691 (CTNNA)maps-> Pathways in cancer
    Endometrial cancer
    Leukocyte transendothelial migration
    Bacterial invasion of epithelial cells
    Tight junction
    Adherens junction
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
InterPro domain[114-330] IPR0060773e-52Vinculin/alpha-catenin
Orthology groupMCL10388 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212904-TA
GTGTACGTCCGTCACCCCGAGCTGGCCGCGGCCAAGGCCAACCGTGACTTCGTACTGCGCGCGGTGTGCTCCGCCGTGGACACCATCTCGTGCGTGGCGCAAGGAAGACCGCTCCCGCCGGCCGGGTCAAACCGGGTGCCCGTGGAAGGTCCGGGGGAACTGGCCCAAGCCTTGGATGACTTTGATGAGCGGATGGTGATGGAGCCCATGTCGTACTCCGAGCTGAGGACCAGGCCCTCGCTGGAGGAGCGTCTCGAGAGCATCATCTCCGGGGCGGCGCTGATGGCGGACAGCTCCTGCACCAGGGACGAGCGCCGCGAGCGCATCGTGGCGGAGTGCAACGCCGTGAGGCAAGCGCTGCAGGACCTGTTGCACGAGTACATGAGTAACGCCGGCAGACAGGAACAGTCTGAGGGTCTGGAGCGAGCCCTGGAACAGATGTGCCGCAAGACGAGGGACCTCCGGAGGCAGCTGAGGAAGGCCGTCGTGGACCACGTGTCCGACAGGAAACAGCGGCAAGTACCCGAGCGTCCCATATCCTGGACAGTTGACGCTAGGAAGTCTGTTCCAAAAACTAATTGCGCACGCGTTCACATGAAGTTGGCCACAACGATCTCTTTCCGTTCAGTCGCTAACCTGGTGTGCTCCATGTCCAACAACGAGGACGGCGTGAAGATGGTGCGGCACGCCGCGGGACAGATAGAGGCGCTGTGCCCCCAGGTGATAAACGCGGGTAGGGTGCTCGCCGCGCGCTGCAGGTCGCGGGTGGCCCAGGAGAACGCGTCGGCCTTCGCTCGCGCCTGGGAGGCCGCGGTGCGTCTGCTCACAGACGCGGTGGACGACATCACCACCATCGACGACTTCCTCGCCGTCAGCGAGAACCACATCCTGGAGGACGTCAACAAGTGCGTGGTGGCGCTGCAGGAGGCCGACCCTGACGGACTCGAGAGGACCGCAGCCGCTATACGGGGCAGGGCTACCAGACTGAGTGACCTTAATGACCAGACATTGCCAAGTCCGGCACATCACTCATCATTATATGCCTTCGCCCAGGGTCTGCTCGGTGGTGACTCAGGAGATGGACAACTACGAGCCCTGCATATACACCAAGAGAGTGTTGGAAGCCGTCACCGTGCTCAGAGACCAGGGTGA

Protein sequence:

>DPOGS212904-PA
VYVRHPELAAAKANRDFVLRAVCSAVDTISCVAQGRPLPPAGSNRVPVEGPGELAQALDDFDERMVMEPMSYSELRTRPSLEERLESIISGAALMADSSCTRDERRERIVAECNAVRQALQDLLHEYMSNAGRQEQSEGLERALEQMCRKTRDLRRQLRKAVVDHVSDRKQRQVPERPISWTVDARKSVPKTNCARVHMKLATTISFRSVANLVCSMSNNEDGVKMVRHAAGQIEALCPQVINAGRVLAARCRSRVAQENASAFARAWEAAVRLLTDAVDDITTIDDFLAVSENHILEDVNKCVVALQEADPDGLERTAAAIRGRATRLSDLNDQTLPSPAHHSSLYAFAQGLLGGDSGDGQLRALHIHQESVGSRHRAQRPG-