Monarch geneset OGS2.0

DPOGS203361
TranscriptDPOGS203361-TA2106 bp
ProteinDPOGS203361-PA701 aa
Genomic positionDPSCF300003 + 86334-92633
RNAseq coverage317x (Rank: top 36%)
Annotation
HeliconiusHMEL0225134e-1596.00% 
BombyxBGIBMGA011890-TA4e-14681.01% 
Drosophilaalpha-Cat-PA1e-15489.68% 
EBI UniRef50UniRef50_B0WEX38e-15188.26%Actin binding protein n=2 Tax=Endopterygota RepID=B0WEX3_CULQU
NCBI RefSeqXP_001965858.13e-15473.30%GF20570 [Drosophila ananassae]
NCBI nr blastpgi|1947675097e-15373.30%GF20570 [Drosophila ananassae]
NCBI nr blastxgi|1947675092e-14373.30%GF20570 [Drosophila ananassae]
Group
Gene OntologyGO:00071551.4e-93cell adhesion
GO:00156291.4e-93actin cytoskeleton
GO:00051981.4e-93structural molecule activity
GO:00457352.4e-13nutrient reservoir activity
KEGG pathwaydan:Dana_GF205701e-153 
 K05691 (CTNNA)maps-> Pathways in cancer
    Endometrial cancer
    Leukocyte transendothelial migration
    Bacterial invasion of epithelial cells
    Tight junction
    Adherens junction
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
InterPro domain[418-660] IPR0060771.4e-93Vinculin/alpha-catenin
[506-523] IPR0179971.9e-14Vinculin
[63-398] IPR0014192.4e-13HMW glutenin
Orthology groupMCL10388 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203361-TA
ATGTCCAAGTTCGCGGTCCGTGTGGACGCGGCCGTATCAGCCCTCGGTGGAGGGGCGGGTTCTGGTTTGGACGAGAACGACTTCATAGACGCGTCCCGTCTCGTTTATGATGCTGTTAGGGAGATACGGCGAGCTGTCCTCATGAACAGGGGGATAGGACGAACAGGGAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAGTGATAGAGTAGAGATAGGCCAAGGAGATAGGACGAACAGGGTGAATGATAGAGTAGAGATAGGCCAAGGGGATAGGATGAGCAGGGACGAGGAGGATCTGGACCCCGAGGACGTGGAGCTCGACGAGCATTACACCCTGGAAACGAGAAGCAAATGTGAGATCACAGCTCGTGACGAGCATACAAGCGACGATCTGGACACCGACACGGAGTTCGAACCTGTCGAAGATATGACCATGGAGACGAGGAGCAGATCGAGCGCCCACACCGGAGAGCACGGGGTCGACGAATATCCGGACATCAGTGGAATAACGAACGCAAGAGAAGCCATGCGGAAAATGACGGAAGAAGATAAGAGGAAGATACTCCAGCAAGTGGAGTTGTTCAGGCGGGAGAAGATGACCTTCGACAACGAGGTCGCTAAGTGGGACGATGCCGGAAACGACATCATAATGTTGGCCAAACACATGTGTATGATCATGCTCGAAATGACAGACTTCACCAGAGGCCGCGGTCCCTTGAAGACGACCATGGACGTCATTAACGCGGCCAAGAAGATATCGGAGGCTGGAACTAAACTGGACAAACTCACGAGAGAAATAGCCGAACAGTGCCCGGAGTCGTCGACCAAACAGGATTTGCTGGCCTACCTTCAGCGTATAGCGCTCTACTGTCACCAGATACAGATCACCAGCAAGGTGAAGGCGGACGTTCAGAACATATCCGGCGAGCTGATCGTTAGCGGGTTGGACAGCGCCACGTCTCTCATACAAGCTGCCAAAAACCTGATGAATGCTGTGGTGCTGACGGTCAAGGCCTCGTACGTCGCCTCTACGAAATACACCAGACAGGGCACCATCGCCTCACCCATAGTGGTGTGGAGGATGAAGGCCCCGGAGAAGAAGCCGCTCATAAGACCGGAGAAGCCGGAGGAGGTGCGCGCGAAGGTCAGGAGAGGGAGTCAGAAGAAACAACCCAGCCCCATACACGCGCTCGCCGAGTTCCAGAGCCCCGCCGAGAGTGTGTGGTGA

Protein sequence:

>DPOGS203361-PA
MSKFAVRVDAAVSALGGGAGSGLDENDFIDASRLVYDAVREIRRAVLMNRGIGRTGRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVSDRVEIGQGDRTNRVNDRVEIGQGDRMSRDEEDLDPEDVELDEHYTLETRSKCEITARDEHTSDDLDTDTEFEPVEDMTMETRSRSSAHTGEHGVDEYPDISGITNAREAMRKMTEEDKRKILQQVELFRREKMTFDNEVAKWDDAGNDIIMLAKHMCMIMLEMTDFTRGRGPLKTTMDVINAAKKISEAGTKLDKLTREIAEQCPESSTKQDLLAYLQRIALYCHQIQITSKVKADVQNISGELIVSGLDSATSLIQAAKNLMNAVVLTVKASYVASTKYTRQGTIASPIVVWRMKAPEKKPLIRPEKPEEVRAKVRRGSQKKQPSPIHALAEFQSPAESVW-