Monarch geneset OGS2.0

DPOGS203543
TranscriptDPOGS203543-TA1140 bp
ProteinDPOGS203543-PA379 aa
Genomic positionDPSCF300055 + 303979-307974
RNAseq coverage933x (Rank: top 14%)
Annotation
HeliconiusHMEL0132067e-9198.76% 
BombyxBGIBMGA004344-TA0.090.26% 
DrosophilaG-salpha60A-PB0.084.68% 
EBI UniRef50UniRef50_P630926e-17272.59%Guanine nucleotide-binding protein G(s) subunit alpha isoforms short n=40 Tax=Euteleostomi RepID=GNAS2_HUMAN
NCBI RefSeqNP_001093292.10.094.47%G protein alpha S subunit Gs1 [Bombyx mori]
NCBI nr blastpgi|1537919740.094.47%G protein alpha S subunit Gs1 [Bombyx mori]
NCBI nr blastxgi|1537919740.094.47%G protein alpha S subunit Gs1 [Bombyx mori]
Group
Gene OntologyGO:00071862.7e-271G-protein coupled receptor protein signaling pathway
GO:00190012.7e-271guanyl nucleotide binding
GO:00048712.7e-271signal transducer activity
GO:00055255.7e-43GTP binding
GO:00071657.2e-43signal transduction
KEGG pathwayaga:AgaP_AGAP0120950.0 
 K04632 (GNAS)maps-> Salivary secretion
    GnRH signaling pathway
    Amoebiasis
    Gap junction
    Vibrio cholerae infection
    Vasopressin-regulated water reabsorption
    Gastric acid secretion
    Dilated cardiomyopathy
    Vascular smooth muscle contraction
    Chagas disease
    Calcium signaling pathway
    Long-term depression
    Melanogenesis
    Taste transduction
InterPro domain[1-379] IPR0010192.7e-271Guanine nucleotide binding protein (G-protein), alpha subunit
[71-85] IPR0003675.7e-43G-protein alpha subunit, group S
[47-180] IPR0110257.2e-43G protein alpha subunit, helical insertion
Orthology groupMCL11808 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203543-TA
ATGGGGTGCTTCGGGTCACCGGGCGCCAAGAGTGGGGAGGATGACGCCAAGTCACAGAAACGCCGCAGCGACGCCATCACACGCCAGCTGCAGAAGGACAAGCAGCTGTACCGGGCAACCCACCGCCTGCTGCTCCTGGGGGCCGGTGAGTCCGGCAAGTCTACGATCGTGAAACAGATGCGTATACTCCACGTGAACGGCTTCTCGGATAAAGAGAGACGGGAGAAGATCGAAGATATTAAGAAAAACATACGTGATGCTATCCTTACAATAACTGGTGCTATGAGTACTCTGACACCTCCCATACCGCTTGAGAAGGTGGAGAACAAAGCTCGTGTTGATTACATCCAGGATGTCGCCTCGCAGCCTGACTTCGACTACCCTCCAGAGTTCTACGAGCATACGGAGGAGCTCTGGAAGGACCAGGGGGTGCAGAGGACCTACGAGAGGAGCAATGAGTACCAGCTCATAGACTGCGCCAAGTATTTCCTGGATCAGGTGCATATAATAAAGAGAGCTGACTACACGCCATCAGAGCAGGATATACTCCGCTGTCGAGTTCTCACCTCCGGGATATTCGAGACCCAGTTCGTCGTCGATAAAGTCAATTTTCATATGTTCGACGTGGGCGGGCAGCGTGACGAGCGCCGGAAGTGGATCCAGTGCTTCAACGATGTGACGGCCATCATCTTCGTGACCGCGTGCTCCTCATACAACATGGTGCTGAGAGAGGATCCCACACAGAACAGGCTCCGGGAGTCGCTGGACCTTTTCAAGAGCATATGGAATAACAGATGGCTCCGTACGATATCGGTGATCCTGTTCCTCAACAAGCAGGACCTGCTGGCTGAGAAGGTGTTGGCGGGGAAGTCTCGTCTCGAGGAGTACTTTGCGGAGTTCGCCCGCTACCAGACGCCGCCCGACGCGACCCCCGACACGAGGGACCACCCGGACGTCGCCAGGGCCAAGTACTTCATCAGAGATGAGTTCCTGCGCATCAGTACAGCGAGCGGTGACGGCAAGCACTACTGCTACCCTCACTTCACGTGCGCCGTCGACACTGAGAACATTAAGCGCGTCTTCAACGACTGCCGCGACATCATACAGCGCATGCACCTCAGACAGTACGAGCTGCTCTAA

Protein sequence:

>DPOGS203543-PA
MGCFGSPGAKSGEDDAKSQKRRSDAITRQLQKDKQLYRATHRLLLLGAGESGKSTIVKQMRILHVNGFSDKERREKIEDIKKNIRDAILTITGAMSTLTPPIPLEKVENKARVDYIQDVASQPDFDYPPEFYEHTEELWKDQGVQRTYERSNEYQLIDCAKYFLDQVHIIKRADYTPSEQDILRCRVLTSGIFETQFVVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVTACSSYNMVLREDPTQNRLRESLDLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSRLEEYFAEFARYQTPPDATPDTRDHPDVARAKYFIRDEFLRISTASGDGKHYCYPHFTCAVDTENIKRVFNDCRDIIQRMHLRQYELL-