Monarch geneset OGS2.0

DPOGS206625
TranscriptDPOGS206625-TA1707 bp
ProteinDPOGS206625-PA568 aa
Genomic positionDPSCF300048 - 760555-783169
RNAseq coverage1378x (Rank: top 9%)
Annotation
HeliconiusHMEL0123080.084.60% 
BombyxBGIBMGA004018-TA3e-4136.16% 
Drosophilasgg-PD9e-17878.19% 
EBI UniRef50UniRef50_Q7QA460.082.79%AGAP004443-PA n=2 Tax=Anopheles gambiae RepID=Q7QA46_ANOGA
NCBI RefSeqXP_002427157.10.080.71%mitogen-activated protein kinase ERK-A, putative [Pediculus humanus corporis]
NCBI nr blastpgi|1561183100.095.62%shaggy [Danaus plexippus]
NCBI nr blastxgi|1561183100.095.62%shaggy [Danaus plexippus]
Group
Gene OntologyGO:00055243.5e-85ATP binding
GO:00046743.5e-85protein serine/threonine kinase activity
GO:00064683.5e-85protein phosphorylation
GO:00167728.1e-82transferase activity, transferring phosphorus-containing groups
GO:00046721.6e-63protein kinase activity
GO:00047139e-10protein tyrosine kinase activity
KEGG pathwayphu:Phum_PHUM2994500.0 
 K03083 (GSK3B)maps-> Axon guidance
    Prostate cancer
    Alzheimer's disease
    B cell receptor signaling pathway
    Hedgehog signaling pathway
    Pathways in cancer
    Chemokine signaling pathway
    Endometrial cancer
    Insulin signaling pathway
    Neurotrophin signaling pathway
    T cell receptor signaling pathway
    Melanogenesis
    Focal adhesion
    ErbB signaling pathway
    Basal cell carcinoma
    Colorectal cancer
    Wnt signaling pathway
    Circadian rhythm - fly
    Cell cycle
InterPro domain[191-492] IPR0022903.5e-85Serine/threonine-protein kinase domain
[190-523] IPR0110098.1e-82Protein kinase-like domain
[193-492] IPR0174421.6e-63Serine/threonine-protein kinase-like domain
[191-492] IPR0206359e-10Tyrosine-protein kinase, catalytic domain
Orthology groupMCL11305 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206625-TA
ATGAGCGGACGCCCTAGGACCACGTCATTCGCCGAGGGCAGCAAATCCGTTCGTAAGACATTTGAAAATCAACCACCGAAACCACCTCTAGGAGGCGTAAAAATTAGCACGAGTGTCGGGGATCTCGCTATCTTGCGGTCGAGGCAAGAGCACGCCATCGTCCATCCTGCCATTCGCGTCATAATCGTGCCAACGTACTCTCGGCGCGGTCGCGTTCGTTACACCGCCAAGCTGGATCGTCCGCGGGGTCCGGTATGTTTAGACAGTATGCGAGTGACCGAAGGTCCGGCGATGGTACTGCGGCCCGTAGGCGAAGTCACCGCTCGCTCGGAGCCCTCGATACAGCGATTGCTTGACAACGATAAGACCGGCCAATTGAGTCGGGCTCGCGATCGCCTTTCTACATTTTGTAAGAAATTACGACGCACTATTAGCGACGTGGATACACCTAAAAAGAACGACCTGGCTTCGGAAAAGAAACCCGTCTCCTCGCACCGCAAGGACGGATCCAAAGTGACAACCGTGGTAGCGACGCCGGGGCAGGGCCCCGATAGACCCCAGGAAGTCAGCTATGCTGATATGAAGCTGATAGGTAATGGAAGCTTCGGCGTCGTGTACCAGGCCAAATTATGCGACACGGGCGAGCTGATCGCCATCAAGAAAGTGCTTCAGGACAAGCGGTTTAAGAACAGGGAGCTGCAAATCATGAGACGGCTGGAACATTGTAATATTGTCAAATTGAAATACTTCTTTTACTCCAGCGGTGAAAAGAAGGACGAAGTGTACCTGAATTTGGTGCTGGAATACATACCTGAGACAGTATACAAAGTCGCCCGTCACTACTCCAAAGATGAACAAACAATACCCATTAGTTTTATAAAGCTCTACATGTACCAGCTGTTCAGAAGCCTCGCATACATCCATTCGCTTGGCATCTGTCACAGGGACATCAAACCTCAGAACCTACTGCTGGACCCCAAGTCAGGGGTGCTAAAGCTGTGCGACTTCGGTTCAGCGAAACACCTCGTCCGCAGCGAACCGAATGTTTCTTACATCTGCTCGCGGTATTACCGAGCCCCTGAGCTGATATTCGGGGCCACAGATTATACAACTAAAATCGACGTATGGAGCGCGGGTTGCGTTGTGGCTGAACTTCTCTTGGGCCAGCCGATCTTCCCCGGGGATTCTGGTGTTGATCAGCTCGTTGAAATCATCAAGGTGTTAGGAACTCCCACACGAGAACAGATACGTGAAATGAATCCAAACTACACAGAATTCAAGTTCCCGCAGATCAAGAGTCATCCATGGGCGAAGTACATGTTGGAGAGAATACCACCCGCCCGCACCACTGATCACCGCATACGAGTGTTCCGCGCTTGCACTCCTCCGGACGCCATATCCCTGGTGTCCCGCCTGCTGGAGTACACCCCGGGCGCTCGTCTATCCCCTCTCCAGGCGTGTGCGCATTCCTTCTTCGACGAGCTCCGTGAACCCGCCGCACGCCTCCCCAACGGTCGCGCTCTGCCGCCGCTGTTCAACTTCACGGAATACGAGCTGGCCATACAGCCGAGCCTCAACGACTTCCTCAAGCCTCGTGCCGCCGTCGCTGACGCCGCCCCGGCCCAGACCGCCGCCTCCTCCGAACAACACGACGCGCAGGGAGAGACGGCGGCGAGCGGCCCCAGCGGTTCCCCGGGAGCGTCGTAG

Protein sequence:

>DPOGS206625-PA
MSGRPRTTSFAEGSKSVRKTFENQPPKPPLGGVKISTSVGDLAILRSRQEHAIVHPAIRVIIVPTYSRRGRVRYTAKLDRPRGPVCLDSMRVTEGPAMVLRPVGEVTARSEPSIQRLLDNDKTGQLSRARDRLSTFCKKLRRTISDVDTPKKNDLASEKKPVSSHRKDGSKVTTVVATPGQGPDRPQEVSYADMKLIGNGSFGVVYQAKLCDTGELIAIKKVLQDKRFKNRELQIMRRLEHCNIVKLKYFFYSSGEKKDEVYLNLVLEYIPETVYKVARHYSKDEQTIPISFIKLYMYQLFRSLAYIHSLGICHRDIKPQNLLLDPKSGVLKLCDFGSAKHLVRSEPNVSYICSRYYRAPELIFGATDYTTKIDVWSAGCVVAELLLGQPIFPGDSGVDQLVEIIKVLGTPTREQIREMNPNYTEFKFPQIKSHPWAKYMLERIPPARTTDHRIRVFRACTPPDAISLVSRLLEYTPGARLSPLQACAHSFFDELREPAARLPNGRALPPLFNFTEYELAIQPSLNDFLKPRAAVADAAPAQTAASSEQHDAQGETAASGPSGSPGAS-