Monarch geneset OGS2.0

DPOGS205053
TranscriptDPOGS205053-TA3720 bp
ProteinDPOGS205053-PA1239 aa
Genomic positionDPSCF300074 - 426414-457858
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0156010.070.46% 
BombyxBGIBMGA006799-TA0.075.22% 
DrosophilaCG10738-PD0.062.56% 
EBI UniRef50UniRef50_C7LAF50.052.93%Guanylate cyclase n=27 Tax=Neoptera RepID=C7LAF5_DROME
NCBI RefSeqXP_970405.10.054.28%PREDICTED: similar to guanylyl cyclase receptor [Tribolium castaneum]
NCBI nr blastpgi|910913000.054.28%PREDICTED: similar to guanylyl cyclase receptor [Tribolium castaneum]
NCBI nr blastxgi|910913000.054.48%PREDICTED: similar to guanylyl cyclase receptor [Tribolium castaneum]
Group
Gene OntologyGO:00091904e-101cyclic nucleotide biosynthetic process
GO:00355564e-101intracellular signal transduction
GO:00168494e-101phosphorus-oxygen lyase activity
GO:00167721.2e-44transferase activity, transferring phosphorus-containing groups
GO:00064681.6e-29protein phosphorylation
GO:00046721.6e-29protein kinase activity
GO:00055244.5e-12ATP binding
GO:00046744.5e-12protein serine/threonine kinase activity
GO:00047135.6e-10protein tyrosine kinase activity
KEGG pathwayphu:Phum_PHUM1000000.0 
 K12323 (NPR1)maps-> Purine metabolism
    Vascular smooth muscle contraction
InterPro domain[907-1102] IPR0010544e-101Adenylyl cyclase class-3/4/guanylyl cyclase
[586-891] IPR0110091.2e-44Protein kinase-like domain
[152-506] IPR0018288.3e-34Extracellular ligand-binding receptor
[619-868] IPR0012451.6e-29Serine-threonine/tyrosine-protein kinase
[573-872] IPR0022904.5e-12Serine/threonine-protein kinase domain
[606-868] IPR0206355.6e-10Tyrosine-protein kinase, catalytic domain
Orthology groupMCL10045 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205053-TA
ATGATTAATTGGAAGTGGGCTATGATAGGATGTCTATTTAGTGGGTCTGATTGCATTACTATGTATCGGGCGCGGCCACATGGCGAGAAGGCTGCGCGGGCGCCGGGATCAACGCTGTTTATTGATGCGGGTTCAAACCTTCTACACGCTGCGGGACGGACTGATTATAACAGAAGATATCTCGAGTTGGACTATAATTACCAACGCGAGATGTTTACTTTCACAATACAGTTCAAGGCATACTTTAACACGCACGAAAATCAAAAAAACCCCGTCCGAGCAGAGCTGTCTGAAGCAACTATATACCAAAAGTTAGAAGAGTGGCACTGGTGCTTCATATTACTAGCTACCCTGGACACATCCCATGGGGAGAAGTTCACACTCGGATACCTAACAGGGTCACAAAGACGGCCTGGCGACCTGAATTATCAAATGCCCGGCAGAGTTATTTCAGGAGCAATTTCCATGGCAGTAGATGAAGTGAACGAAAAACTTCTAGCTCCCATGGGCCATTCATTAGACTTCGTCGTCGCTGAGACATACGGTCAGGAAGAAGTTTCCATACGACAGGTCGCAGCGTTGTGGGCTGCCAACGTGTCAGCGTTCATTGGACCTCAAGAAACTTGCGTGCACGAGGGTCGTATGGCCGCAGCCTTCAACCTTCCCATGATTAGTTACTACTGCAGGGACGTGGCCAATTCCAACAAGCGTGTGTACTCAACGTTCGCCAGAACTCGTCCATCCGACATAGACATATCTCGGTCCGTGGTCGCCGTGCTAACGCATTTCAATTTCATGCATATAGCCATAGTGTACCTAAAGGCACAGAATCACGATTTCTCTCGTCTCGCGTACGCAATAATGACAGCAGCGAATGAAAAGAAAATAACCATCCGTACGGTTCAAGTCTACCGCCAGCCGTATTACTATGACAATAGACCAGATAATTTCTCGAGGAATCCATTCTTAGACATCGTGAGGGAGACGTATAAGGATACTAGGATTTACGTTTCTATCGGCTACTATCACGACCATATAGGTCTCATGTTGGCTTTGGACGCAATGAAGCTTCTTAATTCTGGGGAGTATGCCGCTATTGGAGTGGATGTGGACAGGTATGACCCTAACGAGGCCATCAGATATCTGTCTGGGCCGTTAAGGAAGGAGCCCGAGCCTCGTGTCATGAGAGCGTTCCGTTCGTATCTAGCCGTGGCGCCCTCCGCCGGACCCTCCTACTCCGCCTTCTCTAAGGAAGTCAACTCACGATTGACAATGCCTCCATTTAATTTTAACAATCCACTGTTAGCTCTGGGCGGTGGAAAAGTTATCCCAGTGGAGGCCGCGTGGCTTCATGACGCGGTTTGGTTGTATGCGCGTGCGCTGGCCGACTCGCTCCGGACAGGGGAGGCTCCGAGGGATGGACGCGCTATAGCCTCACGTATGAGGAACACTACATACCTTAGTGTGATGGGTTATTGGATCCACATGGATGAAAACGGGGATGCTGAAGGGAATTACACCCTACTATCCATTGACCCGTCTCGCCCTCCCGGACTCTACCCTCTCGCAGTCCTTCATAAAGTGGCCCCGGAGTTACGGCTCCTGAGAGAGATGAAATGGCCGGGCGGTAGAGTGCCGCTCTCGGAGCCCCCATGCGGATTCAGAGGAGAGAAGTGCGTCAATAGCAGCGGAATTAACAACACCAATGGTATGACAAACGAGAAGGGTTGTGCTGGCAGTACGGGGGTCTCAGGGGTGTCAAGATCCAGTCAAGTGTCTCTAAGCTCCAACCCCGACCTGGACTTTAGATACTCCGCTATTTTCACGGAGGTAGCCTTCTATAGAGGTCGTCTTCTCGCAGTGAAACGTGTCAGAAGAAGTCACATTGATATCACAAGAGAAGTCAAGAAAGAACTTAAAATTATGCGTGATTTACAACACGACAACGTGAACGGGTTCGTTGGAGCGTGTATCGAGCCTCCCAATGTGTGTGCACTATCCGAGTACTGTACTAGGGGGAGTCTCAAGGACATTATAGAGAATGAAGATATAAAACTCGATAACATGTTCACCGCGTCTCTCGTCGGTGACATTATAAGGGGTATGATCTTCATTCATGAATCACCACTTCAATATCACGGCGCACTCCGTCCGTCAAATTGTCTCGTTGACGCGAGGTGGGTCGTGAAACTAGCGGACTTCGGACTAAGGGAATTTCGCAGAGGGGAAATTACACCTTCAGAACCGAACGCACTGAGATCTCATATTGAGAGTCTAGTGTACCAGTCACCAGAGCAGTTACGAGCGGGGGGTTGGGGTGGGGAGTGTTTTCCCTCAAACTGGTCCTTGGGGTCGCAGGCGTCGGACGTGTTTTCATTTGCGCTGCTCCTGTACGAGCTGCACACCCGCCGTGGTCCATACGGTCCAGACATGTCCCCGCCAGCTGCCCTGCTGCGTCGCCTCGCTAGACCGCACCCAGCGCCCTACAGGCCCCCTCTAGAGGCGCTCTCTGGAGGGTTCGACTGTGTGCGTGAGTGTTGTACGGAGTGCTGGGCGGAGGATCCCGCCCTTAGGCCAGACTTTAAGACGATTCGTGCTAAACTGAGACCTTTAAGGAAGGGCATGAAACCAAATATATTTGATAATATGATAGCAATGATGGAGAAATACGCGAACAACTTAGAAGCGTTGGTTGACGAGAGAACGGATCAGTTGCAAGAAGAAAAAAAAAAGACAGAGGCGTTGTTGGAGGAGATGCTGCCAGCTCCCGTCGCTGAACAGTTGAAGCGCGGCCGGCGCGTGTTGCCTGAGAGCTACGACTCCGTGACCATATACTTCAGCGACATAGTCGGCTTCACGGCCATGTCCGCAGAGAGCACTCCGCTACAGGTCGTGGTCGACTTCCTCAACGACCTGTACACTTGCTTCGATTCAATTCTAGAAAATTTTGATGTTTATAAAGTAGAAACAATAGGAGACGCTTACATGGTGGTGTCAGGGTTGCCAATCCGTAACGGCATCCGTCACGCGGGCGAGGTGGCGTCCATGGCGCTGGCGCTCCTCGCCGCCACGAGGTCCTTCCGTGTGAGACATCGACCAGAACAGAGACTACTGCTCCGTATAGGAATACATTCCGGTCCTGTCTGCGCCGGAGTCGTCGGACTGAAGATGCCCCGCTACTGCTTATTCGGCGACACCGTCAACACAGCGAGTCGATTCGAATCTACTGGAGTCCCTTTAAAGATCCACTGCAGCGGCGCGTGCAAGTCTCTGTTGGATCAGCTGGGAGGGTATATCTTGGAGGAGCGCGGCGTGGTGTCAATGAAAGGCAAGCGGGACCAGCTGACGTGGTGGGTGTGTGGCGAGGAACCTCATGCCAGGAGACAGGCGCCACATCAACACCAACGAAGCTCACTCAAGGCGCGCAACTGGAACAACCAGGGCAGCCTGCATAGATGTTGCAGTTTAGAATCTCCAAAGAAACTTAGATTTGCTTCGGGCAGTCATTTAGAATCGCACACGGACAGTGTCCTTCATCATAGGAGTGATGAATATTTGATGGAGGTGGTTGGCGAGGGTGGACGACCAGCACTACTAGAGGCCCCTCTGCGGGATCACGCGGCCAGCGTGTCGTGTCCCGTCATCGAGGCGGCCGGGGACATCCAGCTCGTGGTGCCTCCCCGCGACCCCGCCGCCGTCCCACTCCTCGCCGACGCCTAA

Protein sequence:

>DPOGS205053-PA
MINWKWAMIGCLFSGSDCITMYRARPHGEKAARAPGSTLFIDAGSNLLHAAGRTDYNRRYLELDYNYQREMFTFTIQFKAYFNTHENQKNPVRAELSEATIYQKLEEWHWCFILLATLDTSHGEKFTLGYLTGSQRRPGDLNYQMPGRVISGAISMAVDEVNEKLLAPMGHSLDFVVAETYGQEEVSIRQVAALWAANVSAFIGPQETCVHEGRMAAAFNLPMISYYCRDVANSNKRVYSTFARTRPSDIDISRSVVAVLTHFNFMHIAIVYLKAQNHDFSRLAYAIMTAANEKKITIRTVQVYRQPYYYDNRPDNFSRNPFLDIVRETYKDTRIYVSIGYYHDHIGLMLALDAMKLLNSGEYAAIGVDVDRYDPNEAIRYLSGPLRKEPEPRVMRAFRSYLAVAPSAGPSYSAFSKEVNSRLTMPPFNFNNPLLALGGGKVIPVEAAWLHDAVWLYARALADSLRTGEAPRDGRAIASRMRNTTYLSVMGYWIHMDENGDAEGNYTLLSIDPSRPPGLYPLAVLHKVAPELRLLREMKWPGGRVPLSEPPCGFRGEKCVNSSGINNTNGMTNEKGCAGSTGVSGVSRSSQVSLSSNPDLDFRYSAIFTEVAFYRGRLLAVKRVRRSHIDITREVKKELKIMRDLQHDNVNGFVGACIEPPNVCALSEYCTRGSLKDIIENEDIKLDNMFTASLVGDIIRGMIFIHESPLQYHGALRPSNCLVDARWVVKLADFGLREFRRGEITPSEPNALRSHIESLVYQSPEQLRAGGWGGECFPSNWSLGSQASDVFSFALLLYELHTRRGPYGPDMSPPAALLRRLARPHPAPYRPPLEALSGGFDCVRECCTECWAEDPALRPDFKTIRAKLRPLRKGMKPNIFDNMIAMMEKYANNLEALVDERTDQLQEEKKKTEALLEEMLPAPVAEQLKRGRRVLPESYDSVTIYFSDIVGFTAMSAESTPLQVVVDFLNDLYTCFDSILENFDVYKVETIGDAYMVVSGLPIRNGIRHAGEVASMALALLAATRSFRVRHRPEQRLLLRIGIHSGPVCAGVVGLKMPRYCLFGDTVNTASRFESTGVPLKIHCSGACKSLLDQLGGYILEERGVVSMKGKRDQLTWWVCGEEPHARRQAPHQHQRSSLKARNWNNQGSLHRCCSLESPKKLRFASGSHLESHTDSVLHHRSDEYLMEVVGEGGRPALLEAPLRDHAASVSCPVIEAAGDIQLVVPPRDPAAVPLLADA-