Monarch geneset OGS2.0

DPOGS211445
TranscriptDPOGS211445-TA3414 bp
ProteinDPOGS211445-PA1137 aa
Genomic positionDPSCF300223 - 2720-9286
RNAseq coverage36x (Rank: top 74%)
Annotation
HeliconiusHMEL0074510.065.14% 
BombyxBGIBMGA002197-TA0.062.36% 
DrosophilaCG34357-PC0.059.94% 
EBI UniRef50UniRef50_Q8IT600.071.33%Guanylate cyclase n=2 Tax=Obtectomera RepID=Q8IT60_MANSE
NCBI RefSeqXP_972984.20.054.51%PREDICTED: similar to CG34357 CG34357-PA [Tribolium castaneum]
NCBI nr blastpgi|232686850.071.33%receptor guanylyl cyclase GC-II [Manduca sexta]
NCBI nr blastxgi|232686850.058.65%receptor guanylyl cyclase GC-II [Manduca sexta]
Group
Gene OntologyGO:00091903e-90cyclic nucleotide biosynthetic process
GO:00355563e-90intracellular signal transduction
GO:00168493e-90phosphorus-oxygen lyase activity
GO:00167721e-40transferase activity, transferring phosphorus-containing groups
GO:00064681.2e-31protein phosphorylation
GO:00046721.2e-31protein kinase activity
GO:00055245.4e-09ATP binding
GO:00046745.4e-09protein serine/threonine kinase activity
GO:00047136.2e-09protein tyrosine kinase activity
GO:00061821.9e-05cGMP biosynthetic process
GO:00043831.9e-05guanylate cyclase activity
KEGG pathwaydre:1404254e-164 
 K12321 (GUCY2D_E_F)maps-> Phototransduction
    Purine metabolism
InterPro domain[782-975] IPR0010543e-90Adenylyl cyclase class-3/4/guanylyl cyclase
[474-766] IPR0110091e-40Protein kinase-like domain
[506-741] IPR0012451.2e-31Serine-threonine/tyrosine-protein kinase
[474-743] IPR0022905.4e-09Serine/threonine-protein kinase domain
[440-743] IPR0206356.2e-09Tyrosine-protein kinase, catalytic domain
Orthology groupMCL10914 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211445-TA
ATGCTGCAGCCTCTCCTGCTGGCGCTGGCCACAGCTCGGCCTCTCGACACCGACCACACCAACGACACACAGCGGCTCTTGATAGTGGCACGACAAGCTTGTCACGCCGACGACCCTACCCTCTACTCTGCGGACCTGTTCTCCGCTCATCTCACACAGTATGCGCCGTCTCAGCTCCGGACTCGTCCTTTCCGTCTTCTTAATTACGAAGAGACCTGCGGCCCTCGCTGGTCTGAGGCGGTCCTGGTGGACGTGTTGCGGTCGGTGGGCGCGGGCGGGTCCGTGGTGGGCGCGGGTCGTGTGGGCGGTCGTGTGTGTGGTGGCGTGTCCCCGGCCCTGCTGTCTCTCGGACGGCCTCTCCCTGGGTCGTGCTCTGAGTCCCCGGCTCGTGCCGCGGCCGTCTCTCTCTTGGCTCGGTCTCTGGGCTGGCGGCGCGTCTCTCTGCTCGTGACCGGAGTCCTCGACGAATGCTCAGCCGCTCTTCTTCGGGTCCAAGAGTCTCTCGTTACGTCCGGTGTCCTCGTGAAACGCGCACTTCCTCGACACCTCGAGGACTCTTTCTCGATCCGTCCCCACGCTCTCATTATCTGTGGTTCGGCGATGACGGCCGCGATACTATCAAGAAATGTGAATAAAATTCAGATACTTATCTTAAATGTAATAGATCCTGAAGTAGAGACCTCTTTGAAAGGTAATGGAACGAAAACTATCGACAAGCAATCGGCTAATGTTACAGGCAAAGAGAGATCAATGTATCAGAAGTTTCATTTCGGCGAGCTTTCTAAAGACAGCCTTAATGTTATCAGAGAGAAGCCATTACTCCTGCTGAGTGACAGGCAGTTCGAAGTTCAGCACCTTGCCAAGGATAATTATGATGAAATAATTAAGGCCCTTTTAAACTTCAGGGAAGCGGAGATTACCAGAATGGAAAAATACCATCATGTTTTTCATTTATATAGTAGTATAACTAAGGAAAACGATTTGTTCTGGAAGCAAAGAGCAACACTTAATATTACAACTGAGAGATTGTTAGATAATTATACGTCACACTCCACACTCGTGAATATCGACGCCCTGAAAGAAACCGAACTCGAGGAATTTAAACTAGAGGGCGCATTTGACGAAACGGTCGACGCGAAGGTCTCCAGCGGTACGGACGCAGCCTTGCACATGACGGTGTTCACGTTGTTCGTCACCTGTATCTTGATCCTCATCCTCACCTTCGTCGTAACATGCCTCAAACGGGTGTTTGTCAACCGTATGCCGAAACGTTCTCGTGCAGTTCCGGTTCTTGCTTCCACCGACTTCCAGTTTCCATCGGATGAGGGACGACGTGTGGGCGAGGGCATGGAGACGATGCTCACGTGGCTTCAACAGCTGCACGAATTCGGAAGCTCGGAACCGGAGCGACCCGACTTACTGAAACGGCCCGAACCCTTGGGGCAGTCCGCACCCTCCTCTACGTGCAGCATTACACGTCTTCCATTGGACAATCGTACCAGATATAAGGGCGACCCAGTTCATATGAAATATTTACCTGCGGCTTCACTCGAGTTGCGTCGCAAATCAATAGACGTTTTACTCACGATGCAGAGCCTTCGTCACGAAAACGTAAACTCCTTCATAGGATGTTTGACGGAAACAAGACCAGCTCTCGTGTTCGAAGCTTGTGGGAGAGGTTCTCTAGAAGACGTCCTCATGGCCGACGACATCCGGTTGGATTGGACGTTCCGCCTGTCGCTGCTAACGGACCTGGTGCGAGGGATGCGCTACCTCCACTCATCGCCGCTGCGTGTTCATGGTCGACTCACTTCACGTAACTGTGTAGTCGACTCCCGATGGGTGTTGCGTGTCACCGATTATGGGATTCCATCGTTCACAAAGACGCAGTCACTGCCGCACCCTCCGCGTACCGCACGAGAGCTGCTGTGGACGGCGCCGGAGCTTCTTCGTGAAGCCGATTCAGGTAACGTCATCTGCGGAACGCAGCCGGCCGATGTGTTCTCTTTCGCTATAATCATGCAGGAGGTTATTGTCCGTGGAGAGCCTTACTGTATGCTTCCTTTTACGCCGGAAGAAATAATAGAAAAGTTGACTCATCCCCCGCCTCTTATAAGACCGTCCGTGTCCATGAGCGCTGCTCCCCCGGAGGCAGTGAGTGTGGCACGTCAGTGCTGGAGCGAACAGCCTCATCTACGACCAGACTTCATACAACTCTACGAGGTGTTCCGACACATGCATCGAGGACGAAAAGTCAACATTGTAGATTCTATGTTCGAAATGTTAGAGAAATACAGCAATAATTTAGAGGAATTGATAAAGGAACGAACCGAGCAGCTGGACATGGAGAAAAAGAAAACGGAGCAGCTCCTGAACAGGATGTTGCCCAGAACAGTAGCCGAGCGATTAATTTTGGGTCTCCGAGTGGAGCCGGAAGAGTTCGAGGAAGTATCCATCTACTTCAGCGACATCGTGGGTTTCACATCAATAGCAGCGCGCTCCACTCCCGTCCAGGTCGTTGACCTGCTCAATGACCTGTATACCACCTTCGATGCTACTATAGAAATGTACCGAGTCTACAAGGTGGAAACGATCGGTGACGCGTACATGGTGGTGGGCGGGCTGCCGATCCGCTCCAGCGACCACGCAGAGAGCGTGGCGACCATGGCGTTACATTTATTACACTTGGCGGGACAATTTCGAATTCGACATCTGCCAGCGTCTCCTCTACATCTCCGGATAGGGCTCCACACGGGCGCTTGTTGTGCGGGAGTGGTGGGACTCACTATGCCACGGTATTGTTTGTTTGGTGACACGGTCAACACGGCTTCCCGCATGGAATCCACAGGAGCGGCTTGGCGAATCCAGATCTCGTCGGCGACGGCAGAGAAACTCGCAGCGGCCGGCGGCTACAGGCTTCGCTCCCGAGGACTCACGCAGATTAAAGGCAAAGGCGTCATGCACACCTTCTGGTTACTCGGAAAAGAAGGCTTTGAAAAAACTCTGCCGACGCCACCGCCGTTGAAATCGGAAGAAGTTCTCTTTGAAGCAGAGAGCGAGAACGACTGTGACACTAACGAGGCGCCACCAAGCGAGGGACATAACACTAGCCTCACACAATCCGTGGAACGACAACGTTCCGATCCATCTCCGACAGTCGACAGATTTTCATGGCGAAGGTCTGGTGCAGTCTCCGCGGAGAGCAGCCCGCCAGGACCGACCTCATTGCGAAATCGATACTTACGTTCATCCGTGTCCACTGTGGCAGGCTTAGTCGATACACCGCGCCTATCGGATCGGTGGACTACATCAGGATCGCGCGTGCTGCGGCGTCAGTGGTCGCTGGAGCGCGGGGACGAGCCTCGTTACCGGACCCGGCGCGACCTCTCCACTCCCGACGCGTCCGCTCGCTGA

Protein sequence:

>DPOGS211445-PA
MLQPLLLALATARPLDTDHTNDTQRLLIVARQACHADDPTLYSADLFSAHLTQYAPSQLRTRPFRLLNYEETCGPRWSEAVLVDVLRSVGAGGSVVGAGRVGGRVCGGVSPALLSLGRPLPGSCSESPARAAAVSLLARSLGWRRVSLLVTGVLDECSAALLRVQESLVTSGVLVKRALPRHLEDSFSIRPHALIICGSAMTAAILSRNVNKIQILILNVIDPEVETSLKGNGTKTIDKQSANVTGKERSMYQKFHFGELSKDSLNVIREKPLLLLSDRQFEVQHLAKDNYDEIIKALLNFREAEITRMEKYHHVFHLYSSITKENDLFWKQRATLNITTERLLDNYTSHSTLVNIDALKETELEEFKLEGAFDETVDAKVSSGTDAALHMTVFTLFVTCILILILTFVVTCLKRVFVNRMPKRSRAVPVLASTDFQFPSDEGRRVGEGMETMLTWLQQLHEFGSSEPERPDLLKRPEPLGQSAPSSTCSITRLPLDNRTRYKGDPVHMKYLPAASLELRRKSIDVLLTMQSLRHENVNSFIGCLTETRPALVFEACGRGSLEDVLMADDIRLDWTFRLSLLTDLVRGMRYLHSSPLRVHGRLTSRNCVVDSRWVLRVTDYGIPSFTKTQSLPHPPRTARELLWTAPELLREADSGNVICGTQPADVFSFAIIMQEVIVRGEPYCMLPFTPEEIIEKLTHPPPLIRPSVSMSAAPPEAVSVARQCWSEQPHLRPDFIQLYEVFRHMHRGRKVNIVDSMFEMLEKYSNNLEELIKERTEQLDMEKKKTEQLLNRMLPRTVAERLILGLRVEPEEFEEVSIYFSDIVGFTSIAARSTPVQVVDLLNDLYTTFDATIEMYRVYKVETIGDAYMVVGGLPIRSSDHAESVATMALHLLHLAGQFRIRHLPASPLHLRIGLHTGACCAGVVGLTMPRYCLFGDTVNTASRMESTGAAWRIQISSATAEKLAAAGGYRLRSRGLTQIKGKGVMHTFWLLGKEGFEKTLPTPPPLKSEEVLFEAESENDCDTNEAPPSEGHNTSLTQSVERQRSDPSPTVDRFSWRRSGAVSAESSPPGPTSLRNRYLRSSVSTVAGLVDTPRLSDRWTTSGSRVLRRQWSLERGDEPRYRTRRDLSTPDASAR-