Monarch geneset OGS2.0

DPOGS207218
TranscriptDPOGS207218-TA2940 bp
ProteinDPOGS207218-PA979 aa
Genomic positionDPSCF300001 + 6207043-6218927
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0074516e-5850.00% 
BombyxBGIBMGA010713-TA0.088.49% 
DrosophilaGyc88E-PD0.080.62% 
EBI UniRef50UniRef50_Q16JH80.071.64%Soluble guanylate cyclase gcy n=7 Tax=Coelomata RepID=Q16JH8_AEDAE
NCBI RefSeqXP_002422715.10.076.80%Soluble guanylate cyclase gcy-31, putative [Pediculus humanus corporis]
NCBI nr blastpgi|35111750.076.89%soluble guanylyl cyclase beta-3 [Manduca sexta]
NCBI nr blastxgi|35111750.076.89%soluble guanylyl cyclase beta-3 [Manduca sexta]
Group
Gene OntologyGO:00168496.9e-89phosphorus-oxygen lyase activity
GO:00091906.9e-89cyclic nucleotide biosynthetic process
GO:00355566.9e-89intracellular signal transduction
GO:00061821.2e-71cGMP biosynthetic process
GO:00043831.2e-71guanylate cyclase activity
GO:00200379.3e-59heme binding
GO:00054881.8e-54binding
KEGG pathwaysmm:Smp_1499802e-157 
 K12319 (GUCY1B)maps-> Salivary secretion
    Purine metabolism
    Vascular smooth muscle contraction
    Long-term depression
    Gap junction
InterPro domain[410-602] IPR0010546.9e-89Adenylyl cyclase class-3/4/guanylyl cyclase
[197-431] IPR0116451.2e-71Haem NO binding associated
[1-167] IPR0116449.3e-59Heme-NO binding
[1-192] IPR0240961.8e-54NO signalling/Golgi transport ligand-binding domain
Orthology groupMCL15709 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207218-TA
ATGTACGGCTTGCTGTTGGAGAACATGGCGGAGTACATCCGTCAGACTTACGGAGAAGAAAGATGGGAGGATATACGGCGTCAGGCTGGAGTGGAACAGCCATCATTCTCAGTGCACCAAGTCTATCCTGAGAATTTAATTACAAGATTGGCTAAAAAGGCCCAGGAGGTGTTAGGCATATCAGAAAGAGAATTTATGGATCAAATGGGCGTATACTTTGTAGGTTTTGTCTCACAGTACGGCTACGACAGAGTTTTATCAGTTTTAGGTCGACATATGCGGGATTTTCTGAACGGTTTGGATAATTTACACGAATACTTAAAATTCAGTTATCCAAGAATGAGAGCCCCGAGTTTTATTTGTGAAAATGAAACAAGGCAGGGACTGACACTACACTACCGGTCAAAACGGAGGGGGTTCGTTTATTACGCTATGGGACAAATTAGAGAGGTAGCCCGTCACTTCTACCATAAGGAGATGCGTATAGAGTTGTTACGCGAGGAACTCCTTTTTGACACAGTTCATGTAACTTTCCAACTGACGTTCGACAATCGTGCATTCACCCTGGCCTCGCTGGCAATGACAAGGGAAGAAAAACATCTGCCTATTAGCGCTTCGGTCCTCTTTGAGATATTCCCGTTTTGTATTGTCTTTGGTTCAGACATGGTAGTTCGCAGCATCGGCAATTCCCTGATGGTGATTTTACCAGACCTAGTGGGGAAGAAGATCACCAACTGGTTTGATCTCGTGCGACCGCTCATAGCGTTTAAATTTCAAACCATACTAAACAGGACGAATAACATCTTCGAACTGGTGACAGTGGAAGCTGTGATGCATGAGAAGGCGCCTGACAAACGTAACGAACTCATCAGGCTGTCTGATGAATCTGATACAACTACTGAGAAGAATTTGCGGCTCAAAGGCGCCATCTCAAAGAAACGATTTGCATATGACGGAAGGCAGTTACCCGATGTTATAAATTTACATCTCAAAGGACAAATGATATACATGGACAACTGGCGCATGATGATGTATCTTGGCACGCCAGTGATGCCTGACCTGGCAGCGCTTGTGTCAACAGGGCTATACATCAACGATCTCTCGATGCATGACTTCAGCAGAGACCTTATGTTAGCTGGCACACAACAATCAGTCGAACTAAAGCTGGCCTTGGACCAGGAACAGCAAAAAAGTAAGAAGCTCGAAGAATCCATGAGGAAATTGGATGAAGAGATGAAGAGAACGGATGAGCTGTTGTATCAGATGATACCGAAACAGGTCGCTGATAGGTTGAGGAACGGAGAGAATCCCATTGACACTTGTGAGATGTTCCATAGTGTGTCCATATTATTCTCCGATGTTGTGACCTTCACTGAGATCTGTTCCCGCATCACTCCGATGGAAGTTGTCTCGATGCTTAATGCTATGTACTCCATATTCGATACGCTCACAGAACGTAATCGCGTTTATAAGGTTGAAACAATAGGTGACGCTTACATGGTAGTGTCAGGGGCACCAGAGAAAGAGGACAATCATGCTGAGAAGGTCTGCGACATGGCACTTGACATGGTAGACGCGATAACAGACCTTAAAGATCCCAGCACAGGTTCCCATTTATCGATTCGGGTGGGAGTACATTCTGGTGCAGTGGTCGCAGGCATCGTTGGTTTGAAGATGCCTCGCTACTGTCTTTTCGGGGACTCAGTGAATACAGCATCTCGTATGGAATCGACCTCAGAGGCGATGAGGATCCACATCTCACAGACAACGCAAGAGCTACTGTCGCCATCCTACAAGGTCACCGAACGAGGCGAAATACAAGTGAAAGGAAAAGGTGCTATGAAAACTTACTGGTTAGAGGGACGTGAATCCAGGCCATCGCTGACTAAACTAATTTCATCCCAAATTCAACCAGTATCGGAACTGGAATGGGAAAGGGCAGCCGATGTACGAGACAGCATCGCCGAATATTCAGCACAGCAACTGAATAATAAGGAAACAAATATCCATCTTCCTAACGCAATCAATTCTGGGCCCAATTCACTTAGCAACAACAACGCTGGTAATCCAACATTCCAACCATCCACTCCGACTGTCAAGAGCCCTACAGCCCCTACTATGATGTCACCAGCTGAAGAGAGACGGATGTATTCTCCTGTCACTTTCCAGGATGTCGCTAGACGGAGTATCGCAAACTCACCGAACAGAACAGAAAAGGATAAAGAATCAAGATCAACCACAGCGAGTGTGGGAGGTCAATGGACTGATGCGGAATCTTTGGACCCACAACGCACCCTCGACAGTTTAAACTCTTCTTTCTGTTCAACGTCCCCTTGTAGGGTCGGTACAGCACCAGCAACCAAATGTGATGACTTCTTTACAGAACCAATGACACGCGAATCTCCGGCACACTCTGCTCCAGTACTACCAGCATTACCAGCGCCAGCGCTCATGAGAACCAGCCTCGACGATATTGAAACTGATACAGAATATCAAGATGCACACACGGATCACATCTGCGCTTCAGAAAACACAGAACCTCCAAAACAAGGCAAGACACGCGAATCTCCGGCACACTCTGCTCCAGTACTACCAGCATTACCAGCGCCAGCGCTCATGAGAACCAGCCTCGACGATATTGAAACTGATACAGAATATCAAGATGCACACACGGATCACATCTGCGCTTCAGAAAACACAGAACCTCCAAAACAAGGCAAGGTCAGCAGATTCCGAGCTCGAATAGTACCAGGGCAGCATAAAATATGTGCGTTAAAAAATTCAACCAAGGATTCTGTCAAAGAAAAAGTCCAACCGCCGACTAACGTCCAGCCACACGGCCATCATCACACAAAAAATGTAAACCATCACCAATGTTGCGGTGCGTTCGGAAATCCGCATGTCCGTCACAAAACCAGTTCCAGCTGTCATTTGATTTAG

Protein sequence:

>DPOGS207218-PA
MYGLLLENMAEYIRQTYGEERWEDIRRQAGVEQPSFSVHQVYPENLITRLAKKAQEVLGISEREFMDQMGVYFVGFVSQYGYDRVLSVLGRHMRDFLNGLDNLHEYLKFSYPRMRAPSFICENETRQGLTLHYRSKRRGFVYYAMGQIREVARHFYHKEMRIELLREELLFDTVHVTFQLTFDNRAFTLASLAMTREEKHLPISASVLFEIFPFCIVFGSDMVVRSIGNSLMVILPDLVGKKITNWFDLVRPLIAFKFQTILNRTNNIFELVTVEAVMHEKAPDKRNELIRLSDESDTTTEKNLRLKGAISKKRFAYDGRQLPDVINLHLKGQMIYMDNWRMMMYLGTPVMPDLAALVSTGLYINDLSMHDFSRDLMLAGTQQSVELKLALDQEQQKSKKLEESMRKLDEEMKRTDELLYQMIPKQVADRLRNGENPIDTCEMFHSVSILFSDVVTFTEICSRITPMEVVSMLNAMYSIFDTLTERNRVYKVETIGDAYMVVSGAPEKEDNHAEKVCDMALDMVDAITDLKDPSTGSHLSIRVGVHSGAVVAGIVGLKMPRYCLFGDSVNTASRMESTSEAMRIHISQTTQELLSPSYKVTERGEIQVKGKGAMKTYWLEGRESRPSLTKLISSQIQPVSELEWERAADVRDSIAEYSAQQLNNKETNIHLPNAINSGPNSLSNNNAGNPTFQPSTPTVKSPTAPTMMSPAEERRMYSPVTFQDVARRSIANSPNRTEKDKESRSTTASVGGQWTDAESLDPQRTLDSLNSSFCSTSPCRVGTAPATKCDDFFTEPMTRESPAHSAPVLPALPAPALMRTSLDDIETDTEYQDAHTDHICASENTEPPKQGKTRESPAHSAPVLPALPAPALMRTSLDDIETDTEYQDAHTDHICASENTEPPKQGKVSRFRARIVPGQHKICALKNSTKDSVKEKVQPPTNVQPHGHHHTKNVNHHQCCGAFGNPHVRHKTSSSCHLI-