New model in OGS2.0 | DPOGS207218  |
---|---|
Genomic Position | scaffold2911:+ 2991-14875 |
See gene structure | |
CDS Length | 3048 |
Paired RNAseq reads   | 61 |
Single RNAseq reads   | 146 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010713 (0.0) |
Best Drosophila hit   | guanylyl cyclase at 88E, isoform C (0.0) |
Best Human hit | guanylate cyclase soluble subunit beta-1 (6e-90) |
Best NR hit (blastp)   | soluble guanylyl cyclase beta-3 [Manduca sexta] (0.0) |
Best NR hit (blastx)   | soluble guanylyl cyclase beta-3 [Manduca sexta] (0.0) |
GeneOntology terms    | GO:0008074 guanylate cyclase complex, soluble GO:0004383 guanylate cyclase activity GO:0020037 heme binding GO:0023034 intracellular signaling pathway GO:0006182 cGMP biosynthetic process |
InterPro families    | IPR001054 Adenylyl cyclase class-3/4/guanylyl cyclase IPR011644 Haem NO binding IPR018297 Adenylyl cyclase class-3/4/guanylyl cyclase, conserved site IPR011645 Haem NO binding associated |
Orthology group | MCL15835 |
Nucleotide sequence:
ATGTACGGCTTGCTGTTGGAGAACATGGCGGAGTACATCCGTCAGACTTACGGAGAAGAA
AGATGGGAGGATATACGGCGTCAGGCTGGAGTGGAACAGCCATCATTCTCAGTGCACCAA
GTCTATCCTGAGAATTTAATTACAAGATTGGCTAAAAAGGCCCAGGAGGTGTTAGGCATA
TCAGAAAGAGAATTTATGGATCAAATGGGCGTATACTTTGTAGGTTTTGTCTCACAGTAC
GGCTACGACAGAGTTTTATCAGTTTTAGGTCGACATATGCGGGATTTTCTGAACGGTTTG
GATAATTTACACGAATACTTAAAATTCAGTTATCCAAGAATGAGAGCCCCGAGTTTTATT
TGTGAAAATGAAACAAGGCAGGGACTGACACTACACTACCGGTCAAAACGGAGGGGGTTC
GTTTATTACGCTATGGGACAAATTAGAGAGGTAGCCCGTCACTTCTACCATAAGGAGATG
CGTATAGAGTTGTTACGCGAGGAACTCCTTTTTGACACAGTTCATGTAACTTTCCAACTG
ACGTTCGACAATCGTGCATTCACCCTGGCCTCGCTGGCAATGACAAGGGAAGAAAAACAT
CTGCCTATTAGCGCTTCGGTCCTCTTTGAGATATTCCCGTTTTGTATTGTCTTTGGTTCA
GACATGGTAGTTCGCAGCATCGGCAATTCCCTGATGGTGATTTTACCAGACCTAGTGGGG
AAGAAGATCACCAACTGGTTTGATCTCGTGCGACCGCTCATAGCGTTTAAATTTCAAACC
ATACTAAACAGGACGAATAACATCTTCGAACTGGTGACAGTGGAAGCTGTGATGCATGAG
AAGGCGCCTGACAAACGTAACGAACTCATCAGGCTGTCTGATGAATCTGATACAACTACT
GAGAAGAATTTGCGGCTCAAAGGACAAATGATATACATGGACAACTGGCGCATGATGATG
TATCTTGGCACGCCAGTGATGCCTGACCTGGCAGCGCTTGTGTCAACAGGGCTATACATC
AACGATCTCTCGATGCATGACTTCAGCAGAGACCTTATGTTAGCTGGCACACAACAATCA
GTCGAACTAAAGCTGGCCTTGGACCAGGAACAGCAAAAAAGTAAGAAGCTCGAAGAATCC
ATGAGGAAATTGGATGAAGAGATGAAGAGAACGGATGAGCTGTTGTATCAGATGATACCG
AAACAGGTCGCTGATAGGTTGAGGAACGGAGAGAATCCCATTGACACTTGTGAGATGTTC
CATAGTGTGTCCATATTATTCTCCGATGTTGTGACCTTCACTGAGATCTGTTCCCGCATC
ACTCCGATGGAAGTTGTCTCGATGCTTAATGCTATGTACTCCATATTCGATACGCTCACA
GAACGTAATCGCGTTTATAAGGTTGAAACAATAGGTGACGCTTACATGGTAGTGTCAGGG
GCACCAGAGAAAGAGGACAATCATGCTGAGAAGGTCTGCGACATGGCACTTGACATGGTA
GACGCGATAACAGACCTTAAAGATCCCAGCACAGGTTCCCATTTATCGATTCGGGTGGGA
GTACATTCTGGTGCAGTGGTCGCAGGCATCGTTGGTTTGAAGATGCCTCGCTACTGTCTT
TTCGGGGACTCAGTGAATACAGCATCTCGTATGGAATCGACCTCAGAGGCGATGAGGATC
CACATCTCACAGACAACGCAAGAGCTACTGTCGCCATCCTACAAGGTCACCGAACGAGGC
GAAATACAAGTGAAAGGAAAAGGTGCTATGAAAACTTACTGGTTAGAGGGACGTGAATCC
AGGCCATCGCTGACTAAACTAATTTCATCCCAAATTCAACCAGTATCGGAACTGGAATGG
GAAAGGGCAGCCGATGTACGAGACAGCATCGCCGAATATTCAGCACAGCAACTGAATAAT
AAGGAAACAAATATCCATCTTCCTAACGCAATCAATTCTGGGCCCAATTCACTTAGCAAC
AACAACGCTGGTAATCCAACATTCCAACCATCCACTCCGACTGTCAAGAGCCCTACAGCC
CCTACTATGATGTCACCAGCTGAAGAGAGACGGATGTATTCTCCTGTCACTTTCCAGGAT
GTCGCTAGACGGAGTATCGCAAACTCACCGAACAGAACAGAAAAGGATAAAGAATCAAGA
TCAACCACAGCGAGTGTGGGAGGTCAATGGACTGATGCGGAATCTTTGGACCCACAACGC
ACCCTCGACAGTTTAAACTCTTCTTTCTGTTCAACGTCCCCTTGTAGGGTCGGTACAGCA
CCAGCAACCAAATGTGATGACTTCTTTACAGAACCAATGACACGCGAATCTCCGGCACAC
TCTGCTCCAGTACTACCAGCATTACCAGCGCCAGCGCTCATGAGAACCAGCCTCGACGAT
ATTGAAACTGATACAGAATATCAAGATGCACACACGGATCACATCTGCGCTTCAGAAAAC
ACAGAACCTCCAAAACAAGGCAAGGTCAGCAGATTCCGAGCTCGAATAGTACCAGGGCAG
CATAAAATATGTGCGTTAAAAAATTCAACCAAGGATTCTGTCAAAGAAAAAGTCCAACCG
CCGACTAACGTCCAGCCACACGGCCATCATCACACAAAAAATGTAAACCATCACCAATGT
TGCGGTGCGTTCGGAAATCCGCATACACGCGAATCTCCGGCACACTCTGCTCCAGTACTA
CCAGCATTACCAGCGCCAGCGCTCATGAGAACCAGCCTCGACGATATTGAAACTGATACA
GAATATCAAGATGCACACACGGATCACATCTGCGCTTCAGAAAACACAGAACCTCCAAAA
CAAGGCAAGGTCAGCAGATTCCGAGCTCGAATAGTACCAGGGCAGCATAAAATATGTGCG
TTAAAAAATTCAACCAAGGATTCTGTCAAAGAAAAAGTCCAACCGCCGACTAACGTCCAG
CCACACGGCCATCATCACACAAAAAATGTAAACCATCACCAATGTTGCGGTGCGTTCGGA
AATCCGCATGTCCGTCACAAAACCAGTTCCAGCTGTCATTTGATTTAG
Protein sequence:
MYGLLLENMAEYIRQTYGEERWEDIRRQAGVEQPSFSVHQVYPENLITRLAKKAQEVLGI
SEREFMDQMGVYFVGFVSQYGYDRVLSVLGRHMRDFLNGLDNLHEYLKFSYPRMRAPSFI
CENETRQGLTLHYRSKRRGFVYYAMGQIREVARHFYHKEMRIELLREELLFDTVHVTFQL
TFDNRAFTLASLAMTREEKHLPISASVLFEIFPFCIVFGSDMVVRSIGNSLMVILPDLVG
KKITNWFDLVRPLIAFKFQTILNRTNNIFELVTVEAVMHEKAPDKRNELIRLSDESDTTT
EKNLRLKGQMIYMDNWRMMMYLGTPVMPDLAALVSTGLYINDLSMHDFSRDLMLAGTQQS
VELKLALDQEQQKSKKLEESMRKLDEEMKRTDELLYQMIPKQVADRLRNGENPIDTCEMF
HSVSILFSDVVTFTEICSRITPMEVVSMLNAMYSIFDTLTERNRVYKVETIGDAYMVVSG
APEKEDNHAEKVCDMALDMVDAITDLKDPSTGSHLSIRVGVHSGAVVAGIVGLKMPRYCL
FGDSVNTASRMESTSEAMRIHISQTTQELLSPSYKVTERGEIQVKGKGAMKTYWLEGRES
RPSLTKLISSQIQPVSELEWERAADVRDSIAEYSAQQLNNKETNIHLPNAINSGPNSLSN
NNAGNPTFQPSTPTVKSPTAPTMMSPAEERRMYSPVTFQDVARRSIANSPNRTEKDKESR
STTASVGGQWTDAESLDPQRTLDSLNSSFCSTSPCRVGTAPATKCDDFFTEPMTRESPAH
SAPVLPALPAPALMRTSLDDIETDTEYQDAHTDHICASENTEPPKQGKVSRFRARIVPGQ
HKICALKNSTKDSVKEKVQPPTNVQPHGHHHTKNVNHHQCCGAFGNPHTRESPAHSAPVL
PALPAPALMRTSLDDIETDTEYQDAHTDHICASENTEPPKQGKVSRFRARIVPGQHKICA
LKNSTKDSVKEKVQPPTNVQPHGHHHTKNVNHHQCCGAFGNPHVRHKTSSSCHLI