Monarch geneset OGS2.0

DPOGS200876
TranscriptDPOGS200876-TA1002 bp
ProteinDPOGS200876-PA333 aa
Genomic positionDPSCF300071 + 727123-733642
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0114801e-10766.44% 
BombyxBGIBMGA009865-TA1e-11268.44% 
DrosophilaCG33958-PA3e-8750.17% 
EBI UniRef50UniRef50_Q7Q9X32e-9453.63%AGAP004555-PA n=2 Tax=Endopterygota RepID=Q7Q9X3_ANOGA
NCBI RefSeqXP_001649945.13e-9455.41%guanylate cyclase [Aedes aegypti]
NCBI nr blastpgi|3479721217e-9453.63%AGAP004555-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479721218e-9053.29%AGAP004555-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00168495.6e-93phosphorus-oxygen lyase activity
GO:00091905.6e-93cyclic nucleotide biosynthetic process
GO:00355565.6e-93intracellular signal transduction
KEGG pathwaymmu:2301032e-72 
 K12324 (NPR2)maps-> Purine metabolism
    Vascular smooth muscle contraction
InterPro domain[92-286] IPR0010545.6e-93Adenylyl cyclase class-3/4/guanylyl cyclase
Orthology groupMCL20379 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200876-TA
ATGGATGACTGGGACTCCGTAGGGAATGATTACGTCGAAAAAACCTCAGGCTTCGTGGAGGTCTCCAAAACGATGCAGACGGGTTTATGGAAATATGTCAGGAAATCAGCTGATCATGAAGTGTGGTACGCTCGTCGGACCCTGGCGTTGGGCGTGTTTGTGCTGGTGATAGTGTTGCTTGCTTCACCAGTCCTTATACTGATACTGAGACATATTATAACCACTATACAGGCATTCACAGAATCTATACAACACAGTACGGAACAATTGATGGCTGAAAAGCAGAAGAGCGATTTGTTGCTGTCAAGAATGTTGCCACTACCGGTTCTGAAGAGGCTTCGAGCTCAACGGACGGTACCTGCTGAGGCTTTTGATGCTGTCACAATTTACTTTAGCGATATCGTTGGCTTCACTAACATTTCAGCGAACAGCACTCCGATGGAGATTATAAATATGCTTAATATGCTGTACAGATTGTTCGACGACAGAATAATGCAGTACAACGTTTACAAAGTGGAAACGATAGGAGACGCGTACATGGTCGTCTCTGGACTACCACAAAGAAATGGAAATCGTCACGTGTCGGAGATAGCGGACATGGCGTTATCCCTTTTGCGCTGCGTGGAGGGAGCTGTAGTACCCCACCGGCCGGAGGAGCCTCTGAGGGTTCGAGCCGGGGTCAATACAGGCCCGTGTGTGGCGGGGGTCGTCGGGGCTACCATGCCACGGTACTGCCTTTTCGGGGATGCCATTAACACTGCCAGTAGGATGGAGAGCACAGGGGAAGCGATGAAAATTCACATATCATCAAGCACTAAGGAGGCTTTAGACAAAATAGGAAATTACATTACAGAATCCCGAGGGATGATCGACGTAGCGGGTAAGGGTCTGATGGAGACTTTTTGGTTGGTGGGGAAAGTTGGGAATATGGTCTCCGAGAGCCCTTGTCAGCTGAAGCTGGAGGACTACGACCAGAACACACTCGAACTACTTATTAAATAA

Protein sequence:

>DPOGS200876-PA
MDDWDSVGNDYVEKTSGFVEVSKTMQTGLWKYVRKSADHEVWYARRTLALGVFVLVIVLLASPVLILILRHIITTIQAFTESIQHSTEQLMAEKQKSDLLLSRMLPLPVLKRLRAQRTVPAEAFDAVTIYFSDIVGFTNISANSTPMEIINMLNMLYRLFDDRIMQYNVYKVETIGDAYMVVSGLPQRNGNRHVSEIADMALSLLRCVEGAVVPHRPEEPLRVRAGVNTGPCVAGVVGATMPRYCLFGDAINTASRMESTGEAMKIHISSSTKEALDKIGNYITESRGMIDVAGKGLMETFWLVGKVGNMVSESPCQLKLEDYDQNTLELLIK-