Monarch geneset OGS2.0

DPOGS200819
TranscriptDPOGS200819-TA1899 bp
ProteinDPOGS200819-PA632 aa
Genomic positionDPSCF300071 - 752368-781698
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0057003e-10258.61% 
BombyxBGIBMGA009865-TA1e-7249.09% 
DrosophilaCG33958-PA3e-17752.70% 
EBI UniRef50UniRef50_F4X1060.054.73%Atrial natriuretic peptide receptor A n=5 Tax=Endopterygota RepID=F4X106_ACREC
NCBI RefSeqXP_001868864.10.054.88%guanylate cyclase [Culex quinquefasciatus]
NCBI nr blastpgi|3407136660.054.35%PREDICTED: hypothetical protein LOC100646059 [Bombus terrestris]
NCBI nr blastxgi|3800256710.054.81%PREDICTED: uncharacterized protein LOC100863861 [Apis florea]
Group
Gene OntologyGO:00168492.4e-94phosphorus-oxygen lyase activity
GO:00091902.4e-94cyclic nucleotide biosynthetic process
GO:00355562.4e-94intracellular signal transduction
GO:00061822.6e-05cGMP biosynthetic process
GO:00043832.6e-05guanylate cyclase activity
KEGG pathwaydgr:Dgri_GH139875e-80 
 K12323 (NPR1)maps-> Purine metabolism
    Vascular smooth muscle contraction
InterPro domain[380-574] IPR0010542.4e-94Adenylyl cyclase class-3/4/guanylyl cyclase
[89-318] IPR0135877e-18Nitrate/nitrite sensing protein
Orthology groupMCL17092 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200819-TA
ATGCATTACAATAAATTCAAAACATCTGTTTTCTTATCATTAGCAACACCGAAACGACTAAAACCGGATGATAACACCAGTTGCGTGTTCAAATTCCGTAAATGGTGCCGCAGTCGACGCTGTCAGCTGTGGAGGCTGCTACTTCTCCCCTTCATACCTATCCTGGCTCTGATTGTTCAAACCACCTTCTCCTTAAAGAACAGCCTCACTAATGGCATGGAAGTTGCTGATGTCGAAGAACAGGTCAGCAGAGCTACTGAGTTAGGGAAACTGGTGACCAGGTTGCAGCAGGAGAGGTCTGAAGTCGCATTCTTTATATTCACCAATGGAAGTACTCTGAGGTCAAATTTAACGCAACGCTTTGCTGGCACGGATAGCGCTATCGAGCAAATGGGAGAATTACCGCCTTTGACCTTCAAAAATAGAGCTGTTATTGACGGACAGGAGTTTGTAAATGAACTCGCTGATCTGAGAGCAAAAATAAATTCCGGAACGTCGATGACAGAGGTTGTGGAGTGGTATACGAACGCCAACGCCGCACTCCTCACACACCTCACCAAGGAAATCAAGGACACTGATAGCAGTACCATATGGAGATATTTGGTTGGTTTTAAAAATTTGCTTCGGAGTGTTGAATGTAAGGGCATCGCTTCTGTGTTGGGCATAAATTACTTCGCGAGGGGTTTTCTACAGCCACGAGCACATGAAAAATATATAAGCCACATGGTTTTAGGAAGGGATCTTCTAGACAATACTTTGAATCTGGTGCCATCCCTCATTCCGCTCCTCATAGACATAAAAGAGGACAGTCCAGAGTACCAGGTTCTAGAGAAGAAGAATCAACGAATTGTTGATAACAAGCCCCACGCTGGTAGTGTCGGCGAGGCAATAGAATACTTCGACCAAACTGCGACTTTGCTCGGAAAATTAAGAGCTGTTCAGAAGCAATTAAGAGAATACATACGTGAGGGCGTCAATGAAAGTCTCCGAGAAGCTCGCCGCAGTGAGGCTATATGTGGTGGTATATTGGTGCTAGTGTGCGTTGTCTCACCCATCATTATAGCGTTGGTGAGGAATGCCGTCAATACTATACAGATATACGCTCGCAACCTGTCAGAGAAAGCAAGGGAATTGGAATATGAAAAGGAATTGAGTGATTCTCTGCTTTACCAAATGCTGCCAGCCAGCGTTGCCAAACAGCTGAAGCAAACGCAACAGGTGCCCGCTGAATTCTTCGCGTCAGTGACCGTGTACTTCAGTGACATAGTTGGCTTCACAGCTATCGCTGCCGTGTCTACACCCTATCAGGTTATAAGCTTTTTGAACTCCGTGTACAAACTGTTCGATGAACGCATCGAGTGCTACGATGTTTACAAAATAGAAACAATTGGTGACTCCTACATGGTAGCGTCAGGACTTCCCGTCAGGAACGGTAACAAACATGCAACAGAGATCGCCAGCATGGCTTTGGAACTGCTTGAGGCGACCTCTATCTGCCGGCTCCCTCACCGGCCCGATCAGGCTCTTTGTATGCGGAGCGGTATCCACACGGGTCCTTGTGTAGCTGGGATCGTTGGCAGCAAAATGCCACGTTACTGTCTCTTCGGTGACACTATCAACACCGCCAGTAGAATGGAAAGCACAGGGGAGCCGATGAAAATTCAAATATCAGAAGACGTTAAGTTGGCGTTGGACAAAACTGGACTCTTCATCACGACACCTAGAGGAGTTGTTGACGTGAAGGGCAAAGGTGAAATGACGACCTACTGGCTGAATGGAAGGACTGGCCCATCTCCAGTCCGGCCTCCTGCTTCCTCGTTGGACTGTACTCCAAGCTTTCTCACCCGCATCCACTCACAGCGCCGCAGTTCACCTCGATACCAACCCAACCAGAGCTAG

Protein sequence:

>DPOGS200819-PA
MHYNKFKTSVFLSLATPKRLKPDDNTSCVFKFRKWCRSRRCQLWRLLLLPFIPILALIVQTTFSLKNSLTNGMEVADVEEQVSRATELGKLVTRLQQERSEVAFFIFTNGSTLRSNLTQRFAGTDSAIEQMGELPPLTFKNRAVIDGQEFVNELADLRAKINSGTSMTEVVEWYTNANAALLTHLTKEIKDTDSSTIWRYLVGFKNLLRSVECKGIASVLGINYFARGFLQPRAHEKYISHMVLGRDLLDNTLNLVPSLIPLLIDIKEDSPEYQVLEKKNQRIVDNKPHAGSVGEAIEYFDQTATLLGKLRAVQKQLREYIREGVNESLREARRSEAICGGILVLVCVVSPIIIALVRNAVNTIQIYARNLSEKARELEYEKELSDSLLYQMLPASVAKQLKQTQQVPAEFFASVTVYFSDIVGFTAIAAVSTPYQVISFLNSVYKLFDERIECYDVYKIETIGDSYMVASGLPVRNGNKHATEIASMALELLEATSICRLPHRPDQALCMRSGIHTGPCVAGIVGSKMPRYCLFGDTINTASRMESTGEPMKIQISEDVKLALDKTGLFITTPRGVVDVKGKGEMTTYWLNGRTGPSPVRPPASSLDCTPSFLTRIHSQRRSSPRYQPNQS-