Monarch geneset OGS2.0

DPOGS214166
TranscriptDPOGS214166-TA3111 bp
ProteinDPOGS214166-PA1036 aa
Genomic positionDPSCF300014 - 283321-298556
RNAseq coverage72x (Rank: top 66%)
Annotation
HeliconiusHMEL0068080.087.08% 
BombyxBGIBMGA006212-TA0.083.98% 
DrosophilaAc3-PA2e-12955.77% 
EBI UniRef50UniRef50_E0VF290.048.50%Adenylate cyclase type, putative n=2 Tax=Neoptera RepID=E0VF29_PEDHC
NCBI RefSeqXP_001230685.20.050.37%AGAP009315-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582881210.050.37%AGAP009315-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1892372190.052.39%PREDICTED: similar to AGAP009315-PA [Tribolium castaneum]
Group
Gene OntologyGO:00168491.7e-66phosphorus-oxygen lyase activity
GO:00091901.7e-66cyclic nucleotide biosynthetic process
GO:00355561.7e-66intracellular signal transduction
KEGG pathwayaga:AgaP_AGAP0093150.0 
 K08043 (ADCY3)maps-> Salivary secretion
    GnRH signaling pathway
    Olfactory transduction
    Progesterone-mediated oocyte maturation
    Gap junction
    Vibrio cholerae infection
    Vasopressin-regulated water reabsorption
    Gastric acid secretion
    Dilated cardiomyopathy
    Chemokine signaling pathway
    Purine metabolism
    Vascular smooth muscle contraction
    Calcium signaling pathway
    Melanogenesis
    Oocyte meiosis
InterPro domain[807-1013] IPR0010541.7e-66Adenylyl cyclase class-3/4/guanylyl cyclase
Orthology groupMCL10063 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214166-TA
ATGGAGAAAACTGTACAAAATCGAACTACTGAAAAGTGTTCCTGCTGGATGGTGCCACCATGGAGATTTTCGGATTTATCAGACGAGAAATTATATCTGGCCTATACACAACAACAACGACAAAGGGCTGTTCTCCGATTGATCATTACAGGGGTTTTGTTCCAAGTGTTTGCATCGCTTGTGCCTGGAGAACGTGATTTGCATTTCGCCTATAAATCTGTCGCCATTTCCCTTGTACTGAATTTAATTTTGGCAGTAATCTACGCTCTGTTCCGCAAAGCAAGAACAGCTTTGAATCATATCGCATGGATTACTCTATGGGTTCAACTCCTTGTGAGCACTTCCAGACGTCTTGGAGATTCGTATAATGAATTGCTGGGTTGGGCTGTCGTACTGCAATATTTTACTCTTGCAACATTACCATTCCATTATATACTTCTTATATTATACAGTACACTGTCACTCACTGCGTATTTACTTATTCAATACTATAATGCTACAACTACTGAAAGCAGACTGGCCGGCGATTTTTACTTGCAGCAAATAGCGAACGGATGCCTCTTACTGGGAGCTACATTTCTGGGCGGAACTGCATATGCAATAAGCGAAAAACAACAGCGGAGCTCATTTCAAGAGACAAAACGGAGTTTACGCGATAAACTTACAATTGAACAACAAAGCAAAGAACAAGAAAGATTGTTGCTGTCCGTGCTTCCTGAACATGTGGCAGTACAAATGCGTAAAGATTTGGGACTTATCGATACTCAGTTTAAGAAAATTTACATGTCCCGCCATGAAAATGTTAGTATATTATATGCAGACATCGTCGGCTTTACGGCTATATCCTCAACCTATTCAGCGCAAGATCTAGTAGCGATATTAAACGAACTTTTTGCAAGATTTGATCGACTCGCGGAGAAATACCAACAACTAAGGATCAAGATTCTAGGCGATTGCTATTACTGCATCAGCGGCGCTCCGCTGGAGAGACCTGACCACGCTGTTTTATGTGTTCACATGGGACTGTCTATGGTCAAGGCGATTAAATATGTTCAACAAACAACAAATTCACCAGTGGATATGCGCGTGGGAATCCATACGGGTGCCGTGCTAGCGGGCGTGCTCGGACAACGACAGTGGCAGTTTGACGTTTACTCTCGGGATGTCGAACTTGCCAATAAAATGGAAAGCAGCGGGATGGCTGGTCGCGTTCACGTCTCGGAGGTGACCTTGGGTTTTCTAAACGATGAGTTCGAGGTGGAGCCGGCCCACGGTGAACGACGAGAAGAGATGCTGCGACAGGCCGGCATTAAAACATACTTCATAGTACGGGTATTGAAACCGTATCATGGCGAAGAAAAGAGTGCTGCTGGGGAAGCGGGTGAGGGAGATGGGAGCGACGCGCTCTCCGATCTGAAAGACGACGATGAGGACTCTGTTCTATCACCGCAAGACGAGAGCAAGGACAATGAGGATGTCAAAATACTCGCAATGTTAGAGGAAGAACTCGTCAATAGAGATGACAACAAGGAACTGGCGGATGCGACTACTTTATGCCTGACTTTCAAGACCCAAGCGGCGGAGGCTTGCTATGCCCGACGGGTTGATGCCTGCCCCCTGGCTGTGTGCGCCCCTCCGCTACTGTTGCTACCAGCTGTTGTGGCTCTTGCTAGCTTTGCTGTTCCATCAACATGGTCTTTACACGCCGTCTATCTTGCGCTGGTTCTTGAAACAGGACTCGTTAACGTTGCATACTACGCTTGTTATAGATATGCTCATTATAAGAAAAAAGCAATATCGCCCTACTGGAAGCTGACATGTGGAACGTTTAACATTGTGATGTTTGTCGCTGCCAATATTGCTCCATTGATTATATGTGGCAACTTCGAAACGATGCTAGCAGAAGACGGGCTGAGAGATGACGCCCCTTCACATAGCGGCGCCCGACGTTGCATTCACCCATCATATTATTATCACGTGGGTGCGGGTGCGTTATGTGGTGCGTTGTGGGTTTCTCCATTGGGTGGATGTGCACGAGCGGCTTTGTTGGGGTCACTAGCGGCGGCTCATGCTGTACCCGCAGCCCTACACCCCGCGGTACTAACATCTCGCCCTACACTCGACCGCGACCCCTTTCGTTTGCCATACAACGCCACCGACGACTGCGAAGAAGAATTATTAAACCTGCTAGCCCAGATTCACACTGGTAGACACATCCTGATGCTGTTGTTCATAGCGGTGGCACTCATCATATACAACAGATACAGCGAGTCTCTGGAGCGCGCACGGCACTCGCGCGGGGAGAGAATGCGCGCGCAGGCGGAGCAAGCGAGTGACTTACGGCGACGGAACCAGGCACTGGTCCACAACGTTCTGCCACCACACGTTGCCAGGCATTTCATGGGCGCACGTCATCATCACCGCGATTTATACAGCCAGAGTTACGCGGAAGTCGGCGTGCTTTTCGCCTCCATGCCAAATTTTACAGAGTTTTACTCGGAAGAAACAGTTAACAATCAAGGCCTAGAATGTTTACGATTTCTCAACGAAGTTATATCGGACTTTGATCTCCTGCTGGAAGATGCCAAATTCAGTAAGGACATCATCAAAATAAAGACGATTAGTTCAACTTACATGGCTGCGTCAGGCCTAAATCCTACGCGACAAATGCAGCCTTCTGATGGTGTGTTGGTTCGTTGGGCTCACCTGGCATGTTTGGTGGAGTTCGCGTTGGAGCTGCAGCGTGTTCTGGCCGCCATCAACGAGCAATCTTTCAACCATTTCGTACTTCGAATGGGTGTAAACCACGGGCCGATTACAGCAGGCGTCATTGGTGCCAGGAAGCCCCACTACGATATATGGGGGAACACTGTTAACGTAGCTTCAAGGATGGAAAGCACTGGCAAGGCTGGATGTATACAGGTGACCGAAGAGACCTGTCATATCCTTGAGGATTTCGGGTACTACTTCGAACAACGTGGCCTAGTCGCAGTTAAGGGCAAGGGGCAGCTTATGACGTATTACCTGCAAGGGAAGAAAGGCGACATAGATAATACCATTGTTGACGTAATACAACGAGCGATGAAAGATGTAACAGCATTGGATGGATAA

Protein sequence:

>DPOGS214166-PA
MEKTVQNRTTEKCSCWMVPPWRFSDLSDEKLYLAYTQQQRQRAVLRLIITGVLFQVFASLVPGERDLHFAYKSVAISLVLNLILAVIYALFRKARTALNHIAWITLWVQLLVSTSRRLGDSYNELLGWAVVLQYFTLATLPFHYILLILYSTLSLTAYLLIQYYNATTTESRLAGDFYLQQIANGCLLLGATFLGGTAYAISEKQQRSSFQETKRSLRDKLTIEQQSKEQERLLLSVLPEHVAVQMRKDLGLIDTQFKKIYMSRHENVSILYADIVGFTAISSTYSAQDLVAILNELFARFDRLAEKYQQLRIKILGDCYYCISGAPLERPDHAVLCVHMGLSMVKAIKYVQQTTNSPVDMRVGIHTGAVLAGVLGQRQWQFDVYSRDVELANKMESSGMAGRVHVSEVTLGFLNDEFEVEPAHGERREEMLRQAGIKTYFIVRVLKPYHGEEKSAAGEAGEGDGSDALSDLKDDDEDSVLSPQDESKDNEDVKILAMLEEELVNRDDNKELADATTLCLTFKTQAAEACYARRVDACPLAVCAPPLLLLPAVVALASFAVPSTWSLHAVYLALVLETGLVNVAYYACYRYAHYKKKAISPYWKLTCGTFNIVMFVAANIAPLIICGNFETMLAEDGLRDDAPSHSGARRCIHPSYYYHVGAGALCGALWVSPLGGCARAALLGSLAAAHAVPAALHPAVLTSRPTLDRDPFRLPYNATDDCEEELLNLLAQIHTGRHILMLLFIAVALIIYNRYSESLERARHSRGERMRAQAEQASDLRRRNQALVHNVLPPHVARHFMGARHHHRDLYSQSYAEVGVLFASMPNFTEFYSEETVNNQGLECLRFLNEVISDFDLLLEDAKFSKDIIKIKTISSTYMAASGLNPTRQMQPSDGVLVRWAHLACLVEFALELQRVLAAINEQSFNHFVLRMGVNHGPITAGVIGARKPHYDIWGNTVNVASRMESTGKAGCIQVTEETCHILEDFGYYFEQRGLVAVKGKGQLMTYYLQGKKGDIDNTIVDVIQRAMKDVTALDG-