Monarch geneset OGS2.0

DPOGS211074
TranscriptDPOGS211074-TA3399 bp
ProteinDPOGS211074-PA1132 aa
Genomic positionDPSCF300007 - 1408856-1419669
RNAseq coverage57x (Rank: top 69%)
Annotation
HeliconiusHMEL0093840.068.41% 
BombyxBGIBMGA002956-TA0.058.82% 
DrosophilaPhlpp-PA7e-10132.84% 
EBI UniRef50UniRef50_UPI00022C8E6F2e-15339.26%UPI00022C8E6F related cluster n=1 Tax=unknown RepID=UPI00022C8E6F
NCBI RefSeqXP_973398.21e-15240.85%PREDICTED: similar to adenylate cyclase [Tribolium castaneum]
NCBI nr blastpgi|3504014658e-15339.26%PREDICTED: PH domain leucine-rich repeat-containing protein phosphatase 2-like [Bombus impatiens]
NCBI nr blastxgi|1571151691e-15239.13%adenylate cyclase [Aedes aegypti]
Group
Gene OntologyGO:00038248.9e-24catalytic activity
KEGG pathway 
InterPro domain[634-882] IPR0019328.9e-24Protein phosphatase 2C-like
Orthology groupMCL11006 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211074-TA
ATGGTCTGTGACGACGGCCTCACTCCATCGCCTCTTATCTCACGCAAATCGCTTCGAAGACTAGCATCAACAAGAGGATTCAAACCGCGCTCTGAATCAACCTGGATACGCGTATTTGATGGTCTTGAACCTTATGCTGTGGATGCGCCGAGCAAGCTCGTAAAAGTGTCCCCCTATACTACCGTCGAAGACATAAACAAGAAACTTGGCTTCAATGAAGAATTGACGCTATGGGTGCAGATAGGAGGAGAAAATTCTCGACGGCTGGAATTGAACGAGTTCCCATTCCAAATACAAGAGAAGTTTTTAATTAACAATGGTTGGAAGTCAGAGGCTAGGCGACAGCGGCTTGCGGTAGATCCGGAGTTACGTCACAGTCTGCGCTGGTGTGCAGGACCTTCCAGTCGGTCTGGTGGTGTCCTGCGGTCAGGCACTGTTTATGTTTTAAAAGGGCACGTGTTCCCACAATGGAAGCCCCGACAGGCCCACATTATTGGATCGCAATTACATACACACGGTGTGTCCTGGGATATGTTGGAGCTCAGTGGAGGTAGTATTGAAATGTGTCAACCGAAAGCTCAGAAACTAGTCCTCTGCGTAAAGCTTCTTTGTCAAGGCAATGGTGTACTTGACACGGGAGTCAATCATTTATTTCTGGGGTTCAATACAATTTGGGAGCGTAATATGTGGTGCCGTTGGTTAAAAGAGAGTAATAAAATGAAATGTGAAGACGAGGAGATAGACAGCTTGGATGTTTTCTCTCAAGAAAATGAAGATGTGTTCTTGGATAGCTTGGAACCAGCTACGTATAGAGAGAAATATGAAAACTCGGGCACTCAAAGTACACAGCCACCAAATGTCTTAGACTTGAGCGGTGGCGGACGGTCCTGTCTGCCCGTAGCTCTAAACCAGCACGCCTCAGATGGGTTCGCGGTGAAAGTTTTAAGGATGCGGAGCAACACTCTACCAGCCTTGCCTCCTCAAACATGCAATCTGATAGCCCTAACTCATCTCGATGTGAGCGACAATAAAATCATTGAGCTACCAAAGGAGATTTCACATCTAACGCAATTAGAAGAAATAAACGTGAGCAACAACGAAATTAAGTCGCTGGACTGTCTCTTACGACTTCCTCGTCTACGAACTGTCGTAGCTGCTAGAAACCTGATCACGCAATTCGGAGTCAATGACACTAGCCAAATGGGTTTTCTAGAGGAAAATAAATCAGAATATCGTGCACCGCTAACGAATGTCGACCTCCGATACAATAAACTAAAAGGAAGCATAATTCTTGGTAATTATGAGCATCTGGTGACTCTTGATGTCTCTCAAAACTCCATTGAGGTTTTGGTGCTTTCATCGCTGCGTGGACTACGGGAGCTGTATGCTGCTCATAATTCTATCCAGCACTTGGCCTTGCATGGTGCTTCGTTACGAGTCCTACATGCTCCGTACAATAATATGGAGAATTTGACAACAATGGTGCCACCAATAAATTTAGTGGAGATGAACCTGACATACAATAAATTATCATCTTTACCACAGTGGATCAGTGGTTGTTCAGATCTGACCAAACTCTTTGCAAAAATTGAAGAACTGGTTCTTTCTGGTAATTCACTCTCGAAATTGCCAGACAATTTGCCACAGATGAATAACATAAAAATTGTGAGGGCACATTCAAATCGTCTTCGCTCAGTTCCAATGTTTGCTTGCAGTGCTAGCGTTAAAATTCTAGACTTTGCTCATAACGAACTGGACAGCATTGATCTGCGTCTTTTAGCACCGAAGCAATTAAAATTTTTAGACATATCATGTAATAAGAAGTTACAAATGAATCCCTCGCAGTTTAACGCTTATAAATGTCAACGACCTTTAAGCCTAGTTGATGTCACTGGACAACATGGAAATTCTTTATCGCAAAAAAATAATTTTCATGAAGAATTAAGTGGTGGGACCCCGTGGGTAACTGGTTTTTCGGAATGTCCAAATAAAAAACTTCTTCTATCTTGTGCACAAATACGACTTCCATCGTTTTGTAACAAGGAAGGCTTATTTGCTATAATTGACGGGGAAACAGATATCGAAGTCCCAAGAATACTACAGTCATGTCTTCCAGGACTACTACTTGAAGAAAAATCTATTAAGGAAACAGTCAATGAATATATGAAATATGTTATACTAGCTGCACATAGAGAATTGAAACAAAAAGGACAGACAAAAGGTGCATGTCTTGTTATGTGTCACTTGTCTCCTATTAGTACCCCCGATAACAGTTTTGGACAATCTATAAGACGATATAATATAAGATTAGCGAATGTCGGCAATACAAAAGCAGTGTTAAGTCGTCGTAATGGCCCTTTATGTTTAGGTATAGATGATAATAAGCGATTAGGTTATTCTTCAAGATACCCAGTTAATGTACCTGATCCCGATATTATACAAACTGTAATTAAAGAAGACGATGAGTTTTTAATATTAGGAAACGCTAAATTTTGGGAATCCGTTACAGTCGATACTGCAATATCAGAAGTGAGGGCTGAACGGAATCCAGTATTAGCAGCAAAGAGATTACAAGATTTGGCTCAAAGTTATGGAATAGAAGATTGTATATCGGTGGTAATCGTAAGATTTGATACAGTTCGTTCTGATGTAGATTTATTAATGCGAGAATTACGACATACGATCAACACAAACAAACCTGTATGTAATCCTGACTGCTGTTGCTCTCGTTTAGAACCATGTTGCCATTCTATCTCACCACCAAAATCAAATAGCGATAGATCTTCTCCAAGTGGACAAAGCGATCGACCTTCTAGTGAAACAGTTAGTCATCAACACTATGCCAGTGTACGTTCTCATAATAGGGCCTCAGAAAGAAAACCAAGAGGCGGAGTTGCACGAGCAATTCGAGTACGAGTTGAAGAAGATAAAGAGACTGAAAAAATTATTGACGATGTTCCCTCTTCAGATGAACAATTCAAATGTTGGGAATATATGCTGGAGCAAAATACACAAATGATATTTGATAAAGAGCTAGATAATCTTTCAAAAGGTATCAAATCAAATTCAAGTAGTTTAAGAAATTTAAAGGGACTCTCAGGAAGTAGTCCCCAACTACATCTAAATACGAAACAAACAAAACTACCGTTTCTCTCAAAACATTTCGGGAGTGCTAGATCTTTCGGTAGTAATATAAAGCCTGAGTTTCGTTTCGGTTCAGGAAGAATGCCTAATGGTGGTCCAAATGCTGCTTACTTTGGTTCACTTCAAAGGTTAATGCCTTATCATTTAGAATACGATTTCGCGGTTATTCAAGAAAAACAAACACAATCACAGGACTCTCTTGATCTCGAGGGCCGGATGCAACAATATTGGGGAGTTGCAACAACTGAACTTTAA

Protein sequence:

>DPOGS211074-PA
MVCDDGLTPSPLISRKSLRRLASTRGFKPRSESTWIRVFDGLEPYAVDAPSKLVKVSPYTTVEDINKKLGFNEELTLWVQIGGENSRRLELNEFPFQIQEKFLINNGWKSEARRQRLAVDPELRHSLRWCAGPSSRSGGVLRSGTVYVLKGHVFPQWKPRQAHIIGSQLHTHGVSWDMLELSGGSIEMCQPKAQKLVLCVKLLCQGNGVLDTGVNHLFLGFNTIWERNMWCRWLKESNKMKCEDEEIDSLDVFSQENEDVFLDSLEPATYREKYENSGTQSTQPPNVLDLSGGGRSCLPVALNQHASDGFAVKVLRMRSNTLPALPPQTCNLIALTHLDVSDNKIIELPKEISHLTQLEEINVSNNEIKSLDCLLRLPRLRTVVAARNLITQFGVNDTSQMGFLEENKSEYRAPLTNVDLRYNKLKGSIILGNYEHLVTLDVSQNSIEVLVLSSLRGLRELYAAHNSIQHLALHGASLRVLHAPYNNMENLTTMVPPINLVEMNLTYNKLSSLPQWISGCSDLTKLFAKIEELVLSGNSLSKLPDNLPQMNNIKIVRAHSNRLRSVPMFACSASVKILDFAHNELDSIDLRLLAPKQLKFLDISCNKKLQMNPSQFNAYKCQRPLSLVDVTGQHGNSLSQKNNFHEELSGGTPWVTGFSECPNKKLLLSCAQIRLPSFCNKEGLFAIIDGETDIEVPRILQSCLPGLLLEEKSIKETVNEYMKYVILAAHRELKQKGQTKGACLVMCHLSPISTPDNSFGQSIRRYNIRLANVGNTKAVLSRRNGPLCLGIDDNKRLGYSSRYPVNVPDPDIIQTVIKEDDEFLILGNAKFWESVTVDTAISEVRAERNPVLAAKRLQDLAQSYGIEDCISVVIVRFDTVRSDVDLLMRELRHTINTNKPVCNPDCCCSRLEPCCHSISPPKSNSDRSSPSGQSDRPSSETVSHQHYASVRSHNRASERKPRGGVARAIRVRVEEDKETEKIIDDVPSSDEQFKCWEYMLEQNTQMIFDKELDNLSKGIKSNSSSLRNLKGLSGSSPQLHLNTKQTKLPFLSKHFGSARSFGSNIKPEFRFGSGRMPNGGPNAAYFGSLQRLMPYHLEYDFAVIQEKQTQSQDSLDLEGRMQQYWGVATTEL-