Monarch geneset OGS2.0

DPOGS203378
TranscriptDPOGS203378-TA3075 bp
ProteinDPOGS203378-PA1024 aa
Genomic positionDPSCF300003 + 420130-428215
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0072840.069.33% 
BombyxBGIBMGA002043-TA0.058.78% 
DrosophilaPde6-PB1e-9835.17% 
EBI UniRef50UniRef50_B0WPM10.045.52%cAMP and cAMP-inhibited cGMP 3',5'-cyclic phosphodiesterase 10A n=8 Tax=Metazoa RepID=B0WPM1_CULQU
NCBI RefSeqXP_970904.20.046.15%PREDICTED: similar to camp and camp-inhibited cgmp 3,5-cyclic phosphodiesterase, partial [Tribolium castaneum]
NCBI nr blastpgi|2700033230.046.15%hypothetical protein TcasGA2_TC002545 [Tribolium castaneum]
NCBI nr blastxgi|2700033230.046.08%hypothetical protein TcasGA2_TC002545 [Tribolium castaneum]
Group
Gene OntologyGO:00071651.9e-92signal transduction
GO:00041141.9e-923',5'-cyclic-nucleotide phosphodiesterase activity
GO:00055152.1e-31protein binding
GO:00080815.7e-20phosphoric diester hydrolase activity
GO:00038241.4e-07catalytic activity
KEGG pathwaytca:6595100.0 
 K01120 (E3.1.4.17, PDE)maps-> Purine metabolism
InterPro domain[576-920] IPR0020731.9e-923'5'-cyclic nucleotide phosphodiesterase, catalytic domain
[391-546] IPR0030182.1e-31GAF
[636-649] IPR0230885.7e-203'5'-cyclic nucleotide phosphodiesterase
[638-809] IPR0036071.4e-07Metal-dependent phosphohydrolase, HD domain
Orthology groupMCL18783 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203378-TA
ATGACAGCGAGACCTCATTGCAGACCACAGGCCCACTTCACCACGGTTGATTCTTCGGCCCATCACAATGTTAGTAGCAGAGAGGCCTCTCTCCTCGCAAACTTATGTGGAATGGCTGGGAAGCAAAGCCCTATATATAATGAAGAGACAAGGGGTTCTGACGGACAAAACGTAAAGAAGACAGAACCGAATAGACGGGCGATGCGACCGCCGAGGTCTAATATACCAAAACCTTCGCGTCAGATTACTAAAACAATATTATCTACAGAAAACAAAGAAAAGAAGTCCTCGGATATTTTTAAAAACTCCGGTTCTGAATATTCCAACAAGGAACGGCGATGTTTAAGGGATGAATTAAAAAAATCACGGATGCTGGATGAAACATCTCTAAGTGAGCGATCAACTCCCGAGGGTTTGGATGCCAATGTGGTTGTATATCTGGATCAGAATAGGATGTCTTTGGAGTCCTACATCATGAATAGCGTTTCCATTAACCAACTGGAAAGATGGTTAGCCATAAAGAAACATAAGCATTCCGAATCAGGAAAGGCATCCTGTCACAGCGGTAGACTTAAAGAGAAAATATCTAATAAAAATATATTCCTGGAACTGTTCAAGGATTTGAAAAGGGATAAGAATGAACATTCTGTTATCATAGATATAGTACGTCTAATTACTATGACAATTGAAATAGATGCCTACAGAGTTTACAAGTTATTTCCAGATCACGATAGTTTTGTCCAATATTTTGTAGCAGACCAAAGCCAGCTTATATCTAAGCAGTTACGTCGTGTGGATTTAAATAAAATCGAAAACGTCCTCCGCGTTGCTAAAGATGGGACATGTTTACGTCTATCAAGAGAGAATACGGTGAATTTTCCGAGAATGTGTGAAACTTTAGTCGAGGCTTTCGGAATAAAAAATTTGGACGAAGCTAAATATGTTCTGTATCAGCCAATATTGACGGCAAGTGGCAAGACTGGATATGTTATAGAGATGTGGCGCACAAAAAAGGAATTTGATGACGGAGACGAACATTTAAGTTCTAACTATATTGTGTGGGGGAGTCTCATCTTACAATATTGTAACCTATGTCTTGACAAAATGCGCGAACGTAACATGACCGATTTTCTGTTAGACGTTGTTAAGGCTATTTTTGAAGAAATGGTGTCACTTGAACAACTTGTAAAAAAAATATTGGAATTCGCCCAGAAATTGGTTAGCGCCGATCGAGCTTCTCTTTTTCTGGTGGACCATAGAAATAATGAGCTAGTATCAACTGTGTTCGATTTAAAATTTGAACCGGGACAAGGGCGTGACGTGGAGAAAAAAGAAATCAGAATGCCCATCGACCGCGGTATAGCTGGTCACGTCGCCTTATCCGGCGAAACTATGAACATACCAGATGCATATTCGGACTATAGATTTAACAGGGACGTTGATGAGGTCACTGGTTATACCACAAATAGTATACTTTGTATGCCAGTTAAAGTACAAGGAAAGGTCATAGGTGTGGTGCAAATGGTGAATAAAATCAACGCCGACAACTTTGATCGCGAGGATGAGGTTGCGTTTGAAATATTTTCAACGTTTTTTGGGCTCGCTTTACACCATGCAAGGCTATATGACAAAATCATGAGAAAAGAGCAAAGATATAGGGTCGCTTTGGAGGTGCTTAGCTATCATAATACGTGTAAAGAAGACGAAGTGCGTAGAGTTCTCGAGAGCAATGAAGATAGAGAGGAGGATGTGAATTCATTGACAAATTTCTATTTCGATCCTTACAAATTGGGTGACATTCAGAAATGCAAAGCCGTTATCACCATGTTTGATGATCTTTTTGAGCTGTCAAAGTTCGATTTCATGACCATAACGAGATTCATTCTAACAGTCAAAAAGAACTATAGGTCAGTACCATATCACAATTTTGATCATGGATGGTCCGTGGCTCACGCTATGTACGTCATCATTAAGAAGGACCACAGGAAATGCCTTGACTACAAAATGAGATTGGCATTATTTGTCGCATGTTTATGTCATGATTTAGATCATAGAGGTTACACCAACAAATATTTAAATGAAATAGCGTCTCCACTGGCCGCTATGTATACTACATCGCCGTTGGAACATCATCATTTCAACATAACTGTAACAATTTTGCAACAGGACGGACACAATATATTCTCACATTTGTCCAGTCAGGAGTACAAAGACATTCTAGGCTACATAAGGAATTGTATATTAGCAACGGATCTCGCTGCTTTCTTTCCTAACTTAGCTAAATTACAAGAAATTCATAATATGGATTCCAAACGTCCTCATTTCAATTGGAATATTCCTAGTCACAGAGACTTAGGGATGGCGATTTCGATGACAGCAGCCGATTTGTCTGCTTCCGCTAAGCCCTGGGACATACAAATAAAAACCGTAAAGGTGATCTTCAAAGAGTTTTATGAACAGGGAGACAAAGAAAGGGAAGCCGGCAAAACTCCCATACCTATGATGGATCGGAATAAGCCAGAAGAACAACCGGCCAGTCAGGTCGGTTTTTTGAGTCAAATCTGCATACCTTGCTACACTATTTTGTGTAAAATACTCCCAAAGACTAAGCCTATGTATTTAATGGCTTTTAAAAATCTTAATAATTGGAAATCAAAAGTTACCAAAAAGATTGACGATTCTGATAAAGACGAGGATTCTAGGGAGGACACTTTAGAAGTTATATCGGATGCAGAGTTAGACGAAGACGAAGAACGCGAAGAGACTTACACCGAATTTGATAAACTACAAGTTAGTTCAAGCGTTCTTGCAGATCTAGATGAACTGCAAAAAGAGGAAGATGATAACAATATCGAAGAAATAAATAGCAAAATAACCGGCGGGTGGAAATGTATAAAATCTGATTCCGTGTCTTGGGAGGAAGATCAGGAATTAAAAATTGCCGAGACGAAAAAAATAAAAATTTGGAAGGATGTACACGGTAACTTAATCGACGAAGACGGTGTAGTGACGAGATCTAATAAACATTTAAAAGTGGAGATCGGGAGAGCATTACTTAGCAATGTAGAGAATTAG

Protein sequence:

>DPOGS203378-PA
MTARPHCRPQAHFTTVDSSAHHNVSSREASLLANLCGMAGKQSPIYNEETRGSDGQNVKKTEPNRRAMRPPRSNIPKPSRQITKTILSTENKEKKSSDIFKNSGSEYSNKERRCLRDELKKSRMLDETSLSERSTPEGLDANVVVYLDQNRMSLESYIMNSVSINQLERWLAIKKHKHSESGKASCHSGRLKEKISNKNIFLELFKDLKRDKNEHSVIIDIVRLITMTIEIDAYRVYKLFPDHDSFVQYFVADQSQLISKQLRRVDLNKIENVLRVAKDGTCLRLSRENTVNFPRMCETLVEAFGIKNLDEAKYVLYQPILTASGKTGYVIEMWRTKKEFDDGDEHLSSNYIVWGSLILQYCNLCLDKMRERNMTDFLLDVVKAIFEEMVSLEQLVKKILEFAQKLVSADRASLFLVDHRNNELVSTVFDLKFEPGQGRDVEKKEIRMPIDRGIAGHVALSGETMNIPDAYSDYRFNRDVDEVTGYTTNSILCMPVKVQGKVIGVVQMVNKINADNFDREDEVAFEIFSTFFGLALHHARLYDKIMRKEQRYRVALEVLSYHNTCKEDEVRRVLESNEDREEDVNSLTNFYFDPYKLGDIQKCKAVITMFDDLFELSKFDFMTITRFILTVKKNYRSVPYHNFDHGWSVAHAMYVIIKKDHRKCLDYKMRLALFVACLCHDLDHRGYTNKYLNEIASPLAAMYTTSPLEHHHFNITVTILQQDGHNIFSHLSSQEYKDILGYIRNCILATDLAAFFPNLAKLQEIHNMDSKRPHFNWNIPSHRDLGMAISMTAADLSASAKPWDIQIKTVKVIFKEFYEQGDKEREAGKTPIPMMDRNKPEEQPASQVGFLSQICIPCYTILCKILPKTKPMYLMAFKNLNNWKSKVTKKIDDSDKDEDSREDTLEVISDAELDEDEEREETYTEFDKLQVSSSVLADLDELQKEEDDNNIEEINSKITGGWKCIKSDSVSWEEDQELKIAETKKIKIWKDVHGNLIDEDGVVTRSNKHLKVEIGRALLSNVEN-