Monarch geneset OGS2.0

DPOGS212479
TranscriptDPOGS212479-TA837 bp
ProteinDPOGS212479-PA278 aa
Genomic positionDPSCF300222 - 336631-337467
RNAseq coverage70x (Rank: top 66%)
Annotation
HeliconiusHMEL0093224e-8954.95% 
BombyxBGIBMGA009651-TA5e-8556.43% 
DrosophilaCG10927-PA2e-4734.19% 
EBI UniRef50UniRef50_D6WQH05e-5240.07%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WQH0_TRICA
NCBI RefSeqXP_974513.11e-5240.07%PREDICTED: similar to cytidine and deoxycytidylate deaminase zinc-binding region [Tribolium castaneum]
NCBI nr blastpgi|910870432e-5140.07%PREDICTED: similar to cytidine and deoxycytidylate deaminase zinc-binding region [Tribolium castaneum]
NCBI nr blastxgi|910870433e-4940.07%PREDICTED: similar to cytidine and deoxycytidylate deaminase zinc-binding region [Tribolium castaneum]
Group
Gene OntologyGO:00038245.6e-22catalytic activity
GO:00167874.8e-07hydrolase activity
GO:00082704.8e-07zinc ion binding
KEGG pathwaymgr:MGG_085813e-09 
 K01500 (E3.5.4.-)maps-> Atrazine degradation
InterPro domain[79-278] IPR0161935.6e-22Cytidine deaminase-like
[137-246] IPR0021254.8e-07CMP/dCMP deaminase, zinc-binding
Orthology groupMCL11944 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212479-TA
ATGACATTAAATGAAAAAATACCTCTCAAAGATCTCCAACATTTGAAAAGAGTGAAACGGCAGAATATAATTCTATGTCCGATCAGTTCTCTCAACAGCGAGACCATACAAGAGTACATAGAAAAAAATGTTGGCGAACTTAAAGATATATTTGATTATTTTCAAGTGTTAGAAGTGCCGTTCATAGCACCGAGAGTTACTAGACAATACCAGGAGACGAAAAAGTATTGGCCGTGTAATTTTCATCCAAATCACTACTGGGAGAAGCTTGTCAGGGATTCATTCTTTTCTGATCATGAACTATTAATTCACAAAAAATATATGGAAGTTGTGTTCGAAATTGTTAAGTGGCAGGCAACTTGCTTACAAATAAAATTATGTGATGCTGGATTACAAGATGTAAATGCAAGTGTTGTTGTTGATCCCGATATCAATGCAGTAGTTACTATCGCATTTGACCATAGACAAACACATCCTCTACAACACACCGCCATGATAGCCATAGACAATGTAGCTAGGACCCAAAATGGCGGTGTTTGGGATAGCAACATATCCGATGATTTAATGATTAATTTAAAAGAAAAATTCGGCATAAATTTTGGTGTTAGAGATAGTAATAAAAATAGCAAAGAAGGTCCGTACTTATGTACTGGGTATAACATGTATATATTGAGAGAGCCTTGTCATATGTGTTCTATGGCCTTAGTACATGCTCGAACTAAAAGAATCTTCTTCTGTATAGACAACGAAGAAAAAGGCGCTCTGAAGTCAACAGTGAAATTACAAACCATAAGCTCTTTGAATCATCACTTTGAAGTGTTCACTGGATTTTTATAA

Protein sequence:

>DPOGS212479-PA
MTLNEKIPLKDLQHLKRVKRQNIILCPISSLNSETIQEYIEKNVGELKDIFDYFQVLEVPFIAPRVTRQYQETKKYWPCNFHPNHYWEKLVRDSFFSDHELLIHKKYMEVVFEIVKWQATCLQIKLCDAGLQDVNASVVVDPDINAVVTIAFDHRQTHPLQHTAMIAIDNVARTQNGGVWDSNISDDLMINLKEKFGINFGVRDSNKNSKEGPYLCTGYNMYILREPCHMCSMALVHARTKRIFFCIDNEEKGALKSTVKLQTISSLNHHFEVFTGFL-