Monarch geneset OGS2.0

DPOGS200344
TranscriptDPOGS200344-TA882 bp
ProteinDPOGS200344-PA293 aa
Genomic positionDPSCF300026 + 521221-522741
RNAseq coverage138x (Rank: top 55%)
Annotation
HeliconiusHMEL0000434e-9780.68% 
BombyxBGIBMGA005639-TA9e-6072.41% 
DrosophilaDrep-4-PA3e-2331.71% 
EBI UniRef50UniRef50_UPI0002060B717e-3631.10%UPI0002060B71 related cluster n=1 Tax=unknown RepID=UPI0002060B71
NCBI RefSeqXP_001599808.14e-3437.84%PREDICTED: similar to Caspase-activated DNase [Nasonia vitripennis]
NCBI nr blastpgi|2613359532e-9279.81%putative caspase-activated DNase [Heliconius melpomene]
NCBI nr blastxgi|2613359531e-9079.81%putative caspase-activated DNase [Heliconius melpomene]
Group
Gene OntologyGO:00056221.8e-17intracellular
GO:00069151.8e-17apoptosis
GO:00056345.8e-17nucleus
GO:00167875.8e-17hydrolase activity
GO:00057375.8e-17cytoplasm
GO:00063095.8e-17DNA fragmentation involved in apoptotic nuclear change
KEGG pathway 
InterPro domain[2-77] IPR0035081.8e-17Caspase-activated nuclease CIDE-N
[207-287] IPR0153115.8e-17Apoptosis, DNA fragmentation factor 40kDa
Orthology groupMCL12347 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200344-TA
ATGAAAAAAGGATATAAGGTGACCGATGTTAAAAGGGAGAAAAAAATTGGTGTCGCTGCTGAGAACCTAGAGGAACTTATTGAGAAGTCCTGTAAAAAGTTAGGGTTCAATGTCAGCTGCGCAGAGTGCCGTCTGTATGTGGCTGAGGACGGGACTCAGGTAGACGACGAGGACTATCTCAGAACCTTAGCACCACAAACATTATTTATTCTGTTACAGGAAAATGAAACAATGGTTACTGACTTTGACTACTTCTACAATATGATTTCTTCTGTCAAAAAGGATTACCTTAACACAGGGAAGGCTGCTAAGGAATTTCTAAATGTCAATCTTAAAGAAAAATTTAAAGTACTTGAAAGATACATTTCAGCAGCTAACGATTCTAGGGCTATGATAAGTGAGAGGACTCAGGATCCAGACTGGTTTGAAGGTCTTGATTCAAGTGAGAAAACAAAAGAACAGTCAATGTCAAAAAGAGCCAAGGAACGCATGAGGGGTTATTATTACAAAACTAAAACAGCATTACAGTCTTCGCAACTGTATATACACACGAAAAACGGACAAAGCAAGAAACTCATAGACCTGTTCCTATTGGAATTACGTAGAAGACTTGAAAATAATCACAAGATTGAGTTGTCTCGGTCCATCATACCGAATATACAAGAGGCGATCAGGAATATTATTCATGCAAATGTAAAATGCGTGATGTGCAACGTAAGTTCGGGTAAAGGTCACATAGAAATTGACCGATACTATTTACAAATATTCACTAGCGATAATCTAAAATTAGTTCATATAGTTTGTCATGATAAAGGCAAACACAGTGCTGAATCGAGTGCTTACATTTTATGCCAAAAATGTTTTGGTAATTATAGAGTTTAA

Protein sequence:

>DPOGS200344-PA
MKKGYKVTDVKREKKIGVAAENLEELIEKSCKKLGFNVSCAECRLYVAEDGTQVDDEDYLRTLAPQTLFILLQENETMVTDFDYFYNMISSVKKDYLNTGKAAKEFLNVNLKEKFKVLERYISAANDSRAMISERTQDPDWFEGLDSSEKTKEQSMSKRAKERMRGYYYKTKTALQSSQLYIHTKNGQSKKLIDLFLLELRRRLENNHKIELSRSIIPNIQEAIRNIIHANVKCVMCNVSSGKGHIEIDRYYLQIFTSDNLKLVHIVCHDKGKHSAESSAYILCQKCFGNYRV-