Monarch geneset OGS2.0

DPOGS205935
TranscriptDPOGS205935-TA1236 bp
ProteinDPOGS205935-PA411 aa
Genomic positionDPSCF300156 - 218648-221438
RNAseq coverage100x (Rank: top 61%)
Annotation
HeliconiusHMEL0081610.096.95% 
BombyxBGIBMGA002832-TA0.093.20% 
DrosophilaDrep-2-PB3e-14359.91% 
EBI UniRef50UniRef50_A8DY775e-14159.91%DNAation factor-related protein 2, isoform B n=17 Tax=Drosophila RepID=A8DY77_DROME
NCBI RefSeqXP_001651877.15e-15067.29%hypothetical protein AaeL_AAEL006265 [Aedes aegypti]
NCBI nr blastpgi|1571127979e-14967.29%hypothetical protein AaeL_AAEL006265 [Aedes aegypti]
NCBI nr blastxgi|1892411538e-15768.95%PREDICTED: similar to AGAP005254-PA [Tribolium castaneum]
Group
Gene OntologyGO:00056223e-39intracellular
GO:00069153e-39apoptosis
KEGG pathwaynve:NEMVE_v1g1296988e-11 
 K02310 (DFFA, DFF45)maps-> Apoptosis
InterPro domain[13-87] IPR0035083e-39Caspase-activated nuclease CIDE-N
Orthology groupMCL16394 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205935-TA
ATGATAGTCGAAATACAAGAGGAGGTTAGAGGGAAGAGGCCTTTTAAGATATGGGACAGTTCGAGGAATGTACGGAAGGGGCTGGTGGTGACCAGCTTCGAGGAACTCATACATAGAGGTAAAGAGAAGCTATCAGTAGCGGCGAGCGAGCCCGTGAGGCTGGTGCTGGAGAGTGACGGCACTCAGGTCGAAGACGGGGAGTACTGGAGGACGCTCCCACCCAACACTGTGCTGCTGCTGCTGAGGCAGGGGGAGAGATGGTATCCTACAGGGGTCGACGTCATCAAGGCTGCGATATCGGCGATCCCCAAGATTGTGTGCGAAACGATACACGCGCTCGAGTTACACGATGAGACGCCTTCGTGGAAGATCATGGACAACAAGGGGCGTGTCACAGTGGTGCTGCACTGGGACCAGCGACCCCAGGCCTCGCCCGCGGCGCGCTCGCCCTCCCGCCAGGTCAAGCCGGACCGCAGGCCCTCGCTCGTGATCCAGACGTCTCTGGAGCGACCGCAGCCGCCGCCGCCCCACATCACGGTCGTCAACCACGACGAACCTGGCCCGGCGCGAGCTCGTCTCTCCCGCGCGTCCTCGTCTCTCGACCACCATGTCCACACGGCGGAGTGTCGCGCCGCACCGCCGTCCCACGCCCCGCATCAGCACCCTCCGCACGCACCGCACCGCCCGCCCACAGACGAATGCGATTTCCACTGTTGTGCGCTGCACGAGGAAGGGCGGCGTATCGCGGTCCACAAGAGCGTTGCCACCTCTCCCATCCAGGACTCTCAGCCTCGCGCCTCTCCCCAGGGAAGGCCGAAGGGTCACGTTCGGTTTCTAGACGCGGAGTCGGCGCGGCGCGGGGACCGGGACTCGTCCGAGAGCGAAACGGAGAACACTCTCGTCGAGGACGAGGCCGTCACTTCGGAGAAGTTTCTACTCCTCATCGACCAACTGAGTGTCGACCAGAAACGACACCTCACCATCAAAGACATTGGCATCATATTGGAACGGTTAAGTTCGAAGATACTCGACGTGGAGCGCCTCGATCGCGAGTCTGAGTCGGACGACTGTTACAATTGGACGATAAAGGCCACCATCAGGGGGGACGCGTTGAGGGAGCTGGGTGTCATCTACAATGGGAACTACTACGCCATCAGCGAGCACCCGGGCTACCGGGAGGAGAACGAGGAGGCCGGAGACGAGGGCGAGGAGGAAGAGGAGGAGGACAGGCTCTGA

Protein sequence:

>DPOGS205935-PA
MIVEIQEEVRGKRPFKIWDSSRNVRKGLVVTSFEELIHRGKEKLSVAASEPVRLVLESDGTQVEDGEYWRTLPPNTVLLLLRQGERWYPTGVDVIKAAISAIPKIVCETIHALELHDETPSWKIMDNKGRVTVVLHWDQRPQASPAARSPSRQVKPDRRPSLVIQTSLERPQPPPPHITVVNHDEPGPARARLSRASSSLDHHVHTAECRAAPPSHAPHQHPPHAPHRPPTDECDFHCCALHEEGRRIAVHKSVATSPIQDSQPRASPQGRPKGHVRFLDAESARRGDRDSSESETENTLVEDEAVTSEKFLLLIDQLSVDQKRHLTIKDIGIILERLSSKILDVERLDRESESDDCYNWTIKATIRGDALRELGVIYNGNYYAISEHPGYREENEEAGDEGEEEEEEDRL-