Monarch geneset OGS2.0

DPOGS202979
TranscriptDPOGS202979-TA1383 bp
ProteinDPOGS202979-PA460 aa
Genomic positionDPSCF300068 - 637472-640427
RNAseq coverage1362x (Rank: top 9%)
Annotation
HeliconiusHMEL0095390.085.43% 
BombyxBGIBMGA012323-TA0.081.93% 
DrosophilaCG4420-PA6e-12049.27% 
EBI UniRef50UniRef50_G6DPI80.0100.00%DNA-damage inducible protein n=10 Tax=Endopterygota RepID=G6DPI8_DANPL
NCBI RefSeqNP_001040314.10.086.58%DNA-damage inducible protein [Bombyx mori]
NCBI nr blastpgi|1140514170.086.58%DNA-damage inducible protein [Bombyx mori]
NCBI nr blastxgi|1140514170.086.58%DNA-damage inducible protein [Bombyx mori]
Group
Gene OntologyGO:00065081e-61proteolysis
GO:00041901e-61aspartic-type endopeptidase activity
GO:00055157e-11protein binding
KEGG pathway 
InterPro domain[203-326] IPR0191031e-61Peptidase aspartic, eukaryotic predicted
[210-328] IPR0211091.6e-28Peptidase aspartic
[7-74] IPR0006267e-11Ubiquitin
Orthology groupMCL15207 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202979-TA
ATGAAAGTAACCGTAACTACCCTCAACGACGAGCTGTTCGTCTTAGATGTTTCCGAAGATTTGGAATTAGAAAACTTCAAGGCGTTTTGTGAAATCGAGTCAGGGTTTCCCGCCTCGGACATAACTTTGACTTTCAATGGGAAGCCTATGATGCACGATAAGAAGTCGCTGAAGGAGCTGGGCGTCCACGACGGAGATGTCATTGTGCTGCTTCATATGGTGCAGTCTTCATCCAATTTAAACATGAACGACGCCAGTCAAGCTCTACCAAGCGGTTTGGCCAATCTCGATTTTAGCAGCATACAAGTTCCTAGGGGGGCTGCAACTGCAGCAAGCACTTCAATGGCCGCCAGAAATGCTCCTGTAGAGGAGGACCCTCGTATCATCAGAGAGATGTTTCTAGCCAATCCCGACCAACTCGCTCTGCTCAAACAGAACAATCCAAGACTTGCAGATGCCTTGCTGAGTGGCAACCTCGATACATTTGCTTCAGTGCTTCGAGAACAAATTTCAGCTAGAACTGAGCGCCAACAACAAAGAATAAGGATGATGAATTCAGATCCGTTTGACACAGAGGCCCAGCGTATGATAGCGGAGGAGATCAGACAGAAGAACATCGAAGCCAACATGGAGGCGGCCATGGAGTACAACCCCGAGACATTTGGCACAGTAGTCATGTTGTACATCAACTGTCACGTCAACGGCTTCCCAGTGAAGGCGTTTATAGATTCTGGAGCTCAAACCACAATAATGTCAGCAGCTTGTGCAGAGAGGTGTAACATCATGCGACTAGTTGACACTAGGTGGGCGGGTATAGCCAAGGGTGTCGGAGTTCAGAGGATCATCGGCCGCATCCATATGGTGCAGATGCGCATAGAGAAAGATTTCCTCACCACCTCCTTCTCCGTCCTCGAGGAACAGCCCATGGACATGCTGCTGGGACTGGACATGCTCAAACGACATCAGTGCAACATCGACCTGAAGCGGAACGTTCTTCACATCGGCACGACGGGCACCGAGACGCCGTTCCTGCCGGAGGCCGAGCTGCCGGAGTGCGCCAGACTGTCGGGCTTCTCAGAGGACGAGCTGGTGGCCAGGGACGACCGCCTCGTGAGGGACAAGCGAGAATCTAAGGAGCAGTCACCGTCGTCACCGCCGGCCACCACCAGCGGACAGAGCGCTCCAAGAATGAACCCGGGTGTGAACACGCCGAGCCTCATCCGGCCAGAGATCCTCGCCACGGATACGTTCAGTGAAACAGATGTGGAGGAGATAGTAGCGCTGGGCTTCACCAGGGAACAGGCGATTGTAGAGCTGAGGCGTTTCAACGGCGACAAGACTCAGGCTACGGTGGCTTTGTTCGCCAAGTCGCTCAAGTTCTAG

Protein sequence:

>DPOGS202979-PA
MKVTVTTLNDELFVLDVSEDLELENFKAFCEIESGFPASDITLTFNGKPMMHDKKSLKELGVHDGDVIVLLHMVQSSSNLNMNDASQALPSGLANLDFSSIQVPRGAATAASTSMAARNAPVEEDPRIIREMFLANPDQLALLKQNNPRLADALLSGNLDTFASVLREQISARTERQQQRIRMMNSDPFDTEAQRMIAEEIRQKNIEANMEAAMEYNPETFGTVVMLYINCHVNGFPVKAFIDSGAQTTIMSAACAERCNIMRLVDTRWAGIAKGVGVQRIIGRIHMVQMRIEKDFLTTSFSVLEEQPMDMLLGLDMLKRHQCNIDLKRNVLHIGTTGTETPFLPEAELPECARLSGFSEDELVARDDRLVRDKRESKEQSPSSPPATTSGQSAPRMNPGVNTPSLIRPEILATDTFSETDVEEIVALGFTREQAIVELRRFNGDKTQATVALFAKSLKF-