Monarch geneset OGS2.0

DPOGS200023
TranscriptDPOGS200023-TA1047 bp
ProteinDPOGS200023-PA348 aa
Genomic positionDPSCF300337 - 157436-170407
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0036775e-8362.50% 
BombyxBGIBMGA013427-TA5e-2834.84% 
DrosophilaAnk2-PU5e-2833.06% 
EBI UniRef50UniRef50_D6X5692e-6039.20%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X569_TRICA
NCBI RefSeqXP_972943.22e-6037.01%PREDICTED: similar to ankyrin repeat and death domain containing 1A [Tribolium castaneum]
NCBI nr blastpgi|2700007218e-6039.20%hypothetical protein TcasGA2_TC004355 [Tribolium castaneum]
NCBI nr blastxgi|1892418398e-6140.06%PREDICTED: similar to ankyrin repeat and death domain containing 1A [Tribolium castaneum]
Group
Gene OntologyGO:00055156.8e-07protein binding
KEGG pathwayecb:1000615672e-28 
 K08803 (DAPK)maps-> Pathways in cancer
    Bladder cancer
InterPro domain[9-230] IPR0206834.4e-69Ankyrin repeat-containing domain
[43-75] IPR0021106.8e-07Ankyrin repeat
Orthology groupMCL17876 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200023-TA
ATGTCGGAGGTACGTCCTCGTGTTCAACAAAACGACCTTCTCCTCCACGAGGCTGTCATCAAAAATGAACCAGAAGCTGTAAGGGAGGCTCTCAGGCGTCCAACTGATGTCAATTGCAGGAACAATTACGGCCGCGCTCCTATCCATTGGGCGGCTAGCCGAGGCAACGTCGAAATCATAAATTTACTTATAGAAGCCAAATGCGACATAGAGGCCATAGACAAGTTTGGGATGCGACCGCTGTTAATGGCGGCCTGGCACGGTCACTTACCGGCTGTGAAGACGTTTGTACAAGCAGGGGCGTGCATAGCGGCCACAAACAAAGTAGGTCGTACAGCCCTGCATGTGGCGTGCGAGGGCGGCCACTGCGGCGTCGCTGGTCTCCTGGTAGCGCGCGGCGCGGCCCGGGAGGCGAGAGACAACTCGGGGCGCACCCCCCTCCACCAGGCCGCCGTCCACAGGCACACCGAACTCGTAAGGACGCTCCTCGACACCGGCTGCAATGTCGATGCCACGGATAATAATGGTGTAACGGCTTTACAAATGGCGTGTGCTCAAGGATGTCGAGGGATCGTCGAACACTTACTGGAGTATGGAGCCGATGTACACTTACAGAATAATGTTGGTTCCTCAGCATTACATGCAGCATGTGCCGCAGATGCTACTGATATCGTTGAACTGCTCCTGGCACACGGAGCTGATCCAGCTATGACAGATCAGTGGTCTCAATCTCCTCTGAGCGTCGCCGGTAGCGCGGCTGAGTCAATGCCGGAGGGTTTCCGTCGCGCCGCCCGGGGCTCGTTCTCCGCCATCCTTGACCTCCTCACGGCTACTGCGAACCTGGTACAGGAACCATCTGACGGATCACCATCACCTCGAGCGGAGGATAAATCTGGTGGCGTCTCACCTTCTCCTAATGAAGGATGTGCACGATATAGGGAATTGTGTTATCGTGCTGCGCAGGCGCACCTCCCAGCGGAGTCGCCGCCTCGGCCCTGCGACTGTGTCAACTGCTCCGCTAGAAGAGACTCCAACTCAGACGCCTGA

Protein sequence:

>DPOGS200023-PA
MSEVRPRVQQNDLLLHEAVIKNEPEAVREALRRPTDVNCRNNYGRAPIHWAASRGNVEIINLLIEAKCDIEAIDKFGMRPLLMAAWHGHLPAVKTFVQAGACIAATNKVGRTALHVACEGGHCGVAGLLVARGAAREARDNSGRTPLHQAAVHRHTELVRTLLDTGCNVDATDNNGVTALQMACAQGCRGIVEHLLEYGADVHLQNNVGSSALHAACAADATDIVELLLAHGADPAMTDQWSQSPLSVAGSAAESMPEGFRRAARGSFSAILDLLTATANLVQEPSDGSPSPRAEDKSGGVSPSPNEGCARYRELCYRAAQAHLPAESPPRPCDCVNCSARRDSNSDA-