Monarch geneset OGS2.0

DPOGS204366
TranscriptDPOGS204366-TA1875 bp
ProteinDPOGS204366-PA624 aa
Genomic positionDPSCF300040 + 490546-496368
RNAseq coverage589x (Rank: top 22%)
Annotation
HeliconiusHMEL0052940.071.13% 
BombyxBGIBMGA005889-TA0.069.65% 
DrosophilaCG10703-PA1e-6430.94% 
EBI UniRef50UniRef50_E9HAQ08e-6531.46%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9HAQ0_DAPPU
NCBI RefSeqXP_002019770.17e-6631.07%GL12027 [Drosophila persimilis]
NCBI nr blastpgi|1951577761e-6431.07%GL12027 [Drosophila persimilis]
NCBI nr blastxgi|1951074095e-7731.47%GI23889 [Drosophila mojavensis]
Group
Gene OntologyGO:00055152.1e-14protein binding
KEGG pathwayspu:5915035e-61 
 K02084 (APAF1)maps-> Small cell lung cancer
    Huntington's disease
    Amyotrophic lateral sclerosis (ALS)
    Alzheimer's disease
    p53 signaling pathway
    Parkinson's disease
    Apoptosis
InterPro domain[576-620] IPR0002372.1e-14GRIP
Orthology groupMCL15018 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204366-TA
ATGGAATCGTTGTCTAAACAAGAGCTGATATCTACCATAACAAAACAAGCAGATCAAATCAAAAGATACGAAGCTCGTTTAAGAGATGTTGTTGCTGCGTACAAAGGCCTTGTAAAGGAAAAAGAGGCCTTGGAGATTAGTTTGAAGGCCTTAAATAAAACAGATGACGATGGAAGTGGAGCAGGATCTCCAGATTCAATTGCAGCTCTCACTCTATCTCTCTCAACTCTTACAGCAGAAAAGAATCGTATGGAAGAAGTCTTTCAAGCGGACAAGAAAGTTACAAGGGAGAAGTATGAGAACATGTTAGCATCAATGAGAGAGGAAACAAAGAGTCTGGTGCAGCAACATTTAGCGGAAGTCGCAAATCTGAAGACTAAGATAGCATACGAGATACAAGAAAGAGAAAAAGAAAGAGCCGATCATGCAGCTATGTTGAAAGAGCTGCATCTGAAGTTGAATACAGAAAGGAAAAACAAAGAAAAGTTGGAGGACAAAGTTGTTCAAAATACAGAGTCCGAAGCGTCGCAGGCCGACCTGGAGAAACGCGTCCGAGATCTGAGCGGGTATCTGGAAGCGAGTCAACGGCGACTGATGCGTGCTGAGGCGAGGACGGCGGAAACCCCGGCGCTGCTGGTGAGGCTGCAGCAGAAGCTGGCGCTGCTGGAGCAGACGCACGCGGTGGCTATACGAGAGGAACAGATTAAAGCGAAACGTGCCGAGGAATCAGCTCGGAAGATATGTGCCAGGCAGGAGGAACGAGTGGCGCTGCTTGAGGGTAAGGTGGCCGAGCTGTCACAGACGATAGGAGAGTATGACATGATGAGGAGACGAGACCAGAACACCATACAGCAACTGAGAGAGTCCTTTAATGGCAGGTTACTGGAAAAGATTGCCAGTAACGGCTCCGATGGCACTGACGGTGATAATGATGAAACGCAAAAAAGCGAGACGGACAATGAAAAATTAGCGGACAGTGAATATTTGCAAACTTTGATAGACAAAATACACATACTGAAGAAAGAGCTACTGGCCGAGAACGAGAAAGTAGGGAGTCCGGTCGACATTACGACCGTCTTCAAAGTGGACGGCTACGACGACATCCACAGGAAGTGCAGAGAGGAGTACGACAATTTGAAAATCGAATACGACAATTATAAGTTCCTGAACGTCAAAGTGACGGGAGGCGATGAGAGCGAGACGGAAGATCTGAAGAGCGAGATAGCGATATTGAAAGAGAAAGTTGACACTTACGTGTTACTGCTGGAGGAAGAGAAACAGGTTAAGGCTGAGCTGCTGCGGCTTCACGAAGAGAAACTAAGAGCCGAGAAGGAATATCACAAAGAGGTGACATCGGACCTCAAGAACAGAATACAAAGTTTGGAGAAACAGGTTCAGACGCAGAGAGAGAGATACGCGACTCTGTTAGAGGAGACCGACAGCTACATACGATCCAGACACGACAGGTCGCGGAAGGTCAGCAAGGAGGACGGCTGGAAGGACGAACATGTGATCAATGATGGCATGTCCCCGCACATGCTGCACTACGCCCACGAGTTGGCTCGGCGCGACCTCGACATCACTCAGCTGAGGAGAGAGAAACACGCGCTGGAGGGACTGCACAGAGATTGTCAACGTGAAGCTACCATAGAGAAGGAACGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATTGCGAAGGACGCAGTCCCGTGAAGGCGCCAACCTCGAATATCTGAAGAACGTGGTGATATCGTATCTGATGTCTAGTGACTACGTCGGCCGCAGACACATGCTGAACGCCATAGCCGCTGTGCTCCACTTCACAGACAATGAAAGGAAGGCCGTGCTCAGCACGCTGTAA

Protein sequence:

>DPOGS204366-PA
MESLSKQELISTITKQADQIKRYEARLRDVVAAYKGLVKEKEALEISLKALNKTDDDGSGAGSPDSIAALTLSLSTLTAEKNRMEEVFQADKKVTREKYENMLASMREETKSLVQQHLAEVANLKTKIAYEIQEREKERADHAAMLKELHLKLNTERKNKEKLEDKVVQNTESEASQADLEKRVRDLSGYLEASQRRLMRAEARTAETPALLVRLQQKLALLEQTHAVAIREEQIKAKRAEESARKICARQEERVALLEGKVAELSQTIGEYDMMRRRDQNTIQQLRESFNGRLLEKIASNGSDGTDGDNDETQKSETDNEKLADSEYLQTLIDKIHILKKELLAENEKVGSPVDITTVFKVDGYDDIHRKCREEYDNLKIEYDNYKFLNVKVTGGDESETEDLKSEIAILKEKVDTYVLLLEEEKQVKAELLRLHEEKLRAEKEYHKEVTSDLKNRIQSLEKQVQTQRERYATLLEETDSYIRSRHDRSRKVSKEDGWKDEHVINDGMSPHMLHYAHELARRDLDITQLRREKHALEGLHRDCQREATIEKERXXXXXXXXXXXXXXLRRTQSREGANLEYLKNVVISYLMSSDYVGRRHMLNAIAAVLHFTDNERKAVLSTL-