Monarch geneset OGS2.0

DPOGS205780
TranscriptDPOGS205780-TA1059 bp
ProteinDPOGS205780-PA352 aa
Genomic positionDPSCF300144 - 321449-323400
RNAseq coverage332x (Rank: top 35%)
Annotation
HeliconiusHMEL0115801e-16978.35% 
BombyxBGIBMGA010577-TA4e-15571.43% 
DrosophilaCHORD-PA1e-11260.71% 
EBI UniRef50UniRef50_Q9VCC01e-11060.71%Cysteine and histidine-rich domain-containing protein n=15 Tax=Neoptera RepID=CHRD1_DROME
NCBI RefSeqXP_967567.11e-11861.09%PREDICTED: similar to CHORD CG6198-PA [Tribolium castaneum]
NCBI nr blastpgi|910770343e-11761.09%PREDICTED: similar to CHORD CG6198-PA [Tribolium castaneum]
NCBI nr blastxgi|910770344e-12061.09%PREDICTED: similar to CHORD CG6198-PA [Tribolium castaneum]
Group
KEGG pathwayvvi:1002464392e-25 
 K13458 (RAR1)maps-> Plant-pathogen interaction
InterPro domain[202-324] IPR0089786.4e-30HSP20-like chaperone
[8-70] IPR0070513.5e-26Cysteine/histidine-rich domain
[221-299] IPR0174474.2e-14CS domain
Orthology groupMCL12276 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205780-TA
ATGCCGGAAGATAAAAATTTAGTACAATGTTATAATCGTGGATGTGGTCAACTTTTTGATCCGAAAAATAACGATAAAGATGTATGTTGCCATCACCCTGGTGCTCCGGTTTTTCACGATGCTTACAAGGGATGGTCCTGCTGTAACAAGAAAAGTGTTGACTTCACAGAGTTTCTCAACATCAAAGGCTGTACATTATCTAAACATTCAAATGTGAAACCTCCAGAACCTGAGAAGAAAAGTTTAGATAAAGAATTAGAAAAGAAGGAAGTTATTGAAGTAAGAGCCCCATTAGTTGGACCTAAGTTAGACAGACCTCCTTTTGAATCACAATTAGTTACTTTAGAACCCCGAATAGCAGATGCCCTTAAGGAAGCAGTTTATAAAGCTAAGGAGAATGTGGCTGCACCAAACGATGGAACAATAGCTATTGGTACTTCATGTAAGAACGGAGGTTGTAATATATCTTATGAAGGGCCTCACAGTGACAACACCATATGTACATACCATCCCGGCTGCCCAGTCTTCCATGAAGGACTGAAGTTCTGGACCTGTTGCCAGAAAAGGACGACAGACTTTAACACATTTTTAAACCAACCTGGATGCACAACCGGCACTCATAAATGGTTAAAAGAGAGTGCTCCAGCTGGAACGGTGAAATGTCGTTGGGACTGGCATCAAACTCCGGAATATGTCATTGTCAGTGTGTATGCCAAGAAGTATGATCCATTCACAAGCCATGTTAAACTGAATCCTATACGCTTAAACACAAAACTAGTTTTTCAACAAGAAGGAAATGCTGTATTTGAACTCGATTTGGAATTAAGAGGAGTTGTGGATGTGAGTAAAAGTACTGTATCGATGTTGGGTACAAAGGTCGAGATCAAATTAAAGAAAGCAGAGCCAGGTGCTTGGGCTAAGTTGGATTTTCCGAGGAAAGAATCCAAAGAAGAAATGCAGCAGCCAGTAGTTCATACTGTCGAGGAAGATGATTCGGTCGATCTATCCACAGTTGAATCCATTCAGAGTATAGGAAATGTTAATGTAACAGATTCATAA

Protein sequence:

>DPOGS205780-PA
MPEDKNLVQCYNRGCGQLFDPKNNDKDVCCHHPGAPVFHDAYKGWSCCNKKSVDFTEFLNIKGCTLSKHSNVKPPEPEKKSLDKELEKKEVIEVRAPLVGPKLDRPPFESQLVTLEPRIADALKEAVYKAKENVAAPNDGTIAIGTSCKNGGCNISYEGPHSDNTICTYHPGCPVFHEGLKFWTCCQKRTTDFNTFLNQPGCTTGTHKWLKESAPAGTVKCRWDWHQTPEYVIVSVYAKKYDPFTSHVKLNPIRLNTKLVFQQEGNAVFELDLELRGVVDVSKSTVSMLGTKVEIKLKKAEPGAWAKLDFPRKESKEEMQQPVVHTVEEDDSVDLSTVESIQSIGNVNVTDS-