Monarch geneset OGS2.0

DPOGS208817
TranscriptDPOGS208817-TA1335 bp
ProteinDPOGS208817-PA444 aa
Genomic positionDPSCF300036 + 193230-201497
RNAseq coverage121x (Rank: top 57%)
Annotation
HeliconiusHMEL0150941e-13754.57% 
BombyxBGIBMGA007923-TA3e-3437.50% 
DrosophilaBzd-PB2e-9140.73% 
EBI UniRef50UniRef50_UPI00022463912e-10447.49%UPI0002246391 related cluster n=1 Tax=unknown RepID=UPI0002246391
NCBI RefSeqXP_970424.12e-9940.89%PREDICTED: similar to Buzidau CG13761-PB [Tribolium castaneum]
NCBI nr blastpgi|3454803246e-10447.49%PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia vitripennis]
NCBI nr blastxgi|3454803243e-10447.03%PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia vitripennis]
Group
Gene OntologyGO:00082701.2e-08zinc ion binding
KEGG pathway 
InterPro domain[35-72] IPR0028931.2e-08Zinc finger, MYND-type
Orthology groupMCL13773 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208817-TA
ATGTATAAGCTTAAACATATAAACGCTGTAAAGACCGGTGATCTTATTCTATCGGAAGAACCATTTGCCTATGTTTTATCGTCCAAAGAAAAGGGAAGTCGGTGTGATTTCTGCTTAGAAAAAGGAAAGGTGTTGAAATGTTCTGGCTGCCAATTTGTCCACTACTGTAATAGAAGTTGTCAGAAAGATGCCTGGGAAGATCACAAGTGGGAGTGCGCTAATTTAAAGAGGATAGCTCCTAAAACTATACCTGATGCAGCTCGTCTTCTCGCCAGGATACTGAATCGTCTTCAACGTGGTAACGGCGGTGCTTACAAGGCTTTTTATACACCGACTTCATTCAGGGTGTGGAAGGATCTCATGTCGCATTATTCCGATCTTAAATCTGACAAGAAAAGGATGGATCATTTTTCTACCCTCAGTATGGTGCTGTTCGAATACTTGAAGGACATCTCATTACCGAACACAGCTGATCTCATGGGATTGTACGGAAGAATGGTGATAAATAGCTTTACAATATTGGATATTGAGATGAATTCTATAGGAACGGGGATCTATCTGGCCTCGTCTGTAATAGATCATAGCTGCAATCCAAATGCTGTAGCCGTCTTCGATGGAAAGACCATCAACATAAGAGCTTTGAAAGACATGAATTGCTTGGATTGGAAAAAGATTAGAATATCTTACATCGACCTGATGAAGACGCCTTATGAACGTCAAATGGAACTGCGTCAGAGCTACTACTTCCTGTGCCAGTGTGACAGGTGCTTGGACGAAAACCGGATAAAATACGTGCACGCGGCTAAATGTCTGAAAGACGGATGTGAACATCCTGTAAATATAAAATGGCGGGAAAATTTAAAATTAGTTGAAAAGATAGAAAAGGAGAAGGCAAAAGAAAATGGCGCCGGTGATACGATAGTGAGTAATGGATGCAGTGAACCGATATTTAATGGAATTCTGCCACTGCCGGATGGATCTGTGTACTGTGGGAAATGTGGTACTGAGTTTATGAGGAATGATCTCGACAAGTTCATGAGGACGATGGAGGAGAGTGAGGTCAACTTGGAGAATATGAAAGAGACTTCAATGTCGACATATGCAAGCATACTAATACCATGCTTCAGGTTTTATTATGGCGAAACGCATCCCCTGTTGGGACTGTTGCATATAAAGTATGGGAAAATATTGCTGTATAAAATGAACCTGGCGGGAGCGTTGAAGCAGTTCAAATGTGCCGAGAAGATAATAAAGATAACGCATGGAGAGAAACACCCGCTGTACAAAAACCATCTACTGCCGCTGATGTACCAAGCTATAGTCGAATCGGAGTAA

Protein sequence:

>DPOGS208817-PA
MYKLKHINAVKTGDLILSEEPFAYVLSSKEKGSRCDFCLEKGKVLKCSGCQFVHYCNRSCQKDAWEDHKWECANLKRIAPKTIPDAARLLARILNRLQRGNGGAYKAFYTPTSFRVWKDLMSHYSDLKSDKKRMDHFSTLSMVLFEYLKDISLPNTADLMGLYGRMVINSFTILDIEMNSIGTGIYLASSVIDHSCNPNAVAVFDGKTINIRALKDMNCLDWKKIRISYIDLMKTPYERQMELRQSYYFLCQCDRCLDENRIKYVHAAKCLKDGCEHPVNIKWRENLKLVEKIEKEKAKENGAGDTIVSNGCSEPIFNGILPLPDGSVYCGKCGTEFMRNDLDKFMRTMEESEVNLENMKETSMSTYASILIPCFRFYYGETHPLLGLLHIKYGKILLYKMNLAGALKQFKCAEKIIKITHGEKHPLYKNHLLPLMYQAIVESE-