Monarch geneset OGS2.0

DPOGS200118
TranscriptDPOGS200118-TA1299 bp
ProteinDPOGS200118-PA432 aa
Genomic positionDPSCF300044 + 888632-890115
RNAseq coverage274x (Rank: top 39%)
Annotation
HeliconiusHMEL0067560.071.59% 
BombyxBGIBMGA012559-TA8e-15059.63% 
Drosophilawcd-PA7e-6635.18% 
EBI UniRef50UniRef50_UPI00021A7EA75e-8537.71%UPI00021A7EA7 related cluster n=3 Tax=unknown RepID=UPI00021A7EA7
NCBI RefSeqXP_001604219.11e-8339.75%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3800209772e-8638.03%PREDICTED: U3 small nucleolar RNA-associated protein 18 homolog [Apis florea]
NCBI nr blastxgi|3800209772e-8638.03%PREDICTED: U3 small nucleolar RNA-associated protein 18 homolog [Apis florea]
Group
Gene OntologyGO:00055151.4e-34protein binding
KEGG pathwaybfo:BRAFLDRAFT_1255526e-50 
 K11422 (SETD1, SET1)maps-> Lysine degradation
InterPro domain[136-425] IPR0110461.4e-34WD40 repeat-like-containing domain
[136-430] IPR0159432e-31WD40/YVTN repeat-like-containing domain
Orthology groupMCL14129 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200118-TA
ATGAAACGGAAAAATTCCCGATTGGATGAAGAGGAAACGAGGCTGTCGGAATTGCTTTTCAACAAAACCAGAAAGTTTACTGAAAATATATCAGAAAACAAAAAAGACATTGATATAGACCGCAAACCAGCTTGGATAGATGACGACGATGAAAAAACTCCTATTAATATTATACCTCAAGTCAAATCTGAAGGACTTTATGTTCAAAAACTTAAACAAAAGTATGAAACACTTATTGGTACACCAAGTTGGGCTAAAGTCAAAGAGAAGAATGATGAAGATGACAAATTACTGAGAACCGTTGGTCACTTACAGAAGACAAAAACAAAGAGTCTGAAGAAAGATGTCTTAGAAATAAAAAAGTTCCCAAAAATTAATTCTGAGACAAGTAACGAAGGGCCCTTTATATCATCAGTAGAATTTCATCCTAACATGAGTGTGGTACTTGTCGGAGGCAAAGCTGGTATAGTATCATTATTTTCTATTGGCGGAGATGTTAACAGCAAACTTCATAGCTTTAAATTAAAAAAATGGAATGTCACCACTTCACAATTCTCTCCAGACGGTTCTGAAGCATATATTGCATCTAAATTATGTCACAGCTACTGTGTATATAATCTAGCAAAAGCCGAGCCCATGTTGGTCCAGTTGCCAAGGATAGTAAAAACAGCAAAAATATTCAAACTATCACCTGATGGGAAATTCATAGCAACAAGTGATGGTTTCGATGAAATCTACATCATATGTGCCAATTCAAAAGAATTATTAAGAGCACTGAAGCATAACACAAATGTAGAATCAGTAGTATTCAGTAATAACTCAGAAAATTTGTACTGCTATGGCATTCAAGGTGAAATTACTGTGTGGGATCTGTCTATGTTCAGATCAATAAAGAAATTTACGGACAACGGCTGTATTACCGCATCTAAAATAGCAATGAGTCATTGCGGACAATTATTAGCCACTGGAAGCGGGGAAGGTATAGTTAATATATATGAAACAAAAAATCTGGCAACACAAAATCCATTACCTCTTAAGACTATTATGAATTTGACAACAAAAATAACCGATCTGAAATTTAATTCTACAACAGAGATATTATCTATAACATCCAGTTATTTTCCTAATGCCCTCAAATTAATACATATACCATCTTATCACGTGTTCTGTAATTTCCCCAAACAAAACCTGTACCAAGTCGAAACTGTCAGCTTTTCACCCAACAGTGGATATATGGGCATTGGTAATAACAAAGGCTGTGCTTATTTGTATAGACTTAAACATTTTAAAAACTATTAG

Protein sequence:

>DPOGS200118-PA
MKRKNSRLDEEETRLSELLFNKTRKFTENISENKKDIDIDRKPAWIDDDDEKTPINIIPQVKSEGLYVQKLKQKYETLIGTPSWAKVKEKNDEDDKLLRTVGHLQKTKTKSLKKDVLEIKKFPKINSETSNEGPFISSVEFHPNMSVVLVGGKAGIVSLFSIGGDVNSKLHSFKLKKWNVTTSQFSPDGSEAYIASKLCHSYCVYNLAKAEPMLVQLPRIVKTAKIFKLSPDGKFIATSDGFDEIYIICANSKELLRALKHNTNVESVVFSNNSENLYCYGIQGEITVWDLSMFRSIKKFTDNGCITASKIAMSHCGQLLATGSGEGIVNIYETKNLATQNPLPLKTIMNLTTKITDLKFNSTTEILSITSSYFPNALKLIHIPSYHVFCNFPKQNLYQVETVSFSPNSGYMGIGNNKGCAYLYRLKHFKNY-