Monarch geneset OGS2.0

DPOGS212117
TranscriptDPOGS212117-TA1047 bp
ProteinDPOGS212117-PA348 aa
Genomic positionDPSCF300038 - 206843-208528
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0077275e-4158.40% 
BombyxBGIBMGA006748-TA8e-5066.15% 
Drosophila% 
EBI UniRef50UniRef50_UPI00021A78D85e-3740.69%UPI00021A78D8 related cluster n=3 Tax=unknown RepID=UPI00021A78D8
NCBI RefSeqXP_001633052.17e-3035.02%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|3407220622e-3640.69%PREDICTED: hypothetical protein LOC100649892 [Bombus terrestris]
NCBI nr blastxgi|3838605767e-3940.08%PREDICTED: zinc finger CW-type PWWP domain protein 1-like [Megachile rotundata]
Group
Gene OntologyGO:00082701.2e-13zinc ion binding
KEGG pathwaygga:4228972e-06 
 K11424 (NSD1_2)maps-> Lysine degradation
InterPro domain[86-128] IPR0111241.2e-13Zinc finger, CW-type
[144-223] IPR0003132.2e-12PWWP
Orthology groupMCL18847 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212117-TA
ATGGAAAATGTAGAAAATTGTAACTTGGCAATAGGCAGGGCTCATGAACCACAAGAAGAAGCTAATCAACCCCTGACACAAGACAATTCTAAACCTCAAAAGAACGTAAAAGACTACAGTACAGCGGTTGCAACAAATGAGAAACCTGATTTACAAATTGGCAGTACACACGAACTTCATCTTAGTCAGCCGTCGCGAGGTCTAACACACTTACAAAAGTTGATATGGCTTCAAAGCAAACGGACCAAAGGTCTCTGGGTGCAATGCGATGAATGTGATCGCTGGAGATATTTACCAGACATCCTTGATAGATATGAACTCCCAAAGAAGTGGTTTTGTTGGATGAATTCTGACAAATCTCTAGCAAGTTGCGCAGCACCAGAATCTCCAATCCACTTGCATGATGAAGAGGATTTAATTCACAATGAATATTCTGCTGGATCGTTAGTTCTAGCCAGAATTCCCGGCTGGCCGTGGTGGCCGGCAATGGTAGACGACTGTCCAGATACTGAACAGTTCTACTGGCTGGATGGATTCTCTGATATACCAACGCACTACAACGTGGTTTTCTTCGATCAATATGAAGTAACCAGAGCCTGGATAGATCCTCTACACCTGAAGCCTTATTGTAAACAAAAAAATACTGTAAAATTATCATCAAATAACAAGAAATACAGGAATCGCTTAGAGGCAGCTATATTACAAGCTAATGATGCTGAAACATTGTCTCTGATGGGGCGCTTAAAAAAATACAGTTTTATATGCAGATACAAAGGGCCTATCCTAAAACCGAAAACGGTCACGAAAAAATATAAAGAGAAATATCAAAAACAATTCAAAAGAAAATATAACATCGATTTGCTCGATGACTCCTCTGAGTCTGAGAGTGATAGTGATAGTATTAGTAACAAGACGACGCTAAGAAAAAATGTGACGGCTCAGTTCAGAAACATTCCTCTATTTTGGTCCAAGAAACGAAAAAAATCTGATTCAACTCATGTAACTGTCACACATGGTGAGATATTCTGTCGTTTAGATGTTTTTTGA

Protein sequence:

>DPOGS212117-PA
MENVENCNLAIGRAHEPQEEANQPLTQDNSKPQKNVKDYSTAVATNEKPDLQIGSTHELHLSQPSRGLTHLQKLIWLQSKRTKGLWVQCDECDRWRYLPDILDRYELPKKWFCWMNSDKSLASCAAPESPIHLHDEEDLIHNEYSAGSLVLARIPGWPWWPAMVDDCPDTEQFYWLDGFSDIPTHYNVVFFDQYEVTRAWIDPLHLKPYCKQKNTVKLSSNNKKYRNRLEAAILQANDAETLSLMGRLKKYSFICRYKGPILKPKTVTKKYKEKYQKQFKRKYNIDLLDDSSESESDSDSISNKTTLRKNVTAQFRNIPLFWSKKRKKSDSTHVTVTHGEIFCRLDVF-