Monarch geneset OGS2.0

DPOGS206920
TranscriptDPOGS206920-TA1104 bp
ProteinDPOGS206920-PA367 aa
Genomic positionDPSCF300001 - 1373927-1375313
RNAseq coverage96x (Rank: top 62%)
Annotation
HeliconiusHMEL0106263e-12660.55% 
BombyxBGIBMGA013018-TA7e-8150.18% 
DrosophilaCG31223-PA1e-2628.99% 
EBI UniRef50UniRef50_E9IUG03e-6037.33%Putative uncharacterized protein (Fragment) n=2 Tax=Myrmicinae RepID=E9IUG0_SOLIN
NCBI RefSeqXP_001842860.13e-6036.51%HIT zinc finger family protein [Culex quinquefasciatus]
NCBI nr blastpgi|3227912641e-5937.33%hypothetical protein SINV_05393 [Solenopsis invicta]
NCBI nr blastxgi|3407176893e-6838.29%PREDICTED: LOW QUALITY PROTEIN: zinc finger HIT domain-containing protein 2-like [Bombus terrestris]
Group
KEGG pathway 
InterPro domain[11-39] IPR0075293.7e-08Zinc finger, HIT-type
Orthology groupMCL16581 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206920-TA
ATGTCATCTCAACCATCCAAAAGAATTGAAAGGTTATGCGGTTTATGTGAGAACAATTCAAGCAAATATTGTTGCCCTCGTTGCGAAGTATTTTATTGCTCACTTGACTGTTACAAGTCTGAAAAGCATTTGGAATGTTCAGAGAATTTTTATCGAGAATGTGTGAATCAAGAACTTGCTTCGAACACGGCGGACGATGAAGCTAAAAACAAAATGATAGATATTCTAAAAAGGATGCACAATGAAAGCTTTGACAATGATTTAGAAAACATAGAAGAACCAGTAGAGATTGACTCGGATGATGATTCTGAAATTGATTTGCATACAAGGATCAAAGATTTGAATTTAGATGATCCAGATGCACTTTGGAATGCACTCACATCTGATGAAAAGAATGAATTTGAGGCCATGCTCAGTCAGGGTGATGTTGGGACTATAATTCCACAGTGGCAACCTTGGTGGCTGTTCAGAAAGGAAAAGAAACTGGTTGAAGAAGTAGATCTTAATGGTGAAAAGGAAGCACTTGAAAGGTGTCCTACAATTAAAACAGTCCCTAAATTTAGTTCGCTAACTAGTATTCAGCCTTCGCCATCAATAAGGTTTAATATGGTAAATATCATAGCCGCATATGCCTTTGCTATAAGATATGTTAATGGGGAAATTAATCATATAGAAATATGCACATATATTCTTGAAGTGTGCAGTAATCTGAAAACTAATACTAATTTTGAGGATGCTGAACTCGCCATTGAGTCTGTCGCACAAATGTGTATACAAAATGAACATATAGACACAGACGAAGCAAGTTTGAATGTAATGAGGCATGACACCTTCCTTATCCTACAAGGCCCCAGTGAAGAAAACCATTTACATTACAGTAAAACAGCTTTCTCACATCTCTTAGATATTTTCATTGAAGCTAAGACTCAGATGAAGAGAAATAAACCTAAAGAGAGTAAAACAAATGGTAATTTTTCTAGAAAATTTCCCCAATGCAATAAAAGCCATTTGCCTGATATTGATACAGCTACAATTAAAAGGGTCATAAAGAAATTGGAATACTATCTGGCATACTTAGAGAGCTGCAATAATAAGGCGGAGTAA

Protein sequence:

>DPOGS206920-PA
MSSQPSKRIERLCGLCENNSSKYCCPRCEVFYCSLDCYKSEKHLECSENFYRECVNQELASNTADDEAKNKMIDILKRMHNESFDNDLENIEEPVEIDSDDDSEIDLHTRIKDLNLDDPDALWNALTSDEKNEFEAMLSQGDVGTIIPQWQPWWLFRKEKKLVEEVDLNGEKEALERCPTIKTVPKFSSLTSIQPSPSIRFNMVNIIAAYAFAIRYVNGEINHIEICTYILEVCSNLKTNTNFEDAELAIESVAQMCIQNEHIDTDEASLNVMRHDTFLILQGPSEENHLHYSKTAFSHLLDIFIEAKTQMKRNKPKESKTNGNFSRKFPQCNKSHLPDIDTATIKRVIKKLEYYLAYLESCNNKAE-