Monarch geneset OGS2.0

DPOGS204925
TranscriptDPOGS204925-TA792 bp
ProteinDPOGS204925-PA263 aa
Genomic positionDPSCF300160 - 680221-710983
RNAseq coverage192x (Rank: top 48%)
Annotation
HeliconiusHMEL0200193e-5767.88% 
BombyxBGIBMGA011419-TA2e-5062.50% 
DrosophilaCG42741-PA1e-3877.17% 
EBI UniRef50UniRef50_UPI00017585A17e-4036.76%UPI00017585A1 related cluster n=1 Tax=unknown RepID=UPI00017585A1
NCBI RefSeqXP_971645.21e-4036.76%PREDICTED: similar to Krueppel-like factor 5 (Intestinal-enriched krueppel-like factor) (Colon krueppel-like factor) (Transcription factor BTEB2) (Basic transcription element-binding protein 2) (BTE-binding protein 2) (GC-box-binding protein 2) [Tribolium castaneum]
NCBI nr blastpgi|1892398223e-3936.76%PREDICTED: similar to Krueppel-like factor 5 (Intestinal-enriched krueppel-like factor) (Colon krueppel-like factor) (Transcription factor BTEB2) (Basic transcription element-binding protein 2) (BTE-binding protein 2) (GC-box-binding protein 2) [Tribolium castaneum]
NCBI nr blastxgi|3454971014e-3938.40%PREDICTED: Krueppel-like factor 5-like [Nasonia vitripennis]
Group
Gene OntologyGO:00036761.6e-16nucleic acid binding
KEGG pathway 
InterPro domain[202-233] IPR0130871.6e-16Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25848 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204925-TA
ATGGTTTACATGGATGAAAGGCATGCCGTTGGGCGCCGTGTTGGTGCCTCGCTCCTACCATCGATGGAGGCGTGCGGCCCTCCGATTCAGCTGGAGCCTGTGGATCTTAGTCTGAAGAGACGCAATAAAATTGCGTCTTCAGCGACATCTTCTAAGGTCAGAATGGAGGAGGCAGATTCTCTGGATTGGAGCGACCGAAAAGACCTAAATTTCAGCAAGAAAATTAGAATGAATCGTCTCATTCACAGGTCAAGCCCTGAATCAGTCAGGATTAAGAGAGGTCCGGGTCTGAGGAGCGCCAGAAAAGTTAAGAATGCCATCAGATTCAAGCAGCAGGACACGCCAACACAGCTGTCACAGTACACAAACAAACAGATACTCACGCCCGTCATGAATTTCAGCTACCAGCAGATAAAGCCAGCGGGGGATATGAGTTTTTTTCTAAACACAGCCCTACAGAACGCTCTCACAGATTCACTGACAGAACTAAATAAGAAGTACAGAGAAGAGTCAGACGCCAATCAGAATAGGAAGAGGCTTCATAAGTGCGATGTGGCGGGCTGTCACAAGGTTTACACGAAGAGCTCCCATCTGAAGGCGCACAAACGGACCCACACAGGAGAGAAGCCTTATAGCTGCGGCTGGGCCGGATGCAACTGGCGCTTCGCCAGATCTGACGAGCTGACTCGGCACACTCGCAAGCACACGGGACATAGGCCCTTCTCATGCCCTTTGTGCCGACGGGCCTTCGCTAGATCTGATCATCTAGGACTCCATATGCGGAGGCACTGA

Protein sequence:

>DPOGS204925-PA
MVYMDERHAVGRRVGASLLPSMEACGPPIQLEPVDLSLKRRNKIASSATSSKVRMEEADSLDWSDRKDLNFSKKIRMNRLIHRSSPESVRIKRGPGLRSARKVKNAIRFKQQDTPTQLSQYTNKQILTPVMNFSYQQIKPAGDMSFFLNTALQNALTDSLTELNKKYREESDANQNRKRLHKCDVAGCHKVYTKSSHLKAHKRTHTGEKPYSCGWAGCNWRFARSDELTRHTRKHTGHRPFSCPLCRRAFARSDHLGLHMRRH-