Monarch geneset OGS2.0

DPOGS213025
TranscriptDPOGS213025-TA1158 bp
ProteinDPOGS213025-PA385 aa
Genomic positionDPSCF300024 + 324662-326826
RNAseq coverage830x (Rank: top 16%)
Annotation
HeliconiusHMEL0053400.095.47% 
BombyxBGIBMGA006943-TA0.093.96% 
DrosophilaSp1-PD3e-9261.44% 
EBI UniRef50UniRef50_D5GS302e-11860.10%Sp6-9 protein n=2 Tax=Parhyale hawaiensis RepID=D5GS30_9CRUS
NCBI RefSeqNP_001034509.13e-13163.62%Sp-like zinc finger transcription factor [Tribolium castaneum]
NCBI nr blastpgi|865153665e-13063.62%Sp-like zinc finger transcription factor [Tribolium castaneum]
NCBI nr blastxgi|865153661e-14266.91%Sp-like zinc finger transcription factor [Tribolium castaneum]
Group
Gene OntologyGO:00036765e-14nucleic acid binding
GO:00082704.5e-06zinc ion binding
GO:00056224.5e-06intracellular
KEGG pathway 
InterPro domain[290-314] IPR0130875e-14Zinc finger, C2H2-type/integrase, DNA-binding
[292-316] IPR0070874.5e-06Zinc finger, C2H2
Orthology groupMCL11661 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213025-TA
ATGTGCATTTATGATATTTTTACAGACGAAGAGCATCCGAACTTGCGCGGCACGCCATTGGCGATGCTCGCTGCGCAGTGCAGCAAGCTGTCCAGCAAGTCGCCACCACCACTGGCTGATGCAGCGGTCGGCAAAGGTTTTCATCCGTGGAAGAAAAGCCCTGGAACACATTCTCCACCGGGAGCTGGTTTGGTGCCTCGATCGCAGGCGTCGGCTTGCACACCATATGCACGAGCCCCTACCTCATGTGCTGCGGCGCCTTCATACGGAAACGAGCTCTACTTTCCTTCATCGGGTGATCAGTTGCTAGGGAAAAGTGAATCGAGCGCCAGTCTAGGCTCCATGTACTCAAGACACCCTTACGAGTCCTGGCCTTTTAATGTTGGAGGTGGCGGTGGTAGTGGTGCTTTGAAAGCAGCTGAAATGGGCGGTGTAAGCGCTGTAGGTAGTACTTGGTGGGATGTCCACAGTGGGTGGTTAGACGTTGGAGGTCAAATGGCAAACTACGCTGGGCAAGATTATTCTCAATTGACGCACTCTCTTTCTGGAGGAGCTCATTTGCTTCCTCCAGCGCCCCACCTCCTACAAGATGCATATAAATCTGTGTTGCCTACACAGGGATCTTTCGGTCTTCATGCACCAGGATCCCCAGCACCACCAGCTCAGGCTCCGTCACCGCGATCTCAGCGACGATACGCCGGCCGCGCTACTTGTGACTGTCCTAACTGTCAAGAGGCCGAAAGACTCGGACCGGCTGGAGCTCATCTTCGTAAGAAAAATATACATAGTTGTCATATACCTGGATGTGGAAAAGTATACGGGAAAACATCCCACCTTAAGGCTCATCTACGCTGGCACACTGGCGAGAGGCCTTTCGTGTGCAACTGGCTGTTCTGTGGAAAACGTTTCACACGCTCCGATGAACTACAGAGGCATCTGAGAACGCACACAGGCGAAAAAAGATTTGCATGTCCTGTGTGCAACAAACGTTTCATGAGGTCGGATCATCTCGCTAAACACGTCAAGACTCATAATGGAGGAAAGAAGGGCAGTTCGGAATCTTGCTCGGATTCCGAAGAGAATAGCCAAGGGGAGAGTCATGCTGGTGGAAGGTCGCCAGAGCATCACTTGGATGTGAAACCAGGTGCACTCGTGTGA

Protein sequence:

>DPOGS213025-PA
MCIYDIFTDEEHPNLRGTPLAMLAAQCSKLSSKSPPPLADAAVGKGFHPWKKSPGTHSPPGAGLVPRSQASACTPYARAPTSCAAAPSYGNELYFPSSGDQLLGKSESSASLGSMYSRHPYESWPFNVGGGGGSGALKAAEMGGVSAVGSTWWDVHSGWLDVGGQMANYAGQDYSQLTHSLSGGAHLLPPAPHLLQDAYKSVLPTQGSFGLHAPGSPAPPAQAPSPRSQRRYAGRATCDCPNCQEAERLGPAGAHLRKKNIHSCHIPGCGKVYGKTSHLKAHLRWHTGERPFVCNWLFCGKRFTRSDELQRHLRTHTGEKRFACPVCNKRFMRSDHLAKHVKTHNGGKKGSSESCSDSEENSQGESHAGGRSPEHHLDVKPGALV-