Monarch geneset OGS2.0

DPOGS208517
TranscriptDPOGS208517-TA1818 bp
ProteinDPOGS208517-PA605 aa
Genomic positionDPSCF300064 - 25061-30084
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0028111e-13443.85% 
BombyxBGIBMGA008453-TA2e-3335.79% 
DrosophilaCG11247-PC3e-2530.43% 
EBI UniRef50UniRef50_F1R8F32e-3030.60%Uncharacterized protein n=7 Tax=Danio rerio RepID=F1R8F3_DANRE
NCBI RefSeqXP_001862548.16e-2933.64%zinc finger protein 8 [Culex quinquefasciatus]
NCBI nr blastpgi|1417957793e-3030.60%LOC100005466 protein [Danio rerio]
NCBI nr blastxgi|1417957799e-3330.48%LOC100005466 protein [Danio rerio]
Group
Gene OntologyGO:00036761.4e-11nucleic acid binding
GO:00056341.8e-07nucleus
GO:00082701.8e-07zinc ion binding
GO:00056225e-06intracellular
KEGG pathway 
InterPro domain[520-547] IPR0130871.4e-11Zinc finger, C2H2-type/integrase, DNA-binding
[4-73] IPR0129341.8e-07Zinc finger, AD-type
[528-550] IPR0070875e-06Zinc finger, C2H2
Orthology groupMCL22182 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208517-TA
ATGGAAGTCATTTGCAGAGCTTGTCTATCCAAACATGAACAAACGGATCTTTTACAATATTCTGAGAAAAATAGACGATTGTTTGTATATTGCACAGGATTGCAGGTGAAAAGAAATGATGCTTTTGCTTTTCAAATGTGCAAGGATTGTTATATCAACATGAAGGTGGCTTGCCATTTTAAGAAGACTTGTAGAAATTCCGATAAAAAAATTAAAAAGTATTCAGCCATCAAAGAAAGTGGAGATTATATAGATATTTATGATTTTCTTAAAGACAATGACGATCCAATAAAATTACGGCTACCATTAAATTTGGGTAAAAGTCCATCACCATACAGAAGAGATGAAGACAATGAATCAACATGTACAAGCATCCAGAACTTCATGACCGACATCCTTCAAGAGCTGCCAGATTCCGAAGCTAACATCATCAAACAGGTTATTGAAGAAGAGGCTGACATACTCGAAGATTCACTCGACTCACACTGGCTAGAAGACCTGTCGGATAATGAGCTCGGAATGGATTTCAGTTTCAGCCCGTTTTCAACCCCGAGGACACTTCAGAATGACAAAGACGATGACATTTCGACTTCAAAGACATTGAATAATAATGAAGAAATCAAAATAAATAAAACACATGAATGCAGCTTAGATAATATTATAACTAGGAGCATGGACATTGATCTGGAATCACTGGACAAAAGAATTAACTGCGAGGCTGGGAATAATGTATGTGTGATTGATATTAACATAGAGAACGCTTTGAAAAACACAGCAGAGAAGGTTATGTTAGACGATCTACTATCAACACCTCCAGTTTTACCGAACGTGACATCACCGGCTACTCCACTTATAAAGAATATTCTGTTTGGGGATATGGAAGAAACATCACGTTATAATGATGAACCACTAACAAGAACGACAGAAATCAAAGAAAATATTGAAGTTATAGATGAATTTTTATACAAAAATGTATTAAATGATGTAAATACAGAAGACGGTTATAATGATATTAATGAAATAGAGAGATATTTGACTAAACCAGAAATAAAAAGGAAAAATTTAGAAGATATACCATCAGCAAAAAATAAATATTGCATAACAAATTTTTATTGTAAAATGTGTGATAGGAAATTTAAGAATTTGGTCGCTTTGAAAGTCCACTGTGCTAAATATCACAAGCTTAGGATTCCAAAAGATAATGTTGCCAGGATAAGAAAAAAAATGATTTGTGATTATTGCGGGAAAATATTTAATTCGCCAAGATGTATAATCAAGCATATTGAAAATCATAAGAATCCTGTGTCATATGAATGTAAAAGATGTTCGCTGAAATTTGATACAAAGTCGAAGTTGAGACTTCATCAAGGCACTCACGAGCATAATGCCAACAACAATAATGTTATAAAGAATCACGTGTGTTCAATATGCGGCTGGGCGTGCACAAGTTCATCGAATTACAACATACATATCAAGAGACATTTCAATATATACATGCATACTTGCGAGGAGTGCGGACAGGGTTTCTACAGGAAATGTGATATGAATGCACATATGAGACGTCACACAGGCGAGCGTCCGTTCCAATGTTCGTATTGTTCTAGAAGTTTCGCTAGACACGACGCACTCAACAGGCACATTAAGCGACACACGGACGAAAGGCCGTATCCATGTAACTTTTGTAAATCAACATTCACTAACGCTTACGATCTTAGACACCACAAAGAAAATGCGAAGTCTTGTTTGAAAATTCAAATGTTGTTAGCGAAGAAGAGTAATGTTGCCAGTGATATAGTATTAACAACAAATGTTACCTGA

Protein sequence:

>DPOGS208517-PA
MEVICRACLSKHEQTDLLQYSEKNRRLFVYCTGLQVKRNDAFAFQMCKDCYINMKVACHFKKTCRNSDKKIKKYSAIKESGDYIDIYDFLKDNDDPIKLRLPLNLGKSPSPYRRDEDNESTCTSIQNFMTDILQELPDSEANIIKQVIEEEADILEDSLDSHWLEDLSDNELGMDFSFSPFSTPRTLQNDKDDDISTSKTLNNNEEIKINKTHECSLDNIITRSMDIDLESLDKRINCEAGNNVCVIDINIENALKNTAEKVMLDDLLSTPPVLPNVTSPATPLIKNILFGDMEETSRYNDEPLTRTTEIKENIEVIDEFLYKNVLNDVNTEDGYNDINEIERYLTKPEIKRKNLEDIPSAKNKYCITNFYCKMCDRKFKNLVALKVHCAKYHKLRIPKDNVARIRKKMICDYCGKIFNSPRCIIKHIENHKNPVSYECKRCSLKFDTKSKLRLHQGTHEHNANNNNVIKNHVCSICGWACTSSSNYNIHIKRHFNIYMHTCEECGQGFYRKCDMNAHMRRHTGERPFQCSYCSRSFARHDALNRHIKRHTDERPYPCNFCKSTFTNAYDLRHHKENAKSCLKIQMLLAKKSNVASDIVLTTNVT-