Monarch geneset OGS2.0

DPOGS207044
TranscriptDPOGS207044-TA1173 bp
ProteinDPOGS207044-PA390 aa
Genomic positionDPSCF300001 + 1823917-1825170
RNAseq coverage349x (Rank: top 33%)
Annotation
HeliconiusHMEL0068630.079.06% 
BombyxBGIBMGA012850-TA1e-16470.50% 
DrosophilaMeics-PA5e-3635.02% 
EBI UniRef50UniRef50_A8E5182e-3637.80%Zgc:173726 protein n=10 Tax=Danio rerio RepID=A8E518_DANRE
NCBI RefSeqXP_001946669.17e-3737.18%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp), partial [Acyrthosiphon pisum]
NCBI nr blastpgi|3287264186e-3837.61%PREDICTED: zinc finger protein 271-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3016286512e-4338.63%PREDICTED: zinc finger protein 91-like [Xenopus (Silurana) tropicalis]
Group
Gene OntologyGO:00036761.1e-09nucleic acid binding
GO:00056341.7e-05nucleus
GO:00082701.7e-05zinc ion binding
KEGG pathway 
InterPro domain[250-283] IPR0130871.1e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26016 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207044-TA
ATGTATAGGAATATACAAGATTTGATTGAACATGAAATGAATAAAATAAAGCTTATAGACATTTTAGTTTTTCTGAATTGTCTGGAAAATAATGATGAAGAAAACTGGCCCCAAGGAATGTGTGCCTCATGTGTTTCTACTGCGTTGCTGGCTTATAACTTCAAGTTAAATTGCTTAAAAGCCAATGTTACACTATCGCAAATTTTTACCATACAATCTACGTCAAATAATTTGCAGAGTGATGACTTAAATTCAATTGATATAAATGTTGTTTATCAGGACCATGAATATGATGTTCCTCTCTTTAATAGTCACCCAGCCATTGATTTTGCACCTGAAATACCTGAAAATAAGGAACTAACACCCCTACCTCCAGTGACTGAAATAACATCAACAGAGGGACTTCAGCGTAAAGAGGCAGAAAAGCGCTATTCATGTACTTTATGTACAAAGTCATTTACTAGGATATACAGTCTGAGATATCATATGGCAAAACATACTGATTTCCGCAAATATCTCTGCCCAAGATGTGGTAAAACATTCCACACATCTAGTGGTTTAAGGCAGCATTTAGTGTCCCATATTGACATCAATCAGTTTAAATGTGGTTTCTGCAACAAAACTTACAAATCAAGGCAGTCTCTAAAAGAACATTTTAGGGTGGCTCATTCCAGTAATCGCAAATTGTTTGCTTGTGTTACTTGTGACAAGAGATTCACAGCAAAATCTACATTGATGATGCATATCAAGATTCATAAGGGCGTAAAAGAATTTGCTTGTCCTGATTGTCCTAAAACATACACAAGGGCTACTTATTTGAGAGCTCATAGACTGACTCACTTGGGTCAACAGAAACCGAGGCCATTTGTATGCCTGTATAAACATTGTGACAGGAGTTTTGCTACTAAACATTCTTTAGTAGTACATATTGCACATACACACACTAGTGAGAGACCTCATAAGTGTGATGTGTGTTTTAAAGGATTTGCTACATCATCTGGTCTGAAGATCCATAAAGAATCTCATTCCAACAGAGAGGTTCCATGCAGCATTTGTGGGAAGAAGTTGGCTAACAAGAGAGTATTGCAGAAGCATGTTAGGGCCCATAATGCCCAGAGCAATGTTATTTCAGAGCATGTAGTTGATTTTTGCTCACCAGTATCAATCATTTAA

Protein sequence:

>DPOGS207044-PA
MYRNIQDLIEHEMNKIKLIDILVFLNCLENNDEENWPQGMCASCVSTALLAYNFKLNCLKANVTLSQIFTIQSTSNNLQSDDLNSIDINVVYQDHEYDVPLFNSHPAIDFAPEIPENKELTPLPPVTEITSTEGLQRKEAEKRYSCTLCTKSFTRIYSLRYHMAKHTDFRKYLCPRCGKTFHTSSGLRQHLVSHIDINQFKCGFCNKTYKSRQSLKEHFRVAHSSNRKLFACVTCDKRFTAKSTLMMHIKIHKGVKEFACPDCPKTYTRATYLRAHRLTHLGQQKPRPFVCLYKHCDRSFATKHSLVVHIAHTHTSERPHKCDVCFKGFATSSGLKIHKESHSNREVPCSICGKKLANKRVLQKHVRAHNAQSNVISEHVVDFCSPVSII-