Monarch geneset OGS2.0

DPOGS205502
TranscriptDPOGS205502-TA2166 bp
ProteinDPOGS205502-PA560 aa
Genomic positionDPSCF300056 - 441908-445065
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0112860.062.90% 
BombyxBGIBMGA000133-TA1e-7966.67% 
DrosophilaCG31224-PA3e-1726.48% 
EBI UniRef50UniRef50_F4WFF19e-2428.21%Zinc finger protein 484 n=5 Tax=Myrmicinae RepID=F4WFF1_ACREC
NCBI RefSeqNP_001152818.12e-2530.74%zinc finger protein 2-like [Nasonia vitripennis]
NCBI nr blastpgi|2268232004e-2430.74%zinc finger protein [Nasonia vitripennis]
NCBI nr blastxgi|2268232001e-2729.45%zinc finger protein [Nasonia vitripennis]
Group
Gene OntologyGO:00056346.8e-07nucleus
GO:00082706.8e-07zinc ion binding
GO:00036768.4e-07nucleic acid binding
KEGG pathway 
InterPro domain[23-96] IPR0129346.8e-07Zinc finger, AD-type
[415-441] IPR0130878.4e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26602 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205502-TA
ATGGCAAAGGCAGTCAATTTACAAAATCTTGTAAAACATATTGTTAACGGTACGCTGGCGGAAAATCTTTGTCGTATATGTTTATTGCCATTAAATGATCTCTATGAGGATATTTTCACAAATATTTGTAAAGAAAACAACGAATACTGTGTGGCTGATGTATTAAACACGGTCTGTCAAATAAAGATTTCGCATGTGGACAACTACAGAGTATGTGCAAATTGTTTTATTAGTGCCTCTGAGGCTTACAAATTTTACCTTCTCACTCAAAGAAGTAATCAGATTCTAGAGTTCTATGTTAACGAATTGGAAAATCACGTTAATTCAATAGACTACCCGGAGAATATGTCTGTGGACTCTCTATGCATACCATTGCCAGTCATAACAGTTGATACGGATTACGATTTTGATATTAGTAAATTTCCATTTAAAAGTGTTCTTCACGAAAAGCCTATAGTTAACGGAGGAAATGGTACAACTGATGGTGGTGTAGATTGTGTCAAATCAGAAGATATAGATGAAGGGAACATAGTTATTGTGATGGATGATGGGGTGCCGACATTTTTTAAGCCGCAAAGAGATGGATCATTACTCACACTGGACGGAGAAAAAGATCTGAGTATAGACTTCGATAATGAGCCGGAACGGAAGATGATGAAGAAGAGAAAAAGGAAACGGGAGCCAATGGAATATAAAATATGCAGTCGTTGTCCCGTTAGATATAGATTTGTAAGGAAACTTAAAGAACACATGAAAGAGGAACACGGTGTTGACCTTTTTGTTTGTAAGATCTGCCAAGCTATTATAGAAGATGAACAGGAATATAATAATCACCTGCAAACTCACACAGGTATACACACATGCGCCATCTGCAACATGGTGTTTAAGAAACGTAACACAATCATAAAACATTTCAAATGGCACGAAGAAATGAAAACCAAGAGCCAATCCGACGGACTTCATATTTGCGAATACTGTGGCCTCATGTTGGTCAGTGAGGACGATTTAAAAGATCATAAGGAGAAGAAACATGTAAAAAAATACACGTGTTATTACTGCGGCAAGATGTACAAAGGGGAATTGAGTTTCGAAACGCACATTAAGAAACATGAAGCTAACAATGATTCGAATAGAAATGCAGTTAAAGTAAAGCAAGAAACAAAGAAGAAACGCAAGTACACCTGCCCTACCTGTAACCGAGACTTTGTTGATGAGAGGGCTCTTCTGTGGCATGAACGACTTCACACTAACGAAAGGCCGTATGTTTGTGAGGTGTGCGGCCGCGGGTTCGTTTCCCTGAACCGCCGTAACCAGCACTCGGTGTGCGCCCACACAGCCCCCGCGCGCCGCTGCCCGCTCTGTCCGGCGCTGTTCCATCTAAGGTCGATGGTCAACACCCACGTTAAGAAGGTACATCTGAAAACTCACAAGCGTCGTAATCGCACCAGCAAGTATCAGAAGGTGTTCTGGAAGACTGAACGCGTTCCCATCCAGGAGCTGAGCGTGTCCATACAAAATGAGCTGTTTGAGCTGCAGTCCACACAGAGCCAGCCTGTGCTCGCTAAGAAATGGGTATTTATATGTTTGTTCGCTGTGCTCGTGGGGGTGACAGATGGACGGCAGCGGACGGAATACGTTTTATTGAAGTTTATGTTTTCCGTACTTGTGGCTACATATTTTTAGATACAGATTCTATCTGGTTTCCAACACGGAAACACGCAACATGCAATCGTCCGTGTGGAGTGCAATCCACATCCAAATCTGGACGCACGTTTGTTAAGTTGTCCTTGAGCTCTGTTAACCGTGACCGCGAACGGCAATCGAATTGAGAATAGCAGTCTCTTAAATTGGAACGGTGTATCGCTCGGGATCACGGGGAGTCGAGGAACGAGGACATCTTCACGTTTCAAAAGCCCCGTTAAAATCGTTGCTTCCCCTGTTGTTCATTAATTTTTTAACTGTAAGTCGTGTGTCGTGGCAGAGTTTTGGCTGGTTAATGGCGCACTTAGGTGTATGTTAGTATCGTGAAAAATAGATAAAAACAATCCGTGAAGCCTGTTTTATGTCCTGTCAAAACTTTAGCTGAATCGATGCAGAACATTTTGAGTTTTTGAAGCATACACAAACCAACTTAACATTTTATATACATACATTTTGATTCAATAG

Protein sequence:

>DPOGS205502-PA
MAKAVNLQNLVKHIVNGTLAENLCRICLLPLNDLYEDIFTNICKENNEYCVADVLNTVCQIKISHVDNYRVCANCFISASEAYKFYLLTQRSNQILEFYVNELENHVNSIDYPENMSVDSLCIPLPVITVDTDYDFDISKFPFKSVLHEKPIVNGGNGTTDGGVDCVKSEDIDEGNIVIVMDDGVPTFFKPQRDGSLLTLDGEKDLSIDFDNEPERKMMKKRKRKREPMEYKICSRCPVRYRFVRKLKEHMKEEHGVDLFVCKICQAIIEDEQEYNNHLQTHTGIHTCAICNMVFKKRNTIIKHFKWHEEMKTKSQSDGLHICEYCGLMLVSEDDLKDHKEKKHVKKYTCYYCGKMYKGELSFETHIKKHEANNDSNRNAVKVKQETKKKRKYTCPTCNRDFVDERALLWHERLHTNERPYVCEVCGRGFVSLNRRNQHSVCAHTAPARRCPLCPALFHLRSMVNTHVKKVHLKTHKRRNRTSKYQKVFWKTERVPIQELSVSIQNELFELQSTQSQPVLAKKWVFICLFAVLVGVTDGRQRTEYVLLKFMFSVLVATYF-