Monarch geneset OGS2.0

DPOGS210021
TranscriptDPOGS210021-TA1968 bp
ProteinDPOGS210021-PA655 aa
Genomic positionDPSCF300372 - 67316-69993
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0061975e-7243.81% 
BombyxBGIBMGA010719-TA7e-8144.97% 
DrosophilaCG6654-PA2e-2235.80% 
EBI UniRef50UniRef50_UPI00023AD3433e-4030.35%UPI00023AD343 related cluster n=3 Tax=unknown RepID=UPI00023AD343
NCBI RefSeqXP_002429661.12e-3224.95%krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3584170001e-3930.35%PREDICTED: zinc finger protein 836-like [Bos taurus]
NCBI nr blastxgi|3584170002e-4830.35%PREDICTED: zinc finger protein 836-like [Bos taurus]
Group
Gene OntologyGO:00056344.2e-14nucleus
GO:00082704.2e-14zinc ion binding
GO:00036767.9e-07nucleic acid binding
KEGG pathway 
InterPro domain[361-430] IPR0129344.2e-14Zinc finger, AD-type
[546-581] IPR0130877.9e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL30772 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210021-TA
ATGGTAACTTCGTATTATATTATGAAAATAAACAAATCACATAAAAATGTGGAGGTTGACGAATACTTTGATAATTTAAGTAAAGATAACAATATAGTTAACATAACCCAGCGATTTAACAGCCAACAGACCTGCAGAGTTTGTTTAAAAGAAGGTTCACTGCCAATCTTCGGGAATCAAAATACTCCCGATATAATTGAAGCCTTAAGTATATTCGGCGATGTTGAAGTCAACAAACGTGAAGAATATCCAAACAAACTGTGTAAAATATGTTTTAAATTCCTTAAAGGTGCAATTTTATTCAGAAAACTCGCCAAACACACCAATGAATTACTGAAACAACCTTTGAAAGTGGAGCCTGAAAATCAAGATGTTTCAGAGACATCAAATGTTGATGAAGATATGCACAGTTTAAATGATAAAAAACAAATTATTCTTCCAAAACTAGAAAAAGACAAAAGGAATCTTAAAGTTCAATGTTATGTCTGCAATAGAATAGTTAATAGGTCCTATTATAAAGAACACATGACTATGCATGACCCCGATCATAAGAAATATGTTTGTGATATCTGCGGAAAGTCCTTTAGGCTACGATGTGCATATCACAACCACAGTCTCAGACACCGCAACGACTTCCCTTTCAAATGCCAATTCTGTCCATATAAAGGCAGATATGCTGAACTCCTAAAAACACACATGCACACACACACTAAAGACTACAGATATATGTGTACAGAGTGTCCGGCGAGATTTTTATTTAAAAGCAATCTTAACAGTCATATCCTCCTCAAACATAAGGAACCACAGTTTAAATGTGACGCCTGCAAGCGGGCATTTCATACAAAGCTAACACTTCGGAGACATTTTGAGGCTGATCATTTGGGTATCAGAAACCATGTATGCAATGTATGTGGTAAGGCATTTGGTTATAGAAATGCTATGATGAAACACCAACGTCACGTTCATAAGAGAGAAAAATTCATGTACTCTCGCATGCCTGCATATTTACAGGCTGAAAGTGCTTATTACGAAACATTACAGAATTCGACATTCAAAAATGTGTATACTTCTAACTCTGCAACGTGTCGCATTTGTCTTAGAAAGGGTGTCGAACCAATCTACGGAGTTGAAAAGGGTACGACTATTGAAGAAGCCTTAAAATCATTTGGGAACATTGAAATACATGAGGATGATGAATATCCAAAATATTTGTGTAAAGTGTGTTTACAATTCTTAAATAAAGCCATTTCATTTAGAAATTTAGCAAGAAAAACAAACGAATTTTTAAGAGACAGAATTAAAAACGAACCATATCAAGATGACTTCATTCCTAACCAAATTATTAGACGAACGGACGTAGACGCTACTAATAATCAAACAGATGTGCATTCCACAAACCCTGATTTTAAAAAAGAAAAAGATATCAGAATTCAATGCGGTACTTGTAAAAAAATTATACGGAAGTCATATTACAAAATACATAAGACCATGCACGATCCTGAACATCACAAATATGTCTGTGATGTTTGTGGTAAAACGTTTAGGTTGAGAGTAGGCTACCACAACCATAGATTGCGTCATAGAACAGATTTTCCATACAAATGTCACCTGTGTCCTTATAAAGGAAGATATGCCGAGAGACTCAAAAGTCACATGAGGACTCACAGCGGTGAATATAGATATATGTGTACGGAGTGCCCGGCTAGATTTTTGTTCAAGGGAAACTTTAACAGTCATGTTCTACTAAACCACAGAAACCCAGAATATAAATGTGGTTCTTGTGGTAGAGCTTTTCACACACAACTAATTCTACAAAGGCATAACGATGTTGAACATTTAGGGATCAGAAGTAACGTGTGTAATATCTGCGGAAGGGCCTTCGGATATAGAAACGCTATGATGAAACATCAGAGGCGAGTCCATAAGAGGGAAAAGTTACGATTTTCCTTCAGGCCTTCTGAAGTTTGA

Protein sequence:

>DPOGS210021-PA
MVTSYYIMKINKSHKNVEVDEYFDNLSKDNNIVNITQRFNSQQTCRVCLKEGSLPIFGNQNTPDIIEALSIFGDVEVNKREEYPNKLCKICFKFLKGAILFRKLAKHTNELLKQPLKVEPENQDVSETSNVDEDMHSLNDKKQIILPKLEKDKRNLKVQCYVCNRIVNRSYYKEHMTMHDPDHKKYVCDICGKSFRLRCAYHNHSLRHRNDFPFKCQFCPYKGRYAELLKTHMHTHTKDYRYMCTECPARFLFKSNLNSHILLKHKEPQFKCDACKRAFHTKLTLRRHFEADHLGIRNHVCNVCGKAFGYRNAMMKHQRHVHKREKFMYSRMPAYLQAESAYYETLQNSTFKNVYTSNSATCRICLRKGVEPIYGVEKGTTIEEALKSFGNIEIHEDDEYPKYLCKVCLQFLNKAISFRNLARKTNEFLRDRIKNEPYQDDFIPNQIIRRTDVDATNNQTDVHSTNPDFKKEKDIRIQCGTCKKIIRKSYYKIHKTMHDPEHHKYVCDVCGKTFRLRVGYHNHRLRHRTDFPYKCHLCPYKGRYAERLKSHMRTHSGEYRYMCTECPARFLFKGNFNSHVLLNHRNPEYKCGSCGRAFHTQLILQRHNDVEHLGIRSNVCNICGRAFGYRNAMMKHQRRVHKREKLRFSFRPSEV-