Monarch geneset OGS2.0

DPOGS210422
TranscriptDPOGS210422-TA900 bp
ProteinDPOGS210422-PA299 aa
Genomic positionDPSCF300062 - 498564-499885
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0215884e-10258.39% 
BombyxBGIBMGA002752-TA2e-5452.17% 
Drosophiladwg-PA6e-3538.95% 
EBI UniRef50UniRef50_UPI0002060C674e-4342.47%UPI0002060C67 related cluster n=1 Tax=unknown RepID=UPI0002060C67
NCBI RefSeqXP_001942940.17e-4337.18%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Acyrthosiphon pisum]
NCBI nr blastpgi|3287049771e-4242.47%PREDICTED: zinc finger protein 135-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287049775e-4837.93%PREDICTED: zinc finger protein 135-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00036761.2e-15nucleic acid binding
GO:00056344.5e-09nucleus
GO:00082704.5e-09zinc ion binding
GO:00056222.1e-05intracellular
KEGG pathway 
InterPro domain[148-174] IPR0130871.2e-15Zinc finger, C2H2-type/integrase, DNA-binding
[3-45] IPR0129344.5e-09Zinc finger, AD-type
Orthology groupMCL12594 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210422-TA
ATGTCTTTAGCCAATATTGAAATAAAACCTGACGATGGTTTGCCAGATGGAATATGTGAATCATGTAATGATCAACTAAACAACTGTATTAAATTCATTGATCGCTGTAAAAAGTCGGATTCCATATTAAGAAACGTGTGCTACAATGATATACCATCTAAATCACAAACGACAGACTTAAATAATGTGTGTATTAAATTAGAAGTTAGTGAAATAGATATACAAGAAAAATCTAGTTTACAAAATAATACTAGCTGTGATAAAGTATTAAGTGATTCCAATAATACAAACAAACAACAATGCTTTACTTGTGGAAAAGTTATGTCTTCCAAGTTCCGGCTGAAAACTCATTTAGCAACACATGCTACCGAGAAAGCATATATCTGTAGCATATGCAAGAAATCATTTTCAATATTACAGAATTTGAATGTACATTTGAGAACTCACACGGGTGAGAAACCATTTAGTTGTTCAACATGCGGGAAGACGTTTGCTCAGTCATCAGGTTTAACGGCACACAAACGTAAACACACAGGACTTTTGCCATACCAATGTGTTCTGTGTCCACGTAAATTTAGAACTTTTGGTCATTTGCAATATCATATAAGACAACACACGGGTGAAAAAAAATATGAATGTGATGTTTGCAGCCGAGCATTCATAACCAGAAGTGATCTGAAACAGCATGTAATGACCCACAGAGGTGATAAACCGCATATTTGTTCGATATGTGGCATGAGACTGTCGCGTGCATCAAATTTAAAGCGTCACATAACATATTTACATGACAAGAGTAAAACTTTCAATTGCTCTCAGTGCCCTTCGAAATTTATGAATAAAAGTGAACTGACAAAACATGAAAAAAAGCATCAGGAGCTTAAAGTGCCTAATAGTAATTAA

Protein sequence:

>DPOGS210422-PA
MSLANIEIKPDDGLPDGICESCNDQLNNCIKFIDRCKKSDSILRNVCYNDIPSKSQTTDLNNVCIKLEVSEIDIQEKSSLQNNTSCDKVLSDSNNTNKQQCFTCGKVMSSKFRLKTHLATHATEKAYICSICKKSFSILQNLNVHLRTHTGEKPFSCSTCGKTFAQSSGLTAHKRKHTGLLPYQCVLCPRKFRTFGHLQYHIRQHTGEKKYECDVCSRAFITRSDLKQHVMTHRGDKPHICSICGMRLSRASNLKRHITYLHDKSKTFNCSQCPSKFMNKSELTKHEKKHQELKVPNSN-