Monarch geneset OGS2.0

DPOGS205017
TranscriptDPOGS205017-TA1686 bp
ProteinDPOGS205017-PA561 aa
Genomic positionDPSCF300442 - 44103-49143
RNAseq coverage36x (Rank: top 74%)
Annotation
HeliconiusHMEL0225022e-9766.80% 
BombyxBGIBMGA001656-TA3e-6445.00% 
DrosophilaCG12299-PA2e-2529.80% 
EBI UniRef50UniRef50_UPI00020F5B257e-3130.50%UPI00020F5B25 related cluster n=2 Tax=unknown RepID=UPI00020F5B25
NCBI RefSeqXP_002427629.16e-2729.49%gonadotropin inducible transcription factor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3343132912e-3030.50%PREDICTED: zinc finger protein 160-like [Monodelphis domestica]
NCBI nr blastxgi|3343132912e-3930.50%PREDICTED: zinc finger protein 160-like [Monodelphis domestica]
Group
Gene OntologyGO:00036767e-08nucleic acid binding
KEGG pathway 
InterPro domain[509-540] IPR0130877e-08Zinc finger, C2H2-type/integrase, DNA-binding
[394-414] IPR0227551.6e-06Zinc finger, double-stranded RNA binding
Orthology groupMCL34574 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205017-TA
ATGGATGAAAACACTGAAATGAAATTAAAAACAATATGCTGTGCATGTTTGAGTGTTGATAGAAAGCTTACGAAACTGTGCCGAGTCGAGGATGGTGTAAATAGTTTATTCTTCCTACTTTCATACAACTGCGAAGTTTTCGAAGAAATATTCAACAAGAGAGCATCGCTTTTATCAATATGTTGGGAGTGTAAGGCTATTATATATAGACTGCAACAGTTCCGCAAACAAGTAAGCCTCGCCCAGAAACAACTGTCCGAGCTGACAGACGGAAGAGATCTCAAAAATTTCCACAGTCTATCAAAACTACAGTATTCCCATCAGGATAAATACAATTTTATTATAGAAAATATATCAAAGCCAGATAATTTTATAGACTGTGGCCCTGACATCAGTTTCTTGAAGACTGAAAGTGACACAGATGATATACCGCTAGCAGATTTATATTTAAATAATTGTAAAGATATACAACCTCCACCAAACGATATTAACATTGGCTGTATAGAAAATAATACAGATAATAAACATGTTGGGTCTGGATGTTCCAAATATGAAATGACAGAAGAGGAAATGTGGGAGAGTATGAAAACACAGAAAGAAAATGAATTTTACATGAACAGTTCAAGCAAATGTGAATCTTGTGTTAGAGTTTTCAATGACTATAAATGTTTGGAGCGACACAACTTGAAATTACATAGACCGAATCATATACAGTGCGGCATATGTAAAGTGTACGTGAAACGAAGGGGTTTTAAACAGCACAAACAGAGTCACTACACAAAACATGTATGTGATATCTGTCGGTTCGTTGCATACAAAATTGTCACAATGAATACACATCTGAGGAACGAGCATGGTGTTGATGTTGAATACAAAAAGGTAAGAAGGAAACCAAATGAGTCTAAGACATCGGACAAACAGACGGGGGGGTTTCTATGTACAGAGTGCGACAAATGGTTCGAGAACAAAAACAAAAGATACAAACACACACAGAAGTGTCACAGAGACGGCTTCAAGTGCGGCTCCTGCGGGAAGAGGTTCGCCTTCAGGAACACGCTGACCAGGCACGAGAGGGTACATTCCTCGCCACTGCCCAGGGAGCAGTGTCCCACGTGTGGGAAGCTCATCCGCCAGGACCTGATCAAAGCCCACGCTCACACACACACTCACAGACAGACCCATGTGTGTGTCGCGTGTGATAAACGCTTCATATCAAGGGCCTCGTACGAGAATCACCTCAAGTATGCCAAATCACACGCCGTTGGAGACGTCTTGAAATACAAATGTCCTGAATGCAAGAAAGGATACAGATCCCGTGGCGAGTTACGAGATCACGTCAACTACCAACACATGGGCAGGACCCTGCACAAATGTCCTATATGCGACAAGGCCCTAGCGACCCGCAGGTGCATAACCCGCCACGTCAAACGAGCGCATCACGGGATCAAAGAGAACGACAAGGACAAGATATGTCAGACGTGCGGGAAGACATTCCGGGACAATAAGTGTCTCCGGGAACACGAACTAATCCATACAGGCGAGAGACCGCTGTCCTGTGATGTTTGTGGTCGGACGTTCAGACAGAGCGCGTCATTATACACACACAGACGGAGGGTCCATCATATAGTGGCCGCTCAGAGGATCGTCGTACACGAGGAAGGTTCGGAGAAATTGAAACCGGTCTAG

Protein sequence:

>DPOGS205017-PA
MDENTEMKLKTICCACLSVDRKLTKLCRVEDGVNSLFFLLSYNCEVFEEIFNKRASLLSICWECKAIIYRLQQFRKQVSLAQKQLSELTDGRDLKNFHSLSKLQYSHQDKYNFIIENISKPDNFIDCGPDISFLKTESDTDDIPLADLYLNNCKDIQPPPNDINIGCIENNTDNKHVGSGCSKYEMTEEEMWESMKTQKENEFYMNSSSKCESCVRVFNDYKCLERHNLKLHRPNHIQCGICKVYVKRRGFKQHKQSHYTKHVCDICRFVAYKIVTMNTHLRNEHGVDVEYKKVRRKPNESKTSDKQTGGFLCTECDKWFENKNKRYKHTQKCHRDGFKCGSCGKRFAFRNTLTRHERVHSSPLPREQCPTCGKLIRQDLIKAHAHTHTHRQTHVCVACDKRFISRASYENHLKYAKSHAVGDVLKYKCPECKKGYRSRGELRDHVNYQHMGRTLHKCPICDKALATRRCITRHVKRAHHGIKENDKDKICQTCGKTFRDNKCLREHELIHTGERPLSCDVCGRTFRQSASLYTHRRRVHHIVAAQRIVVHEEGSEKLKPV-