Monarch geneset OGS2.0

DPOGS204389
TranscriptDPOGS204389-TA1155 bp
ProteinDPOGS204389-PA384 aa
Genomic positionDPSCF300002 - 1524017-1525171
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0078170.084.20% 
BombyxBGIBMGA007843-TA2e-17977.18% 
Drosophiladwg-PA2e-1634.90% 
EBI UniRef50UniRef50_UPI00022AFE344e-2135.84%UPI00022AFE34 related cluster n=1 Tax=unknown RepID=UPI00022AFE34
NCBI RefSeqXP_002738214.19e-1934.15%PREDICTED: zinc finger protein 347-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2964773563e-2033.60%zinc finger protein 347-like [Bos taurus]
NCBI nr blastxgi|3266672472e-2536.18%PREDICTED: zinc finger protein 850-like [Danio rerio]
Group
Gene OntologyGO:00036767.6e-06nucleic acid binding
KEGG pathway 
InterPro domain[233-266] IPR0130877.6e-06Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25396 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204389-TA
ATGTCTAAAAAAGAATCGATTGATCAAAACGAAAGAAAGTATGATACCAGCGATTTTGATAATATTATATCACCGCACAGTATATTTGAATCATCAAAACCTAAACACCAGGAATTACCGAGTTTATCAGGATTTTCTAATGCTGAAACTCAAACAAAACAAAGTGACTTAAAAGGACAAGGGATCCAAAGCGGTCTTGGTACCTGGCATTATAATCCCTGGTGGATGCTAGCGAGTACATCTCGAAGTGTACAATCACAAGATAATTTCTCAGAAAGCAGTGACCAAACAAAAGTAAAACAAGAACCCAGCGGTGCAGGTACTCCCGATCACGACACTCTACAAAGTGATCTGGATCAATTTCAAACCACACAAACACCTCCACCTCCAGCAGGACTGGATTCCTTTTGTGATGACTGTTCCGATCCTTTTTGTGATTCGAGTATATCTATCTGTCGAAAACTTTTCCATTGCCCACACTGTCGAAAAAGCTATCCTACGATTCTCGAATTTAATACACATCTAACGGGTGTTCATCCAGCACAAAAACCATTTAGATGCCAAATCTGTTTGGAACCATTTTACAAAAAATCTCATTTACGGAGACATTTAGATTCGAACCATACTCGAAAAGATGTGAACAAATGTACTGTATGCTCAAAATATATAAAGGATAAAAGTAACTTACGTAAACATATGCAAGTACACACCGGCCGCGTACCCCAAAAACAGTTCAAGTGTGACCTTTGCAATAATAAACGTTACATGTCATTAGATAGACTCAACAATCACAAGGTGGTTTGCACGGGTGAGAAAGTACTAAAGTTTTGTGATATGTGTACTAAGGTATTTGACAACTCGAGATCCTTGAACAGCCATAAGAAGGTTCATGCTAGGGAACTAAAATGTGACATTTGTGGAGAACAGCTACGCTCTTTAGAGCAATTTAACAGCCACAAAATGATATGTTTAGGTAAAGCACAGGAAGCTGGTAGTAGTGGTTCAACAAATGCAGCTACACCACCACTAAACAATTGTTGTACCCAGCCAGGACTGTGTGAGCATGATAAACCAGCCTACCTAAACATACCTAGTTATGCCGCCGGAAGAGTTATTGATGCTACTCTGTCATCTTTAAAATCAGAAATGAACTAG

Protein sequence:

>DPOGS204389-PA
MSKKESIDQNERKYDTSDFDNIISPHSIFESSKPKHQELPSLSGFSNAETQTKQSDLKGQGIQSGLGTWHYNPWWMLASTSRSVQSQDNFSESSDQTKVKQEPSGAGTPDHDTLQSDLDQFQTTQTPPPPAGLDSFCDDCSDPFCDSSISICRKLFHCPHCRKSYPTILEFNTHLTGVHPAQKPFRCQICLEPFYKKSHLRRHLDSNHTRKDVNKCTVCSKYIKDKSNLRKHMQVHTGRVPQKQFKCDLCNNKRYMSLDRLNNHKVVCTGEKVLKFCDMCTKVFDNSRSLNSHKKVHARELKCDICGEQLRSLEQFNSHKMICLGKAQEAGSSGSTNAATPPLNNCCTQPGLCEHDKPAYLNIPSYAAGRVIDATLSSLKSEMN-