Monarch geneset OGS2.0

DPOGS206321
TranscriptDPOGS206321-TA1215 bp
ProteinDPOGS206321-PA404 aa
Genomic positionDPSCF300082 - 490940-493622
RNAseq coverage148x (Rank: top 54%)
Annotation
HeliconiusHMEL0125925e-7246.24% 
BombyxBGIBMGA014137-TA2e-8940.18% 
DrosophilaCG6654-PA2e-1329.89% 
EBI UniRef50UniRef50_UPI0001DE84518e-1527.89%UPI0001DE8451 related cluster n=1 Tax=unknown RepID=UPI0001DE8451
NCBI RefSeqXP_001813550.15e-1528.71%PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum]
NCBI nr blastpgi|2962102641e-1529.02%PREDICTED: zinc finger protein 729-like [Callithrix jacchus]
NCBI nr blastxgi|3343273756e-1829.41%PREDICTED: zinc finger protein 420-like [Monodelphis domestica]
Group
Gene OntologyGO:00036763.1e-09nucleic acid binding
KEGG pathway 
InterPro domain[276-308] IPR0130873.1e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26175 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206321-TA
ATGGAAGAAAATGATGATATGTTTATCGTGGTCATCGGGAACGAAAGTGATCAAACTTTTCCCGCGCCAATGCAAATTAAAGAGGAAGTTTTAGAAGAAAGTGCAGTCGAAGATGTCGAGGATGTAATTAACAAAGCAAAAGAGTTCGGAGAAGTAGTGGATGTAAAATGTCAGACTGTAAAAGAGGATTATGATAGAAATGTAGATATACAACACGACTTTGAATACGAATTTGTGGATGTGGATTTTATCAAATCTGAAAAAGACGAATCTGATGAGATAAGTGATGAAGATGCTAGAACTATGAAGACAAGAATTAAGGGTTTAAAAATTAAGAGACGGAAGCTTCTGAAACCATCCGAAAGACGGTCGTATATAAAGGAGCTGAAAGAACAGTTTCCAGAATTGCAAGATGATGAAGAGCTTCTCGTTAGATGTTTGGTTGAAATAATGAAGACTACCAAGCCCCCGCCGCCGCCGCTGGATTATTATGTTATGGACGGAATAATGTTAGAGTGTGTTATATGTGGAACTCATAATGAATCAATACCGGCAGCTGGTCGGCATTACCAGGAGAAACACGGAGAGAGATATTTGTACTGCTATGCTTGTGGAGCTAACTTTAGAAGTACTACCAACCTATATAAGCATGAGAAGCGCTGCGAAGCTCCGCATATCAAGTTGGTACTACGTGCGAGAGCCCTGTCCCTAGGGAGAAAGGGACGTTCCAGACCTTTCCTACCTAAATTTGATAACACTGAGCCGAGAAGATACAACTGTGATGAATGTTCAGCGTCTTTCTCAACAAAGTATTCTCTACAAGCACATCAGTTACTCCACAGAGGCCTGAGACCTTATCGCTGTCCCGTGTGTCCCTGCGCCTACACCTCCAGCACTGTCTTGACTCGCCACATGAAGAAGCACGGCTCGTCCCGGTTCGTGTGCGCTCACTGTGACCGCAGCTTCAACGTCAAGGCGGCCCTCGTGGCGCACCTCAACACACATCTGCGTATGTTACAGTATCTTACATTCAAATACCCGAGGATGTCAGTACTGAAAGAGCACATGAGGAAAGTACACGGCATGGAGCTCATGACGAGAAAGATGTTCTTCAAGAAGCTGCCGACGCTGTCCGACACTCAGTTGCATCAGGCCAAAGTGATTCTGAAACACGAAGTACACGAAAACGACTACTACATGTCCAAACATACATAA

Protein sequence:

>DPOGS206321-PA
MEENDDMFIVVIGNESDQTFPAPMQIKEEVLEESAVEDVEDVINKAKEFGEVVDVKCQTVKEDYDRNVDIQHDFEYEFVDVDFIKSEKDESDEISDEDARTMKTRIKGLKIKRRKLLKPSERRSYIKELKEQFPELQDDEELLVRCLVEIMKTTKPPPPPLDYYVMDGIMLECVICGTHNESIPAAGRHYQEKHGERYLYCYACGANFRSTTNLYKHEKRCEAPHIKLVLRARALSLGRKGRSRPFLPKFDNTEPRRYNCDECSASFSTKYSLQAHQLLHRGLRPYRCPVCPCAYTSSTVLTRHMKKHGSSRFVCAHCDRSFNVKAALVAHLNTHLRMLQYLTFKYPRMSVLKEHMRKVHGMELMTRKMFFKKLPTLSDTQLHQAKVILKHEVHENDYYMSKHT-