Monarch geneset OGS2.0

DPOGS208489
TranscriptDPOGS208489-TA1905 bp
ProteinDPOGS208489-PA634 aa
Genomic positionDPSCF300064 - 1058223-1063233
RNAseq coverage50x (Rank: top 70%)
Annotation
HeliconiusHMEL0045462e-4729.58% 
BombyxBGIBMGA010340-TA3e-8245.01% 
DrosophilaCG5245-PA2e-3733.46% 
EBI UniRef50UniRef50_UPI00022AFBF16e-5043.11%UPI00022AFBF1 related cluster n=1 Tax=unknown RepID=UPI00022AFBF1
NCBI RefSeqXP_001952754.16e-5141.06%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Acyrthosiphon pisum]
NCBI nr blastpgi|2961894209e-4943.90%PREDICTED: zinc finger protein 782-like [Callithrix jacchus]
NCBI nr blastxgi|1892367012e-5533.57%PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum]
Group
Gene OntologyGO:00036764.7e-15nucleic acid binding
GO:00056345.7e-15nucleus
GO:00082705.7e-15zinc ion binding
GO:00056227.2e-07intracellular
KEGG pathway 
InterPro domain[423-445] IPR0130874.7e-15Zinc finger, C2H2-type/integrase, DNA-binding
[55-127] IPR0129345.7e-15Zinc finger, AD-type
[426-448] IPR0070877.2e-07Zinc finger, C2H2
Orthology groupMCL30743 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208489-TA
ATGGAAAACTCAACTAATGAAAATGACTATGAGAATAAAATACCTCTGAATGCAATCGTAAATATAAAACCAACTAGTTCTGCGCATCATAAATTTGAGAATGTTTTGATAGAAAACGATATGAAATTGATAGCAATAAACACCATTGACGATATAAGATACCTTTGCCGTATTTGTCTTACAAATGAAGATAACATGATCTCTTTAATGTCTACTATTGATTCAGAATTGCTGGTGGATATATTCTCTTATGTTACATCAATAAAAGCACAATTGGAGGCAGATCTTCCTCAACAAATTTGTGACACATGTTATGATGATTTATTACAATGTTATAAACTTAGAAAAAAATCATTGAAATCAGAACAAACACTGAGGAAAGTGTTGAAATTAGATTCCGGATGCTGTTCTGATGATATAAGTGTGACAATAATGACAAAGGAAAAGGGAATACAGACTGATTGTAATGATTTGATTAAATTATGCGAAGAGAATGGTCCAAGAAGGATTAACTTTAAGGATGAAGATTTTAAAACAGAGTATGAAGAAAGTCATGAGGTTGAATATTTGGACGATGATTTCTTTGAATCAAATGATATTGTCACAAAAGATGTTGTCCATAGCATGAACAGAGAAACACAGCAAAATGATATGAAAACATTGATAGAAATGAAATCTGAAAAAGCGGTAAAGAACTTCCGTAAAATAAAAACATTAAGAGCTCATATGAAGAAGTGTAGAAATACAGAAGTGAAAAATTCTTTTCCCTGCGGAAAATGTAAGGAGACATTCAGCCACGAACAAGACCTGTGCATACATTCAGCATTACATACCAAAGGGAATAAGTGGACTTGTAACGAATGTCAGAAAGAATTCACTGAGAGGAACCGTTTCCGTCGTCACATCCGCCGTCACATGGCGTGCTGGAGGCTGGCCTGTGACGCGTGCGGCAAGACCTTCGCTGAGCCGTGCGCGCTACGCAGACATGCGCGTGTTCATACCGGCGAGAGGAAAGAGAAGACACTGCGCTGCGACATCTGTGACAAACGTTTCAGCGACCGCACCCAGCTAGCGACACATTCCACCCGTCATAGCGGCCTCATGCCCTGTTCGTGTTCCGTCTGCGGTAAGGCGTTTCCATCTCAGAGACTGCTCGCCTCGCACGCTCGTGTACACTCAGACTTGAAGCCCTACGCCTGTCTCTACTGTGACAAACGATTCAGGCACGAGTCCACCAGGAACACACACCACCGCACCCATACCGGCGAAAAGCCATACGTATGTTCCATATGCGGAAAAACCTTCATACAGAACTCTAACCTCAAGCTGCACATGAGGACTCACACGGGAGAGAAACCCTTCGAGTGTGCTATTTGTTCGGAGAAGTTCGGTCGGAAAAATTACTTGGTAAAGCATCTCCGGACACACAAAAATAAAGTAAATAAGGATACGGTGAAAAATCAGGAAATAGTTATTCTCCAGGAAGTGCCATTTGTTGTGGAGGATAGCGTTATATACAATGAGGACCCCGGCAACGAAGTATCCCTAGACACCGGCGTCAATACTGTTAGTAACAAAAATATAACTTTGGAAGTGTCCGAGGAAATCCCAATAGGGGTCTCGGGAGAACTTATTTTACACGAAAATGGTGAAGAGAAGACAGAGCTGGTGGTGGTGGACAACATTCAGAATGGTATGAATTTCACCAACGGTGATATATGTCTTGATGGGAACGTCAACTATGTGAATGATGTCAGTTTAGTTGGAGTCAACGATGATGTACCATCATCAGCCTCATCTGCCGAGGAAACCACCATGAAACTTTACAAATTGGACCAAAGTTTAGTACAAATACAAACATCCACCGGACAACTTACAATAAGGAAAATGACAGCTAACTTTTAA

Protein sequence:

>DPOGS208489-PA
MENSTNENDYENKIPLNAIVNIKPTSSAHHKFENVLIENDMKLIAINTIDDIRYLCRICLTNEDNMISLMSTIDSELLVDIFSYVTSIKAQLEADLPQQICDTCYDDLLQCYKLRKKSLKSEQTLRKVLKLDSGCCSDDISVTIMTKEKGIQTDCNDLIKLCEENGPRRINFKDEDFKTEYEESHEVEYLDDDFFESNDIVTKDVVHSMNRETQQNDMKTLIEMKSEKAVKNFRKIKTLRAHMKKCRNTEVKNSFPCGKCKETFSHEQDLCIHSALHTKGNKWTCNECQKEFTERNRFRRHIRRHMACWRLACDACGKTFAEPCALRRHARVHTGERKEKTLRCDICDKRFSDRTQLATHSTRHSGLMPCSCSVCGKAFPSQRLLASHARVHSDLKPYACLYCDKRFRHESTRNTHHRTHTGEKPYVCSICGKTFIQNSNLKLHMRTHTGEKPFECAICSEKFGRKNYLVKHLRTHKNKVNKDTVKNQEIVILQEVPFVVEDSVIYNEDPGNEVSLDTGVNTVSNKNITLEVSEEIPIGVSGELILHENGEEKTELVVVDNIQNGMNFTNGDICLDGNVNYVNDVSLVGVNDDVPSSASSAEETTMKLYKLDQSLVQIQTSTGQLTIRKMTANF-