Monarch geneset OGS2.0

DPOGS209864
TranscriptDPOGS209864-TA2553 bp
ProteinDPOGS209864-PA850 aa
Genomic positionDPSCF300510 + 3039-10091
RNAseq coverage100x (Rank: top 61%)
Annotation
HeliconiusHMEL0115310.061.57% 
BombyxBGIBMGA001744-TA2e-15948.96% 
Drosophilajim-PE5e-2828.23% 
EBI UniRef50UniRef50_Q5TYW11e-5029.03%Zinc finger protein 658 n=22 Tax=Primates RepID=ZN658_HUMAN
NCBI RefSeqXP_001946669.16e-4829.66%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp), partial [Acyrthosiphon pisum]
NCBI nr blastpgi|3266671062e-5228.26%PREDICTED: zinc finger protein 729-like [Danio rerio]
NCBI nr blastxgi|892739992e-6229.55%novel protein [Xenopus (Silurana) tropicalis]
Group
Gene OntologyGO:00056341.7e-11nucleus
GO:00082701.7e-11zinc ion binding
GO:00036766.2e-09nucleic acid binding
GO:00056223.8e-05intracellular
KEGG pathway 
InterPro domain[60-124] IPR0129341.7e-11Zinc finger, AD-type
[740-766] IPR0130876.2e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL19952 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209864-TA
ATGAAGACGTCTTGGTTGAGCTTGTTAGGGTTACCGCAATGGGTTCCTCGTAGTTTGGTGTGCTCTGACCACTTTAAGGACGACGATATTTGTGAAGTCGAAGGTGGCGAGAGGAAACTCCTACCAGGAGCTGTACCTAAAGCTGTTACTACACCAGTGAACAGTACTGTTACACAGGTATGTCGTATATGCTTGGGGGCAGAGATGAGCGTCTACCCGATACTAAATCCCGATCTTCAGCAGACATTTATATTACTCACAGGCATGAACATATACAAAGATGACTACCTGCCTCAGAATCTTTGTATCGAATGCATGCAAAGATTAAAAAATTGCTATAAATTTAGAACGCAGACGTTACAGGCGCAGACTATTATTGTGGATCTCATTAGAATAGGCAATTTCACAACAGAAGCTATAAAATCATTAAATCACAATCCCAAATGGAATCTGCAAATGCAGAGGTTTGATGCGAACCACTGCGATGCATACATTAACAAAGAGGACGTGAACCCCATACAAGAGATCGTTATAGAGGAACAGGTTAAAATTGAAAAGGAATTTATTAACGAGCAAGACACAGTCGCTATGGGCTACACAACAGACGACAGTCTGCCACTAGAGACGCAGAGAACCAAAGCCAAGAAGACAAAGAAGAAGAAAGAGAAGAAGATACAAGAACCGAAGGTAGATAGGAGGAGAAAGCCGTTCCTTAACGATGATCTGAATGAGAGTCTGTTCACTATCACCGATCTGACCTTGGAGGAACAGATAGCTGATATCCAGAAGAGACAGGAGAGTTCTAACTTCAAGAATTCAGTGTACAAGTGTATGGAGTGCTTTAAGGGTTTCCTTGATGAAGGAGCGTACAACGGACATATGACAAGGCATACTACTCAATGCGGTGAATATTGTTGTGAAATTTGTAAGACACATTTCAAACACTCGCACGCATTGAGGAAACACACGACGGCGCATCACGCGCAGAGGTTCAATTGTAACCGGTGTGCTTTCGTTACTACACACAGACAAACAGCACGACTCCACGAGCGATGGCACAAGGGCACCAAATACGAGTGTCCGCACTGTAATGAAGTGTTCCTTAAATTCACAACCTACATGGGACACATTCGCATCAAACACCCGTCGGATTTCGTGTGCGCTCTGTGCGGTTATTGCTTCGTCAGTCAGAAGGGGATAGATTTGCACAAGAAACTGAAACACAGATTACATCTCGGACAGATCCCGGAGGACGGGCCGCTCTGTGAGCTATGTGACGTGCGGTTCATATCACAAGAGGCTTACAAACGACACCTCAGCGTCTCCGCGAGACACGCTGGCGACGAAATATCCAAGGATCCCAGCAAACCGAAACGCGGAAGAAAATCAAGGGACGCCCTCGACAAAGACGACAGCGAAAAAAACGACTTAAACGACAAGAAAGTGTATCCGAGTCAAGTCAGAAAAGCAGAAGGTCCTATACCGTGCGAGCAATGCGGTATGCAGTTAGAGGACTCGCGCGCCTACCACGCTCACTTCAGACGGAACCATCCCGACAAGAACAGGACCAACTACCCCAGCATGAAGTCGCCCTGCATGTGCGAGGTGTGCGGCAGGATGTTCCAGAGTCACGCATTGTTGAAAGACCATCGCTGGGTGCATACAAACGAGAGGCCATTCGCCTGTGAGTGTGGCAAGAGGTTCCGTATGAAACAACGCCTGGTGGCGCACAGAAGGGTACACAGACAGACCAGGCACTACACATGTGCTCTGTGCGGGAAAGGATTCAGCACACACAGCAACAGGCAGAGACATATGATTGTACATTCACACCGGGCTAAAACCATTAGCAAACCGAAACGCGGAAGAAAATCAAGGGACGCCCTCGACAAAGACGACAGCGAAAAAAACGACTTAAACGACAAGAAAGTGTATCCGAGTCAAGTCAGAAAAGCAGAAGGTCCTATACCGTGCGAGCAATGCGGTATGCAGTTAGAGGACTCGCGCGCCTACCACGCTCACTTCAGACGGAACCATCCCGACAAGAACAGGACCAACTACCCCAGCATGAAGTCGCCCTGCATGTGCGAGGTGTGCGGCAGGATGTTCCAGAGTCACGCATTGTTGAAAGACCATCGCTGGGTGCATACAAACGAGAGGCCATTCGCCTGTGAGTGTGGCAAGAGGTTCCGTATGAAACAACGCCTGGTGGCGCACAGAAGGGTACACAGACAGACCAGGCACTACACATGTGCTCTGTGCGGGAAAGGATTCAGCACACACAGCAACAGGCAGAGACATATGATTATTCACACCGGGCTAAAACCATTTAAGTGTGAGATGTGCGGCAAATGTTTTAAGCATGCCAGCGAGAAACGGGCTCACATAACATATGTACATCTCAAGAAGCCCTGGCCGAAGAGATCACGGGCGAAGAGACAAGGACAGAATATAACAGGTATGGCGAGCGCGAGCGAGATAGACATACAAGGATGGAACGATCCCAAGATAGATCTGATGATGGATAAACAGTATTTCAATGTCAAGATGTAG

Protein sequence:

>DPOGS209864-PA
MKTSWLSLLGLPQWVPRSLVCSDHFKDDDICEVEGGERKLLPGAVPKAVTTPVNSTVTQVCRICLGAEMSVYPILNPDLQQTFILLTGMNIYKDDYLPQNLCIECMQRLKNCYKFRTQTLQAQTIIVDLIRIGNFTTEAIKSLNHNPKWNLQMQRFDANHCDAYINKEDVNPIQEIVIEEQVKIEKEFINEQDTVAMGYTTDDSLPLETQRTKAKKTKKKKEKKIQEPKVDRRRKPFLNDDLNESLFTITDLTLEEQIADIQKRQESSNFKNSVYKCMECFKGFLDEGAYNGHMTRHTTQCGEYCCEICKTHFKHSHALRKHTTAHHAQRFNCNRCAFVTTHRQTARLHERWHKGTKYECPHCNEVFLKFTTYMGHIRIKHPSDFVCALCGYCFVSQKGIDLHKKLKHRLHLGQIPEDGPLCELCDVRFISQEAYKRHLSVSARHAGDEISKDPSKPKRGRKSRDALDKDDSEKNDLNDKKVYPSQVRKAEGPIPCEQCGMQLEDSRAYHAHFRRNHPDKNRTNYPSMKSPCMCEVCGRMFQSHALLKDHRWVHTNERPFACECGKRFRMKQRLVAHRRVHRQTRHYTCALCGKGFSTHSNRQRHMIVHSHRAKTISKPKRGRKSRDALDKDDSEKNDLNDKKVYPSQVRKAEGPIPCEQCGMQLEDSRAYHAHFRRNHPDKNRTNYPSMKSPCMCEVCGRMFQSHALLKDHRWVHTNERPFACECGKRFRMKQRLVAHRRVHRQTRHYTCALCGKGFSTHSNRQRHMIIHTGLKPFKCEMCGKCFKHASEKRAHITYVHLKKPWPKRSRAKRQGQNITGMASASEIDIQGWNDPKIDLMMDKQYFNVKM-