Monarch geneset OGS2.0

DPOGS205199
TranscriptDPOGS205199-TA1869 bp
ProteinDPOGS205199-PA622 aa
Genomic positionDPSCF300265 - 305836-308425
RNAseq coverage75x (Rank: top 65%)
Annotation
HeliconiusHMEL0134568e-7229.34% 
BombyxBGIBMGA014612-TA7e-9445.76% 
Drosophilacrol-PE2e-2624.67% 
EBI UniRef50UniRef50_UPI00022567FB5e-3726.93%UPI00022567FB related cluster n=1 Tax=unknown RepID=UPI00022567FB
NCBI RefSeqXP_002730599.12e-3926.54%PREDICTED: zinc finger protein 197-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2607956871e-3925.55%hypothetical protein BRAFLDRAFT_65421 [Branchiostoma floridae]
NCBI nr blastxgi|3287056032e-4725.94%PREDICTED: zinc finger protein 62 homolog [Acyrthosiphon pisum]
Group
Gene OntologyGO:00036761.9e-12nucleic acid binding
GO:00082701.5e-06zinc ion binding
GO:00056221.5e-06intracellular
KEGG pathway 
InterPro domain[179-217] IPR0130871.9e-12Zinc finger, C2H2-type/integrase, DNA-binding
[192-214] IPR0070871.5e-06Zinc finger, C2H2
Orthology groupMCL31088 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205199-TA
ATGAAAGATTGCGATACACTCAAAGAACATACAACGGTCGATCACAATAGTGTGGACCTAGAACACTTCATACCCCAAAGAGTGATATCCAAAGATGTACCAGTTAAGATTGATGTAACTGATATGGGTTGCAAGTTTTGCAACTTGCCTCTCTCAAACGTAGAAGATCTGATCACTCATATTATTGCGGTTAGATTGAAGCGACACATCAGCAACAGTCATGTCGGATACCGATGCAGGATATGCGAGAAAGTTTTCGACGCGTTTCACAAGGTCGAGAAACACAAACAAAGGACTCACGGAATCCACAGGAGCGTCGCGTGCGATCTGTGCAGTGCAACTTTTCAAAACAATTACCAGATCAAGGTTCACATGGGGAAAGTGCACAATGTAGAAAAGTACAGAATAAAATGCGAGCATTGTCCCAAAATCTGCACGACCAAAGGCGCTTTAGTCTTGCACGTGCAGTCGATGCACTCAGACCAGAGATTTGAATGCGATCTCTGCGATTACAAGACGGGAATAAAGTGGATGATAAAGTTACACAAGCGTAGACATTACGGGGAAAAGAATTACGTTTGTAGCATTTGTGATAAAAGGTTTGGAAGGTCGAGCAATCTGAGGGCGCATATGAAAGTACATACCGGTAGCACTGGGCGTGTTTGTCGCTGGTGTCGCCACGGCTTCGTGGACCTGGAAGGCCTGGACAAGCACGAAAAAGAAACTCCTGGAGTCTCTCGAGTCTCCTGGTTACTTCAAGATTTTAACAAAAGCGAACCTAGGGGTGGCCAGAAGGCTGTCCGTGATGATAGTGAGATCTTCAGCGATGCGCTCATAACATCAGCAAAACCCTCCCTAACCGCAAAGGATAAAAGGAAAATCATGAGAAACAACGTTCTTCAAGTTTTGTTAAAGTCAACAGTGATGCCGTTCCGATGGCTGAAGAGTTCCTATAGATGCTTCTATTGCTACGACATATTCCAAGAATCGTCTGACTTGAAAAATCACCAGCATATGCACGTAGGAAATGACGTCAAAGAACAAGCAATGAACAATTACTGGGAACCGGTTGTCTATGTGGACATATCTAATTTAGCTTGCAAGCTCTGTCCGGAAAATGTCACCGACCTCTATGATTTAATAGATCACTTGGTCGCTAAGCATGAAGTTAATTACAACAAGGACGTGGGAGTTTGTATGGTCGCGTTTAAGTTAGACAATTTCGCGGTGAATTGCCTAGCTTGTGGGGCTAGCTTTTATACTTTTGGACCTCTATTGCATCACACAAATAAAGATCATAAGGGTATCTCGGCCATTCTGTGTGACGTGTGTGGACAGCGGTTCAAGGACGCAAACTTATTACGACTACACGTCAAAACCGTTCACGAGAACACTGGCCTATTGTGTCCGGAATGCGGCGAGAAATTCGAGACACGTTCAAAATTGAAAACACATCAAAAAAATCAGCACGACATGGACAAAAAGTACAAATGTCTCGTCTGCTCACAAACCTTTCAGAGTCATTATAAACGGTCGAGACATATGGCGACCGAACATAAGAATCGGCAGGAAATAAAGTGCATACACTGTCCAAAGACCTTCGTGTTCCGCAGCATGATGATGACCCATCTGAGAGACACTCACCTGAAAGTCAGGAACCATATATGCGGTGTTTGTGGCTGGAAGGCCTTCAATTCGCATCGGTTAAAGAATCACATGTACAAACACAGCGGGGAGAAGAATTTTAAGTGCGATGCCTGCGACAAGGCGTTCACAACTAAGAAGATTATGAGGGCACACTTCGCTCGGATGCATAAGACCATGGACCAGCCGATCGTGTACGAACAACATCCATATGTTGGCCATTAA

Protein sequence:

>DPOGS205199-PA
MKDCDTLKEHTTVDHNSVDLEHFIPQRVISKDVPVKIDVTDMGCKFCNLPLSNVEDLITHIIAVRLKRHISNSHVGYRCRICEKVFDAFHKVEKHKQRTHGIHRSVACDLCSATFQNNYQIKVHMGKVHNVEKYRIKCEHCPKICTTKGALVLHVQSMHSDQRFECDLCDYKTGIKWMIKLHKRRHYGEKNYVCSICDKRFGRSSNLRAHMKVHTGSTGRVCRWCRHGFVDLEGLDKHEKETPGVSRVSWLLQDFNKSEPRGGQKAVRDDSEIFSDALITSAKPSLTAKDKRKIMRNNVLQVLLKSTVMPFRWLKSSYRCFYCYDIFQESSDLKNHQHMHVGNDVKEQAMNNYWEPVVYVDISNLACKLCPENVTDLYDLIDHLVAKHEVNYNKDVGVCMVAFKLDNFAVNCLACGASFYTFGPLLHHTNKDHKGISAILCDVCGQRFKDANLLRLHVKTVHENTGLLCPECGEKFETRSKLKTHQKNQHDMDKKYKCLVCSQTFQSHYKRSRHMATEHKNRQEIKCIHCPKTFVFRSMMMTHLRDTHLKVRNHICGVCGWKAFNSHRLKNHMYKHSGEKNFKCDACDKAFTTKKIMRAHFARMHKTMDQPIVYEQHPYVGH-