Monarch geneset OGS2.0

DPOGS213643
TranscriptDPOGS213643-TA1506 bp
ProteinDPOGS213643-PA501 aa
Genomic positionDPSCF300165 - 58482-60823
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0045910.076.83% 
BombyxBGIBMGA004584-TA0.066.60% 
Drosophilalmd-PB4e-6358.91% 
EBI UniRef50UniRef50_E0VTF92e-6268.94%Zinc finger protein transcription factor lame duck, putative n=1 Tax=Pediculus humanus corporis RepID=E0VTF9_PEDHC
NCBI RefSeqXP_002429403.13e-6368.94%zinc finger protein transcription factor lame duck, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420178596e-6268.94%zinc finger protein transcription factor lame duck, putative [Pediculus humanus corporis]
NCBI nr blastxgi|1954438489e-6543.11%GK11610 [Drosophila willistoni]
Group
Gene OntologyGO:00036761.9e-14nucleic acid binding
KEGG pathway 
InterPro domain[391-420] IPR0130871.9e-14Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25036 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213643-TA
ATGTTCCAGGACCGAGGAGACGTCACCGACCTTCACGCTTACTACATGACGCAGTTTAACCTGGAACTAGAAAACGGGAAAGGAAAGCCGAAAAGTGACGGCTTCAAGTTTGGAATCGAAGAGAGCTTCGACTTCGAAGTTGTGGACAACTGGAACGGCTGGAACTGCAACACCAAAACAGACTTCGACATCATAGACGAAGTCATCGACAACAACGTCAGCGAGCAAAATATCAAAGAGGTACTTCTAGACCTGGACACCATTGAGTTTGATGATAACAGCTTCAAGTCAGCTTCTTGCGAAGGAAGTAAGAGAAGTGAAGCAAATTATGAAACGAACGACTACATAGACGACGAGTGTCTCATCGACGAGCTGTGCAGAGAGGAAGGAGAGTCGTGTCGTCTGACGCCAGACGTGTTTAGCGATGACTGCACCAACATCTTGAACAACGACACGCAGCTGCCATCCATAGAGACCGCCTTCTCTAAAAGATACGGAGCTTTTAACAACTTGGACAGTTATAACACACAAAATGCTTACGAAGCGAACCCATCTCAGAACACAAACACACCGACTGTCAGCAGTTTGGAACATTACAGTTATCCTAATAATATTCTTCACAACTTGGATAATCCAAAGAACTACGAGCTCCCGGACACACCGACAAGCTGCCAGGATTTTAATTTTGACAGAAATATTAGAAAAGTATCGATATCAGATTCTATCGAGAGTGACGTCCAGAGCGCCGGTTATTATGACGACAACTCAGAAAACTTAGACGAAGACGACCTGTTCATAAACCTCGATGACTTCGGAATCGCCTTTGAAACGGAGAATGAAGGAAACAACTGCGAAAAGAGCCATCACGCCGAGAAAAGAAATGACAAAGACAAAACTCAAGGCGAAAGGGTTTGTTTATGGGAGCACTGTTTTGAAAGATATCCAAATCAAAACACACTCGTGGAGCACATAGAGCGCGCACACGTCAACACTTACAAAGGTGACGAGTTCAGTTGTTTGTGGCGGGACTGTGCGCGCGGTCGCCGTCCGTTCAACGCGCGTTACAAGCTACTCATACATATGAGAGTACACTCGGGACACAAACCCAATAGATGTCATCATCCCGGCTGCGGCAAAGCGTTCTCTCGCCTGGAAAACCTTAAGATCCACGTGAGATCCCACACGGGCGAGCGACCCTACGCCTGTCCCGCTCCTCACTGCAGGAAGGCCTTCTCTAATTCCTCAGACCGAGCCAAGCATCAGCGAACTCACTTTAATGCCAGACCGTACGCGTGCGGCGCTGCAGGCTGTAACAAGCGCTACACGGACCCCTCCTCGCTGCGGAAGCACGTCAAGTCCCACCCGCATGCTCCCCCTCGCGCGCGCCTCCCCCCGGCCAGACCTCCTCCTCACGAGGAACAGCTTGTGCCGAGTTCACCCGCCAAACTCGACACACTCAGATGTATCAGAGACAAACTCACCGTTCCCCGACTACGGAACATGTAG

Protein sequence:

>DPOGS213643-PA
MFQDRGDVTDLHAYYMTQFNLELENGKGKPKSDGFKFGIEESFDFEVVDNWNGWNCNTKTDFDIIDEVIDNNVSEQNIKEVLLDLDTIEFDDNSFKSASCEGSKRSEANYETNDYIDDECLIDELCREEGESCRLTPDVFSDDCTNILNNDTQLPSIETAFSKRYGAFNNLDSYNTQNAYEANPSQNTNTPTVSSLEHYSYPNNILHNLDNPKNYELPDTPTSCQDFNFDRNIRKVSISDSIESDVQSAGYYDDNSENLDEDDLFINLDDFGIAFETENEGNNCEKSHHAEKRNDKDKTQGERVCLWEHCFERYPNQNTLVEHIERAHVNTYKGDEFSCLWRDCARGRRPFNARYKLLIHMRVHSGHKPNRCHHPGCGKAFSRLENLKIHVRSHTGERPYACPAPHCRKAFSNSSDRAKHQRTHFNARPYACGAAGCNKRYTDPSSLRKHVKSHPHAPPRARLPPARPPPHEEQLVPSSPAKLDTLRCIRDKLTVPRLRNM-