Monarch geneset OGS2.0

DPOGS215087
TranscriptDPOGS215087-TA1719 bp
ProteinDPOGS215087-PA572 aa
Genomic positionDPSCF300187 + 203118-211608
RNAseq coverage102x (Rank: top 61%)
Annotation
HeliconiusHMEL0214397e-6541.50% 
BombyxBGIBMGA007188-TA2e-4636.33% 
DrosophilaCG6654-PA2e-2327.47% 
EBI UniRef50UniRef50_Q9H0M52e-3132.27%Zinc finger protein 700 n=52 Tax=Eutheria RepID=ZN700_HUMAN
NCBI RefSeqXP_001603104.14e-3227.64%PREDICTED: similar to zinc finger protein [Nasonia vitripennis]
NCBI nr blastpgi|3322533749e-3232.59%PREDICTED: zinc finger protein 700 [Nomascus leucogenys]
NCBI nr blastxgi|3322533741e-3732.59%PREDICTED: zinc finger protein 700 [Nomascus leucogenys]
Group
Gene OntologyGO:00036761.4e-10nucleic acid binding
KEGG pathway 
InterPro domain[387-407] IPR0130871.4e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL19959 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215087-TA
ATGGAAGAACTTAAAGTGTGCAGAATATGCCTTGTGATGGATGTTAAAATGCACGATCTTTGTTCGCACCCCCTTGATGTTTATTATGAAAATGTTATTGGTGGCAACGTATTGAAGATGGTACAACTAAGGGGATATGCATGTTATATGTGTGCACCGATGCTGAGAAAATTTCATTTCTTCAGAGAGAAATGTCTAAAAGGACAAACAGAACTCTATGGTTTACTGAACTCCTTCGGAAAGATTTCCAGAGACGGCATTATACATTTACATAAACAAAATCAAAGTTCTGCCTATTTAGAAAATATAAAATTATACAACATAGTCATCGACACTAACGAAGAGGATGGCGGAATGAAAATAGAACCGACGGAAATATATCAAAGTAGAATATATAATGTGGAGAGGGGTACGAGGGAAAATGACAATGAGAGGTTTAAGCAAGAAGCAAACACTAATACAGTTCTGCATGAATTATGTGGTATTATGACTAAGAAGAAGAGGGGTAGGCCTAAGAAGGGGGAGGTTAGGAAGAAGATCATTAAGAGAAGTAAACATAGTCAACACATAGCGCTGTTGGACAACGATGGGCTCGATGTAGAAGACTATTGCAATATAATAACACTCACCGAGGAGGAACAGAGGGAGGAGGTGATCCAGCGCCAGCAGTCTCTCAACTATATGAACTCCGTGTACAAATGTAATCTGTGCTACAAAGGATTCGTCGACACTCACGCCTGGAAACACCACGTCGCTAAACACGAACCGAGCGCCGGCGATCTGGAGTGTAGCATATGCAAGATGAGATTCAAAACTTTGAGGATTTTGAAGAAACACGCCGGGAACCACGGCAAGAGGTTCTCGTGCAAATCGTGTTCCTACGTGTCGAAAAACACTATATTATCCGCCCGTATCACTGCATCTAGATCTGTAGCGGTTAGCGCGCCGGAATCGTTATATAGAGGGGGTGACGAGTTTGCTCTGCTTTGTGCGGAGGTGATCCGCAGCGCCCGCGAGTACTGGTCACACTTCAGGCGAGTGCATCCCGACAAAACGTATCCCACGCAGAAGGACTTTGTCTGTGACGTGTGCGGGAAGAGCTTCCGGAGCAACGCCTTCCTGAACTATCACAAGCGGACTCACTCGTCGGAGCGCTCGTACAAGTGCAGCCAGTGTCCGAAGGCGTTCCACAACAGGAACAATCTGCAGATGCACCAACGCACACACTCCGACGAGCGACCCTACCCTTGCGCGTTGTGCGACAAAGCCTTCAAATGCAAGAGCGCTCTGGACAGGCACTTCCGGTGGACGTCGTATCTGAGCCACGTCCGCATGAAGCATCCCTCTGAACACATCTGCGGAGCGTGCGGGTATTCCTTCGTGTCGCGGCTGGGACTTAACATGCACAGGACTATGATGCACAAGGATCTGCTGGAGCAGGATGGTGTTGGTGAGGATAACAAAGACTCGCCGTACTGCGGACAGTGTGACGTGAAGTTCATTACCCTGGAGGCGTACAAGAGGCATATGAGTCACAGCGGCGAGAAGCCGTACGTGTGCCAGGTGTGCGGGAAGGCGTTCGCTCAGTCCAACAGTCGCAAGCTGCACGTGAGGACGGTGCACCTGAAGCAGTCGGCGCCGTACGTGTCGCGGGCGAGGCTCGAGAGGAGGACCCGGGCGACCAAGGAGCACGCACCGGCCTCGCATTTTCTCTACTGA

Protein sequence:

>DPOGS215087-PA
MEELKVCRICLVMDVKMHDLCSHPLDVYYENVIGGNVLKMVQLRGYACYMCAPMLRKFHFFREKCLKGQTELYGLLNSFGKISRDGIIHLHKQNQSSAYLENIKLYNIVIDTNEEDGGMKIEPTEIYQSRIYNVERGTRENDNERFKQEANTNTVLHELCGIMTKKKRGRPKKGEVRKKIIKRSKHSQHIALLDNDGLDVEDYCNIITLTEEEQREEVIQRQQSLNYMNSVYKCNLCYKGFVDTHAWKHHVAKHEPSAGDLECSICKMRFKTLRILKKHAGNHGKRFSCKSCSYVSKNTILSARITASRSVAVSAPESLYRGGDEFALLCAEVIRSAREYWSHFRRVHPDKTYPTQKDFVCDVCGKSFRSNAFLNYHKRTHSSERSYKCSQCPKAFHNRNNLQMHQRTHSDERPYPCALCDKAFKCKSALDRHFRWTSYLSHVRMKHPSEHICGACGYSFVSRLGLNMHRTMMHKDLLEQDGVGEDNKDSPYCGQCDVKFITLEAYKRHMSHSGEKPYVCQVCGKAFAQSNSRKLHVRTVHLKQSAPYVSRARLERRTRATKEHAPASHFLY-