Monarch geneset OGS2.0

DPOGS211825
TranscriptDPOGS211825-TA1548 bp
ProteinDPOGS211825-PA515 aa
Genomic positionDPSCF300031 + 276247-278899
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0042820.070.94% 
BombyxBGIBMGA012183-TA1e-3628.35% 
DrosophilaCG6654-PA4e-2829.07% 
EBI UniRef50UniRef50_UPI00020F6A932e-3333.91%UPI00020F6A93 related cluster n=1 Tax=unknown RepID=UPI00020F6A93
NCBI RefSeqXP_312223.45e-3128.35%AGAP002705-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3387269211e-3432.01%PREDICTED: zinc finger protein 709-like [Equus caballus]
NCBI nr blastxgi|1942130923e-4534.90%PREDICTED: zinc finger protein 709 [Equus caballus]
Group
Gene OntologyGO:00036766e-10nucleic acid binding
GO:00082703.6e-05zinc ion binding
GO:00056223.6e-05intracellular
KEGG pathway 
InterPro domain[376-397] IPR0130876e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL34910 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211825-TA
ATGTATTTCAAATACACTGATTATCTTTTTTATGTACTGCTTATAATGATACAAACAGTTCAAAGAGTTTCAGGAAAAGCGTTCATCGAAAATGAACCAGTGACCACTCCAGAACTGCAAATATTTCCAATGGGCTTCGATGAAGTTTTTAATGATGCTACTAATGTGCTTGGTAATGTTGTAAGGAAAATCATATCAAAGAAACGTTTACTTGGACTAGATTCTAATTGGTACTGGTTATGTTTTAAAAACCTTACGGAACAATACGTTCGTTTTGATGATGCGGTTTCATTGCACCCAGAAAGTGGTGTATTTCAACCTTTATCCGAAATATTACTTAAACTATTGGGCGACAATATATGTGATGAGATTAAGGGTGTTGAAGCTGTATGTACAGATTGTGTGGAAAATGCATTGTTGTCAGCCCGCTTTGTAGAGAAGTGCCAGCATTCAACAAAAGCTTTAAATGAAGTGTTCAATAATATTAGCAATACTTTAGATGTAGATATTGATAATAAAGATAGCAATAAAACATTGTATGTTGTAATAGAAGATCTAGAATCTAAACTGTTAGTAGTAAAGAAGACAGATGAAAGAAACAGTCTTCAGGGGACATTTGAATGTGAAGTGTGCACAGATAGTTTTGATACATTTACAGATTTAAAAGTCCATAATTTGACCAATCATGGTACTTTGACATGTGACAAATGCTATGATACATTTGATAGTAATACTGAATTTTCCCTTCATGAGAGTCAGCATCATGTTTATAAATGTCCTGAATGTCCACAATACAGAAACACAGAGGAGAGTTTAGAAGACCACCAAAACAGACTTCACAATGTTTTTGTATGTAAGGAATGTGGAAAACGTTGTCGTGGCCTTTATAAGCTCCAAGTACATGAAGAGAAGCATAAGACAAAAAATTCATGCCCTAAGTGTGGAAAGTCTTATACAACAAAGGAGTTTTTTGATAGGCATGTCAATCTGTGCATCAACAACCTCATAGATCCTCATCCGATAAGAAGCAGCATGGTTAAATCATACTCCTGTGAGAAATGTGATAAGGCTTACAGTACGGCTGGAGGGCTTAGAGTGCATAATAGATTTGCCCATGGAAATGCTAAGCCTCATGAATGCAAGGAATGCGGGAAACAGTTCACTGCTCCCAGTTATTTGAAAGTTCATATGATAAAACATACAGGGGAGAAGAACTTCAAATGTGATATTTGTCATAGTAAATTTGTATCAAAAGAGGCATTGTTGTATCACACTCGACGACACACCGGCGAAAAACCATACAGTTGCAAATACTGCAATGAAAGATTCGTCAATGCCTCAACCAGGGCCGAGCATATCAAATTTAAACATGTGGGACCTACATTAATGTGTGAAATATGTTCTAGAAAATTTGTTACAAGTCACTTCTTAAAGCAGCATATAAATCGTCATCACGATCCTACAAGTAAGCTCTACTATGGCAGGAATATGATTCCACCTAACTTGCCGCTCCAACAGAACATGAAGAAGGTTGTTATACACAACTGA

Protein sequence:

>DPOGS211825-PA
MYFKYTDYLFYVLLIMIQTVQRVSGKAFIENEPVTTPELQIFPMGFDEVFNDATNVLGNVVRKIISKKRLLGLDSNWYWLCFKNLTEQYVRFDDAVSLHPESGVFQPLSEILLKLLGDNICDEIKGVEAVCTDCVENALLSARFVEKCQHSTKALNEVFNNISNTLDVDIDNKDSNKTLYVVIEDLESKLLVVKKTDERNSLQGTFECEVCTDSFDTFTDLKVHNLTNHGTLTCDKCYDTFDSNTEFSLHESQHHVYKCPECPQYRNTEESLEDHQNRLHNVFVCKECGKRCRGLYKLQVHEEKHKTKNSCPKCGKSYTTKEFFDRHVNLCINNLIDPHPIRSSMVKSYSCEKCDKAYSTAGGLRVHNRFAHGNAKPHECKECGKQFTAPSYLKVHMIKHTGEKNFKCDICHSKFVSKEALLYHTRRHTGEKPYSCKYCNERFVNASTRAEHIKFKHVGPTLMCEICSRKFVTSHFLKQHINRHHDPTSKLYYGRNMIPPNLPLQQNMKKVVIHN-