Monarch geneset OGS2.0

DPOGS206205
TranscriptDPOGS206205-TA1809 bp
ProteinDPOGS206205-PA602 aa
Genomic positionDPSCF300405 - 81844-87315
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0225411e-6350.21% 
BombyxBGIBMGA011906-TA6e-7846.34% 
DrosophilaMeics-PA1e-2123.78% 
EBI UniRef50UniRef50_Q9UJU32e-2727.11%Zinc finger protein 112 homolog n=44 Tax=Eutheria RepID=ZF112_HUMAN
NCBI RefSeqXP_784091.11e-2627.66%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
NCBI nr blastpgi|3266806679e-3026.89%PREDICTED: zinc finger protein 729-like [Danio rerio]
NCBI nr blastxgi|3266806672e-3526.86%PREDICTED: zinc finger protein 729-like [Danio rerio]
Group
Gene OntologyGO:00036766.8e-07nucleic acid binding
KEGG pathway 
InterPro domain[364-396] IPR0130876.8e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25900 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206205-TA
ATGAATTTAAGTGAATTAAATATATGCGGAGGATGTCTCTCCGTTGACCGCGACTTGGTAAACTTCAGCGAAGATATGTTCATGTTGTTCTGTTATTTATTGGGAACGGATTACAGCGAAGTTAAGCTGTGCTGGGAATGTGCGGCTTATATGAAGAGGTTCCTGAGATTTAAACAACAGCTAAAACGAGCGCATCACGTGCTGTCTCAATTAGTTTTGGACGACATAAAACCTCTTTCGAGCTTAAAAATAACGAAATGTGATAACGTAAGGGAATATTCACACAATTATGAAATAGCGTTCGTCAGAATGGAAGATCATAACTCTGACGGTGTCAACGAATATGACATTGATTTACAGCTGCCCAAACCGGAACCAATTGAATCTGTGATAACGGCTGACGAAGAGTTCGACGATTCAGACGATGAACCGCTGATGATGAAACAAAAGGCGAGCAAGAAAAAGAAAGGGCAAGTCAAAAGGAAAAAAGAGAAAGTAGTGATTGTGAACAATGCCAGGGTCGATAAGAAATTGAAACAGTTGAACATCGGAGACGAGCATATAGAGATGGTTGTACTTACATGGGATGAGGTCCGTCGCCAACGCGCCCGAGCCCTGTCCAGCGAGTCGTTCACGAGACACGAGTACAAGTGTCACGACTGTGTGCTCGGGTTCAACCACCGGTGTAAGCTGGAGCAGCACATGGTGAAACATTACGAGGCGTCCGGGTCGTGTGTGTGCAGTCTGTGCAGGGTGAGGTGCAAAGACGAACGAGCCCTCGCCGCGCACGAGAGGCGACACAGAGTACGATGGCGCTGTCGCTGGTGCGGCGGCACCTGGTCCCGCGCCGCAGTGTGCGCGGACCACGCGGCCAGGGAACACTGCGCGCCCACACCCACACACACGTGCGGGAGATGCGGACACACGGAGACATCGCTCGGCAGGTTGCGCAACCATATCAAGAACCACTCAGAGAGACAGAAGTGTGATCTGTGCGGCAAGACCTTCAGGGACCGGACCTCTCTCAAGACGCATCTCTTGTTAGTATACGATATGCGGGCACAGCGCCGCACTGAGTGTGTGTACACGAGGCGCATCCATAAAGGCGAGAAGGAATACTCCTGTCCGCGCTGCGACAAGAAGTTCCTGTTCAAGAAGGCCATGGAGATCCACCTGGTCACGCACGAGGCGCCCGCGCACCTCTACTGCTACCAGTGCGACATGAACTTCAAGAACAGGATGTCGTACAACCAGCACCTGAAGTACAGCCTGAAGCACGTGGACCCGGCCGACATCAAGTTATTATCGTTCTTTACGTACGGAGCCGCACCGGTGGCAGTTCCGCCGGTGCGGCGACAGATGGCGCTAGTGCTGCAAACTAGTAACTCCCCACACGCGTGCAAGCTCTGCGACAAGAGGTTCGTGAAGGCCGCCCGCCTCCAGGAGCACAACCTGGCCGTGCACTTGAAGATGACGCCCGTCAAGTGCACGGTGGCCGGATGTGGCTTCGCGTGCTCGTCTAAGCCGGTTCTCCGTAGTCACATCCGTTCCTGTCACCGTCTCGCTGGACGTGTCAGGAATCACGTGTGTCACGTCTGTGGGAACGCTTACACGAGCAACAAGTCCCTGGAGGGTCACCTGCGCAGTCACGCCGGCTCCCGTCCGCTGCACTGCGCTCGGTGCCCGGCGACGTTCGCTTACGACGCGGCGCTCTACAACCACACCAAGCTCGTGCACAGAGACATACAGCCCACCCCAGCCGCTGACCCAGCACGCCCAGCCTCCCAGCAGCCGGAGACGCAGCGATAA

Protein sequence:

>DPOGS206205-PA
MNLSELNICGGCLSVDRDLVNFSEDMFMLFCYLLGTDYSEVKLCWECAAYMKRFLRFKQQLKRAHHVLSQLVLDDIKPLSSLKITKCDNVREYSHNYEIAFVRMEDHNSDGVNEYDIDLQLPKPEPIESVITADEEFDDSDDEPLMMKQKASKKKKGQVKRKKEKVVIVNNARVDKKLKQLNIGDEHIEMVVLTWDEVRRQRARALSSESFTRHEYKCHDCVLGFNHRCKLEQHMVKHYEASGSCVCSLCRVRCKDERALAAHERRHRVRWRCRWCGGTWSRAAVCADHAAREHCAPTPTHTCGRCGHTETSLGRLRNHIKNHSERQKCDLCGKTFRDRTSLKTHLLLVYDMRAQRRTECVYTRRIHKGEKEYSCPRCDKKFLFKKAMEIHLVTHEAPAHLYCYQCDMNFKNRMSYNQHLKYSLKHVDPADIKLLSFFTYGAAPVAVPPVRRQMALVLQTSNSPHACKLCDKRFVKAARLQEHNLAVHLKMTPVKCTVAGCGFACSSKPVLRSHIRSCHRLAGRVRNHVCHVCGNAYTSNKSLEGHLRSHAGSRPLHCARCPATFAYDAALYNHTKLVHRDIQPTPAADPARPASQQPETQR-