Monarch geneset OGS2.0

DPOGS212720
TranscriptDPOGS212720-TA1203 bp
ProteinDPOGS212720-PA400 aa
Genomic positionDPSCF300012 - 417638-420285
RNAseq coverage544x (Rank: top 23%)
Annotation
HeliconiusHMEL0083121e-12368.67% 
BombyxBGIBMGA013126-TA7e-10253.46% 
DrosophilaCG6654-PA9e-3844.05% 
EBI UniRef50UniRef50_Q133604e-4349.15%Zinc finger protein 177 n=8 Tax=Eutheria RepID=ZN177_HUMAN
NCBI RefSeqXP_001945749.16e-4537.46%PREDICTED: similar to mCG7830 [Acyrthosiphon pisum]
NCBI nr blastpgi|3287266027e-4437.46%PREDICTED: zinc finger protein Xfin-like [Acyrthosiphon pisum]
NCBI nr blastxgi|2608184517e-5140.47%hypothetical protein BRAFLDRAFT_58764 [Branchiostoma floridae]
Group
Gene OntologyGO:00036768.9e-15nucleic acid binding
GO:00082707.4e-05zinc ion binding
GO:00056227.4e-05intracellular
KEGG pathway 
InterPro domain[30-56] IPR0130878.9e-15Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL30986 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212720-TA
ATGAAGAAGCACAGAGGCATCAGGAACCACGTGTGCAACATATGTAGCAAGGCCTTCTACGAGGTGTCCAAACTGAACGCCCACATGAGAGTTCATACCGGCGAGCGTCCGTTCGAGTGTCAGTTCTGTTCGCGTAAGTTCGCCCAGCAGTCAGCTCTGATCTATCACAAACGGACCCACACAGGGGAGAAGCCTTACGCGTGTAAGGCGTGCCCCGCCAGGTTCACCACCTCCTCCTCGAGGAACAACCACATGATGACGCACACCGGCAATAAGAAGTTCGTGTGTCCAGTGTGTTTCAAAGGCTGCACGTCTCGGTCGGAGCTGCGAGTGCATTCGAGCAAGCACACGGGGGAGAAACTGTTCGCGTGTGAGATCTGCTCTCAGAGGTTCAGCTCGCCCTCCTACCTCGCTGTGCACAGACGGACGCACACGGGAGAGAAGAGATATCAGTGCACTGAATGTGGAAAAGGTTTCGTGGAGTGCACCTCATACAAGAAACATATGAAGACTCACGCGAAGGACAAACCGCAGAATGAAGAGACGAAGACGCAAAACAACCAGGAGGAAGCAGCCAAGCTGCAGGAAACGAGCTTGGAGATAGCTGAAGAAGAGAAGAAAGAAGACAAAGATGTTACAGCGGAAGCGGAAATAAACCAGGACGGACAAATAGTTATACAAGATGCCGGTACCCAGAAGAGATTCAAGTGTGGGCTGTGCGTTAAGAGCTACACGTATTTGCACAGTCTTAAGAAGCACATGCTGAGTCATGTGCAGATGTGCGTGTTCAGTCCGGACGGGTCGGAGAGGATGTACGCCTTCAAGCAGCAACAGCAGCAGGAGCAGGAGATGCAGCAGCAGCAGGTGCAGCAGGTGGTGGTGGGCGGGGTGGGGGGAGTGCAGCAGCTAGCGGTGCCCGTCCACCAGCACGTGCAGCCTCTCGTGCCGCAGCAACCACAACTGCAAGTGCAAACGATTCAGATACATCCTCAACATCAACCCATACAAATACATGCTGTTGGTGTTCAACAAGAACAACTTCACGTGGCCACTAGCAGTTGTCAGACAATGGTGCCCAATATCCTACAGTTACAGCCGGGTGCTGTCGTTTCCGGTTTGGGGCAGGAACTGGGTGGTGTCCATCGCATCATATTACAGCCACCTCACGCTCACCCCCACGCCCTGTACACTATACACCACTAG

Protein sequence:

>DPOGS212720-PA
MKKHRGIRNHVCNICSKAFYEVSKLNAHMRVHTGERPFECQFCSRKFAQQSALIYHKRTHTGEKPYACKACPARFTTSSSRNNHMMTHTGNKKFVCPVCFKGCTSRSELRVHSSKHTGEKLFACEICSQRFSSPSYLAVHRRTHTGEKRYQCTECGKGFVECTSYKKHMKTHAKDKPQNEETKTQNNQEEAAKLQETSLEIAEEEKKEDKDVTAEAEINQDGQIVIQDAGTQKRFKCGLCVKSYTYLHSLKKHMLSHVQMCVFSPDGSERMYAFKQQQQQEQEMQQQQVQQVVVGGVGGVQQLAVPVHQHVQPLVPQQPQLQVQTIQIHPQHQPIQIHAVGVQQEQLHVATSSCQTMVPNILQLQPGAVVSGLGQELGGVHRIILQPPHAHPHALYTIHH-