Monarch geneset OGS2.0

DPOGS214983
TranscriptDPOGS214983-TA1047 bp
ProteinDPOGS214983-PA348 aa
Genomic positionDPSCF300256 - 260262-262928
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0101604e-9447.78% 
BombyxBGIBMGA012174-TA2e-6640.41% 
DrosophilaCG15436-PA5e-2427.59% 
EBI UniRef50UniRef50_D6WJ552e-2437.78%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WJ55_TRICA
NCBI RefSeqXP_001944018.14e-2632.64%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Acyrthosiphon pisum]
NCBI nr blastpgi|1935917267e-2532.64%PREDICTED: zinc finger protein 271-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1935917262e-2932.64%PREDICTED: zinc finger protein 271-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00056344.5e-12nucleus
GO:00082704.5e-12zinc ion binding
GO:00036762.2e-11nucleic acid binding
KEGG pathway 
InterPro domain[27-95] IPR0129344.5e-12Zinc finger, AD-type
[243-270] IPR0130872.2e-11Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL21053 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214983-TA
ATGGCCTCTCGAGGGAGAATGTTTTGTAGGAATACAAGACCCAAGAAATCCTTCAAGATAGAACTCAGCAGTGGATCCTGTAGGACCTGTGGTAGCACCAGGAGTTTAGGATCTCTATTTAGTGGGGGCGAAAGTAAGGAAAAAATATCTGAACAACTACGGCTAGTAACTGGTTTAGTAATAAGACCTAACGATGGATTATCCCAAAAAATATGTCGCCGGTGCTACAGAGCGGTTAGGTCTGCCATAAGACTGAGACGAGTCAGTAGAAGAACTGAAAAGCTTCGTCTCGGTATGAGGGTGAAGACTGAGAATGTACCAAAACTAGCGATGAATACAGCTGTGAAGACGGAACCTCAATCAGACAACTTACAAGCCAACGACTTTGGACAAATGGGCTTCATTGACAGATATCAGAATTACAACTTCCCCGATCTGTACAATATCTGCAAGAAAGAAGCGGAGGAGACAAAACCTCTCGGCACAACAAGTGACGCGGTCCCATACAAGTGTAAAACATGCGGAAAACCGTTCACCTCCAGGGAGTTTTATGATGCACATCGGAAGACTCACAAGAACGTGTGGGTCTGTGAAGAGTGCGGGAAGTCTTGCCGGAGTAACTTCGAGCTGAAGAACCACAAGCGGGCCAAGCACGGCATGGAGCGGATCCACAAGTGCTCCTACTGCGACTACACACCGGCCACGCAGGAAGCACTCACAATCCACGAGCGGCGCCACACAGGAGAGAGGCCGTATGTGTGTGATCACTGCGGCGCGGCTTTCTACAGACGCAACACGCTGGTTCAGCATCTCGCTCTGCATCTACCAGACAACAAGTACCAGTGCGATCTGTGCCCGAAGCGGTTCAAATCGAAGAACTTCCTGCTCATCCACAAACACGACACTCACACGGGGAAGAGGTACGGCTATCTGTGCTCGGTTTGTGAGCGTCGCTTCCAGAAGCCTTACAAGGTACGAGAGCACGCGAGGAAGGTGCACGGGCTGGCGAACGAGCACCAGGCGCCTGTAGTGCGGGTCGAGCTCTAG

Protein sequence:

>DPOGS214983-PA
MASRGRMFCRNTRPKKSFKIELSSGSCRTCGSTRSLGSLFSGGESKEKISEQLRLVTGLVIRPNDGLSQKICRRCYRAVRSAIRLRRVSRRTEKLRLGMRVKTENVPKLAMNTAVKTEPQSDNLQANDFGQMGFIDRYQNYNFPDLYNICKKEAEETKPLGTTSDAVPYKCKTCGKPFTSREFYDAHRKTHKNVWVCEECGKSCRSNFELKNHKRAKHGMERIHKCSYCDYTPATQEALTIHERRHTGERPYVCDHCGAAFYRRNTLVQHLALHLPDNKYQCDLCPKRFKSKNFLLIHKHDTHTGKRYGYLCSVCERRFQKPYKVREHARKVHGLANEHQAPVVRVEL-