Monarch geneset OGS2.0

DPOGS215019
TranscriptDPOGS215019-TA1038 bp
ProteinDPOGS215019-PA345 aa
Genomic positionDPSCF300256 + 244231-248191
RNAseq coverage4x (Rank: top 89%)
Annotation
HeliconiusHMEL0101602e-9247.75% 
BombyxBGIBMGA012174-TA2e-6439.70% 
DrosophilaCG15436-PA8e-2427.30% 
EBI UniRef50UniRef50_B4GJE36e-2428.44%GL26275 n=7 Tax=pseudoobscura subgroup RepID=B4GJE3_DROPE
NCBI RefSeqXP_001944018.11e-2632.79%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Acyrthosiphon pisum]
NCBI nr blastpgi|1935917262e-2532.79%PREDICTED: zinc finger protein 271-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1935917266e-3032.79%PREDICTED: zinc finger protein 271-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00056344.4e-12nucleus
GO:00082704.4e-12zinc ion binding
GO:00036761.2e-07nucleic acid binding
KEGG pathway 
InterPro domain[27-95] IPR0129344.4e-12Zinc finger, AD-type
[250-270] IPR0130871.2e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL21053 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215019-TA
ATGGCCTCTCGAGGGAGAATGTTGTGTAGGAATACAAGACCCAAGAAATCCTTCAAGATAGAACTCGGCAGTGGATCCTGCAGGACCTGTGGTAGCACCAGGAGTTTAGGATCTCTATTTAGTGGAGGCGAAAGTAAGGAAAAAATATCTGAACAACTACGGCTAGTAACTGGTTTAGTAATAAGACCTAACGATGGATTATCCCAAAAAATATGTCGCCGGTGCTACAGAGCGGTTAGGTCTGCAATAAGACTGAGACGAGTCAGTAGAAGAACTGAAAAGCTTCGTCTCGGTATGAAGGTGAAAACTGAGAATGTACCAAAACTAGCCATGAATACAGCTGTGAAGACGGAACCTCAATCAGACAACTTACAAGCCAACGACTTTGGACAAATGGGTTTCATTGACAGATATCAGAATTACAACTTCCCCGATCTGTACGATATCTGTAAGAAAGAAGCGGAGGAGACAAAACCTCTCGGCACAACAAGTGATGCGGTCCCATACAAGTGTAAAACATGCGGAAAACCGTTCACCTCCAGGGAGCTTTATGATGCACATCGGAAGACTCACAAGAACGTGTGGGTCTGTGAAGAGTGCGGGAAGTCTTGCCGGAGTAACTTCGAGCTGAAGAACCACAAGCGGGCCAAGCACGGCATGGAGCGGATCCACAAGTGCTCCTACTGCGACTACACACCGGCCACTCAGGAAGCACTCACAATCCACGAGCGGCGACACACAGGGGAGAGGCCGTATGTGTGTGATCACTGCGGCGCGGCTTTCTACAGACGGAAAACTCTGGTTCAGCATCTCGCTCTGCATCTACCAGACAACAAGTACGAGTGCGATCTGTGCCCGAAGCGGTTCAAATCGAAGAACTTCCTGCTCATCCACAAACACGACACTCACACGGGGAAGAGGTACGGCTATCTGTGCTCGGTTTGTGAGCGTCGCTTCCAGAAGCCTTACAAGGTACGAGAGCACGCGAGGAAGGTGCACGGGCTGGCGAACGAGCACCAGGCGCCTGTAGTGGACTAG

Protein sequence:

>DPOGS215019-PA
MASRGRMLCRNTRPKKSFKIELGSGSCRTCGSTRSLGSLFSGGESKEKISEQLRLVTGLVIRPNDGLSQKICRRCYRAVRSAIRLRRVSRRTEKLRLGMKVKTENVPKLAMNTAVKTEPQSDNLQANDFGQMGFIDRYQNYNFPDLYDICKKEAEETKPLGTTSDAVPYKCKTCGKPFTSRELYDAHRKTHKNVWVCEECGKSCRSNFELKNHKRAKHGMERIHKCSYCDYTPATQEALTIHERRHTGERPYVCDHCGAAFYRRKTLVQHLALHLPDNKYECDLCPKRFKSKNFLLIHKHDTHTGKRYGYLCSVCERRFQKPYKVREHARKVHGLANEHQAPVVD-