Monarch geneset OGS2.0

DPOGS215017
TranscriptDPOGS215017-TA1317 bp
ProteinDPOGS215017-PA438 aa
Genomic positionDPSCF300256 + 231915-235356
RNAseq coverage90x (Rank: top 63%)
Annotation
HeliconiusHMEL0101602e-9845.89% 
BombyxBGIBMGA012174-TA5e-4680.81% 
DrosophilaCG7372-PA2e-2128.52% 
EBI UniRef50UniRef50_F4X0Y73e-2333.07%Zinc finger protein 235 n=5 Tax=Endopterygota RepID=F4X0Y7_ACREC
NCBI RefSeqXP_001603104.11e-2231.73%PREDICTED: similar to zinc finger protein [Nasonia vitripennis]
NCBI nr blastpgi|3320194031e-2233.07%Zinc finger protein 235 [Acromyrmex echinatior]
NCBI nr blastxgi|3517087902e-2828.30%Zinc finger protein 624 [Heterocephalus glaber]
Group
Gene OntologyGO:00056341.8e-09nucleus
GO:00082701.8e-09zinc ion binding
GO:00036761.5e-08nucleic acid binding
KEGG pathway 
InterPro domain[36-105] IPR0129341.8e-09Zinc finger, AD-type
[345-364] IPR0130871.5e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL21053 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215017-TA
ATGTTGCCATTATTCCTTTTAATTCATAGTATGGCGTCTCGGGGAAGAATATTTTGTAGGAATACAAGGCCTAAGAAATCCTTCAAGATAGAACTCGGTAATGCAACCTGCAGGACCTGTGGTTGCACCGGGAGTTTAGGATCTCTATTTACCGACGGCCGTAATCACGAAAAAAGATCTGAAGAATTACGCATTGTGACTGGTTTAACCATAAAGTATAACGATGGACTGTCGCAAAAAATATGTCGCCGCTGCTACAGGTTCATCAAAAGATCGCTAAGATTCCGGCGAGTCAGCAAAAGAACTGAAAAGCTTCGATTCGGTATGAGAGTGAAAACTGAGAATGGACCAAGACTAGCGATGACGACGGTTGTGAAGACGGAACCTCAATCAGACAACTTACAAGCCAACGACTTTGGACAAATGGGTTTCATTGACAGATATCAGAATTACAACTTCCCCGATCTGTACGATATCTGTAAGAAAGAGCCAGAGGAGGTGGATCCTCACAACACAACGAATGACATAGTCTCATTCACTTGTAAAACTTGTGGCGAACTGTTCACCTCAAGGAACCTATACGACTCACATCGGAAGACTCACAACAATTTATGGGTGGATGAAAATGCGACCCCCGGGATACAGAAGCAGATCGGTGCCAACGGAATGTACAAATGCCATATCTGTGGCAAGGAGTTCAGGATGAGGGCCACCTACACCTCGCATCTCAGGTTCCACACGCATTACTGCGTCTGTGAGTTTTATAGATATTTTCGTTACAACACCATCAGCCACATCTTTATTACTGGATCTAAGTCTCTCAGTAATTACCGCCAACGTTGGGCCTCTCCCATCAATTCACCTAGACCCTCGGTACAGGACTGCGGTAAGCGCTGCCGGAACAACAACGAGCTCCAGGAACACAAGCGAGCTCGACACGGCTTGATGAAGATACACAAATGTTCCCACTGCGAGTACAGCTCCGCGACCAAGGAGGCGCTCACCATCCACGAGCGACATCACACGGGAGAGCGGCCTTACGTCTGCGATCACTGCGGCTCCTCCTTCTTCAGGCGGTCCACGCTGGTGCAGCACATCGCCATACACCTGCCAGATAAGAACTTCCAGTGCGACATGTGCCCTAAATGGTTCAAATCCAAAAAGTTCCTTCAAATCCACAAACACGACGCTCACACCGGAAAGAAATACGGTTACCTGTGTTCAGTTTGCAATCACCGTTTTGAGAAGCCGTATATAGTGCGCGCGCACACGAGGAAGGTCCACGGGGTCCCCGACGAACAGACTGTAGACGACTAG

Protein sequence:

>DPOGS215017-PA
MLPLFLLIHSMASRGRIFCRNTRPKKSFKIELGNATCRTCGCTGSLGSLFTDGRNHEKRSEELRIVTGLTIKYNDGLSQKICRRCYRFIKRSLRFRRVSKRTEKLRFGMRVKTENGPRLAMTTVVKTEPQSDNLQANDFGQMGFIDRYQNYNFPDLYDICKKEPEEVDPHNTTNDIVSFTCKTCGELFTSRNLYDSHRKTHNNLWVDENATPGIQKQIGANGMYKCHICGKEFRMRATYTSHLRFHTHYCVCEFYRYFRYNTISHIFITGSKSLSNYRQRWASPINSPRPSVQDCGKRCRNNNELQEHKRARHGLMKIHKCSHCEYSSATKEALTIHERHHTGERPYVCDHCGSSFFRRSTLVQHIAIHLPDKNFQCDMCPKWFKSKKFLQIHKHDAHTGKKYGYLCSVCNHRFEKPYIVRAHTRKVHGVPDEQTVDD-