Monarch geneset OGS2.0

DPOGS212967
TranscriptDPOGS212967-TA1785 bp
ProteinDPOGS212967-PA594 aa
Genomic positionDPSCF300057 + 415305-417373
RNAseq coverage74x (Rank: top 66%)
Annotation
HeliconiusHMEL0113060.081.27% 
BombyxBGIBMGA011610-TA0.090.86% 
Drosophilajim-PE1e-15871.90% 
EBI UniRef50UniRef50_E0VAN37e-17657.69%Zinc finger protein Xfin, putative n=7 Tax=Neoptera RepID=E0VAN3_PEDHC
NCBI RefSeqXP_393705.30.069.03%PREDICTED: similar to jim CG11352-PC, isoform C [Apis mellifera]
NCBI nr blastpgi|3800292690.065.71%PREDICTED: zinc finger protein 93-like [Apis florea]
NCBI nr blastxgi|3800292690.066.83%PREDICTED: zinc finger protein 93-like [Apis florea]
Group
Gene OntologyGO:00036763.1e-14nucleic acid binding
GO:00082708.1e-07zinc ion binding
GO:00056228.1e-07intracellular
KEGG pathway 
InterPro domain[421-446] IPR0130873.1e-14Zinc finger, C2H2-type/integrase, DNA-binding
[421-443] IPR0070878.1e-07Zinc finger, C2H2
Orthology groupMCL12539 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212967-TA
ATGCAACACCAAAGCCCTGGGGGTCCTTCCCCTCAGCCGGAACCACAACTCGCGTCCCCGGCCCCCTCCCCATACCCCCCTGCGCCCCCGCCCCCCGACCCCTCCGCGGATGAGGCCGCACAACTTGCTCAACAACAAGCGCAACAACAAGCACAGCAACAAGCGCAGCAACAAACATCCCCAGAGCACATGCAAGTCGAGCAGCAGCTACAAAATCAACCTCAGCCTCAACAACCATCGCAAGACCATCTAGACCGAACGCCATGCTTCGTCACGCAACCTGACTTACAACAGCAATACACGCCATATTTCAAAGAAGCGAGACCTGCGAGCCATATGCTCGGAACTGGTGGTTTTCCCTTACACTACCTAAAACAGAACGGCGGAGTTCTATCGTTGGAGGGTCCACTTGACCAATATGCAACGGCGGACTTGCGCCACTTACCAGATATCGTGCAACCTGAACCACCGAAACCTCGGAAGCACAACCCAAATACTGAGCTTCGTCTATTCAAATGCCTGACCTGCGGCAAAGACTTCAAACAGAAGTCCACCCTGCTCCAGCACGAGCGGATCCACACAGATTCCAGGCCATATGGATGTCCAGAGTGCGGCAAGCGGTTCCGGCAGCAGTCCCATCTGACGCAGCATCTACGCATCCACGCCAACGAGAAACCGTACGCGTGCGTGTACTGCCCGCGCTCCTTCCGCCAGCGGGCCATCCTCAATCAGCACCTACGCATCCATTCCGGTGAGAAGCCATATACGTGCGGCGAGTGCGGCAAACATTTCCGCCAGAAGGCCATCCTGAACCAGCACGTCCGCACACACCAAGGCGAGCGCTATTCACACATATATGCCCCGCCCCGGTCCTCGCACGTTTCACCGCATCTTATCTTCAAGAACGGGACGACGCCGACTTTGTGGCCTCAGGATGTACCATTTCCACCGGATGAGAACAAGGAAGAAGTTCAATCGACTTATGGCGACACAGATGGTCAGAATGGTGAACAACGAGGATCTTGCTTTTCACCAGGAGATACTGCACAGTATCCTGCTTATTTTAAAGATACTAAAGGCCTCAACCATGCCGTATTCGGTTCTAGCCTTTCATTGCAGTACCTAAAAGGAGGAAAACTGCCGGATGTATTAGGGGGACGTGCTATGCCTTTGTATGTGCGATGTCCTATTTGTCAAAAGGAATTCAAACAGAAGTCAACATTGCTGCAACACGGCTGTATCCATATTGAATCACGGCCTTATCCATGCCCCGAATGCGGAAAACGTTTCAGGCAGCAATCACATCTTACCCAACACTTACGCATTCACACAAATGAGAAACCATACGGCTGTGTGTATTGCCCACGATTCTTTAGGCAGCGAACGATTCTCAACCAACATCTTCGCATCCACACCGGCGAAAAGCCATACAAATGTACGCAATGCGGGAAAGATTTCAGGCAGAAGGCTATTTTAGATCAGCACACGCGGACTCACCAAGGCGACAGGCCGTTCTGTTGTCCGATGCCAAACTGCCGTCGCCGCTTCGCCACCGAACCCGAGGTGAAGAAACATATCGACAACCACATGAACCCGCACGCGGCGAAGGCTCGGCGGGCGGACGCGAAGACCCCTCGGCCACTGCCGCCGGCCTCGGCGGTTGTGAAGCCGGAACTGTACTTCCCGCAGTGCTACGCGCCGCCGTTCCAGCAGTTCCCCACAGGCGGCGCAGGTGAGTTCAAGCCGGCGGTCGGTGGCGGCGTAGCTTGTCTGACGACACAGTGA

Protein sequence:

>DPOGS212967-PA
MQHQSPGGPSPQPEPQLASPAPSPYPPAPPPPDPSADEAAQLAQQQAQQQAQQQAQQQTSPEHMQVEQQLQNQPQPQQPSQDHLDRTPCFVTQPDLQQQYTPYFKEARPASHMLGTGGFPLHYLKQNGGVLSLEGPLDQYATADLRHLPDIVQPEPPKPRKHNPNTELRLFKCLTCGKDFKQKSTLLQHERIHTDSRPYGCPECGKRFRQQSHLTQHLRIHANEKPYACVYCPRSFRQRAILNQHLRIHSGEKPYTCGECGKHFRQKAILNQHVRTHQGERYSHIYAPPRSSHVSPHLIFKNGTTPTLWPQDVPFPPDENKEEVQSTYGDTDGQNGEQRGSCFSPGDTAQYPAYFKDTKGLNHAVFGSSLSLQYLKGGKLPDVLGGRAMPLYVRCPICQKEFKQKSTLLQHGCIHIESRPYPCPECGKRFRQQSHLTQHLRIHTNEKPYGCVYCPRFFRQRTILNQHLRIHTGEKPYKCTQCGKDFRQKAILDQHTRTHQGDRPFCCPMPNCRRRFATEPEVKKHIDNHMNPHAAKARRADAKTPRPLPPASAVVKPELYFPQCYAPPFQQFPTGGAGEFKPAVGGGVACLTTQ-