Monarch geneset OGS2.0

DPOGS202265
TranscriptDPOGS202265-TA1794 bp
ProteinDPOGS202265-PA597 aa
Genomic positionDPSCF300032 - 502664-507936
RNAseq coverage468x (Rank: top 26%)
Annotation
HeliconiusHMEL0056024e-8551.58% 
BombyxBGIBMGA004914-TA6e-6051.56% 
Drosophilasu(Hw)-PB3e-3924.71% 
EBI UniRef50UniRef50_UPI000179124E1e-6732.62%UPI000179124E related cluster n=1 Tax=unknown RepID=UPI000179124E
NCBI RefSeqXP_001944230.13e-6832.62%PREDICTED: similar to gonadotropin inducible transcription factor [Acyrthosiphon pisum]
NCBI nr blastpgi|1935826125e-6732.62%PREDICTED: zinc finger protein 91-like isoform 1 [Acyrthosiphon pisum]
NCBI nr blastxgi|2420185901e-7332.62%zinc finger protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00036769.4e-11nucleic acid binding
GO:00082709.8e-06zinc ion binding
GO:00056229.8e-06intracellular
KEGG pathway 
InterPro domain[355-396] IPR0130879.4e-11Zinc finger, C2H2-type/integrase, DNA-binding
[174-197] IPR0070879.8e-06Zinc finger, C2H2
Orthology groupMCL15785 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202265-TA
ATGCCACCAAAACTTAAAAGAGGCCGTCCACCAGGCTCAAAAAACAAGAAGACTATCAATCAATTAGCTCAGAAAATAAAGGATGAACAAAACTGTGAGAACATTAAGCAACAAAATGATGATTGCTATTGTGAATCTTGTAAACAATCGTTTCCATCGCTTCGGGTTTATCGAAAGCATGTCAAATCTTGTAAAATTAAACAGGAAATAACACTCGCGGGTGTCGATGTAACACCAATAAAGCCTGCATACAACTGTAAGCAATGCAGCAAAAAGTTCAGATTTATCAAGAGTTATGAAAGTCATCTATTAAACGAACATTCCTCGACTCCGGGCAGTGTTTCTTGTAACCAATGTGATGTGGTGTGTCCTAATGAAGATGTTTTGGCCCAACACGTCCAGAATGTGCATCTTAGAGAGATCTTTGAGTGTGAACATTGTGATCAGAAGTTTGTTAGGAGGTCCCATGTCCTTAGACACATGATGCAAAAGGGCTGTGATGGCAGAAAAGTCGTCACCTATCCATGTGAAATCTGCGAGGCATCATTCTCAAGGAAAGATAACCTGATGGTACATCTCCGGTTGCAGCATATAACAACCAAGCATTTCAGTTGTAAGCTCTGTGAATATCATACAAGAAACTTCTCCAAGCTAATAATACACAATCAGCAGAAACATACAGAAACACCAAGATTTGAGTGCGATCATTGCGGCAAGGTAACCGGCTCACGAGGAGCCATCGCCAAACACCTCGAAATACACGGCGAAAAGAAATACGGCTGTGATGTGTGTGGTTACACAACCTTCACGGTGGAGGTAATGAGACGTCATGTTCTAACACATGTCAGAGAAAAGCCATTCAAATGTGATATATGCGATCAGTCGTACATACAGAAAGTCCAGCTACAACGACACTTGGAGAAACACACCGGAAACCAATGCAACATCTGTTCTAAGAGCTTTGATTCTAAATCAAAATTAATTGTTCATGAAATCATGCACAAGGGAGAGCCTTTGTTGTGTCCGATCGACAACTGTGATACAAAAAAGTTTAAAGATGCCAACGATCTGGCCAGTCATATCAAGATGCACTTGAACGATAAGCCGTTCGAGTGTGAGGTGTGCAGCAAGAGGTTCCACCTGGAGGTGAACATGCGCCGTCATATGAGCACTCACACTTTGGAAAAAGCCCGTCGCTGTATGTACTGCGTCTCCGCGCGAGCCTACGTCAGGGGGGAGCAGTTGGTCCGTCACGTGAGGAAGACTCACCCGCAAGTGTTCCAGCAGAGACTATTGCACGTGCGGAGTGTCCTTGGCTCGAATCTCGGGCTGAGTCGCGTGAGGAAGTCTGAGATAGAGTCTATCCTGAACGTATTGGACGCGGAGTCGGACCGCATCCTTGAAGGTTGTGGTGACGGAGCTTTGTACGGTGGACTCCAAGAGAACGATGACTCCAATGTGAAGCAGGAAGAAGAAAAACCACTCATGGCAGAGGATGAACTAGCTGATAGTTTGAACAAACTCCTCACACACATGATAGACAAGGAGATGCTGGAGTGTTTCGGATGGCCGGACGAACCTATAGATGAGGTTTTGGAGCACATGATTTCAAACTGTGGCGCCAAAGCAGCTGATCACAGTTGGCCTCGAGTCCAACGTTTGAGAGAGAATGCCAAACAATTATTCCTGCACGTAGTGGAAGACGAGACAGTGGCTCGCATGCTGCACACACACACCATCGACCAGGTCATCAAACACATACTTGCCCAAGTCGCTGATGATGTACAGAAGTGA

Protein sequence:

>DPOGS202265-PA
MPPKLKRGRPPGSKNKKTINQLAQKIKDEQNCENIKQQNDDCYCESCKQSFPSLRVYRKHVKSCKIKQEITLAGVDVTPIKPAYNCKQCSKKFRFIKSYESHLLNEHSSTPGSVSCNQCDVVCPNEDVLAQHVQNVHLREIFECEHCDQKFVRRSHVLRHMMQKGCDGRKVVTYPCEICEASFSRKDNLMVHLRLQHITTKHFSCKLCEYHTRNFSKLIIHNQQKHTETPRFECDHCGKVTGSRGAIAKHLEIHGEKKYGCDVCGYTTFTVEVMRRHVLTHVREKPFKCDICDQSYIQKVQLQRHLEKHTGNQCNICSKSFDSKSKLIVHEIMHKGEPLLCPIDNCDTKKFKDANDLASHIKMHLNDKPFECEVCSKRFHLEVNMRRHMSTHTLEKARRCMYCVSARAYVRGEQLVRHVRKTHPQVFQQRLLHVRSVLGSNLGLSRVRKSEIESILNVLDAESDRILEGCGDGALYGGLQENDDSNVKQEEEKPLMAEDELADSLNKLLTHMIDKEMLECFGWPDEPIDEVLEHMISNCGAKAADHSWPRVQRLRENAKQLFLHVVEDETVARMLHTHTIDQVIKHILAQVADDVQK-