Monarch geneset OGS2.0

DPOGS203435
TranscriptDPOGS203435-TA1374 bp
ProteinDPOGS203435-PA457 aa
Genomic positionDPSCF300242 - 252376-254447
RNAseq coverage279x (Rank: top 39%)
Annotation
HeliconiusHMEL0115701e-15274.10% 
BombyxBGIBMGA001685-TA7e-5340.68% 
Drosophilacrol-PE6e-4333.10% 
EBI UniRef50UniRef50_Q96MU63e-5640.34%Zinc finger protein 778 n=11 Tax=Euteleostomi RepID=ZN778_HUMAN
NCBI RefSeqXP_001945749.12e-5939.23%PREDICTED: similar to mCG7830 [Acyrthosiphon pisum]
NCBI nr blastpgi|3287266026e-5941.41%PREDICTED: zinc finger protein Xfin-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287266023e-6441.06%PREDICTED: zinc finger protein Xfin-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00036761.7e-11nucleic acid binding
GO:00082701.1e-05zinc ion binding
GO:00056221.1e-05intracellular
KEGG pathway 
InterPro domain[334-362] IPR0130871.7e-11Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL34502 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203435-TA
ATGACTTCAGAGTTTATCCAAATTAAGAAAGAGCCAGTGTGTGGCGAAACACCATCGTCCCATGAAATCGTAAATAATGTTAATAATAAACAAATTACAGAAAATGTCGGTGTCTTTGAACAGAAAATAGTAAGTGTTGAATTTGTGAACATTAAAGTGGAGCCGCGGGACGATGTTAAACATGAGCCCAGCCTGCTTGTTGAGACCTCAACATTTCACAATACAACCATTAAAGAGGAACCCGGTTACCACAACATAGACGTAGAAGTGTCCATTAAAGAAGAGCCCTTGGATAGTGATCATGTGGAGTCGTCCAACTTCAGTGTGCCCGAGACTGTGAGAGGTCCCCCATGTAATGTTTATATCAGTAATGTGTCGGAGGAGGTGTTGAGTCTGAACTCGACTGTGGACGGTAAGTACATGTGTCCCATATGTTACAAGAGATTCGCCAACAAAGGTAATGTTCAGAGACATGTGGCTCTACACTCGAGGGAGAAATGGGAGTGTGAAGTGTGCTTTAGACAGTTCTTCAAGAAGATGCTGTATGAGAAGCACGTCCTGACGCATTCCACGGAGAAGAGATTCAAGTGTGAGGAGTGCCAGAAGACGTTCAGGACGTCCTCCAACCTGGAGCAGCACAAGAGGATACATCTCACGGTCAAACCCTTCGAGTGTGACACCTGCAAGCGACAGTTCTCCGTGAGAGCGAACTTACTGAAGCATCAGGGCGCGGGCAGGTGTAAGAAACCCAGCGACAAGCCCATAGAGTGCGGAGTCTGTCAGAAGGTTTTCCAGAAGGAGTTCCTGTTGAAGAGTCACCTCAGGAGACACACGACGGAGCGGCCGTTCGTCTGCGACAAATGTAAGATGAGTTTCAAGTACAAGTCGACTCTGATACGTCACGTGCAGCTCCACAACGGCATCAAACCTTACTCCTGCAAGATATGCAGGAAGAGGTTCACCCACGCGGGACTCATCAAACCTCACATGAGGAAACACACGGGCGAGAAGCCCTACTCGTGTCCGGTCTGCGCCAAGAGCTTCGCCCACAAACACAACATGCAGCGACACACGGTCCGCCATTCGAAGATAAAGAACACGGTGTGCGCCGTCTGCAACAAGACATTCCCCAAGGAGAGCCGGCTCATCTACCACATGAGGACGCACACCAACGCGAAGCCGTTCGCGTGCGGAGTCTGCGAGAAGAAGTTCTCCCACAGGCAGAACATCATCAGGCACTACGGGCGGAAACATCCCGGAGACACGTACGAGTGTCGGGACACGGACGCCAGCGTGGCCAAGGACGTGTGGGAGAACGTCGTGAGGAACATGAACCCGGACGCGCACGGCGACGTCAGGATAACGCCGGACTGA

Protein sequence:

>DPOGS203435-PA
MTSEFIQIKKEPVCGETPSSHEIVNNVNNKQITENVGVFEQKIVSVEFVNIKVEPRDDVKHEPSLLVETSTFHNTTIKEEPGYHNIDVEVSIKEEPLDSDHVESSNFSVPETVRGPPCNVYISNVSEEVLSLNSTVDGKYMCPICYKRFANKGNVQRHVALHSREKWECEVCFRQFFKKMLYEKHVLTHSTEKRFKCEECQKTFRTSSNLEQHKRIHLTVKPFECDTCKRQFSVRANLLKHQGAGRCKKPSDKPIECGVCQKVFQKEFLLKSHLRRHTTERPFVCDKCKMSFKYKSTLIRHVQLHNGIKPYSCKICRKRFTHAGLIKPHMRKHTGEKPYSCPVCAKSFAHKHNMQRHTVRHSKIKNTVCAVCNKTFPKESRLIYHMRTHTNAKPFACGVCEKKFSHRQNIIRHYGRKHPGDTYECRDTDASVAKDVWENVVRNMNPDAHGDVRITPD-