Monarch geneset OGS2.0

DPOGS203099
TranscriptDPOGS203099-TA1476 bp
ProteinDPOGS203099-PA491 aa
Genomic positionDPSCF300391 + 14955-18496
RNAseq coverage137x (Rank: top 55%)
Annotation
HeliconiusHMEL0045460.065.79% 
BombyxBGIBMGA011111-TA1e-14957.40% 
Drosophilacrol-PE7e-6437.35% 
EBI UniRef50UniRef50_UPI00020612744e-8444.64%UPI0002061274 related cluster n=6 Tax=Takifugu rubripes RepID=UPI0002061274
NCBI RefSeqXP_001945749.12e-9348.70%PREDICTED: similar to mCG7830 [Acyrthosiphon pisum]
NCBI nr blastpgi|3287266022e-9248.70%PREDICTED: zinc finger protein Xfin-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287266021e-10048.70%PREDICTED: zinc finger protein Xfin-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00036761.2e-15nucleic acid binding
GO:00056348e-07nucleus
GO:00082708e-07zinc ion binding
GO:00056223.6e-06intracellular
KEGG pathway 
InterPro domain[427-452] IPR0130871.2e-15Zinc finger, C2H2-type/integrase, DNA-binding
[13-89] IPR0129348e-07Zinc finger, AD-type
[406-428] IPR0070873.6e-06Zinc finger, C2H2
Orthology groupMCL25812 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203099-TA
ATGTTAAAAATGATGGATACGTTCAGTGAAGGCGGTATTTGCCGCTGTTGTCATGCCCAAGGAACATTTAAGAGTCTAAACGAAACGTATACAAGTGAAAACAGGCAAGATACTTACTACAACGCTCTGCAAAAAACATTTAATATAGAAATAAAATATATATCAAGCATGGATGGTATGTCGATTTGTAACGTTTGCATCGAGAAACTGCTAGATGGGGAATTATTTAAACAGCAGGTTGAGGCTTGCGAGACAATCTTGAAACAGCACTACAGCAATAAAGATGATTTTAAGCAAGTAAAAGATGAACTCTATGACTTTCCCGACGTTGAAACAATAGTTAATGATGATTTAGATGATCAACCGCTGTCGGCTTACAAGACTTTGGAAATTGATTGGAAAATCCAACGGAATGATAAATATGTAAATAAATCAGAAAATCCAAAAATAAAGAAGTACGGCTGCGATTTATGTGATAAAAGATTCTCATACAGCGGTTCCTTAGATGTACACATGCGAGGTCACAGCGGTGAAAAATCGTATATATGCAGGCTATGTGACAGTAAATTTGCTTTGAACAAGGAATTAATACGACATATAAACTCTGAGCACTCCCAAAATGGACTGTTCCCGTGTAATTTCTGCTCGAAGAAACTAGAAAATGCCAGATCTCTGAAAAATCATCTGACAATACATTTCGGAGAGAAGTTTAAATGTCCTCACTGTGACAAAGAGTTCCAAAGAAAGAAAGGCCTCAAGGAACACTTGAAGACGCACACGGGTGAAAGGAATTTCAGTTGTACGCTGTGCGATAAATCCTTCTGCCACAACCAGACACTGAAATCACATATGCTGACCCACACGGGCGAGAGGCCGTACGTGTGTTTTGTCTGCGGGAGAAGGTTTCCTCAGCGAACACATCTGAAACGACACATCTTCTTAATACACACCGGCATAAAACCTCACACCTGCAAAGTCTGCAATAAACAGTTCTCGAGCAAGAGCTATCTGTCGATACACCAACGGACGCACACAGGAGAGAGACCCTATTCGTGTGATGTCTGCAAGAAAGACTTCACAGCATACACGACGTTAAAGGTCCATATGCGTGTGCATACTGGTCTAAAACCATATTTGTGTACATTCTGCAATCGCCAATTCGCTCAGTTAGCGAGTTTTAAGCTCCACGAAAGGACACACACTGGAGAGAGGCCGTACTCTTGTAAGGTTTGCAAGAAATCCTTCTCCGACAACGGCTACCTGAAGATCCACATGCGTGTACACACGGGCGAGAAGCCGTTCAGCTGCGACATCTGCAAGCGTTCGTTCAGAGAGACCGGACAACTGAAGCGCCACATGCGCGTGCACACGGGCGTGAAGCCCTACACGTGCAAGGTGTGCAACAAACAAATCGGCAATCTGTCCAAACACATGCGTGTCCACACCGACGACAGGCCCTACAGCTGTAATGTGTGA

Protein sequence:

>DPOGS203099-PA
MLKMMDTFSEGGICRCCHAQGTFKSLNETYTSENRQDTYYNALQKTFNIEIKYISSMDGMSICNVCIEKLLDGELFKQQVEACETILKQHYSNKDDFKQVKDELYDFPDVETIVNDDLDDQPLSAYKTLEIDWKIQRNDKYVNKSENPKIKKYGCDLCDKRFSYSGSLDVHMRGHSGEKSYICRLCDSKFALNKELIRHINSEHSQNGLFPCNFCSKKLENARSLKNHLTIHFGEKFKCPHCDKEFQRKKGLKEHLKTHTGERNFSCTLCDKSFCHNQTLKSHMLTHTGERPYVCFVCGRRFPQRTHLKRHIFLIHTGIKPHTCKVCNKQFSSKSYLSIHQRTHTGERPYSCDVCKKDFTAYTTLKVHMRVHTGLKPYLCTFCNRQFAQLASFKLHERTHTGERPYSCKVCKKSFSDNGYLKIHMRVHTGEKPFSCDICKRSFRETGQLKRHMRVHTGVKPYTCKVCNKQIGNLSKHMRVHTDDRPYSCNV-