Monarch geneset OGS2.0

DPOGS201361
TranscriptDPOGS201361-TA1293 bp
ProteinDPOGS201361-PA430 aa
Genomic positionDPSCF300083 - 366717-387773
RNAseq coverage45x (Rank: top 71%)
Annotation
HeliconiusHMEL0147012e-17490.91% 
BombyxBGIBMGA000607-TA5e-13085.82% 
Drosophilascrt-PA2e-8881.42% 
EBI UniRef50UniRef50_D2CFX72e-10955.87%Scratch 2 n=1 Tax=Tribolium castaneum RepID=D2CFX7_TRICA
NCBI RefSeqXP_001602332.12e-11150.41%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3800195722e-10949.12%PREDICTED: uncharacterized protein LOC100866730 [Apis florea]
NCBI nr blastxgi|910932501e-12158.12%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00036762.3e-11nucleic acid binding
GO:00082709.9e-06zinc ion binding
GO:00056229.9e-06intracellular
KEGG pathway 
InterPro domain[338-358] IPR0130872.3e-11Zinc finger, C2H2-type/integrase, DNA-binding
[340-362] IPR0070879.9e-06Zinc finger, C2H2
Orthology groupMCL14289 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201361-TA
ATGCCGCGCTGCCTCATGGCCAAGAAGTGGAAGGTGGCTCCGACGACTGAGCTGGCCGACGACTCCGACGAGGAGATCGACGTGGTCGGTGAGGGGCGCGGCCGCAGCCCCGCGCCCGCTGCTACCGCGCCGGGACCTTCGCCCCGCGATCCCGAGCCCACGCTACTCTACAACGGCTACACTGAGGAGCAGCCTTCCACGCAGTTCACCGCGCAGCTGCCGAGCGAGGTGAGCCCGGCTCTCTCCGGCCCCTCTCAGCCGGCGCCTCAGCCTGTCTCCTACGCACTTCTGCACCTCTCCTCCGACTCGACAGATCCACCTTTCCTCCAAGTACAACTCCAGTCTTCACCTGTCTCCCCTCCTCCGCCACCGGCGAGCGGCAAATCCAACTCCATGTGCTATACTAGCACGGGCGCTCTTCTATCTCTACCACCGAAGAAGAAGGATATTTACCGCCCCTACTGTCTCGGCGAACGCGCATACCTTCATCGAGCGGAGGAGGACCTCAATGCAGCCCACGCCATCCTTGATTTAAGCCAGCACGCCGTTTTCCTTACAACTCCACCGCGCATCGAACCCCAAGCACCTCCCGAGCAACCGCCACCACCACCACCCGAACCACAGCAGGAGTCGACAGAGCGTCCCAAAGAGACTGTAGCTTACACCTACGATGCTTTTTTCGTAAGCGACGGCCGATCTAAGCGTCGAAACTATGCGAACGCTCCCGTTGAAAGTGCTCAGCAGCCTGAACAAAAATCAAAATACACCTGTAGCGAGTGCGGCAAGCGTTATGCCACGTCATCGAACTTATCAAGACACAAGCAAACTCATCGCAGCCTCGATTCGGTCGCAGCCAAGCGCTGCGGAGAATGCGGCAAGGCTTACGTATCGATGCCAGCGCTCGCTATGCACGTGCTTACTCATCAACTCTCACACGTGTGCGGTGTTTGCGGAAAATTATTTTCTAGGCCATGGTTGTTGCAAGGTCACCTCCGTTCTCATACCGGCGAGAAGCCTTATGGGTGCGCACACTGCGGCAAGGCATTTGCAGACCGTTCAAATTTACGTGCACACATGCAAACGCACTCTGCCGATAAGAACTTCGAATGTTTGCGATGCCACAAAACTTTTGCCCTCAAGAGCTATCTCAACAAACATCAGGAGTCAGCCTGTGTGCGCGACGGGGAATCACCACCACCCGAGTCTCTTTCAGGCTCGCCTCCCACCAATGAACCTACCACCGAAGCTACCACCGATCCCGCCATCACCACCTCCGTCATAGCCATCGGCTAA

Protein sequence:

>DPOGS201361-PA
MPRCLMAKKWKVAPTTELADDSDEEIDVVGEGRGRSPAPAATAPGPSPRDPEPTLLYNGYTEEQPSTQFTAQLPSEVSPALSGPSQPAPQPVSYALLHLSSDSTDPPFLQVQLQSSPVSPPPPPASGKSNSMCYTSTGALLSLPPKKKDIYRPYCLGERAYLHRAEEDLNAAHAILDLSQHAVFLTTPPRIEPQAPPEQPPPPPPEPQQESTERPKETVAYTYDAFFVSDGRSKRRNYANAPVESAQQPEQKSKYTCSECGKRYATSSNLSRHKQTHRSLDSVAAKRCGECGKAYVSMPALAMHVLTHQLSHVCGVCGKLFSRPWLLQGHLRSHTGEKPYGCAHCGKAFADRSNLRAHMQTHSADKNFECLRCHKTFALKSYLNKHQESACVRDGESPPPESLSGSPPTNEPTTEATTDPAITTSVIAIG-