Monarch geneset OGS2.0

DPOGS205627
TranscriptDPOGS205627-TA1404 bp
ProteinDPOGS205627-PA467 aa
Genomic positionDPSCF300023 - 486989-491689
RNAseq coverage455x (Rank: top 27%)
Annotation
HeliconiusHMEL0062020.089.33% 
BombyxBGIBMGA001139-TA0.085.92% 
DrosophilaCG12769-PA4e-8371.50% 
EBI UniRef50UniRef50_D1ZZX41e-14556.64%Putative uncharacterized protein GLEAN_07382 n=1 Tax=Tribolium castaneum RepID=D1ZZX4_TRICA
NCBI RefSeqXP_975219.12e-14656.64%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastpgi|910810134e-14556.64%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastxgi|910810132e-14856.26%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
Group
Gene OntologyGO:00036761.4e-09nucleic acid binding
KEGG pathway 
InterPro domain[109-133] IPR0130871.4e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL15747 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205627-TA
ATGTCATCAAAAGGGGAAACAGAGGCCGATTCTGGCACCGTGTCGCCAAATCTACCTGCCGTCAAAAATGAACCGGCACCAACAGCGAGCAAGGAAACACCTAAAAGGGGCAGTGGTACTCTATTGAAGTGTACTACATGTAACAGTTTTAGTACGCTAAGCAGTCGAGCATTAACCACACACATGGCACAGTGTTCACCTGACAATAACAATGTGGCGGCCGCTCAGAACACAGACGCTAGGCCACACAGAAAGCTTTTCGAGTGCGATGTGTGTAACATGAAGTTTTCCAATGGCGCCAACATGCGACGCCACAAAATGCGCCACACCGGCGTCAAGCCTTACGAGTGTCGTGTGTGTCAGAAGAGGTTCTTCAGGAAGGACCATTTAGCGGAGCACTTCACGACCCACACAAAGAGCTTGCCGTACCATTGCCCCATATGTAATCGTGGCTTCCAGCGACAGATAGCCATGCGTGCTCACTTCCAGAACGAGCACGTTGGCCAGCATGATCTCGTCAAAACTTGCCCGCTCTGCAGTTACCGAGCGCCGACAATGAAGAGTCTCCGGGTCCATTTTTTCAATAGACACGGTATTGATCTGGACAACCCAGGGCCCGGTAACAACTCTGTTTCCCTTCTCGCGGCCGGCATAGCCAACGCCGCTTATGCTGAAGGCACAATGGACCCCAGCGCCATATCAGCATTAGGGGCATTGGGCTCAGGCCTCTCAGTGTCTGTAGCGGCTGCTTACTCAGATAGTGGTGACAGCAATGGAGAACGTTCAGTGGACAACGCAACCCCACCTATGCATTATTTGACCCCACATGTAGAAATATCTATGGCCGATAACAACGAAACATTCTCACCTGGCCAGAATAACCATCAAGAGTCCCGTATGAATGGTTCAGGGTCACCTCATTCAGAGAGCGCTGCAAGTAGTTCCCTGGCGTTGCCTGCTATAACACCATCAATCACGCTCATACCCATTAAACAGGAACCTAATGCTCAGGAGGAAGGTTCAGGAGATGCTGGTGAAGGGGAAGACAAACGTGACGTCTCATCATCGCTGTCTTCACTCATACAAGTGTCGCCATTGAAGAGTTTGTTGCGAGAGGACCTGCGCAGACGCATCTCAGCCAGGGGCCGGTCTAGGGCTAATAATGCGTCCCGAGCCTCCCCTTCGGAGGGGGGAGTGACTACTTCGACCCAAGGGGACGCAGCCCTACTGCCTTCGTCGCTTGTTTGCTCGTTCTGCTGCATCACGTTCCCCGACTCCACACTATACTTCCTGCACAAGGGCTGCCACTGCGACGCCAACCCCTGGAAATGCAACATCTGCGGCGAGCAGTGCTGCAACGTTTACGAGTTCAATTCCCATCTGCTGTCGAAGAGCCACCAATGA

Protein sequence:

>DPOGS205627-PA
MSSKGETEADSGTVSPNLPAVKNEPAPTASKETPKRGSGTLLKCTTCNSFSTLSSRALTTHMAQCSPDNNNVAAAQNTDARPHRKLFECDVCNMKFSNGANMRRHKMRHTGVKPYECRVCQKRFFRKDHLAEHFTTHTKSLPYHCPICNRGFQRQIAMRAHFQNEHVGQHDLVKTCPLCSYRAPTMKSLRVHFFNRHGIDLDNPGPGNNSVSLLAAGIANAAYAEGTMDPSAISALGALGSGLSVSVAAAYSDSGDSNGERSVDNATPPMHYLTPHVEISMADNNETFSPGQNNHQESRMNGSGSPHSESAASSSLALPAITPSITLIPIKQEPNAQEEGSGDAGEGEDKRDVSSSLSSLIQVSPLKSLLREDLRRRISARGRSRANNASRASPSEGGVTTSTQGDAALLPSSLVCSFCCITFPDSTLYFLHKGCHCDANPWKCNICGEQCCNVYEFNSHLLSKSHQ-