Monarch geneset OGS2.0

DPOGS203104
TranscriptDPOGS203104-TA1515 bp
ProteinDPOGS203104-PA504 aa
Genomic positionDPSCF300391 + 99388-120856
RNAseq coverage419x (Rank: top 29%)
Annotation
HeliconiusHMEL0148843e-7452.65% 
BombyxBGIBMGA011150-TA5e-3771.43% 
Drosophila% 
EBI UniRef50UniRef50_D6WG983e-1043.33%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WG98_TRICA
NCBI RefSeqXP_002407566.13e-0841.94%conserved hypothetical protein [Ixodes scapularis]
NCBI nr blastpgi|2700046891e-0943.33%hypothetical protein TcasGA2_TC010361 [Tribolium castaneum]
NCBI nr blastxgi|2700046891e-0942.57%hypothetical protein TcasGA2_TC010361 [Tribolium castaneum]
Group
Gene OntologyGO:00036767.3e-17nucleic acid binding
KEGG pathway 
InterPro domain[4-86] IPR0066127.3e-17Zinc finger, C2CH-type
Orthology groupMCL34482 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203104-TA
ATGGGTTATTATTGTACAGTGCCACAGTGCACCTCGCTTGCCGGCAAGACTAAAAACGTAAAGTTCCACCGTTTCCCTCGCGACAACGACATGGCACTCAAGTGGAATACTATTCTCAAGCGCGGCAAACCAGTAACGAAATACTCTAAAGTTTGCAGTCTTCACTTCACACCAACAGATTACAACATAACAACAATGGGTAAGAATAAAGGTCAATGGAAAACATTGTCCAAAGATGCTATACCGACCCAGAACCTGCCGAAGCTCAACCCTGATGGCACCGTCATGGTTATAAGGAAGAGCAGGACAGTCAAATACAAGGATTGCAAGAAAGAGGAGGTCAGTGCCAAAGAGGAGAAGTTCACAAGCGTGGCACAAACACAACCCCACTTAACCGACGACGTCAAGCAGGAAGATACCGAATCGACTGTGAGGACCCTTGACTCCGCTCCGCGCCTTCCGCACCTCCCCCTTTGTGCCGGCAGCAGCGTGACGTTGCTTTCGTACTATCCATCATTGTCACGTCCCGGACCAGAACTACTGATTGCCAAACAGAACGCTCTGGCGGCTCACACACAACTATGTCAATGGAAAACATTGTCCAAAGATGCTATACCGACCCAGAACCTGCCGAAGCTCAACCCTGATGGCACCGTCATGGTTATAAGGAAGAGCAGGACAGTCAAATACAAGGATTGCAAGAAAGAGGAGGTCAGTGCAAAAGAGGAGAAGTTCACAAGCGTGGCACAAACACAACCCCACTTAACCGACGACGTCAAGCAGGAAGATACCGAATCGACTCTAGCTTACCGTCTGATGTGTAATCTAGCGACCGAGTTACCAAAACCAAGGAAGCAAGACGCGACTATGCAAACAGATCCCATACTGTCCGAGGACGAGCAGAACTCTATGGAAATAAATTTTGATATAGACTACGACATGCCAAAAGACTTCTCCAAACAGAACGATGATTTCATACCGCAGACCAACGAGGAATACAACAAGGAGGCGGAAAAATACGCCCTACAGATAAAAAATCACAGCCAGTACGACCAGAAAGTCGAAAGCTATACGAAGTTACCCGAAGTGTACAACGGGTTCAGGGAGAAGGCTTACAGTGATGTAATATATCAGAACACTTACACAGACAGCGTGGAGATACTGAGGAACCAGCAGCACAACCTCCAAATGCTGCAAGAACAGCACAGGATCAACGAGAACCAGAACTTCTTCAATGAAGACCACATAAAGCAAGAAATAGACTTCACCAACGACGAAGAGAAATTTTTGTATGAAAGGAATAAGGAACAAATTGATCATTCGATGTACGAGCACAGAGAACAACAAAGCATAATTGATCACGTGACGACCGTGAAACAGGAACCGGAAGTTACAGGAGTTATAAATAGAATACAAAAACTGAATATTAATTATGTACAAAGCAACGTCACGCTGCTGCCGGCACAAAGGGGGAGGTGCGGAAGGCGCGGAGCGGAGTCAAGGGTCCTCACCTAG

Protein sequence:

>DPOGS203104-PA
MGYYCTVPQCTSLAGKTKNVKFHRFPRDNDMALKWNTILKRGKPVTKYSKVCSLHFTPTDYNITTMGKNKGQWKTLSKDAIPTQNLPKLNPDGTVMVIRKSRTVKYKDCKKEEVSAKEEKFTSVAQTQPHLTDDVKQEDTESTVRTLDSAPRLPHLPLCAGSSVTLLSYYPSLSRPGPELLIAKQNALAAHTQLCQWKTLSKDAIPTQNLPKLNPDGTVMVIRKSRTVKYKDCKKEEVSAKEEKFTSVAQTQPHLTDDVKQEDTESTLAYRLMCNLATELPKPRKQDATMQTDPILSEDEQNSMEINFDIDYDMPKDFSKQNDDFIPQTNEEYNKEAEKYALQIKNHSQYDQKVESYTKLPEVYNGFREKAYSDVIYQNTYTDSVEILRNQQHNLQMLQEQHRINENQNFFNEDHIKQEIDFTNDEEKFLYERNKEQIDHSMYEHREQQSIIDHVTTVKQEPEVTGVINRIQKLNINYVQSNVTLLPAQRGRCGRRGAESRVLT-