Monarch geneset OGS2.0

DPOGS213596
TranscriptDPOGS213596-TA1500 bp
ProteinDPOGS213596-PA499 aa
Genomic positionDPSCF300033 + 633222-636315
RNAseq coverage640x (Rank: top 20%)
Annotation
HeliconiusHMEL0078982e-16273.03% 
BombyxBGIBMGA011662-TA2e-11463.31% 
DrosophilaCG31601-PA4e-1028.31% 
EBI UniRef50UniRef50_D6WGD34e-4148.85%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WGD3_TRICA
NCBI RefSeqXP_971602.18e-4248.85%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910920901e-4048.85%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910920907e-6238.37%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00082702.8e-05zinc ion binding
GO:00036762.8e-05nucleic acid binding
KEGG pathway 
Orthology groupMCL22013 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213596-TA
ATGGATTCTGAAGATTCTCGTGATGCCACTGGAGATGGCCGTTCATCATCCAGCAGTAGTAGCTCAAGTTCCAGGAGTCGCCGTAGTTCATCATCAGATAGTTTTAAAAAGAATCCCAAGTCCCCACCTCAAAGACCCAAATCTCCAAAATCTAATGGACCCCGTTTCCACTCCCCAGGGAGTCACTCTCAATCACCTCACAGAAGTTCTGACAAAGCCCCACAGTCACAAGATAACTTAAGTGGTGCACATTCAGACATTAATTCACCAGAGAATTCAATGCCTGCTTCTCCTTTTGGGCCAAAATCTCCTGATGCCCCTAAGTCTCCAAGTGAAGTGGGTTCCAATATAGGATCTCCCGAACAACCTGAGCACTATTCATCCAATCGTTCAGGAGGTGAGGGTCCTAAAACTCCTCAATCTCCTGCTAGTTCAGTAAAATCTGCTCCAGGTTCACCAACTAGTAGATCCCAACCAATATCACCCGACTCAGCATCCCAAATCACAAGTCCATCTAGATATCGTGATTCTTGGTTGCCTTCCTACAATAATCGTAGAGCATCTCCTCATTCCTCCCCTACTGCTTCTCCTGTTAAGAAAAATGACTGGAGTCCTAAAGAGAATTGGCGACGGCATACAAAGGCAGAACAGAAAATGGTTCCCACTCCAAGAAATAGAAGTAGGTCGGATTCAAGTCGATCCAGTCAAAGTTCATCATACAATAAGAGAACTAACACTAAAGATCCCGCCACCGAAGAAATATCTGATGGGGAAATGGAATCGGATCAAGAGGACAATAAGTCCAGGCGCAAGTCTCCGGGCAGCGAGAAACATGAAAAAGAGAATAATATCAGTCATGAAGATTTAAGTGACGTGTCAGACGAGGACTCCGACACACAAGATGACAATAAAGACATAAAGACCAAACCAAACCCCCCATCCACTGATAAGAAAAATGAAATTTTTAATGGTAACGATGGTGAAATTTCACAGCTGTCTTCTATTTCGGAGGGTGGAAAAAATGCTAAAAAAACAGAAAAGGTGTCACAAGAAAAAACAAACAATACAGAAGACGGCGAGGAACAGTTGGACTTTGAAGCTGAAGAAGGAGAGTGTATTGAAACGACAAAGACAAAAGATCAATCTAACGGGGATGTTGATAGTCAGAGCAATGAGAAAGAGGAAGTGAAAGCGGGTCGTAGATCGCGAGGTTCTTCTCTAGAAGAGGGCGAGGTGTCTGACGAGGCAGAGCGTCGGCCCGAGGAGAGTGAACCGCGACCGGTCTGCAGATTCTTCTCACGGGGCGCCTGCACCTGGGGAGTCAGCTGCAGGTTCCTGCATCCCGGTGTAACGGACAAAGGTAACTACAACATGTTTGATGTGGTCCGAGGTGTACCCAGTGGAACCATTCCCGCGCCGAGGGACTCCGCCCCGCCAGAGTCGGCCTGGGAAAGGGGACTGCGGACTGCCAAAGAAGTAAGGATACTGATAAGATTTTGA

Protein sequence:

>DPOGS213596-PA
MDSEDSRDATGDGRSSSSSSSSSSRSRRSSSSDSFKKNPKSPPQRPKSPKSNGPRFHSPGSHSQSPHRSSDKAPQSQDNLSGAHSDINSPENSMPASPFGPKSPDAPKSPSEVGSNIGSPEQPEHYSSNRSGGEGPKTPQSPASSVKSAPGSPTSRSQPISPDSASQITSPSRYRDSWLPSYNNRRASPHSSPTASPVKKNDWSPKENWRRHTKAEQKMVPTPRNRSRSDSSRSSQSSSYNKRTNTKDPATEEISDGEMESDQEDNKSRRKSPGSEKHEKENNISHEDLSDVSDEDSDTQDDNKDIKTKPNPPSTDKKNEIFNGNDGEISQLSSISEGGKNAKKTEKVSQEKTNNTEDGEEQLDFEAEEGECIETTKTKDQSNGDVDSQSNEKEEVKAGRRSRGSSLEEGEVSDEAERRPEESEPRPVCRFFSRGACTWGVSCRFLHPGVTDKGNYNMFDVVRGVPSGTIPAPRDSAPPESAWERGLRTAKEVRILIRF-