Monarch geneset OGS2.0

DPOGS214819
TranscriptDPOGS214819-TA1809 bp
ProteinDPOGS214819-PA602 aa
Genomic positionDPSCF300059 + 713785-716338
RNAseq coverage133x (Rank: top 56%)
Annotation
HeliconiusHMEL0049720.059.93% 
BombyxBGIBMGA012052-TA4e-17658.12% 
DrosophilaCG11247-PC3e-8035.90% 
EBI UniRef50UniRef50_E3XCL22e-10344.08%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XCL2_ANODA
NCBI RefSeqXP_969615.18e-10543.86%PREDICTED: similar to AGAP012410-PA [Tribolium castaneum]
NCBI nr blastpgi|910793482e-10343.86%PREDICTED: similar to AGAP012410-PA [Tribolium castaneum]
NCBI nr blastxgi|910793482e-11043.96%PREDICTED: similar to AGAP012410-PA [Tribolium castaneum]
Group
Gene OntologyGO:00036767.3e-11nucleic acid binding
KEGG pathway 
InterPro domain[387-415] IPR0130877.3e-11Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL16568 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214819-TA
ATGGCTAATGTAAAGATAAGAATTAACTCATCCAAGCCATCGTCGAATAAAATAAGTGGGAGTGGTAGAATAAAAAAATCATCCACTACAAATAATGATAAAAGTTCAGGTTCAAAAATTGCTAAAAGTAGTTATGTGAAAGAAAAATTGAATGACAGTGTGCTAATAAGAGATGGAAAACAAAAAAAAGATATACAAGTAACCAAAGACAAGTTGCTGACCAAACATGGTCACCTGAAGCTCAAAAACAAGAATAATTTGAAAATTTGTAACAACATTTTAATGAGTGAGTTGCTTATTGATAATAAGAATAAAAAACCAACAGAGGACACTCCACCTAACAGTTCTAATAACAACAACATTGTTATTAATCACAACAAGAACATCAACTCACAAAATGTGCAAAATGATGCCAATGATGCCAATGATGCCAATGATGATGACTTTTTTGTAGATTATCAGCCTAAAGTCCGCACTAAACCTCCTGCAAAGAAACAAAAAAAAGGAAACTTTGTATGTGATCATTGCCAAAAATCATTTCTTACTAAAGCAACTCTAAGAAGACATATTTATATACATATACAATCAGAATCACACCCTTGCAAGCACTGCAACAAAGTCTTCAGCAAAAACATATACTTGTCCGCCCATATAACAAAACAACACCCAGATTGGGAACAGCATTTTATTTGCAACATGTGCGATAGAACATTTTTGTTGAAGGAAAACCTAGTGTTGCACATAGCCAGCCACACGCAACCTGTGCCAATGTTCAAATGCATTTATTGCAAGGAGAAATTTAAAAAGCAAATTGAATTAATGCAACATGAAAAAACTCACTTGGTGAGCGGAGTATATGATTGCATTATATGCGAGCAAAGCTTTGATTGCAGAAACAAATTGACATCACATTTTAAAACTCACTTAAAAGTAAAAGACTACATATGTCAACATTGCGGCAAAGAGTTTTTGAGGAATAACTCAATGAGACGTCACGTTCAAATCTCTCATGTTGGAGTAAGAATACAATGTCCAATATGTAAGAAATATCTTAGGGGGCATTTGACAGAGCATATGCGTACTCATGAGAAGAAGCGTCCTCATAAATGCCCTGACTGTGGTCTGTGTTTCACTCAATCTACACAGCTGACTGTCCATCGACGTTCTCACACTGGGGATCGTCCATACCCTTGCAGGATATGTGACCGACCATTCACTCACTCAAACGCACTCAGATTGCATATTCGAAGACATACTGGTGAAAAACCATTTGAATGTGCCATGTGTCCTTTGTCATTTTCACAATTGCCGCACATGAAGTCACACATGCGTAAAATTCACGGTAAAGAGAAACCGTACAGGTGCCAAAAATGCAAATTGTTTTTCAAACTCAAAGCAGATCTCGAAAACCATAATAAAAAGTGTAAAGATGTTGAAAAAGAGATGTCATTTGAGGAGCAAATTCAGGCTTCTGTGTCAGTTAAAGAGGAAGAGGTTGTAGAATCCCCGATGACTTTGTCGCATATGAGATATTTATTGGCTCTCCTCTTGACAATGATTGCTACAAAAGAAAAATTGAAGTACTTAGGGTTCAACAAACGTCTAATTGATGATCTGCTAGTTGAATCTCTTGAAGCAATGGGCCACACGCCCTGTAGAGATGCATCTCTGACGGCTCTTAAGCGTTTGAAGACAAATATTCAGATTCTGTTGAATGGAACAGTCCCAAAGGCACAGATGGAGAAATTTCAAAATGAAAATAAAAGCATGGAAGACATTCTGGTGTTATTGACAGATGATAAGAAATAA

Protein sequence:

>DPOGS214819-PA
MANVKIRINSSKPSSNKISGSGRIKKSSTTNNDKSSGSKIAKSSYVKEKLNDSVLIRDGKQKKDIQVTKDKLLTKHGHLKLKNKNNLKICNNILMSELLIDNKNKKPTEDTPPNSSNNNNIVINHNKNINSQNVQNDANDANDANDDDFFVDYQPKVRTKPPAKKQKKGNFVCDHCQKSFLTKATLRRHIYIHIQSESHPCKHCNKVFSKNIYLSAHITKQHPDWEQHFICNMCDRTFLLKENLVLHIASHTQPVPMFKCIYCKEKFKKQIELMQHEKTHLVSGVYDCIICEQSFDCRNKLTSHFKTHLKVKDYICQHCGKEFLRNNSMRRHVQISHVGVRIQCPICKKYLRGHLTEHMRTHEKKRPHKCPDCGLCFTQSTQLTVHRRSHTGDRPYPCRICDRPFTHSNALRLHIRRHTGEKPFECAMCPLSFSQLPHMKSHMRKIHGKEKPYRCQKCKLFFKLKADLENHNKKCKDVEKEMSFEEQIQASVSVKEEEVVESPMTLSHMRYLLALLLTMIATKEKLKYLGFNKRLIDDLLVESLEAMGHTPCRDASLTALKRLKTNIQILLNGTVPKAQMEKFQNENKSMEDILVLLTDDKK-