Monarch geneset OGS2.0

DPOGS212559
TranscriptDPOGS212559-TA2853 bp
ProteinDPOGS212559-PA950 aa
Genomic positionDPSCF300075 - 377598-380728
RNAseq coverage141x (Rank: top 55%)
Annotation
HeliconiusHMEL0077740.088.64% 
BombyxBGIBMGA012283-TA0.082.18% 
Drosophilavfl-PA9e-3570.83% 
EBI UniRef50UniRef50_D6WJS78e-4136.06%Vielfaltig n=1 Tax=Tribolium castaneum RepID=D6WJS7_TRICA
NCBI RefSeqXP_001812268.12e-4136.06%PREDICTED: similar to CG12701 CG12701-PA [Tribolium castaneum]
NCBI nr blastpgi|1892379793e-4036.06%PREDICTED: similar to CG12701 CG12701-PA [Tribolium castaneum]
NCBI nr blastxgi|2420161118e-10431.28%zinc finger protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00036762.7e-08nucleic acid binding
KEGG pathway 
InterPro domain[906-934] IPR0130872.7e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL20551 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212559-TA
ATGATTACCAAACCAAGTCCAGAGTTTCAACAGCGCGCGACGCCTGAAACAACAGTGCAGTTTCCGCACCCTGCAACACCACAGAGCTATCACAGCGCACCATCACCTTACCAAAATCCAGATCAAACAAACTTCTCACCGGGAGCTCAATTTGGTAATAATTTTACTCATTCCAATTTTTCTCAAAATCAACACACTGAGCAAATAAACTGGGAACAGTCGCAATATAATCAAGAATATCATAAGCCGAATCGTTTCCATCCATATAATATGCAAGATCGTGTATCACAAGTCTCATCTTCGAGCCCATTGTATGGCCAGCCGCTAAATCAACCAACGCCATCACCTTCGCCGAATCAATGCGACAAGTGTGGCTACGTTTGTGATTCCGCTGTACAGTTAAATGAGCACTGCAATTCGGCTCACGCAGGTACGAGCGCTGTACCTGCTACAGGAAACATTCCCTTTCAACAGTTTCCTAGCAAATCATATAACAATTCTAGCTATCAAAATGATAGTATAAAAGTTAAAGAAGAACATGAGGAATCATCGGATATTCTAGATTTAGATTCCCAAAAGGTAGTTTACCAGGGAAATGAAGGAGAACAACAAAATCCACCTTACGAAGAAACATCGTCTCAGGTACGTGAAGTAAACACAAGGACTGTACCTATGATGCCATGGGAAACTCAAAAGATATATACAAATCCTCAAATGAATGGCGACGTATCCCTGTTCAAAGAACAAAAAATGTTTGCTGATCAAAAGGCATATGCAACTGAAGGAAAGATGTTTCATCCAGATCAAAAATTTTCCTATTCACAAGATAAATTTTTAGTGCATCATGAACAAAAACCTTTTATGCACGTGGAACAAAAAATTTATTCTGGTGTTCAAATGCAACCGTTAACAGATTACTCAGGGAATGTGGCTTCAACTAACTCAGATATGAAGCCACCATACCGACCATATGATTCCCCTAATGCACCGCAAATAACAAGCACACAACCTGCCAATCCTACATCGTCATCATTGCCCTCTATTGGGGGTAAAGGAGCTAATTGGAAGTCGAATGAAGCAAGAAGACCGAAGACCTATAACTGTACTGCTTGTAACAAGTGGTTTACTAGCTCGGGCCACTTAAAAAGGCACTATAATACTACACTACATAAAAACGCTGTAAGGTCATCTGGGCAACCTGATCCAGCAACGATGCCCATTTCAAGCCATCATCATCCAAGTCGTGATTCCTTACAAAATCGAGCTCAACAACAGAATCCGGATTCCAATACGCAAAGTCCAGTCCCCTCCGAAGACAGCAGGAACGTGGATGACGGTGCTTTGCAGTCGCCATATGCACCGCAAAACTTTGAACGCGCTCACCGTGTTGCAACTATGCAGTCCAAATCACCATACACGCATCTCCAACAGGGTAATTTAGATAATAATTTCAGTAACATTCCTCTAACTAATCACCCCTTACAGCATCAAGTGGGTTCACACCCGATTAATATAAATATAAGTAGCCAACCACCAGGCATTGGGAATCCAAACGATAGTCCTCAAGGTCCAAAAAGTACTTCAATATCAACTGGTACGAATCCCCCAAACGGGGAAGCAGGTCCCTCTGTTTCTCAAAATCACCATATGAGGGGCCTGCTATCAGTGTCAACCAGCAATATTTCAACACCAGTACTAACCCAGAGTACACCAGCGCTTACGGCTCACACTCTACCGCCGTTCAGCCATTTGGGTGTCAACCCGTACAGCCCGAGGTCTACGGATCCTTTGGGGCCTTCGGTACCGGACCCCACGCACACCCCATTATATTTGGGTCAGAATTTTCAACAGACTATAGCACCGAGTTACCCAAACGGGATGGCCCCCCACGTTATGGATATGGCTATCAACAATCTGCCTATAGCCAATCCGGCTACTTTTGGTGAAGCCGCACCAGAAGAGGTCGATGTTATGGAACAAGAAACGTCTAGTCAGCCTGCCAATGGACGCTTGCCTAGTTTCTCGCAGCTACAAATGCAGAGCTTCAGCGTTTATGTTTCAAACTATATTACATCACCTAACGTGGGGGGTCAAGTTGTTGCCGACGATTCTACCGCCGGTTACATTATTGTAGATCCAGTTAACTCTCCTATACAATACAATGGCATGGATATAAATGGACAAAGTATAGATTATGATTATAATTATTCACCTAAAGCTAACCCAATGAAGGAAAATGTAGTTAGCTACAATACGGATACAATTAAGGTTTATCCGTACGGCTACGCAGTTGGAGGTAAAACTATAAAAAGAGAAGATGACGTATTAAGAACGGATTCTAGTGAACTACAAATTTTAAAAATCGAAGACATAATGGATTATGCTAATAAAGAAAATTATGGAACACAAATGAAGTCTCCAGCAAGTCCAGAGAGCGCTAAAGCTGAGAATGAAAGTCACGGGTCCCCTTCAATATCATCAACAGTTTCGAGTACGCCACTAACAGAGAGAAATCATTTGAAAAAGTCAACACCAACAACGGTACACAAATGTTTTGAATGCGACAAATTATTCAACAAGGTTTGTTATCTTACTCAACATAATAAAACTTTCCATTCGGGAGCTAAACCATTTAAATGTGATAGATGTGGTAAACGCTTTTCGGACGGTGTTTCATATGAGGGTCATTACTTAAAACATGCAGACAATAAACCCTTCAAATGTAATGAATGTCCAAAGTCGTTTAATCATAAAACTGATTTGCGTCGTCACATGTGTTTACACTCTGGGTGCAAGCCGTTTGCGTGCGATCATTGTGCAAAGCGGCAGAACAACTCCGACGCATGCATTTAG

Protein sequence:

>DPOGS212559-PA
MITKPSPEFQQRATPETTVQFPHPATPQSYHSAPSPYQNPDQTNFSPGAQFGNNFTHSNFSQNQHTEQINWEQSQYNQEYHKPNRFHPYNMQDRVSQVSSSSPLYGQPLNQPTPSPSPNQCDKCGYVCDSAVQLNEHCNSAHAGTSAVPATGNIPFQQFPSKSYNNSSYQNDSIKVKEEHEESSDILDLDSQKVVYQGNEGEQQNPPYEETSSQVREVNTRTVPMMPWETQKIYTNPQMNGDVSLFKEQKMFADQKAYATEGKMFHPDQKFSYSQDKFLVHHEQKPFMHVEQKIYSGVQMQPLTDYSGNVASTNSDMKPPYRPYDSPNAPQITSTQPANPTSSSLPSIGGKGANWKSNEARRPKTYNCTACNKWFTSSGHLKRHYNTTLHKNAVRSSGQPDPATMPISSHHHPSRDSLQNRAQQQNPDSNTQSPVPSEDSRNVDDGALQSPYAPQNFERAHRVATMQSKSPYTHLQQGNLDNNFSNIPLTNHPLQHQVGSHPININISSQPPGIGNPNDSPQGPKSTSISTGTNPPNGEAGPSVSQNHHMRGLLSVSTSNISTPVLTQSTPALTAHTLPPFSHLGVNPYSPRSTDPLGPSVPDPTHTPLYLGQNFQQTIAPSYPNGMAPHVMDMAINNLPIANPATFGEAAPEEVDVMEQETSSQPANGRLPSFSQLQMQSFSVYVSNYITSPNVGGQVVADDSTAGYIIVDPVNSPIQYNGMDINGQSIDYDYNYSPKANPMKENVVSYNTDTIKVYPYGYAVGGKTIKREDDVLRTDSSELQILKIEDIMDYANKENYGTQMKSPASPESAKAENESHGSPSISSTVSSTPLTERNHLKKSTPTTVHKCFECDKLFNKVCYLTQHNKTFHSGAKPFKCDRCGKRFSDGVSYEGHYLKHADNKPFKCNECPKSFNHKTDLRRHMCLHSGCKPFACDHCAKRQNNSDACI-