Monarch geneset OGS2.0

DPOGS200439
TranscriptDPOGS200439-TA2139 bp
ProteinDPOGS200439-PA712 aa
Genomic positionDPSCF300236 + 404693-412012
RNAseq coverage286x (Rank: top 38%)
Annotation
HeliconiusHMEL0045461e-3630.94% 
BombyxBGIBMGA008897-TA0.052.46% 
DrosophilaOpbp-PA7e-7638.18% 
EBI UniRef50UniRef50_D6WY011e-10741.77%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WY01_TRICA
NCBI RefSeqXP_968302.22e-10841.77%PREDICTED: similar to Optix-binding protein CG30443-PA [Tribolium castaneum]
NCBI nr blastpgi|3838547766e-11838.13%PREDICTED: zinc finger protein 221-like [Megachile rotundata]
NCBI nr blastxgi|3838547761e-12435.70%PREDICTED: zinc finger protein 221-like [Megachile rotundata]
Group
Gene OntologyGO:00036764e-12nucleic acid binding
KEGG pathway 
InterPro domain[523-546] IPR0130874e-12Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL17396 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200439-TA
ATGGGAGACTTCGTTTTTTACTGTTCCCTTTGCTGCAATGCCACATTTGATTCTAAGAAATCTCTTATTGATCATTTGTCCCAAATATCTTTAAATTTAGGTTGTCCGATATGCGTTAAAAAGTTCTCGTCGATAGAAGATCTGGTAGAACACTTAAAAGATGACGATTGTGACGTTAAATCCGTTAACTTGGAAATCAATGATCATGATGAGACCAATGGTGTTAGTAAGGAGGTCCAAACAGACGCGTCTTCCTATTCAGAGCCAAATACTATAAATGATGTTGATGATGATGATAAAATGTATGTCGAAGTACTCAGCAAGCAAATGATGAAACCATGCCTACAAACACAGGAGTTGAAATTAGTTAAGGAGCATGGCGAGAGTCGTTATGTGATTGTCACCGATGATGATTTGTCATTAAATTCTGGAATTACAATAGTCACTAAACAAAATAAAGATGGAACCATATCTTTGACAACACAAGAATCCCTCAATGTGTCAAGCGCAGAGACAGGAATAGAAATAGAGGAGACAGATGAAAAGAATGAAACCGTAAAGAATGAAACAGAAAGTTCCCAAGAGGAGAGATACAGCTGTAACACCTGTGATATGACATTCACATCGGTTCTAGAACACATACAGAACTACCACAACGATCAGGATGTTGTGGTCGAGGAGCCGTTGGATGAAAATTCACAAGAAATTAACCTGGAATTTGATCAAACGGAAAATGAAGATGTGATAGAGAAACAGACGTCACGCAGAATGATAACGGACACGGGAGATATCATAGAGACCCCGATTACTGAAGAAATTGACACTGAAATGGAAGAAATGAAACAAGCAAACAATAGACGATATAAGAAAATAGATCAATTCTGTAACAGCATTGTTAAAGACATTAAAAACGATGATACCAAGAACGGTCCGTACCACAAGGTTGTTATAAAGGAGGTGCAGACCACGGCCGGCAACAAAGTCAAAGTATACAATTGTATGTCATGTAATGTATATGTGACGACACTATCGGAATTCAAGCAGCATCCCTGTAAGATTTTAAAATACCCGTGTTCGCAATGTCCAGTGGCATATGAGAATTCAAAGTCTCTCTGCGCCCATATGAAGAGGAATGTTTCAGAAATAACAGCGGCCATAACGAAACACGAGTGCGAAGTCTGCAACACGATATTTCCATCGAGCAAGTCGTTGAAACTCCATAAAAGAATGCACGATCCCGTCAAGTCACGTCCATTGGGTCCCCAAGAGTCGGAAGACGGTACGACTGATAACGTAGAAAAATTCCTGTGCAACGTCTGCAATAAATTGATACCCGAGAACTATAGGACCATACATCAGAACTCGCACAAGAGCAGCAACAAGATGAACTGCGACATATGCAACAAGAAGTTCCTGTCCAAAGAAAACTTGGAGATGCACATGGGCGTGCACAACCTGGATAAGATAACCATCGGCAAACAGGATAAGGCGTTGCCGTACGAGTGTTTGTATTGTAACAGAAAATTCGGAAGACCGCACGAGAAGGTCAAACACGAGAGAATACACACGGGTGAGAAGCCCCACTCGTGTGATATCTGCGGCAAGTCTTTCCGAGTCTCGTACTGTCTGACGCTACACATGCGCACGCACACGGGCGCGCGGCCGTACGCGTGTCCGCACTGTAACAAGAGGTTCAAAGCCCACAGCGTGTACAACCACCACCTGCTGACGCACTCCGACGTGCGCGCGTACAAGTGTCCGTACTGTCCGAAGGCCTTCAAGACGTCGGTGCAGTTGGCGGGGCACAAGAACTCGCACACCAAGCCCTTTTCATGCCAACAGTGCAATAGGCCTTTCGCTTCTTTATACGCTGTCCGAGTCCACATGGAGATACACAGCCGTCAGAATAGTCTCAAATTTTCATGTACCATGTGCGGCGCGAGCTACGCTAGGGCCTTCGCCCTCAAGGACCACGTGAGGCAGGCGCATAAAGATGTGGCGCCGATGGTTAAAGATGAGGAGTGGATGAGAACAGAGAACAGCGCCAACGTGGAAACAGACGACATACAGCTGAACAAGGACTTCCCACAGGATATAAACGAACTGGGCACCTACGAGACGGAGTTGATAATACCTTGA

Protein sequence:

>DPOGS200439-PA
MGDFVFYCSLCCNATFDSKKSLIDHLSQISLNLGCPICVKKFSSIEDLVEHLKDDDCDVKSVNLEINDHDETNGVSKEVQTDASSYSEPNTINDVDDDDKMYVEVLSKQMMKPCLQTQELKLVKEHGESRYVIVTDDDLSLNSGITIVTKQNKDGTISLTTQESLNVSSAETGIEIEETDEKNETVKNETESSQEERYSCNTCDMTFTSVLEHIQNYHNDQDVVVEEPLDENSQEINLEFDQTENEDVIEKQTSRRMITDTGDIIETPITEEIDTEMEEMKQANNRRYKKIDQFCNSIVKDIKNDDTKNGPYHKVVIKEVQTTAGNKVKVYNCMSCNVYVTTLSEFKQHPCKILKYPCSQCPVAYENSKSLCAHMKRNVSEITAAITKHECEVCNTIFPSSKSLKLHKRMHDPVKSRPLGPQESEDGTTDNVEKFLCNVCNKLIPENYRTIHQNSHKSSNKMNCDICNKKFLSKENLEMHMGVHNLDKITIGKQDKALPYECLYCNRKFGRPHEKVKHERIHTGEKPHSCDICGKSFRVSYCLTLHMRTHTGARPYACPHCNKRFKAHSVYNHHLLTHSDVRAYKCPYCPKAFKTSVQLAGHKNSHTKPFSCQQCNRPFASLYAVRVHMEIHSRQNSLKFSCTMCGASYARAFALKDHVRQAHKDVAPMVKDEEWMRTENSANVETDDIQLNKDFPQDINELGTYETELIIP-