Monarch geneset OGS2.0

DPOGS205369
TranscriptDPOGS205369-TA5835 bp
ProteinDPOGS205369-PA1944 aa
Genomic positionDPSCF300373 - 105318-118663
RNAseq coverage71x (Rank: top 66%)
Annotation
HeliconiusHMEL0134450.073.79% 
BombyxBGIBMGA008762-TA0.063.38% 
DrosophilaCG12299-PA6e-2027.97% 
EBI UniRef50UniRef50_D6WI021e-3820.53%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WI02_TRICA
NCBI RefSeqXP_002429661.13e-4121.24%krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3266671107e-5222.57%PREDICTED: zinc finger protein 729-like, partial [Danio rerio]
NCBI nr blastxgi|3266674652e-7120.61%PREDICTED: hypothetical protein LOC571721 [Danio rerio]
Group
Gene OntologyGO:00036763e-07nucleic acid binding
GO:00082702.5e-05zinc ion binding
GO:00056222.5e-05intracellular
KEGG pathway 
InterPro domain[1909-1943] IPR0130873e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL18340 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205369-TA
ATGAAAGGCATTTTGTCTGTTTTAGACACGTTATCCCAGGCCAGGCGGAATGCCGAAGTAGTGGTGAAATACTCGACAGCCTACCCGTTCAGGCTGCCAGAGACCTCGCTCATGTGCGTGTACTGCTGCGAAACGTACTCCGACACAGCACAGTTCAGATCGCACATGCACGAGGAACATCAGACTTTTAAAGTCGAAACAGCGTTCGCACATTGCAATGAGGGATACTTGAAAGCTGATTGCACAGACCTCAAGTGCAGAATCTGTTGGGAACCATTTAAAAAGTTGGATGATGTCGCCAAACATATTAACGATGTCCATAATATTAAAATACTATTTGAATTTCACATTGGCATACAACCTTTTAAGTTCGATGATGAAAAGCTTTTATGCGGTATATGTGATAGGAATTTCCCTTGTCTCCGGCAGCTTAGTCGTCACATGACTTCCCATTACCAGAACTATACCTGCGAAGAGTGCGGGAAGTCTTATACTACAAACAGCTCTCTACAGCAGCATATCAGATTTTCGCATATATCAAACAAAAGGATTTGCAGAAGGTGTAAGAAGACATTTAATTCGTTGCTAGATAAAAGGGAACATGTTAAATCTTCATCAAAATGTTGGGCTTACCAGTGCGCCGTTTGTGGTGTACGATTTATGACGTGGACTCTGAAAGAACAGCATTTAGAAAATGTTCATGGACAAGCTAAAAAGACACATAAGTGTCCCGAATGCGCAACAATATTCTTGACCAGGAACGCCTACAGAAATCATTTTGCGACAATGCACGCGGGAATTAATTTCGTTTGCTCTTATTGCGGAAACACATTGGATATTCGTGGACGTAGGGAGAATGAGAGTGAGAAAAAGAAAATAGTACTCGTTTATCAAACTCCTCAGCGTCGTAACGCAGAATTAATTTTAAGGCATTCGACCGCTTATCCGTTTAAAACGCGATTTAGTCAAATTTTATGCGCCTATTGTCATGACGAATATAACACATTATCATCTCTCAGATATCATATGAAAAACGAACACAAAACATCCGATTTTAAAAACGTTTTCTATAGAGCTAAAGATAACTTAATAAAAGTGGATATTACCAATTTGACATGCAATATTTGTAATGAAAGCATACAAGATATTGATACTTTAATGGGTCATTTGTCTAGAGAACACAATAAGCCAGTGAAATTCAACTCTCGATTTGGTGTTCTGCCTTACAAACAAAATGCGTCCAATCATTGGGTTTGCGTTTACTGTCAAAGGACTTATATAGAATTTGTTCAATTCAAACGTCACATAGCTTCACATTTCATGAATTTTAGCTGCGATAAGTGCGGAACGACTTTTATATCGGATCATGCTCTACGAGATCACAGGCGACAGGTTAAATGCTTCAGAACAGCGTATAAACCTCGCAATGGTAAAGTGATGAGGCCAAGAAGTAATGCAGAGATCATATTGCAATGTTCAACGGCCTGTCCCTTCAGGACATGGAAAAGTAACTTTAACTGCGTGTTCTGTAGAGTACAATCCAATGATCCCAGTATACTTAGAAGCCACGTAGCGACAAGACATGAAAACTATGACGTACAGTCAGCGTTCTATAAAAAACTAGGCAAGGAATTCCTTAAGATTGATATAACAGATCTGCAGTGTAAGTTATGCTTTATGCCGATAACTAATTTTGAAAATTTGACGTATCATTTGATGAATGATCACCAACAGCCTATAAACTCTGACGCCCAATTAGGGGTGTTGCCATTCCGACTTAACGATGGTTCAGTTTGGAAGTGCACAATGTGTCCAAATGTTTTTAAAGATTTCGTCTCTCTCAATAAGCACACGTCAGAACATTTCCAAAATTACGTTTGCGATACCTGCGGTGAAGGTTTTATTACTGAATCCGCGATGATTGCACACACTAAAGTTCCTCACGAAAATAAATATAGCTGTAGCCGTTGTGTAGCGACTTTTTGCGCGTTAGAGGAAAGGAATCCCTATATGTGTGTGTACTGTAAAGATAAACCACGCTTCGCCAATTGGGAACTGCGGAAGAAACACTTGATGGAAGTTCACAATTATAAGACAGGACCCACGAAGGATATTGACGGACTCAAAGAGAAAGAGAGAAAACAAAAAAAGGGGTTATTCGAAAGATCCGTTAAACATAACCCCCAGCGTCGAAACGCTGTTTTGGTTCTTAGACATTCCACGGCTATTCCTTTTAAAACACGATTTAATAGAATACTCTGCTCTTATTGCCATGATGAATTCCAGCCCATGGAGGCTCTTAGAATACATATTAAGGAGAAACACATTAACGCTGATTTTAACAGTGCTTTTTATAAGGTGGTCGATGATCTCAAAATTGATATAAGCCATTTCAAATGTAACATATGCTCGCAAGACATTGAGAACGTTGACACATTTATGAATCATTTATCAGGGGATCATGGGAAACCAGTTAATTTTGATGTACCTTTCGGTGTGTTACCGTATAGACAGAATGAGACTGGTGCTTGGCTGTGCCTTCACTGTGATAAAATATATCCGGAATTCTCCCAAATAAACAGCCATTTACGAACCCACGCCAAAATTTCTACTTGCGATAAGTGCGGGGCGACTTTCCTCTCGGAGCACGGTCTAAAACAACACGAGCGTAATTTCCAATGCTATAAAGCAACATACAAACCTCGCTTCGGTAAAGCCTTGAAGCATAAATACAATACTGAAATTATTTTACAATGTTCAACTGCATGTCCTTTCAGAACGTGGGGACAAAATTTTAACTGTGTCTTTTGCAGAGTGCAATCAAATGATCCCAATGGGCTACGAGCTCATATGGCATCCAGACATGCCAACTTTGACATACAACTAGTATTTAGCAGGAAATTACGAAAGGAATTTTTAAAAGTCGATATAACAGATTTGCAATGTAAACTTTGCTTCATGCACATTGACACTTTAGATGATTTATTGACACATCTCAAAAATGATCACAAACAACCGGTGAACATAGACGTCCAACCGGGGGTCTTGCCGTTCAAGTTGAACGACGGCTCTTGTTGGAAATGTGCTATATGCAAAATACAGTTCTCCGATTTCATATCGTTAAAAAAACACACAGCGGAACACTATCAGAACTACGTTTGCGACACATGTGGGGAGGGTTTCATAACAGAAGTCGCATTGCGGGCGCACACGAAAATACCGCATGATAATAAATACACCTGCAGTAGATGCGTTGCGACGTTCTCCACGTTAGAAGAGAGAAGTGTTCACATAAAAACACAACACACGAACCTACCGTACATGTGTACTTATTGCAAGGACAAACCGCGGTTCGCCACCTGGGAGCTCAGGAAACGGCATTTATATGAGATACACAATTATAAATCAGGGGCGGAAATGTACGAGTGTACCACCTGTCACATGATGTTCAAGACGCGATCTCAGAAATACCACCACAACGTCAAAGTTCATCGGACAAAAAAGGAAATAGATTTCGGTTTCTCTTGCGGCCACTGCGCTAGAGGAAGAGGCAGAGAAACAGAAAATATCACTGATATAAACATACTCCAACAGTCACATCTCGATACAGATAATGACATTAAGGTTGGAGAGAAACGAAAGTATCAAAAAAGTGCTAGATCCCAAGCGAGATTCATGACAAAGAAGAACGCAAGCTCTATTCTCGAATGCTGGTCTGGAATACCATTCAGATGGAAAAAAAATAGATTTAAATGCGCCTATTGTGAAGAAAATTTTAATGAGTGTTCGGATTTGAGGGAGCATGTTAGATTATGTGCTACCCAGTACAATGTAGGCAGTATATTCAGTAAATTTAAAGAAATGACTCTCATAAATATGGATGTCAGTGAGGCCGCTTGTCGAATATGTTCTGAGCCGTTCAGAGAACTTGATGGTATGCGAGAGCACGTCATTCGACACGGCTACGAATTAGATGTTTCGCATCCGGACGGTGTTATACCGTTTTGTCTCACGAAAGAATCCTGGTCGTGTGTCTTATGTCGCGAGACATTCAATAACTTTCTGAAACTTTACGAGCATATGAACACGCATTATCAGTACCACATATGTTCTATATGCGGAAAAGGTTACATGACTGGACCGAGGCTAAGGAAACATCTAGAATTACACATAACGGGAACATTTCCTTGCGATAAATGCAAGAAAGTTTTCACAAAACGCACAGGGAGAGACAATCACAAAGCCTACGCCCACGCCAAAGGTCCGCGTTATGAATGCCCACAATGTAATATGAGATTTGAAGGTTATTATGATCGAATGAATCATTTGAAACAAGCTCACAGAGAAAAGGAAGTGAAGTACGGCTGTTCACACTGTGATCTGTCATTTAAAACGAGCGGCAAGCGAGCCATCCACGTCAAAACGGTTCACTTTCCTCGTCAGAGTAACTTTAGTTGCCCTTATTGCAAAACCCTATTCAAAACAGCCTTCGGTATGAAACGTCACATGGTAAAACATAATGGAGAAACCTGTACTGTTTGTGGTGAAAGTTTTACTAAAAGTAAAGCATTGAAGGAACACTTGGCAGGTTCCGCTAAGGCGGACTTGGCAGGTACTGTAGGCGTTAAACCGGTTAGGAAGTTAAAGGAAAATGTCTGCGCGAGACAAATGCGCCGCCGGAGACGCGCTAACAACGAACTCCCCGAAGAATCTGAGAAACGTATCGCGAAAACTATGATGCGCAGAAATGCTTTAACTATACTAGAAAGTTCTACAGCTTGGGCCTTTCGTGATGATCACGGTCAAGATGATGTAGTTGTCAGCGCTGTAATGCACAGTGACGATAGCTTGTTGGGTCAGGGAATTTTAAAGGAAGATAGTAATGTCGGCTTTGAAACTATTATTCCCTTTAGAGAACATGATTACGATTTTATAACTGGAATAGACGGTGACAGTTTTATGATTAAAGACAAACATTTATCTGAGGAGGGCGATTGTGCTCTTCATACGCCAATATCTTGTAGTGAACAGGAATATGATACAAATTCTGGTATAAATTTTGAAAGCATTATCTCGAATGATATACTCATGGCTGGAGAGAGTAATTCATGCTCTCAAGCTCTCATATCATATAATGAGCCTGATTGCGATGTAACTTCTGATATAAATAAGGAGGAGAGTGTTGATTGGCCCGGTGTAGACTTGGATCTAGACGACATATTCATATTTGAAATCAGCGAAAGGTCGTTTTGTACGACATGCGACGAAGAATTTATAAATCTCGACCTATTAAACGAACACATGATTATACACGATGACAAATACATTTGCGAACAGTGTGGAACGGATTTTAAGAAATTAGATTTGTTAGAAAACCACTCTTTGACGCATCTGTCTAAAACGTTTCTGTGTCTGGTTTGCTCTTTAGCTTTACCGTCAGAAGATGAAAGGAATAAACATTTTAGAGCTAACCACAAAAATATAGTTATACACAAATGCCCGTTCTGTCCGGAAATATTCCGGAATTATATTTATAGAGACAAACATGCATTACGGAAACACGGTGTCAAGTATAAAGGGTTCCCATGCTTTCATTGTTCCAAATCATACGCTTCGAATGCAAAATTGAAGGCTCACACACATTCGATTCACAGCGAAGAAAAACGTTTCTCATGTAAAGTCTGCAACCAGAAGTTTCATTTCCAATATAATTTAAACGACCACATGATCAAGCATTTGGGAGCTAGAAGCTATCAGTGTTTTGTGTGCAAGAAATATTTTGCAAGGAAACGCGCCGTCGCAAGACATATGAAGAAACATGATAATGAAACTTAG

Protein sequence:

>DPOGS205369-PA
MKGILSVLDTLSQARRNAEVVVKYSTAYPFRLPETSLMCVYCCETYSDTAQFRSHMHEEHQTFKVETAFAHCNEGYLKADCTDLKCRICWEPFKKLDDVAKHINDVHNIKILFEFHIGIQPFKFDDEKLLCGICDRNFPCLRQLSRHMTSHYQNYTCEECGKSYTTNSSLQQHIRFSHISNKRICRRCKKTFNSLLDKREHVKSSSKCWAYQCAVCGVRFMTWTLKEQHLENVHGQAKKTHKCPECATIFLTRNAYRNHFATMHAGINFVCSYCGNTLDIRGRRENESEKKKIVLVYQTPQRRNAELILRHSTAYPFKTRFSQILCAYCHDEYNTLSSLRYHMKNEHKTSDFKNVFYRAKDNLIKVDITNLTCNICNESIQDIDTLMGHLSREHNKPVKFNSRFGVLPYKQNASNHWVCVYCQRTYIEFVQFKRHIASHFMNFSCDKCGTTFISDHALRDHRRQVKCFRTAYKPRNGKVMRPRSNAEIILQCSTACPFRTWKSNFNCVFCRVQSNDPSILRSHVATRHENYDVQSAFYKKLGKEFLKIDITDLQCKLCFMPITNFENLTYHLMNDHQQPINSDAQLGVLPFRLNDGSVWKCTMCPNVFKDFVSLNKHTSEHFQNYVCDTCGEGFITESAMIAHTKVPHENKYSCSRCVATFCALEERNPYMCVYCKDKPRFANWELRKKHLMEVHNYKTGPTKDIDGLKEKERKQKKGLFERSVKHNPQRRNAVLVLRHSTAIPFKTRFNRILCSYCHDEFQPMEALRIHIKEKHINADFNSAFYKVVDDLKIDISHFKCNICSQDIENVDTFMNHLSGDHGKPVNFDVPFGVLPYRQNETGAWLCLHCDKIYPEFSQINSHLRTHAKISTCDKCGATFLSEHGLKQHERNFQCYKATYKPRFGKALKHKYNTEIILQCSTACPFRTWGQNFNCVFCRVQSNDPNGLRAHMASRHANFDIQLVFSRKLRKEFLKVDITDLQCKLCFMHIDTLDDLLTHLKNDHKQPVNIDVQPGVLPFKLNDGSCWKCAICKIQFSDFISLKKHTAEHYQNYVCDTCGEGFITEVALRAHTKIPHDNKYTCSRCVATFSTLEERSVHIKTQHTNLPYMCTYCKDKPRFATWELRKRHLYEIHNYKSGAEMYECTTCHMMFKTRSQKYHHNVKVHRTKKEIDFGFSCGHCARGRGRETENITDINILQQSHLDTDNDIKVGEKRKYQKSARSQARFMTKKNASSILECWSGIPFRWKKNRFKCAYCEENFNECSDLREHVRLCATQYNVGSIFSKFKEMTLINMDVSEAACRICSEPFRELDGMREHVIRHGYELDVSHPDGVIPFCLTKESWSCVLCRETFNNFLKLYEHMNTHYQYHICSICGKGYMTGPRLRKHLELHITGTFPCDKCKKVFTKRTGRDNHKAYAHAKGPRYECPQCNMRFEGYYDRMNHLKQAHREKEVKYGCSHCDLSFKTSGKRAIHVKTVHFPRQSNFSCPYCKTLFKTAFGMKRHMVKHNGETCTVCGESFTKSKALKEHLAGSAKADLAGTVGVKPVRKLKENVCARQMRRRRRANNELPEESEKRIAKTMMRRNALTILESSTAWAFRDDHGQDDVVVSAVMHSDDSLLGQGILKEDSNVGFETIIPFREHDYDFITGIDGDSFMIKDKHLSEEGDCALHTPISCSEQEYDTNSGINFESIISNDILMAGESNSCSQALISYNEPDCDVTSDINKEESVDWPGVDLDLDDIFIFEISERSFCTTCDEEFINLDLLNEHMIIHDDKYICEQCGTDFKKLDLLENHSLTHLSKTFLCLVCSLALPSEDERNKHFRANHKNIVIHKCPFCPEIFRNYIYRDKHALRKHGVKYKGFPCFHCSKSYASNAKLKAHTHSIHSEEKRFSCKVCNQKFHFQYNLNDHMIKHLGARSYQCFVCKKYFARKRAVARHMKKHDNET-