Monarch geneset OGS2.0

DPOGS203342
TranscriptDPOGS203342-TA2112 bp
ProteinDPOGS203342-PA703 aa
Genomic positionDPSCF300003 - 127073-135076
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0225547e-8644.73% 
BombyxBGIBMGA008140-TA6e-1930.41% 
Drosophilaerm-PA1e-1732.93% 
EBI UniRef50UniRef50_UPI0001760B422e-2137.68%UPI0001760B42 related cluster n=1 Tax=unknown RepID=UPI0001760B42
NCBI RefSeqXP_002429661.12e-2226.77%krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420183924e-2126.77%krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420183926e-3225.43%krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00036761.8e-09nucleic acid binding
KEGG pathway 
InterPro domain[87-124] IPR0130871.8e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL23309 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203342-TA
ATGCTAGCACATACTACAGGACAGGTGAAGCCGAGAGACATCCATTGTCTCGCATTTGAATCTTGGCACTATAAAACCAAACGCGTCCTCAAGCTGCACATATCCGAGGTTCACCTGGGCGTGAAACTAAACAACACGCCTTGTCCGGATTGCGGGAAAGAGTTTAAGTCTCTGGAGCGAAGGAATACTCACATTAAGATCGTACACGACCGCGTTTATTCCTGCACATGCGACATCTGCGGTCTCATCATCAGCAACAAGTACATGATGGACACTCATCTCACCACGCACTCGGACTTGAAGCCGTTTTTATGCAGTTTCGGGAACTGCGAGAAGCGTTTTAAGGACAAGGGCACTCTTAAGAAGCACACAATAATCCATTACCCGGACCAACATCACGCGTGTCCGGCATGCGGTAAACTATTCGCCAGGATAAGTAGACTGAAGAAGCACAGCTTACAGCACAAGGAAAAGACGAAGTGTGTCTTCTGCGATCACTGTGGGAAGGGGTTTTATAATAAAAATTACTTGGCGTCGCACATCACCAGGAAGCACGTGTTGAGAGAGCGATTCGTTTGCGACCTCTGCGATCTCATCACATACAACAAGCCCAGCATAGTGATGCACCTCAAGTACGGACACGTCAACGAGCGAGATCGTAAATGCAGAATATGCAAGAAGATATTCAAGGAACACAAGTATCTGAAGCAGCACTACTGGGTCACTCATTGCATCAAGTACAAAGTGATGCAGAGACAAACAAAAAAACCGATTCAGATAAAACGGGAAGTTGTTGACTACAAGCAGATAGAAATATTACACGAGGTCAAGGTCGAACGCGTCGACGTTACTCGGATCAAATATGTTAAAATGAATGGCGAAGTCAACGAGCGAGATACAAAGGACGAGGGATCGCGTAACGAAGCGGAGGTTAACGATTTGAATGATGCTGAAACACCAGCTGTCATAGATGATGTAGATGGAGCGGTTGTAGATGAGATAGATGAAGACGATACAGATAAAGAACGAGATGAAAATATGAAAAACAATAAATTCAAATTAAATTCACATCAGTGCTACGTCTGTTTGAAGCTTTACGGAACTAGGGAGGAGCTTTTGGAGCACTGTCAGGAGCATTTCGATGTTTGCAACTCAACAATACTTAAAAAATGTCCTCTGTGCGACTACGTCACCAAGTTAAACATATCGCGGCATATAAGAAACGTTCATAACATAAAAAACAGAATATGTACAAGAATTAACGAGAATAAGAACGAGAAAGATGGCACTGGGTACTATTACAGCATAGAGGACAGAAGGAACATCATTGAAATTATACCCAGCGTTAAGGAGCTGAACAAACGAGCTTGCATCAGATTGGACAAAAGAAGAAAGGACAACGAACGCAGAGGAATACAGAAGACTAGACTAGTCAAAAAGAACGGCGAGTGGATTGTTGAGAAACAGGATATAAATGTCAGTGATTACATTTTACCGGAAATCGGTAATGTTATGAATGTTGCCAGCAACGATTACGTCAGCAGGTTGAAAGTTTTGGGGCTCATAGCGAAAAACGAGGGTAGGAGCATAATGTATCCGTGCGATAAATGCGAGAAGATATGCCAAACTCTCAGCGCATTGAAACTGCATTATAGGAAACACGAGGATAATCCCAAGCCGTTCAAACCGAAGGTATGGAAACATAAAACCGGTTTATCGCGAGAACCGGTCACAAATGAGCCGGTCCCATCAGAAAATCGGTATCAAAAACCAAAACCGATCGTCAGCAAGCACAGATGCGATCCGAAGCTCGAGGAGTTTTATGCGAACAACATAAAAGGTGGTGACATAGAGTTCTGGCATTTCTTGAAGATATTCAATAAGATGTCCAAAGAAAACGTCAACGATTTCAAAGATCTAGAGAAACGTACGGATTTCGGTATACATGTTAAAGGAAAAGCCATCAAAGAGACGTCCGCGAGACGCGAAAAAACACACAGAGCGTTCACGCGGACAATAATGTTGACCAGGAAAGACTATAACCAACGAAAAGATACGATAGCAAAAATGAGGCAAAACATACGTCGCTATCATGAGGGAAAAAAAATCTAG

Protein sequence:

>DPOGS203342-PA
MLAHTTGQVKPRDIHCLAFESWHYKTKRVLKLHISEVHLGVKLNNTPCPDCGKEFKSLERRNTHIKIVHDRVYSCTCDICGLIISNKYMMDTHLTTHSDLKPFLCSFGNCEKRFKDKGTLKKHTIIHYPDQHHACPACGKLFARISRLKKHSLQHKEKTKCVFCDHCGKGFYNKNYLASHITRKHVLRERFVCDLCDLITYNKPSIVMHLKYGHVNERDRKCRICKKIFKEHKYLKQHYWVTHCIKYKVMQRQTKKPIQIKREVVDYKQIEILHEVKVERVDVTRIKYVKMNGEVNERDTKDEGSRNEAEVNDLNDAETPAVIDDVDGAVVDEIDEDDTDKERDENMKNNKFKLNSHQCYVCLKLYGTREELLEHCQEHFDVCNSTILKKCPLCDYVTKLNISRHIRNVHNIKNRICTRINENKNEKDGTGYYYSIEDRRNIIEIIPSVKELNKRACIRLDKRRKDNERRGIQKTRLVKKNGEWIVEKQDINVSDYILPEIGNVMNVASNDYVSRLKVLGLIAKNEGRSIMYPCDKCEKICQTLSALKLHYRKHEDNPKPFKPKVWKHKTGLSREPVTNEPVPSENRYQKPKPIVSKHRCDPKLEEFYANNIKGGDIEFWHFLKIFNKMSKENVNDFKDLEKRTDFGIHVKGKAIKETSARREKTHRAFTRTIMLTRKDYNQRKDTIAKMRQNIRRYHEGKKI-