Monarch geneset OGS2.0

DPOGS211815
TranscriptDPOGS211815-TA1191 bp
ProteinDPOGS211815-PA396 aa
Genomic positionDPSCF300031 - 279089-280366
RNAseq coverage49x (Rank: top 70%)
Annotation
HeliconiusHMEL0042817e-11951.25% 
BombyxBGIBMGA012183-TA3e-4029.04% 
DrosophilaCG12299-PA8e-2732.53% 
EBI UniRef50UniRef50_UPI0001CBA32F1e-3031.07%UPI0001CBA32F related cluster n=4 Tax=unknown RepID=UPI0001CBA32F
NCBI RefSeqXP_002734413.12e-3131.07%PREDICTED: zinc finger protein 111-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2192818227e-3132.11%zinc finger protein 26 [Mus musculus]
NCBI nr blastxgi|3266765311e-3729.85%PREDICTED: zinc finger protein 850 [Danio rerio]
Group
Gene OntologyGO:00036763.8e-07nucleic acid binding
KEGG pathway 
InterPro domain[306-328] IPR0130873.8e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL34909 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211815-TA
ATGCTAGAAAATCTCGGTATTCCACCAGAGTCTCATCTCCCACAATCTATCTGTAAGAGTTGTGCAACAATCACATCTCACTGCTACTTGTTCCAGAAGCTAGTTCGTTTCTCACATGACAAGTGGAGTAGTATAACAAAACTTCTGGATCAATCAATAGATAAATCAAAAGACGTAAACCTCCAAAATGCTAAGAGTGCCTTTCTATTTATTAATGATGATGACACATTTATTATTTCCAGTCGTAAACATTATTCAACCAAGAAAAAAAAAGATATTCTTTTGAAAGTGAAACATATTATAAAAACGGGTATTGAGGCTAAACAGAGAAGTAAGGAGTCTGTTTGTAATGAATGTGGAGAGAGTTTCACAGCACGAGTGCTTTCAAAACATAAGAAATTACATAACAAACTGAATCACCCATGTGCACAGTGCCCTAAAATATTTTCTACAGCAATGCAATTAGAAGAACACATAGAAAGACTACACTTTCCAAAGAGACTGAGATGTGACCAGTGTTCAAAGAAATTTAGTACAGAAAAATTATTAAACATTCACACAAGAAATAATCATGTAGCAGTATATTGCAAGTTATGTAATTTAGAATTCCCATCTAGAACAAGTCTTAGAGCACATGTAGACAAGCATGGAACAAATATATGTCCACAGTGCAACAAAAAATTTATAAACAGACAAACTTTCAAATTACACCTCAAATGCTGTGGTAAGAGTATTGAGCAACAAAATTTCATTTGTGACATATGCAAGAAGTGCTACGCTAGAAAAAATGGATTACGATCACATCTCAAGATAGACCACGGGTTTGGGAATGTTCTAACGTGCAACTGGTGTAATAAAAAGTTTGACGCTGTAAGTAGACTGAAAAACCATATCGTGAAACACACAAGAGAACGGAAGTTTCATTGCGATCAATGCGGCGGTAAATTTGTAACATATCCAGCATTGGTCTATCACACAAGACTTCACACGGGTGAACGACCTTTCCCTTGCGACCTTTGCGATGAAAGCTTTTTATCAGCTTCTAGAAGAATGGAACACAAGAAAAGAAAGCACTTCGGACCGAGTCATGAATGTGGAATTTGTAGAGGAAAGTTCACAACAAAGCACCAGTTAAGAAAACATATTAAAAGGCATTTTAATCCCGGAAGCAAGCTGTATGTTACTGATTAA

Protein sequence:

>DPOGS211815-PA
MLENLGIPPESHLPQSICKSCATITSHCYLFQKLVRFSHDKWSSITKLLDQSIDKSKDVNLQNAKSAFLFINDDDTFIISSRKHYSTKKKKDILLKVKHIIKTGIEAKQRSKESVCNECGESFTARVLSKHKKLHNKLNHPCAQCPKIFSTAMQLEEHIERLHFPKRLRCDQCSKKFSTEKLLNIHTRNNHVAVYCKLCNLEFPSRTSLRAHVDKHGTNICPQCNKKFINRQTFKLHLKCCGKSIEQQNFICDICKKCYARKNGLRSHLKIDHGFGNVLTCNWCNKKFDAVSRLKNHIVKHTRERKFHCDQCGGKFVTYPALVYHTRLHTGERPFPCDLCDESFLSASRRMEHKKRKHFGPSHECGICRGKFTTKHQLRKHIKRHFNPGSKLYVTD-