Monarch geneset OGS2.0

DPOGS201602
TranscriptDPOGS201602-TA1575 bp
ProteinDPOGS201602-PA524 aa
Genomic positionDPSCF300152 + 194372-200607
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0134432e-0639.62% 
BombyxBGIBMGA001689-TA2e-0622.33% 
DrosophilaCG4707-PA2e-0627.11% 
EBI UniRef50UniRef50_G3UVV38e-0736.30%MCG1556, isoform CRA_b n=43 Tax=Eutheria RepID=G3UVV3_MOUSE
NCBI RefSeqXP_001994311.13e-0726.97%GH23864 [Drosophila grimshawi]
NCBI nr blastpgi|3807960672e-0636.30%zinc finger protein 407 isoform 1, partial [Macaca mulatta]
NCBI nr blastxgi|3227989823e-1122.36%hypothetical protein SINV_02496 [Solenopsis invicta]
Group
Gene OntologyGO:00056342.8e-09nucleus
GO:00082702.8e-09zinc ion binding
KEGG pathway 
InterPro domain[15-83] IPR0129342.8e-09Zinc finger, AD-type
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201602-TA
ATGAAGAGGCTAAATATGGAGGAGGAAGAGGACTTTGTAGGTTGCATTACCTGCATGTCCACAAATGATTTATGCGACCTATTTGTTAATTACGCAAACAATGAAGAAACTTACGCCCAAATGCTTGAAACCTGTTTCGAAATAAAGGTAACAAATGATTTTAAATACATATGTGCAATTTGTGTGGAGAAATTGGAAAAATCCTACGAGTTCAAAGATCAGACGGTTAAGAGTATTGATATGCTTCAATCCATTAAAGAGGAAGAGTATCTGGATGAGGCGATCTTTGACGATCCGGACAAGGATAAGTCCAGGAAAGCCGATTTGAATTACGAGGATGATGAAGAGCATGAGATCGAAGCGCTGGACGAGACCCGCTGTATCTACTGCGACCTCCAGCTGGGGTCGTCGGAGGAGGTGCAGCACCACGTGAGGCTCAGACACGGTCTGGAGCCCCGCACTGGGCGGAGGGTGACACAGTGCCACCTGTGCGGGGCTTCGCTGAGGGACCTCGCTGACCATATTAACAGATGCCACAGCAGTTCGGAGACCGAGCGCCAGTACGGGTGCCACTTCTGCGACAACGTCTACAACAGCAAGAAGGCGTGCTTCACGCATCTCAGGATGAAGCATGGATTGAAGCTCTGCAATGACCACACTCCGAAGTACTCGTCACGCGACAGGAAGAAGTGCCACATCTGCGGCAGGGACTTCAGCCAGAAGCAGATCCTGAACAACCACCTGTGGAAGGCTCACGGGTTCGAGGTTTCGTTCCACCTCTTCGCGTTCCGGTTCTTCTGTCCGCTCTGCTCGGAGCGCGTCAGTTACGGCGCCAACTTCAGCGCCCACCTCACGCAGCAGCACGACGTCACCGAGGACGTGGAACAGCTGGAGTTCAGCTCCATGGACGATTTCATGCTGTACAAGAGCGCTATTCAAGAGGAAACCAAGTTTCGGTTCAGGAAGACCACCGCCAGCAAGCAGACGATAGAGGGAGTCAGGTCGCACTACATGTGCAGCCAGTCCGGCATATACGTGTATCAGGTGAGGACGGGCAGGAGGCTGGGCTGGCTCCCGGCCCATTTCATGCTGTACAAGAGCGCTATTCAAGAGGAAACCAAGTTTCGGTTCAGGAAGACCACCGCCAGCAAGCAGACGATAGAAGGAGTCAGGTCACACTACATGTGCAGCCAGTCCGGCATATACGTGTATCAGGGTAAGGGCAAACGTCCGGCCCCCGAGCGTCAGATCTACAAGACGGGCAAGGCGTGCCCGGCCCACATGATAGTGACGGAGACCCTGGACAGAGTCCTCGTCACCTTCTACAAGACGCACGTCGGACACGGCACGTGTCCGTACTACGAGCCGCGCGATCCGAAGCGCTCCAAGCAGGAGGAGCAGGCGCTGGTGTGCGACACGTGCGGGGCGAGGGTCGCCGCGGGCAGGCTGCGGGCTCACGTGGCGGCTCACGGTCTGCACCTCTTCCCGTGCGACTACTGCGACCAGCTGTTCCAGAACATCGACGCGTGGACCCAGCACACGAGGACGGAGCACGCGGTCGGCTCGCTGTTCTGA

Protein sequence:

>DPOGS201602-PA
MKRLNMEEEEDFVGCITCMSTNDLCDLFVNYANNEETYAQMLETCFEIKVTNDFKYICAICVEKLEKSYEFKDQTVKSIDMLQSIKEEEYLDEAIFDDPDKDKSRKADLNYEDDEEHEIEALDETRCIYCDLQLGSSEEVQHHVRLRHGLEPRTGRRVTQCHLCGASLRDLADHINRCHSSSETERQYGCHFCDNVYNSKKACFTHLRMKHGLKLCNDHTPKYSSRDRKKCHICGRDFSQKQILNNHLWKAHGFEVSFHLFAFRFFCPLCSERVSYGANFSAHLTQQHDVTEDVEQLEFSSMDDFMLYKSAIQEETKFRFRKTTASKQTIEGVRSHYMCSQSGIYVYQVRTGRRLGWLPAHFMLYKSAIQEETKFRFRKTTASKQTIEGVRSHYMCSQSGIYVYQGKGKRPAPERQIYKTGKACPAHMIVTETLDRVLVTFYKTHVGHGTCPYYEPRDPKRSKQEEQALVCDTCGARVAAGRLRAHVAAHGLHLFPCDYCDQLFQNIDAWTQHTRTEHAVGSLF-