Monarch geneset OGS2.0

DPOGS210231
TranscriptDPOGS210231-TA2430 bp
ProteinDPOGS210231-PA809 aa
Genomic positionDPSCF300196 - 255889-258675
RNAseq coverage430x (Rank: top 28%)
Annotation
HeliconiusHMEL0204500.080.30% 
BombyxBGIBMGA002551-TA0.073.76% 
Drosophilazfh1-PB2e-7837.73% 
EBI UniRef50UniRef50_E0V9Q04e-14641.27%Zinc finger protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0V9Q0_PEDHC
NCBI RefSeqXP_002422823.17e-14741.27%zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420036851e-14541.27%zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420036853e-15341.45%zinc finger protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055151.6e-18protein binding
GO:00063555.4e-18regulation of transcription, DNA-dependent
GO:00435655.4e-18sequence-specific DNA binding
GO:00037005.4e-18sequence-specific DNA binding transcription factor activity
GO:00036771.5e-17DNA binding
GO:00036761.4e-10nucleic acid binding
KEGG pathway 
InterPro domain[472-544] IPR0090571.6e-18Homeodomain-like
[485-547] IPR0013565.4e-18Homeobox
[486-545] IPR0122871.5e-17Homeodomain-related
[113-142] IPR0130871.4e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL16158 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210231-TA
ATGGTACCGGGTGCTATCCGGCCTCTAGCTGGTGGAGGTTCCGAGATCAAGACGTGTGAGGAGGAAGAGGATCGTGGGTCTTCGCCTCTGGGTCCCGGCGGGCCTTTCCCCTGCAGCCACTGTCGCGGAGCCTACCCGACCCGCGAACAGCTCGAGCGACATGAGACGCTGCACGCGCCGAGCACTCAGACGTGCAAAATATGCCACAAGAGTTTTCAAAATGTCTACAGACTTCAGCGTCACATGATAAGTCATGATGACAGCGCGAAGTTACGCAAATATAAATGCAATGACTGCGATAAAGCATTTAAATTTAAACATCATCTCAAAGAGCATCTCCGGATACACAGTGGAGAAAAACCGTTCGAATGCGCTAACTGTGGAAAAAAGTTTTCACATTCCGGCTCTTATTCGTCTCATATGACTTCCAAGAAATGTCTTGTTATGAATCTCAAAATGGGAAGAATAAAGCCAAATAATCCGGCATTGAATCCAGACCGGAGTCCATCTCGGAAACGTGCAAACGCCATGGCAGCTAGTCAGTTGAATAATAATATCGCGCCAAACGGCAATTCGTTTTTGCCAATATTGCCAAAGTATAACGAAGCTGCAGCATTTTTCGCATCTATGTCATCTCAAGAAAATAATTTCCTGAGGCCCCCGTTGGGTCAACCTGGATTAAATCCTTTCTATATGCCTCCTGGTATGCCAATGAGCCCAGCCAACGGTATTGCACCTTACACCTTCCCTACTTCATTAAGCCAATTATTTGAGCAACTGGCCTCTCAACATTATCAACAACGAAAAATAGAGATTCCTAGTCCAAAGCTTGTGAGTCCACCCGCAAACCCTGAAGACTTAATCGAGGAAGTAGTGGATGAGGAAGATAAGCGCTCCGAGGCCAGCGCAGAACTAGTGATGGATATAGACGACGATGACAACGTTACCGTAAAGAAAGAACAAGAAGACCGGGAAACTGAGGCGAGTTCTCCTTCCCGTAATTACGAATCTATCTTAAGTAGCAATGAACGCGGCGAGTCCGACATCAATCATTCGGATTTTAATACTGTTAAAGCATCCGATACTAAATATTATTTTAAGACGCACAATGATCAGCAGTCTCCTATATCTGGCCAGGAATACCCCGGTGAAGCTTTACCATTGACACAGATAAATGTTAAAGAGGAACCTGATATTGATACCCTACGTTGCTTAAAATGTAACGTATTGTTCAATGACAAAAATGATTTATTGGAACATGACAAAGCAGCGTGTGGTAATATTTTTAGAAAACATGAAGGACTAGCTGCTCAAGTGGCTGAGACGGTGGCTCTGAATAGATTAGAAGCTGAAATGCGCGCATCTATACAAAGTGGGGTAAGTGCGAGTGAAGATGAGGATTTCGGGAGAGAGGATCGGGAAGACAAAGCCTCTATAAATGAAAATGACAGAAAAATCAGAGTTCGCACCGCACTTACCGAAGAACAACAGATGGTGTTAAAAAGGCACTATTCGATCAACCCTCGACCGAATCGAGAGGAATTTAAGAAGATCGCACAGCAGATAGGCTTAGATAACCGAGTAGTACAAGTTTGGTTCCAAAATAATAGAGCCAGAGTACGGAGGATGACTCAGGCGGTCGCGATATCTGATCAACCTCTAGATTTATCTACAAAAAAATCGAATACCTCCGTTACTTCAAGCCCGTCACCTTCACCGACTTGCAGCATTTCAGTAACACATTCCGATTCCGAGGAAGCGGTTAATTTAAGTCAAAAATCTTCTCGCAGCACGACCCCACATCGCGCTAACTACATAAATACGTATCCACATTCCAACTGCTCGTCCTCATCGTTCACGGATTTTCGGTTATCACCCTCACCAGGTGAAACTATGAACGGTTACAAAAGAATGTTGCAACAGAAAATGCCCATCAATCCTATGATGCCGATGGACAAACTTCTTCATTACAACGACCTGAGTAACGGACGATCTCCAATTCTTAACATGCAAGTGCCCGAGAGGCAAGAATCCAGTCCGTCTTACGACCGCCCAATATGGAACGAGGATCTCCAGACTCAAATCGAATTAGAAGATGAAACCACTGTACTTAAAAAGAGCAAAATAAAGGCTGGAAATGAGTTGAAAGAGGGAGAAGGACAATTCGTTTGTGATCAATGTGATAAAACTTTTGTCAAGCAAAGTTCTCTCGCGAGGCATAAATACGAGCACTCAGGCCAGCGACCTTACAAATGCTTGGAATGTCCTAAGGCTTTCAAGCATAAGCACCACCTGACTGAACACAAGCGGTTGCACACCGGCGAGAAGCCGTTCCAGTGCTGCAAGTGTCTCAAGAAGTTCTCTCACTCCGGCTCCTACAGCCAGCACATGAACCACAGGTTCGCGATCTGCAAGCCATACAGAGACTAG

Protein sequence:

>DPOGS210231-PA
MVPGAIRPLAGGGSEIKTCEEEEDRGSSPLGPGGPFPCSHCRGAYPTREQLERHETLHAPSTQTCKICHKSFQNVYRLQRHMISHDDSAKLRKYKCNDCDKAFKFKHHLKEHLRIHSGEKPFECANCGKKFSHSGSYSSHMTSKKCLVMNLKMGRIKPNNPALNPDRSPSRKRANAMAASQLNNNIAPNGNSFLPILPKYNEAAAFFASMSSQENNFLRPPLGQPGLNPFYMPPGMPMSPANGIAPYTFPTSLSQLFEQLASQHYQQRKIEIPSPKLVSPPANPEDLIEEVVDEEDKRSEASAELVMDIDDDDNVTVKKEQEDRETEASSPSRNYESILSSNERGESDINHSDFNTVKASDTKYYFKTHNDQQSPISGQEYPGEALPLTQINVKEEPDIDTLRCLKCNVLFNDKNDLLEHDKAACGNIFRKHEGLAAQVAETVALNRLEAEMRASIQSGVSASEDEDFGREDREDKASINENDRKIRVRTALTEEQQMVLKRHYSINPRPNREEFKKIAQQIGLDNRVVQVWFQNNRARVRRMTQAVAISDQPLDLSTKKSNTSVTSSPSPSPTCSISVTHSDSEEAVNLSQKSSRSTTPHRANYINTYPHSNCSSSSFTDFRLSPSPGETMNGYKRMLQQKMPINPMMPMDKLLHYNDLSNGRSPILNMQVPERQESSPSYDRPIWNEDLQTQIELEDETTVLKKSKIKAGNELKEGEGQFVCDQCDKTFVKQSSLARHKYEHSGQRPYKCLECPKAFKHKHHLTEHKRLHTGEKPFQCCKCLKKFSHSGSYSQHMNHRFAICKPYRD-