Monarch geneset OGS2.0

DPOGS208038
TranscriptDPOGS208038-TA1479 bp
ProteinDPOGS208038-PA492 aa
Genomic positionDPSCF300203 + 65267-66990
RNAseq coverage427x (Rank: top 29%)
Annotation
HeliconiusHMEL0040500.091.52% 
BombyxBGIBMGA001468-TA0.089.47% 
Drosophilapygo-PA2e-4375.00% 
EBI UniRef50UniRef50_F4WHY76e-6844.92%Protein pygopus n=8 Tax=Endopterygota RepID=F4WHY7_ACREC
NCBI RefSeqXP_394285.38e-6048.40%PREDICTED: similar to pygopus CG11518-PA [Apis mellifera]
NCBI nr blastpgi|3504235053e-7748.04%PREDICTED: hypothetical protein LOC100746903 [Bombus impatiens]
NCBI nr blastxgi|3838519571e-12151.12%PREDICTED: uncharacterized protein LOC100875705 [Megachile rotundata]
Group
Gene OntologyGO:00055157.3e-08protein binding
GO:00082701.6e-06zinc ion binding
KEGG pathway 
InterPro domain[419-488] IPR0110111.6e-11Zinc finger, FYVE/PHD-type
[426-482] IPR0130832.6e-09Zinc finger, RING/FYVE/PHD-type
[427-480] IPR0197877.3e-08Zinc finger, PHD-finger
[426-480] IPR0019651.6e-06Zinc finger, PHD-type
Orthology groupMCL26673 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208038-TA
ATGAGTCACAATCTGGCGGGTATGCCGTCTTATAGACTGCCTGGGCCCGGGTTGGGTCCTCCAGACTTCAAACCGCCCATGGACACTCCTACACCACCGGCCGCTGCGCCCAGTAACCCCAAAAAGAGAAGAAAAACATCAAATGCAAACAATGCTCTCACACCTCCACAACCTCCACCAACAGCCCAAGATTTATTGCCACCACCCCTCACAGGGTATGGTGATACAATTGTTGCTTCCAACCCCTTTGATGACTCCCCATCTACAGTTTCTCATAATGGACCCATGATGAATCAAAATGGTCCCATGATGAGTCAAAATGGGCCAATGGGAATGATGGGCCCTATGACCCACAGCATGGGTGGGCCACCAATGAGGCATATGAGTCCTTTACCACACAACATGAGTCCTATGAGTCAACAGATGCCACCAAGAGGAGGGATAAGTCCCATGGGGAATATGAGTCCTATGGGTCATAGTCACATGGGTGGGATGTCTCCCATGGGAGGACCAAATATGGGTATGAGTAACCACAGTATGGGTCCAGGTATGGGACCAACATCAAGATCAATGGGAAGTCCGATGAGTCCTATGAATTCAATGCCCATGGGTTCTCCGATGTCTTCAGGACCAATGGGTAGTCCAATGAATATGGGATCCATGGCTGGCAGTCATATGAGCAATAGTCCTATGGGACCCCCAATGCATAGTCCGCTTGGAGGGGGTTCAATGAATGGTCCGATGAATGGACCAATAAATGGCCCAATGGGTGGTGGCCCAGGTATGAATGTTCCTCGCATGAATGGCCCTATGGGACCCAGTTGTTCTAATGGTTCTATGGGCCCAACTAGTTCTATAATGTCACCAAGTCCTATGCAGAGTGGGGGGATGGGTCCGGGACATTGCGGGCCAATGAGACACGGCAGCCCGATGGGCTCAGGAATGGGAAGTGGTCCCATGGGTGGAAATGGACCTATGACATCAATGGGTCCCGGCCCACCATATTCAGGGAACCACATGAGTCACAGTGGTCCAATGGGTATGGGGGGAAGCAGTTCAATGGGAATGGGTCCAGGACCTGGAAATATGGGAAATTGTGGGCCACTGGCTGGTATGAGCGGAATGTCAATGGGCGGTCCAGGTGGTCAGGGACCCATGGGGCAGAATATGGGAATGTTTGGACCAAAACCTATGCCAGTGAGTGCAGGGAAAGTGTACCCACCAGACCAACCTATGGTGTTCAATCCTCAAAACCCTAATGCTCCACCCATTTATCCCTGTGGAGTGTGTCACAAGGAAGTCCATGATAATGATCAAGCTATTTTATGTGAATCAGGCTGCAACTTTTGGTTCCACAGAGGATGCACCGGGCTGACAGAGCCAGCTTTTCAGTTGTTAACAGCGGAAGTATACGCTGAATGGGTGTGTGACAAGTGCCTACATTCTAAAAACATACCACTGGTGAAATTCAAACCTTAA

Protein sequence:

>DPOGS208038-PA
MSHNLAGMPSYRLPGPGLGPPDFKPPMDTPTPPAAAPSNPKKRRKTSNANNALTPPQPPPTAQDLLPPPLTGYGDTIVASNPFDDSPSTVSHNGPMMNQNGPMMSQNGPMGMMGPMTHSMGGPPMRHMSPLPHNMSPMSQQMPPRGGISPMGNMSPMGHSHMGGMSPMGGPNMGMSNHSMGPGMGPTSRSMGSPMSPMNSMPMGSPMSSGPMGSPMNMGSMAGSHMSNSPMGPPMHSPLGGGSMNGPMNGPINGPMGGGPGMNVPRMNGPMGPSCSNGSMGPTSSIMSPSPMQSGGMGPGHCGPMRHGSPMGSGMGSGPMGGNGPMTSMGPGPPYSGNHMSHSGPMGMGGSSSMGMGPGPGNMGNCGPLAGMSGMSMGGPGGQGPMGQNMGMFGPKPMPVSAGKVYPPDQPMVFNPQNPNAPPIYPCGVCHKEVHDNDQAILCESGCNFWFHRGCTGLTEPAFQLLTAEVYAEWVCDKCLHSKNIPLVKFKP-