Monarch geneset OGS2.0

DPOGS210714
TranscriptDPOGS210714-TA1239 bp
ProteinDPOGS210714-PA412 aa
Genomic positionDPSCF300013 - 284534-290114
RNAseq coverage164x (Rank: top 51%)
Annotation
HeliconiusHMEL0075408e-14367.72% 
BombyxBGIBMGA006325-TA0.094.12% 
Drosophilaescl-PA2e-14256.53% 
EBI UniRef50UniRef50_Q5ZKH36e-15161.65%Polycomb protein EED n=165 Tax=cellular organisms RepID=EED_CHICK
NCBI RefSeqXP_002427573.12e-17269.95%Polycomb protein esc, putative [Pediculus humanus corporis]
NCBI nr blastpgi|23524160.094.17%extra sex combs [Junonia coenia]
NCBI nr blastxgi|23524160.094.90%extra sex combs [Junonia coenia]
Group
Gene OntologyGO:00055152e-44protein binding
KEGG pathway 
InterPro domain[41-406] IPR0159432e-44WD40/YVTN repeat-like-containing domain
[51-411] IPR0110463e-44WD40 repeat-like-containing domain
[150-190] IPR0016808.9e-08WD40 repeat
[152-190] IPR0197811.2e-07WD40 repeat, subgroup
Orthology groupMCL13010 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210714-TA
ATGAATTTTTCTGACAATGAAGCAGACGACACTTCTAGTGTTGAAAGTACTTCGAATACCGACAATACTTCTCGCAGTGAAACTCCGACGAACACTCGCGTGAAGAAAAGAAGACGTGGTAAGAAAAAGGCCGTTACTAAACCAGTGAAACCTCCATATAAATTCAATTGCAGTGCAAAAGAGGATCATGGACAACCCCTTTTTGGTTGTCAATTCAACCATCACCTTAGGGAAGGGGAACCTCAAATATTTGCTGTTGTCGGCAGCAACAGAGTCTCTATATATGAGTGCCCAGAATCGGGAGGTTTTAAATTTCTGCAATGCTATGCTGATCCTGATGTTGATGAGACATTTTACACATGTGCATGGTCGTATGAGGAAGAAACTGGTTTACCACTCCTAGCTGTGGCCGGATCCCGTGGGATAGTGAGAATTTTTCATCCCGCAACCCAAACATGTATAAAGCACTACATAGGCCATGGTCATGCTATCAACGAAGTCAAATTCCATCCTCGCGATCCGAATTTGTTGCTGTCTGCGAGCAAGGACCATGCTTTACGGCTATGGAATATCATGACGGATGTCTGCATCGCCATTTTCGGTGGGGTCGAAGGTCACAGGGACGAGGTCCTCAGCGCCGATTTCGACTTAAAAGGCGAAAGGATAATGTCATGTGGCATGGACCACTCGCTGAAACTCTGGAGGCTGGATAAACCATCCATGAACGAAGCCATCAAACAAAGTTATAGTTTTAATCCGCACAGAGCACTCCGGCCATTCAATTCGCTCAAAGAACATTTCCCCGACTTCTCAACCAGAGATATTCACAGGAACTACGTGGATTGTGTGAGGTGGATGGGTGATTTAATATTATCGAAGTCGTGTGAAAACGCTATCATATGCTGGAAACCTGGACGGCTGGAGGACACAGACTTAAGACCTGGAGATAACTCGGTGACGATCGTTCACAGATTTGACTACAAGGAGTGTGAGATATGGTTCATAAGATTTGCTGTTGATTATAGTCAAAGAGTTATAGCTCTCGGTAACCAGTGCGGGAAGACGATGGTTTGGGAGTTGGGCGGCGTGGCGGGAGGGTCGCGCGTGTCGCTACTAGTTCATCCGAGATGTGTGGCCGCCGTCAGACAGGTGACTCTGTCTCGAAACGGCAAAATACTACTGACCTGCTGCGACGACGGCACTATATGGAGATGGGATCGGGTCCACAACGGAAGCTGA

Protein sequence:

>DPOGS210714-PA
MNFSDNEADDTSSVESTSNTDNTSRSETPTNTRVKKRRRGKKKAVTKPVKPPYKFNCSAKEDHGQPLFGCQFNHHLREGEPQIFAVVGSNRVSIYECPESGGFKFLQCYADPDVDETFYTCAWSYEEETGLPLLAVAGSRGIVRIFHPATQTCIKHYIGHGHAINEVKFHPRDPNLLLSASKDHALRLWNIMTDVCIAIFGGVEGHRDEVLSADFDLKGERIMSCGMDHSLKLWRLDKPSMNEAIKQSYSFNPHRALRPFNSLKEHFPDFSTRDIHRNYVDCVRWMGDLILSKSCENAIICWKPGRLEDTDLRPGDNSVTIVHRFDYKECEIWFIRFAVDYSQRVIALGNQCGKTMVWELGGVAGGSRVSLLVHPRCVAAVRQVTLSRNGKILLTCCDDGTIWRWDRVHNGS-