Monarch geneset OGS2.0

DPOGS214197
TranscriptDPOGS214197-TA1242 bp
ProteinDPOGS214197-PA413 aa
Genomic positionDPSCF300014 + 154327-156337
RNAseq coverage229x (Rank: top 44%)
Annotation
HeliconiusHMEL0073280.076.16% 
BombyxBGIBMGA006222-TA1e-16569.34% 
DrosophilaCG14722-PA9e-10449.86% 
EBI UniRef50UniRef50_E0VGC99e-11254.55%WD-repeat protein, putative n=2 Tax=Neoptera RepID=E0VGC9_PEDHC
NCBI RefSeqXP_001810630.14e-11854.67%PREDICTED: similar to CG14722 CG14722-PA [Tribolium castaneum]
NCBI nr blastpgi|3323737043e-11753.90%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|1892370823e-12553.45%PREDICTED: similar to CG14722 CG14722-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055151.4e-46protein binding
KEGG pathway 
InterPro domain[62-373] IPR0110461.4e-46WD40 repeat-like-containing domain
[86-370] IPR0159434.5e-43WD40/YVTN repeat-like-containing domain
[335-369] IPR0197811.9e-09WD40 repeat, subgroup
[330-369] IPR0016802.2e-07WD40 repeat
Orthology groupMCL13367 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214197-TA
ATGACGCTAGACTTTAGGGGTTTCGAAAGAGATTCTTCAGATGACAGTATAAGTGATATAAGTGACCATTCTGATATGGAGCCAGAACAAATGAGTGATTCAGATGATGCAAATGAAAATGAAACAATTGAAGAAGTTCAAAATAAATCTGATGAAGAAAACGATAGTTCCGAAAACGATGATGATGATGATGATGTCATAAGAGCCATCAAACAAGAAAAAAATAAGCAAAGAGATCACCCTCCAAGTATAAAATGTGAAGATTTCATTGTAGATATATCATTACACCCTGTTAAAAATATAATAGCTTTAGGAAATATAGTTGGAGATGTTTTACTATATGAATACAGTAATGATGAAACAAAAATTTTACAAACATTAGAACTACATATTAAAGCCTGCAGAGATATTGAATTTGATGCTGATGGTGTAAATTTATTTAGTACAGCAAAAGATAAAGCGATTATTGCTACAGATGTTGAAACAGGCCAGTTGAAACAATGTATTGAAAATGCTCATGAGGAACCGGTTTATAAATTACGAACATTGGATGAAAACAAAATTGTTTCAGGAGATGATAATGGCACAGTGAAATTATGGGATATGAGGAAGCAGGATGCCGTATTCTCTATTAAAGTAGGCGAAGAACATGTTTCCGATATGATAACAAATGATGCACAAAAGTATTTAGTTTGTTCAGGCGGTGATGGGGTACTTACTACAATAGACTTGAAAGGAAGCAAAATTTACACAACATCTGAACAATATGATGCTGAATTAACATGTATGGGACTTTTTCGTACTGATACTAAGTTATTAGTTGGCTCTTCCATTGGAAAATTCTACTTGTTTAATTGGAAAGAATTTGGTTATCATAGTGATGAATATATTGGACAGAAACATTCTATACAATGCATGATCCCAATCACACAGAATATTGTTGTATCTTCCGGAGAAGATGGGACATTACGTGCTGCTCACATGTTCCCTCAGAGACAGTTAGGTGTGGTGGGTCAGCATAGTTTACCAGTTGAATGTCTAGACATAAGCCACGATGGGCAATATATTGCATCATGTTCTCACGACAATGATGTTAAGTTCTGGAATATATCATATTTTGAATCCATTGATTCTTTGATTGATGTTAGCCACAAACAAAATAAGAAGAGAGATATGGTAAACAACTTGCCATCAAGCAGTGTCCAAAATGCATCCGACTTTTTTTCAGGTTTAATATCATAA

Protein sequence:

>DPOGS214197-PA
MTLDFRGFERDSSDDSISDISDHSDMEPEQMSDSDDANENETIEEVQNKSDEENDSSENDDDDDDVIRAIKQEKNKQRDHPPSIKCEDFIVDISLHPVKNIIALGNIVGDVLLYEYSNDETKILQTLELHIKACRDIEFDADGVNLFSTAKDKAIIATDVETGQLKQCIENAHEEPVYKLRTLDENKIVSGDDNGTVKLWDMRKQDAVFSIKVGEEHVSDMITNDAQKYLVCSGGDGVLTTIDLKGSKIYTTSEQYDAELTCMGLFRTDTKLLVGSSIGKFYLFNWKEFGYHSDEYIGQKHSIQCMIPITQNIVVSSGEDGTLRAAHMFPQRQLGVVGQHSLPVECLDISHDGQYIASCSHDNDVKFWNISYFESIDSLIDVSHKQNKKRDMVNNLPSSSVQNASDFFSGLIS-