Monarch geneset OGS2.0

DPOGS206810
TranscriptDPOGS206810-TA1512 bp
ProteinDPOGS206810-PA503 aa
Genomic positionDPSCF300001 - 3983185-3998025
RNAseq coverage638x (Rank: top 20%)
Annotation
HeliconiusHMEL0121994e-15775.27% 
BombyxBGIBMGA000632-TA2e-15379.94% 
DrosophilaCG2812-PA2e-12160.42% 
EBI UniRef50UniRef50_UPI0002060F021e-14870.88%UPI0002060F02 related cluster n=1 Tax=unknown RepID=UPI0002060F02
NCBI RefSeqXP_001122414.14e-16559.20%PREDICTED: similar to CG2812-PA [Apis mellifera]
NCBI nr blastpgi|3071852265e-16359.14%WD repeat-containing protein 47 [Camponotus floridanus]
NCBI nr blastxgi|3071852262e-16258.95%WD repeat-containing protein 47 [Camponotus floridanus]
Group
Gene OntologyGO:00055151.1e-63protein binding
KEGG pathwayddi:DDB_G02821895e-19 
 K06666 (TUP1)maps-> Cell cycle - yeast
InterPro domain[185-501] IPR0159431.1e-63WD40/YVTN repeat-like-containing domain
[175-503] IPR0110462.4e-59WD40 repeat-like-containing domain
[235-267] IPR0197819e-10WD40 repeat, subgroup
[230-269] IPR0016803e-07WD40 repeat
Orthology groupMCL15641 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206810-TA
ATGGACGCAAGAAACTTTAAAACTTCTTCATCCAAAAGTTCAGCGTATTCTGATTTTTCTGGAATGAGTGCGTACAGTAATAAGTCCCGGGACTATTATTTCCTATCGCCTAGTTATCGATCGGCTGGCGCGCCAACGCCAAGCAATCATGCGGATTTATACTTGAGTAGAGAAAGATCCAGGCAGATTCGAACTGCTAACGCTTCTTCTCCTGCCTGCCCCTGCAGCCGATCTCGATCCCTTGAGGACGTGCGTACGGAAGTGGTTACCGAATGGGAGGATGATGATGAAAATGGAAACCGTATCGTAGCTCCGGCGACCAAATTTAATCGCACAACATACAAAACGAACGCTACGTTTGAAAAACAAAATTTTATGACCCGACATTCCATGGAAAATCTAGTAGAAAGATCGCCACAAGTGCCTCCACCAAAACGCATCAGTGCTTTTCAGGCGCCAAGCAGGAACGGTAGTCAGAGTGCGCTTGGCACTCCAGCCGTTGCGGAAGTGAACGGCTCCCGGCCGACCTTCAAGCCAGTGACAGTGCTAGAAGATCTACAGGCGGTGCGTTGCGCTGAATTCCATCCCGGCGGCAGATTGTACGCTGTGGGCTCCAACACCAAGACGCTCAGGATATGCAGCTATCCCAAGATTGATGATGTCAAAGACAATAGTTCACCGACAGCGCCAACTGTGCTACTAAAACGCACTAAGCACCATAAAGGTAGCATCTATTGTCTGGCCTGGAGCCCGGCCGGAGATCTTCTAGCCACCGGTTCCAATGACAAGACAGTTAAGCTCATGAGATTCAACAGCCACACCTGCAACCTGGAGGGACAGGAGGTTGAGCTGACTATGCATGACGGAACGGTCCGTGACGTGTGCTTCATAGAGGACACCTCAAACAAAACCAGCCTTCTCGTGAGTGGTGGCGCTGGGGACTGCAAGATATACGTAACAGATTGCGCTACCGGGAAGACTTTCCAGGCGCTGAGCGGTCATTCCGGGCACGTGCTGTCTCTATACAGTTGGGGTGGAGCTTTGTTCGTGTCAGGGAGTCAGGATAGGAGCGTCCGTTTCTGGGATCTGAGAACAGCTGGTTGCGTCAACGTGATCGCACCCCCGCCAGCTGGACAGCATGCGAAGGGGTCAGCTGTCGCGTCTCTGGCAGTGGATCCGAGTGGTCGTCTCCTAGTATCTGGTCACGAGGATGGCTGGTGTGGTCTACACGACGTGCGCGGCTCTCGTTCCTTGCAGCGGTTCTCGCCTCACGCGGGCGATGTACGATCAGTTCGTTTCTCACCCGGAGCTTACTATTTACTGACAGCTGGATACGACGGACGTGTAGTACTTACAGATCTTCAAGGTGACCTCACGTGCGCACTTCCTAGCGTACCAGTGGCTCGTCACCCGGACAAAGTGATCTCAGCTCGCTGGCATCCCGATGACTTCTCATTCCTTTCAACGTCAGCTGATAAGACAGCTGTTCTATGGACCATACCGCCACTATAG

Protein sequence:

>DPOGS206810-PA
MDARNFKTSSSKSSAYSDFSGMSAYSNKSRDYYFLSPSYRSAGAPTPSNHADLYLSRERSRQIRTANASSPACPCSRSRSLEDVRTEVVTEWEDDDENGNRIVAPATKFNRTTYKTNATFEKQNFMTRHSMENLVERSPQVPPPKRISAFQAPSRNGSQSALGTPAVAEVNGSRPTFKPVTVLEDLQAVRCAEFHPGGRLYAVGSNTKTLRICSYPKIDDVKDNSSPTAPTVLLKRTKHHKGSIYCLAWSPAGDLLATGSNDKTVKLMRFNSHTCNLEGQEVELTMHDGTVRDVCFIEDTSNKTSLLVSGGAGDCKIYVTDCATGKTFQALSGHSGHVLSLYSWGGALFVSGSQDRSVRFWDLRTAGCVNVIAPPPAGQHAKGSAVASLAVDPSGRLLVSGHEDGWCGLHDVRGSRSLQRFSPHAGDVRSVRFSPGAYYLLTAGYDGRVVLTDLQGDLTCALPSVPVARHPDKVISARWHPDDFSFLSTSADKTAVLWTIPPL-