Monarch geneset OGS2.0

DPOGS207618
TranscriptDPOGS207618-TA2310 bp
ProteinDPOGS207618-PA769 aa
Genomic positionDPSCF300248 + 236857-240779
RNAseq coverage199x (Rank: top 47%)
Annotation
HeliconiusHMEL0078610.083.16% 
BombyxBGIBMGA006359-TA0.068.52% 
DrosophilaCG9799-PA0.047.10% 
EBI UniRef50UniRef50_E2BZ010.049.94%WD repeat-containing protein 36 n=6 Tax=Formicidae RepID=E2BZ01_HARSA
NCBI RefSeqXP_966791.10.055.10%PREDICTED: similar to wd-repeat protein [Tribolium castaneum]
NCBI nr blastpgi|3454891160.052.19%PREDICTED: WD repeat-containing protein 36 [Nasonia vitripennis]
NCBI nr blastxgi|910774360.054.97%PREDICTED: similar to wd-repeat protein [Tribolium castaneum]
Group
Gene OntologyGO:00055159.3e-37protein binding
GO:00320405e-26small-subunit processome
GO:00063645e-26rRNA processing
KEGG pathwaytml:GSTUM_000104350018e-09 
 K03130 (TFIID4, TAF5)maps-> Basal transcription factors
InterPro domain[22-353] IPR0110469.3e-37WD40 repeat-like-containing domain
[124-359] IPR0159438.1e-31WD40/YVTN repeat-like-containing domain
[679-750] IPR0073195e-26Small-subunit processome, Utp21
[554-593] IPR0016804e-07WD40 repeat
[557-593] IPR0197814.8e-07WD40 repeat, subgroup
Orthology groupMCL13637 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207618-TA
ATGTCTGAAGAGAGACGAAGTAGCCAAATATTTACGGGTGCTCGAGTACTAGGTTATGTTAGTACACATGTTCCTTTTGTGGCTCGTTTTATTAAAAGACGGGGTGAAACATTGCTATGTACTAGTGTAGGAAAGTGGTTCCATACTTATGGCTGTGATAAATTTCGCCTCTTGAGTGTCAGTGGTGAACATCCTGGTCCCATTACATGCATGACTGGTGATAGTTTTCACGTTTATACAGCCAGCGAAAACGATATATACGCATGGAGACGAGGCTGCGAGCTAAAGCACGTTTACAAGGGACACCAGGCACCGATACACCAGCTATTACCATTTGGAGTTCATCTCATATCAATAGATAAAGATAATGTCCTTAAAATATTTGACATTAAAGAGGGATCAGAGTTTCTCGATCTCAAGTTCGATGAAACTCATTTCAAAATTACAACTTTATGTCATCCACCCACTTATCTTAATAAAATATTACTTGGCAGTAAACAGGGCCAACTCCAGATATGGAATATTAGAACTTCAAAATTGGTGTATACATTTAAAGGTTGGGACTCACCTGTGACAGTTACAGAAGCTGCTCCAGCAATTGATGTTGTAGCTATTGCTTTGGGTAATGGAAAAATTATTCTTCATAATCTCCGTTATGATCAAGAGGTAATGGAGTTTATTCATGATTGGGGCAGAGTTAGTTGTTTGTCATTTAGAATGGATGGAGTGCCCATAATGGTAACAGGAAGTACACAAGGACATTTAGTTATGTGGGATTTAGAAGAGAAAAGAGTGAAGTCACAGATACAGTCAGCTCATTTTGCTAAAATAGCTGGTTTACAATGTTTAAATTCTGAACCACTAATGGTTACCAATTCCCAAGATAATTCATTAAAAATGTGGATTTTTGATATGCCAGATGGAGGGGCTAGACTTTTGAAGAAAAGGGAAGGTCATTCTTTACCTCCAACGATAGTGCGCTACTGTGAGCCAACTGGTGGAAACATTCTTGCAGCAGGCAGTGATAGCAGTCTTCATATTATGAATACAGTAACAGAAACTTTTAACAAAAGCATGGGTAAAGCCTCATACAACAGGAAAGCATCCAAAAAGAAAAAAAGATATCAGATAGATACAAAAATTCTTCCACAAATAACTAATATAAGCTCCTGTATGCAAAGGGATAAGCAATGGGACAGTATTGCAACATTGCATGAAGGAAAGTACTTGGCTACTACTTGGTCATATAATAGAATGTGTATGGGAACACACAAATTAAAGCCACCTGATATGGAAAAAAGTACTCTGTCAACCTGCTTGACGGTAACACATTGTGGCAATTTTGTTATTATTGGTTATAGTAATGGACAAGTGCATAAGTTTAATATGCAGTCAGGCCTTTACCGAGGCCATTACGGCAAAGAAAACAAACAGGCCCACAAAGGAGCACTGAGAGGCGTAGAAACAGATATCTGTAATCAAAGGCTCATTACTGTTGGTGCTGACGATAAACTTAAATTCTGGCATTTTAAAACTGCTACCACCCCATATCATGTACTGAGATTGGATGAATCTGTGAGTATGACAAAATGCCACAGGGAAAGTGGTTTGCTGGCGTTAGCAAATGAAGATTTTACAATTACACTGGTCGATATAGACACCATGAGAGTTGTTAGAAACTTCGAAGGTCATGTTGGTAAAATAAACGACATTGATTTTGATTGTCAAAGCAGATGGTTAGTGTCATCATCTATGGATTGTACAATTTGTACTTGGGATATACCAACTTCACAACTGGTTGATATATTTTCTGTTGAACAGCCATGTACATCTCTAACTATGTCACCAACCGGTGATTATCTGGCGACGTCCCATGTGGGTGAGCTTGGGATCTGTCTTTGGGCCAACAGATTGTTGTATAGCAAAGTCTTCCTCAAGCCCGTTGATAGAAATGATGTGCCGCGATTGAAACTACCAACTACTGCAGCCGAGAAACCTGATATAGATGATATAGGAACAATTGATTTGGGCGATGACGAATATAAATCACCGGAACAAATCAGCGAGGAACTTTTAACACTATCTGGCCAGCCTACATCAAGATGGCTGAATTTGCTCAATTTGGACGTAATAAAACGTAGGAATAAACCCAAAACGCCTTTGACGGTTCCCAAATCGGCGCCATTCTTTCTCCCAACAATCCCAAGTCTTGACCTTGAATTCGATTTAGAAAAGGAAAAGGCGGGAAACACGAAAAAGTTGCTCATACCGGATACATTGTCAACTTTAACGCCATTTGCAAAAAATTGA

Protein sequence:

>DPOGS207618-PA
MSEERRSSQIFTGARVLGYVSTHVPFVARFIKRRGETLLCTSVGKWFHTYGCDKFRLLSVSGEHPGPITCMTGDSFHVYTASENDIYAWRRGCELKHVYKGHQAPIHQLLPFGVHLISIDKDNVLKIFDIKEGSEFLDLKFDETHFKITTLCHPPTYLNKILLGSKQGQLQIWNIRTSKLVYTFKGWDSPVTVTEAAPAIDVVAIALGNGKIILHNLRYDQEVMEFIHDWGRVSCLSFRMDGVPIMVTGSTQGHLVMWDLEEKRVKSQIQSAHFAKIAGLQCLNSEPLMVTNSQDNSLKMWIFDMPDGGARLLKKREGHSLPPTIVRYCEPTGGNILAAGSDSSLHIMNTVTETFNKSMGKASYNRKASKKKKRYQIDTKILPQITNISSCMQRDKQWDSIATLHEGKYLATTWSYNRMCMGTHKLKPPDMEKSTLSTCLTVTHCGNFVIIGYSNGQVHKFNMQSGLYRGHYGKENKQAHKGALRGVETDICNQRLITVGADDKLKFWHFKTATTPYHVLRLDESVSMTKCHRESGLLALANEDFTITLVDIDTMRVVRNFEGHVGKINDIDFDCQSRWLVSSSMDCTICTWDIPTSQLVDIFSVEQPCTSLTMSPTGDYLATSHVGELGICLWANRLLYSKVFLKPVDRNDVPRLKLPTTAAEKPDIDDIGTIDLGDDEYKSPEQISEELLTLSGQPTSRWLNLLNLDVIKRRNKPKTPLTVPKSAPFFLPTIPSLDLEFDLEKEKAGNTKKLLIPDTLSTLTPFAKN-