Monarch geneset OGS2.0

DPOGS209460
TranscriptDPOGS209460-TA1674 bp
ProteinDPOGS209460-PA557 aa
Genomic positionDPSCF300275 + 144331-148834
RNAseq coverage1090x (Rank: top 12%)
Annotation
HeliconiusHMEL0053299e-15274.88% 
BombyxBGIBMGA005843-TA1e-13786.25% 
DrosophilaCG7611-PA8e-16049.27% 
EBI UniRef50UniRef50_Q9H7D79e-16352.04%WD repeat-containing protein 26 n=70 Tax=Euteleostomi RepID=WDR26_HUMAN
NCBI RefSeqXP_001604875.10.054.87%PREDICTED: similar to WD repeat protein 26 [Nasonia vitripennis]
NCBI nr blastpgi|3071791740.056.83%WD repeat-containing protein 26 [Camponotus floridanus]
NCBI nr blastxgi|3071791740.056.93%WD repeat-containing protein 26 [Camponotus floridanus]
Group
Gene OntologyGO:00055151.5e-54protein binding
KEGG pathwayptr:4709765e-16 
 K12857 (SNRNP40, PRP8BP)maps-> Spliceosome
InterPro domain[251-551] IPR0110461.5e-54WD40 repeat-like-containing domain
[250-553] IPR0159432.9e-53WD40/YVTN repeat-like-containing domain
[258-293] IPR0197812.8e-12WD40 repeat, subgroup
[254-293] IPR0016804.9e-11WD40 repeat
[280-294] IPR0204729.2e-06G-protein beta WD-40 repeat
Orthology groupMCL12428 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209460-TA
ATGCACCAGCCCTGCACCAACGGCGCTCACCTCAACGGTGACGCCGCCCGCAACGGAGACCTGCCGCCGGGGCTCCGCATGAGCCAAACAGACCAGGAGATTGTGCGCCTCATCGGACAGCATCTGCTCTCGGTCGGACTAGAACGTAGCGCGACTCTGCTGATGGAGGAGTCAGGGTTACACCTGGAGCACCCCGCGGCGGCCACGTTCCGTACACACGTGTTGGCCGGAGACTGGGTGAAGGCGGACCACGACCTGCGAGCGCTGCACGACCTGCTCAGGGACTCGCCTCAGGTGGAGCCTCACAACCTCGCCGAGATGAAGTTCGTAGTGCTCGAGCAGAAGTACCTCGAGCACCTGGAGGCGGGCCGTGTGCTGGACGCCCTGCACGTGCTGCGGAATGAGCTAACGCCGCTGCAGTACGACACGGCTCGCGTGCACCGCCTGTCCGCGCTCATGATGTGCGCCGACGCCGCTGAGTTGAGGCAGCGCGCTCGCTGGCCAGGGGGCCCGCGCTCGAGGGCACGTCTTCTTGCCACCGTGCAGGCGGTGCTGCCGCCCGCGCTCATGATGTCTCCGGGCCGGCTGCGGGCGCTTCTGGCGCAGGCCGCCGCCCAGCAGGCCGCCCGGTGCCGGTTCCACGCGGCGCCTCGCCCTTCGCCTCCCGTCCCGTCCCCCGATCGCGACGACGAGCTCGCCGCCCCCGAGCACATCCCCTTCTCTCTCCTGGCAGACCACCACTGCTCGGCCGACCAGTTCCCCATACACTCCTTGCAGGTGTTAAACGGTCACTGTGACGAGGTGTGGTACTGCAAGTGGTCCCCGGATGGCTCCAAGCTGGCCTCGGGCTCCAAAGACAACACCGTCATGATATGGGACTACAACCCTGTCACAAAACGATTGGCTTTCAGGAAGTCGCTGGAAGGTCACTCGTACGGCGTGTCCTTCCTAGCGTGGAGTCCCGACGGCCGACACCTGCTGGCCGCCGGACCCGAGGACTGCCCCGACCTCTGGATCTGGAACATGGAGACGGAGCAGCTGCACCTGAAGATGACTCACTCCCAGGAGGACTCGCTGACGGCGGCCGCCTGGCACGCCAGCGGGAACGCCTTCGTCTGCGGCGGCGCCCGGGGACAGTTCTATCACTGCGCACTCGACGGTACCCTCATCAACAACTGGGACGGTGTCCGTGTGAACGCGCTGGCGTGCCGCTCCGAGGGCCGCGTGTTGGCCGCCGACACTCACCACCGCGTCCGGCTCTATGACTTCAGCGACCTCACCGACAGGAACCTCATCCAGGAGGAGCACGCGGTGATGGCGATGACCCTGAACGCGGCGGACACGCTGCTGCTGCTCAACGTGGCCAACCAGGGAGTCCACCTCTGGGATATCCGAGCCCGAGCGCTCGTCCGTCGCTTCAGGGGCCTGTCTCAGGGACACTTCACCATCCACGCCTGCTTCGGAGGAGCTCATCAAGACTTCATAGCGTCCGGCAGCGAGGACAATAAGGTGTACATCTGGCACATCGACGGCGAGGAGCCCATCGCGGTGGTGTCGGGACACACGAGGTGTGTGAACGCCGTGGCTTGGAACCCCGTGCATCATGACGTGCTGGTGTCCGCCTCCGACGACTACTCCCTGAGGCTGTGGGGCCCGAGGACCCACCAGACCTAG

Protein sequence:

>DPOGS209460-PA
MHQPCTNGAHLNGDAARNGDLPPGLRMSQTDQEIVRLIGQHLLSVGLERSATLLMEESGLHLEHPAAATFRTHVLAGDWVKADHDLRALHDLLRDSPQVEPHNLAEMKFVVLEQKYLEHLEAGRVLDALHVLRNELTPLQYDTARVHRLSALMMCADAAELRQRARWPGGPRSRARLLATVQAVLPPALMMSPGRLRALLAQAAAQQAARCRFHAAPRPSPPVPSPDRDDELAAPEHIPFSLLADHHCSADQFPIHSLQVLNGHCDEVWYCKWSPDGSKLASGSKDNTVMIWDYNPVTKRLAFRKSLEGHSYGVSFLAWSPDGRHLLAAGPEDCPDLWIWNMETEQLHLKMTHSQEDSLTAAAWHASGNAFVCGGARGQFYHCALDGTLINNWDGVRVNALACRSEGRVLAADTHHRVRLYDFSDLTDRNLIQEEHAVMAMTLNAADTLLLLNVANQGVHLWDIRARALVRRFRGLSQGHFTIHACFGGAHQDFIASGSEDNKVYIWHIDGEEPIAVVSGHTRCVNAVAWNPVHHDVLVSASDDYSLRLWGPRTHQT-