Monarch geneset OGS2.0

DPOGS204851
TranscriptDPOGS204851-TA2172 bp
ProteinDPOGS204851-PA723 aa
Genomic positionDPSCF300227 + 11759-22124
RNAseq coverage38x (Rank: top 73%)
Annotation
HeliconiusHMEL0024030.076.89% 
BombyxBGIBMGA011750-TA1e-1952.63% 
Drosophilaosm-1-PA2e-14640.24% 
EBI UniRef50UniRef50_Q9W0403e-14440.24%Intraflagellar transport protein osm-1 n=17 Tax=Diptera RepID=OSM1_DROME
NCBI RefSeqXP_001842122.12e-14640.22%osm-1 [Culex quinquefasciatus]
NCBI nr blastpgi|1700284783e-14540.22%osm-1 [Culex quinquefasciatus]
NCBI nr blastxgi|1984665431e-14241.22%GA12544 [Drosophila pseudoobscura pseudoobscura]
Group
Gene OntologyGO:00055157.7e-26protein binding
KEGG pathway 
InterPro domain[15-301] IPR0110467.7e-26WD40 repeat-like-containing domain
[15-287] IPR0159436.6e-22WD40/YVTN repeat-like-containing domain
Orthology groupMCL11167 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204851-TA
ATGCGACTGAAATTTTCTAAAACGCTTTTGGATGCTCAGGAATCGGATATCCCAGTAGCCGACATATGTTGGTCACCAAATAATGTAAAATTAGCTGTTGGCACTTCTGAACGGGTGGTACTTTTATTTGACCGCGATGGGATGCGTCGGGATAAGTTTAGCACCAAGCCAGCTGATGCAGCAGCAGGAAAGAAATCATATGTTATCACCGGTTTAGCATTCAGCGACAATTCGGAATTGCTAGGGGTAGCTCAAAGCGATAACATGGTTTTTGTATACCGAATCGGTGCTGACTGGAGCGGCAAAAAAGTAATCTGTAATAAATTTCCTCTCACCGGGTCGCCTTTACGTTTACTTCCAGCTGAAACTGGATTCTTTACAGGAACTAGTGATGGAAAGATAAGATCTTTGGATTGTAAAACAAATAAAAGCTCCAGTTTGTGGTCCGCGGGATCATGTTGCGTGTCTCTGGCACGTGGATCTGAGGCGATGTTGGCATCTGGACACATTGATGGAACTATTTATTTAAATGGCAGATTAATATTACGCTACACTCTACCGCCGACAGCCATGGTCTTGGTATCTTCCTACCTGATAGTCGGAGCTTGTGATGGTAGAATAACAATGTATGAGGCTCAGAGAGGAGCGCTAGTAAGAAGTTTAGAACCAACATTACCTCCTGATAGAAGAGATATCATATCAGCTTCTCTTAGTCCTTCTGGTCAGACGATAGCATTCGGTGTATTTGATGGTTGCCTAATCGGTGAAATAAAGGAATCTGGAAGTATGGAATTATCCACGTTAAATATATCGAATTTGTATGCTGCGCGATCTTTAGCATGGAGTGGAGATGGAACCAAGTTGGCAGTAGCATCGCAAACTGGTGCCGTTTTGGTCTTTGAAGCCGTCCTTAGACGTTGGGTGTGGCGGGACCTTATAGAGGTGCAGCACGTCAGCACACGTCAGCTGCTGCTGCGACGTCGTGGTGCTGATACTGCTGCGCTCACTGTTACCGCTAAACTAGCCCCCGATATATTTAATGTTAGATTTATTGGTAACGATTGGTACGCTGTCTGTCGTACAAGTAATAGTCTGATCCTGTGCGACATAGCTCGTGGTCTGACTAGCGAAATTCCATGGTCCGGTGGAGGGGAGCGTATATATGCAGCTGTGGGTGGAGCCTGTCTGTTGCAGCGTGCTGGCGAACTTGTGTCGTGGAGTACGGCCTTGATAGGGTACTACAAACTGTTTCCGGATGTGCGTACTGAGCGTGTGAATCCTCACGTGCTTAGTGTCCGTATCAATGAGGGCCGGAAGACGGAGGAAGAACGCAAACACTTCGCCTATTTGCTGGATAGACAGACGATTGCTGTCATTGATCTCGTTACTGGAGTTCAGTTAGGGCAATGGTGGCACGAAGCTCGGGTGGATTGGTTGGAGCTGAATGAGAGCGGACATCTGCTTCTGTTCCGCGATACTAGGCGACGTCTAGCACTTCTCCGTATTGATACCGGTGATAAGGAAATTATCGCGAGCGGAGTTAGTTTCGTGCAATGGATAGAAAACAGTGACGCTGTTGTAGCCCAAACCCCAACTCATTTGCTCATTTGGTACAGTGTATGGGAGCCTCAATGTGTTGAAATGTCTGAGTGTGGAGGCGGCTCGGCCGTATCGGTGTCAGAGCGACGAGTCGTACTGGAAGGTGGTCAGATTCAGGCCATCGTTTTGGATGAACATCGACTAGCTTTCAATTCGGCGTTACGTAGCGGGGACTTACAAGATTGTGCTCAGTACTTGGATGCTGTGTCACGGTCTGCGGACGTCGGGACGCTGTGGTGCCAACTGGCTGAGCAAGCTTTGACTGCTTACGATGTTGAGTTGGCGACGAAGTGCTATAGAGCTGTGGGTGATGAGGCTAGAACTTTTTATCTAGAGAAAACTGTCGAGTTAGCTTCAGCCAAAGGGAACGGGAATATCGATGAAGGTTTAAGGAGTCCCGAGGTTCGTGCACGTCTATCAATCTTTGTGGGAGATTTAACCACCGCCGAAGAATATTATGTACGCGGAGCCGCTCAGTCAGAACTGGCCATTAATATGTATAAGCAGTTCAATAGATGGCCGGACGCTATCGCCCTCGCTGAAAAGGTCGATAGACAGGCGGTGACGGCGTAG

Protein sequence:

>DPOGS204851-PA
MRLKFSKTLLDAQESDIPVADICWSPNNVKLAVGTSERVVLLFDRDGMRRDKFSTKPADAAAGKKSYVITGLAFSDNSELLGVAQSDNMVFVYRIGADWSGKKVICNKFPLTGSPLRLLPAETGFFTGTSDGKIRSLDCKTNKSSSLWSAGSCCVSLARGSEAMLASGHIDGTIYLNGRLILRYTLPPTAMVLVSSYLIVGACDGRITMYEAQRGALVRSLEPTLPPDRRDIISASLSPSGQTIAFGVFDGCLIGEIKESGSMELSTLNISNLYAARSLAWSGDGTKLAVASQTGAVLVFEAVLRRWVWRDLIEVQHVSTRQLLLRRRGADTAALTVTAKLAPDIFNVRFIGNDWYAVCRTSNSLILCDIARGLTSEIPWSGGGERIYAAVGGACLLQRAGELVSWSTALIGYYKLFPDVRTERVNPHVLSVRINEGRKTEEERKHFAYLLDRQTIAVIDLVTGVQLGQWWHEARVDWLELNESGHLLLFRDTRRRLALLRIDTGDKEIIASGVSFVQWIENSDAVVAQTPTHLLIWYSVWEPQCVEMSECGGGSAVSVSERRVVLEGGQIQAIVLDEHRLAFNSALRSGDLQDCAQYLDAVSRSADVGTLWCQLAEQALTAYDVELATKCYRAVGDEARTFYLEKTVELASAKGNGNIDEGLRSPEVRARLSIFVGDLTTAEEYYVRGAAQSELAINMYKQFNRWPDAIALAEKVDRQAVTA-