Monarch geneset OGS2.0

DPOGS209531
TranscriptDPOGS209531-TA3078 bp
ProteinDPOGS209531-PA1025 aa
Genomic positionDPSCF300157 - 57728-66478
RNAseq coverage222x (Rank: top 45%)
Annotation
HeliconiusHMEL0060770.094.69% 
BombyxBGIBMGA013837-TA0.088.98% 
DrosophilaCG9062-PB0.061.78% 
EBI UniRef50UniRef50_Q16MY00.062.91%WD repeat-containing protein 48 homolog n=1 Tax=Aedes aegypti RepID=WDR48_AEDAE
NCBI RefSeqXP_974999.10.065.09%PREDICTED: similar to CG9062 CG9062-PB [Tribolium castaneum]
NCBI nr blastpgi|3071681320.064.99%WD repeat-containing protein 48 [Camponotus floridanus]
NCBI nr blastxgi|3071681320.064.99%WD repeat-containing protein 48 [Camponotus floridanus]
Group
Gene OntologyGO:00055154.8e-69protein binding
GO:00055255.1e-57GTP binding
GO:00039245.1e-57GTPase activity
KEGG pathwaydse:Dsec_GM207859e-165 
 K02358 (EF-TU, tufA)maps-> Plant-pathogen interaction
InterPro domain[21-379] IPR0159434.8e-69WD40/YVTN repeat-like-containing domain
[21-316] IPR0110464.8e-67WD40 repeat-like-containing domain
[623-815] IPR0007955.1e-57Protein synthesis factor, GTP-binding
[357-622] IPR0217724.5e-49Protein of unknown function DUF3337
[916-1008] IPR0090011.5e-27Translation elongation factor EF1A/initiation factor IF2gamma, C-terminal
[819-915] IPR0090001.2e-24Translation elongation/initiation factor/Ribosomal, beta-barrel
[916-1005] IPR0041604.1e-21Translation elongation factor EFTu/EF1A, C-terminal
[839-908] IPR0041613.8e-12Translation elongation factor EFTu/EF1A, domain 2
[624-757] IPR0052255.9e-12Small GTP-binding protein domain
[193-232] IPR0016802.8e-08WD40 repeat
[197-232] IPR0197813.2e-08WD40 repeat, subgroup
[84-98] IPR0204722.8e-06G-protein beta WD-40 repeat
Orthology groupMCL13323 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209531-TA
ATGGGCACGAACATGCGTAAAAAGACCCAGGTATCATTTGTGATTCGAGACGAAGAAGAGCGACGTCATAAAAACGGTGTTAGCTCCTTACAATTGGATCCTATACAAGGCAGACTTTATTCAGCAGGCCGAGATGGAATTATTCGTGTCTGGCATACAGGAGGCGGTACTCAGGATAGATACATACAGAGTATGGAACACCACACCGATTGGGTGAATGACATAGTATTATGTTGTGGCGGAAAAAATCTTATAAGCGCGTCATCTGACACTACTGTTAAAGTATGGAACGCACCAAAAGGTTTTTGTATGTCCACATTAAGAACTCATAAAGATTATGTTCGGACTCTTGCGTACGCTAAAGATAAAGAGCAGGTCGCCAGTGCAGGGCTTGATCGTGCCATATTCTTGTGGGATGTAAATACTTTAACCGCTTTAACAGCTAGCAACAATACTGTTACTACGTCAAGTTTGGTAGGAAATAAAGAATCTATATACAGCTTGGCTATGAATCCTCCTGGGACGATTTTAGTCAGTGGCTCAACTGAAAAAGTTCTTCGAGTTTGGGATCCCAGAAATTGCTCGCGTCTTATGAAGCTTAAGGGCCATGCTGATAACGTAAAAGCGTTAGTCGTGAGCAGAGATGGCTCACAATGTGTATCTGGAAGCTCTGATGGTACAATAAAATTATGGTCTCTGTCACAACAGAGATGCGTTTCTACTATACGTGTTCATTCCGAGGCTGTGTGGGCCCTACTGGCGACTGAAAACTTCACACATATAATATCAGGTGGTAGAGATCGTCTAGTCATCATAACAGAACTCAGGAACCCAGAAAACTACATGATAGTATGTGAGGAAACTGCCCCAATATTAAAATTGTGTTTCACTGCCGACCAACAAGGTATATGGGTGGCAACATCAGATTCAGACATAAGATGTTGGAAATTACCACCACTGAACTCATTAAACTCAGATATGTATACTCAGAACAATTATAATACTAACAATGTGTACCAAACACAACCATTACACAACATAGTCGGCGGCAGAGCCATAAAACACTACACAGTTTTAAACGACAAACGACACATTTTAACTAAAGACACCACCAACAATGTTGTATTGTATGATGTGCTGAAAGCATGCAAAGTCGAAGATTTGGGTGAGGTTGATTATGAAGAGGAATTGAAGAAACGTTTCAAAATGGTTTACGTACCAAATTGGTTTAACGTAGATTTAAAGACTGGAATGCTAACAATACATCTGGGGCAAGATGAGACCGACTGTTTTAGCGCCTGGGTCAGCGCTAAAGAGGCCGGTTTGATAACGGAGAATGATCAGAAAGTCAATTTTGGGGCTCTATTATTGCAAGCTTTGTTGGATCATTGGAATCATCCTAATAGGGTTAATGAAGCAGGTCAAAAAGTCGTCGGTAACATATACTTCAGTGTTCCGTTACACACTCCCCTAATATTTAGTGAAGTTGGCGGAAGAACGCTTTACAGATTGCAGGTTGGTGACGCTGGCGGTGAAACGGAGGGCAACCTTCTCATGGAGACTGTTCCGTCGTGGGTTGTAGATGTGGCCATAGAAATGGCTGCCCCAAAACTGAACAAACTACCATTCTACCTATTGCCACATTCAAGTTGTCAGAGTAAACAGGATCGGCAGAAAAAGGACCGTCTGGTGGCAAATGATTTCATCCAAGTCCGTAAAGTCGGTGAGCATGTCGTGGAGAAGATTGTCGGCGGCGGTGATGTAAACGGCAGTTCCAAGAACGAAGACTGTAACAACGACTCCCCGGAAGAAAGAGTGGAACTGTTGTGCTGTGATCAGGTCCTCGACCCGAACATGGATTTACAGAAAAAACATTGCAATGTAGGAACTATAGGGCATGTCGATCATGGAAAAACGACACTTACAGCTGCGATAACCAAAGTTTTATCTAAAGGTGGTTTCGCGAAATATGTTTCTTACGACGAAATTGATAAGGCCCCGGAAGAAAAAGCACGGGGAATTACGATAAATGCGGCACACGTAGGCTACAGTACGAATAACAGACACTACGCTCATACTGACTGTCCGGGACATGCAGATTATATCAGAAACATGATATCGGGAGCGTCCCAAATGGATGCAGCAATTGTTGTAGTCGCAGCCAACGACGGGCCAATGCCGCAAACAAAGGAACATTTGTTGTTGGCCAAACAAGTCGGAATCAAATATGTTTTGGTGTACATAAATAAAGTTGATATCGTAGATAATGAATTAGTCGAATTAGTAGAGATAGAGATGCGAGAAATGTTGACAGATTATGGCTTCGAAGGTTTATCAGCACCTGTGGTTCACGGCTCAGCACTCGCCGCCTTAAAAGATGATCAAACTGAAATAGGGGTTCCCTCTATAATCAAACTCTTGGACACTATGGATAATTACATTCCACCAATCATACGAGATCTAGAATCACCATTTCTTCTGCCCATAGACAACGCTTTCACAGTACCGGGTAGAGGTACTGTTGTTGTTGGAACCATAAAAAGGGGTGTCATGAAGAAGAATGATGAGGCAGAGCTCATGGGATTCGGCTATAATATTAAGACAACACTATCTGATATACAGATATTTAGAAATAGTGTGCCCAAGGCACTGGCTGGTGATAACGTGGGCGTCTTACTTCGCGGTATGAAACTCAAGAACGTTGAGACCGGCATGATCCTGTGTGCGGCGAAGAGTTTGAGCCTGAGCAATCATTATAAGGCGAAAGTTTATTTCCTGAGTCACTCAGAAGGAGGGAGAAAAAAGCCTGTCTTTTCAAAATACACGCAACAAATGTTCAGCGGAACGTGGAATATTGCTTGCAGGATAGATTTAGAGCCTTCGATGGAAATGTTAATGCCTGGTGATCACGCGGACGTCTTCCTCACCCTGTTAGAGGGCATGGTGATGGTCAAAGGACAACAGTTCACCATCAGAGAGAACAACGTGACAGTTGCCACGGGCATCATAACGGACGCTATGAACGCCATAGACGTGCCCAACGGGAAGCTAGGGAAGATCGTCCTCGACACTAATTAG

Protein sequence:

>DPOGS209531-PA
MGTNMRKKTQVSFVIRDEEERRHKNGVSSLQLDPIQGRLYSAGRDGIIRVWHTGGGTQDRYIQSMEHHTDWVNDIVLCCGGKNLISASSDTTVKVWNAPKGFCMSTLRTHKDYVRTLAYAKDKEQVASAGLDRAIFLWDVNTLTALTASNNTVTTSSLVGNKESIYSLAMNPPGTILVSGSTEKVLRVWDPRNCSRLMKLKGHADNVKALVVSRDGSQCVSGSSDGTIKLWSLSQQRCVSTIRVHSEAVWALLATENFTHIISGGRDRLVIITELRNPENYMIVCEETAPILKLCFTADQQGIWVATSDSDIRCWKLPPLNSLNSDMYTQNNYNTNNVYQTQPLHNIVGGRAIKHYTVLNDKRHILTKDTTNNVVLYDVLKACKVEDLGEVDYEEELKKRFKMVYVPNWFNVDLKTGMLTIHLGQDETDCFSAWVSAKEAGLITENDQKVNFGALLLQALLDHWNHPNRVNEAGQKVVGNIYFSVPLHTPLIFSEVGGRTLYRLQVGDAGGETEGNLLMETVPSWVVDVAIEMAAPKLNKLPFYLLPHSSCQSKQDRQKKDRLVANDFIQVRKVGEHVVEKIVGGGDVNGSSKNEDCNNDSPEERVELLCCDQVLDPNMDLQKKHCNVGTIGHVDHGKTTLTAAITKVLSKGGFAKYVSYDEIDKAPEEKARGITINAAHVGYSTNNRHYAHTDCPGHADYIRNMISGASQMDAAIVVVAANDGPMPQTKEHLLLAKQVGIKYVLVYINKVDIVDNELVELVEIEMREMLTDYGFEGLSAPVVHGSALAALKDDQTEIGVPSIIKLLDTMDNYIPPIIRDLESPFLLPIDNAFTVPGRGTVVVGTIKRGVMKKNDEAELMGFGYNIKTTLSDIQIFRNSVPKALAGDNVGVLLRGMKLKNVETGMILCAAKSLSLSNHYKAKVYFLSHSEGGRKKPVFSKYTQQMFSGTWNIACRIDLEPSMEMLMPGDHADVFLTLLEGMVMVKGQQFTIRENNVTVATGIITDAMNAIDVPNGKLGKIVLDTN-