Monarch geneset OGS2.0

DPOGS210163
TranscriptDPOGS210163-TA1725 bp
ProteinDPOGS210163-PA574 aa
Genomic positionDPSCF300379 + 157361-163749
RNAseq coverage2136x (Rank: top 6%)
Annotation
HeliconiusHMEL0119830.072.69% 
BombyxBGIBMGA004036-TA0.070.92% 
Drosophilabel-PA1e-11150.24% 
EBI UniRef50UniRef50_F4W9M02e-15159.73%ATP-dependent RNA helicase vasa n=11 Tax=Neoptera RepID=F4W9M0_ACREC
NCBI RefSeqNP_001037347.10.079.30%vasa-like [Bombyx mori]
NCBI nr blastpgi|1129835880.079.30%vasa-like [Bombyx mori]
NCBI nr blastxgi|1129835880.072.77%vasa-like [Bombyx mori]
Group
Gene OntologyGO:00055248.6e-52ATP binding
GO:00080268.6e-52ATP-dependent helicase activity
GO:00036768.6e-52nucleic acid binding
GO:00043863.2e-32helicase activity
KEGG pathway 
InterPro domain[214-425] IPR0140014.6e-62DEAD-like helicase
[219-398] IPR0115458.6e-52DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[461-542] IPR0016503.2e-32Helicase, C-terminal
Orthology groupMCL12849 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210163-TA
ATGGATGACGATTGGGATGAAAACTGTGATGTTGTTCAGCCACAACATCAGCCAACAATACAGAACAACTTTAGCAATTATGATGTAGATGAAGGTCACAGTTTGTCCAGAGGAAGGGGATTCCCCACATTTATAGAAGAAAATGGTAGTAATTATGATGATTTCAATAAGGGCAGCAATAGTTTTGAAGAACGAAGAGGCAGGGGAGGTTATGGTTACGATCGGGGTGGACGTGGCCGAGGTGGTCGAGGTACACGTGGAGGTGGAAGAGATAGAGGCTATGAAAATGGAGGTGATAACTTTGATGATGGCGAAAGGGGTGAGAGACACGAGAGGGGTGAGCGTGGGCGTGGTAGGGGAAGAGGGCGCGGGGGGAGGGGCTTCAGACCAACCGGAGATGGCGGGGGAGAAACAAACGAAGAGGCTGGAGATGATAAGAAAACCCCTGTAACATATGTGCCACCAGAGCCAACTGAAAACGAAGACGAGATATTTAGCAGCACAATAAGTTCCGGCATCAATTTTGATAAGTTCGATTGTATTGCGGTTAAAGTAACTGGAGAGAATCCGCCACGAGCGATCGAGAGCTTTGAAACTGCTAATCTGCGGAATTATGTATTAAACAACATTCTAAAGTCGGGATATAAAAAACCTACGCCAATACAGAAACATGCTATACCGATCATCATGAATGGAAGAGATTTGATGGGCTGCGCTCAAACTGGTTCTGGGAAGACTGCGGCGTTTCTTCTTCCTATTATAAATACTCTCTTACAAGATCTTCGTGAACTGGTGGTCGGTCCAAACGGATGTGCGCAGCCTCAGGTTGTAATAGTAGCCCCTACCCGTGAATTGACAATACAAATATTCAACGAGGCCAGGAAATTCTCTTACGGATCCATTTTAAAGATCGCCGTGGCATATGGCGGTACGGCTGTAAGACATCAAGGGGATAATATATCTCGTGGATGCCACATCCTGGTAGCTACACCTGGTAGACTTCACGATTTTGTTGATCGTAACCGTGTGTCGTTCGACAGCGTGCGATTTGTAGTACTGGACGAGGCTGATCGTATGTTGGATATGGGATTTATGCCCAGTGTTGAGAAGATGATGGACCATCCCACCATGGTTAACATTACAGAACGTCAAACCCTTATGTTTTCGGCAACATTTCCTGAAGATATCCAACATTTGGCCGGACGTTTCCTTAATAACTATCTTTTTGTGGCTGTGGGAGTTGTCGGCGGCGCTAGTACGGATGTTGAACAAATATTCCATCAAGTTATTAAGTACGAAAAGCAAAATACTCTAAAGAAACTCATTGAAGAAAATGATGGCAAACGCATCCTTGTATTTGTCGAAACGAAGCGGAATGCGGATTTCATAGCAGCTATGTTGTCTGAACAACAGATGCTCACATCTTCCATACATGGTGACAGAATGCAGAGGGAGAGAGAGGAAGCCTTGCATAATTTCAAAAGTGGCAGGCATTTTATACTAGTAGCTACTGCGGTTGCTGCAAGAGGATTAGATATAAAAAATGTGGATATAGTCGTGAATTACGACCTCCCTAAAAGTATAGACGAGTATGTACACAGAATCGGCAGGACCGGTCGTGTTGGCAACAGAGGCAAGGCTGTGTCGTTCTTTGATTCCGATCAAAACTTTAACAATGCAAGCACAACTGTTGATCAAGGTCAAGAGCCTGATGAGGAGTGGTAA

Protein sequence:

>DPOGS210163-PA
MDDDWDENCDVVQPQHQPTIQNNFSNYDVDEGHSLSRGRGFPTFIEENGSNYDDFNKGSNSFEERRGRGGYGYDRGGRGRGGRGTRGGGRDRGYENGGDNFDDGERGERHERGERGRGRGRGRGGRGFRPTGDGGGETNEEAGDDKKTPVTYVPPEPTENEDEIFSSTISSGINFDKFDCIAVKVTGENPPRAIESFETANLRNYVLNNILKSGYKKPTPIQKHAIPIIMNGRDLMGCAQTGSGKTAAFLLPIINTLLQDLRELVVGPNGCAQPQVVIVAPTRELTIQIFNEARKFSYGSILKIAVAYGGTAVRHQGDNISRGCHILVATPGRLHDFVDRNRVSFDSVRFVVLDEADRMLDMGFMPSVEKMMDHPTMVNITERQTLMFSATFPEDIQHLAGRFLNNYLFVAVGVVGGASTDVEQIFHQVIKYEKQNTLKKLIEENDGKRILVFVETKRNADFIAAMLSEQQMLTSSIHGDRMQREREEALHNFKSGRHFILVATAVAARGLDIKNVDIVVNYDLPKSIDEYVHRIGRTGRVGNRGKAVSFFDSDQNFNNASTTVDQGQEPDEEW-