Monarch geneset OGS2.0

DPOGS207659
TranscriptDPOGS207659-TA1491 bp
ProteinDPOGS207659-PA496 aa
Genomic positionDPSCF300133 - 95850-99139
RNAseq coverage307x (Rank: top 37%)
Annotation
HeliconiusHMEL0037520.077.22% 
BombyxBGIBMGA010510-TA0.071.17% 
DrosophilaKH1-PA3e-12545.11% 
EBI UniRef50UniRef50_Q9VPT35e-12345.11%KH1 n=18 Tax=Diptera RepID=Q9VPT3_DROME
NCBI RefSeqXP_002041625.12e-12545.57%GM16768 [Drosophila sechellia]
NCBI nr blastpgi|1953501915e-12445.57%GM16768 [Drosophila sechellia]
NCBI nr blastxgi|1953501916e-11845.57%GM16768 [Drosophila sechellia]
Group
Gene OntologyGO:00055245e-34ATP binding
GO:00080265e-34ATP-dependent helicase activity
GO:00036765e-34nucleic acid binding
GO:00043864.2e-19helicase activity
KEGG pathwayxal:XALc_27204e-45 
 K11927 (rhlE)maps-> RNA degradation
InterPro domain[105-315] IPR0140015.2e-36DEAD-like helicase
[110-290] IPR0115455e-34DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[360-441] IPR0016504.2e-19Helicase, C-terminal
Orthology groupMCL13967 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207659-TA
ATGTCTAAGTTACGAGAATGTTTGAAACTGTCTTATTGTAGATATTACAGTGCTCAACAAGTTAAAAAGAAATTGCCTATTATAACATGTAAACGTCCCGAATTTAATCATTATGAAGGACAAACCTATGCTAAGTTTGATGGTATTAAACTTGCCTCACAAGGTTGGTTGCATCCGAAATCAAAAAACGACTATTTTATAGTATATGGAGACGCAAATAAAAAAAATCAAAAAGAAGAAAATGTAGTTTACAAGAAAAGTTTCGAAGAAATTGGCTTGTGTGTAGAACTTAAAGAGGTTATGTCTCGACTCGGATATGAACTGCCAACTGCGATTCAGGCTAGGTCATTTTCGCCTATAATCCAGGGATATAATACTGCTTTGACCGCAGAGACAGGCTGTGGAAAGACTTTGGCCTATTTATTGCCAGTACTGCAACACATACTTGAATGGAAGAATCATGTAAAGGAAGATTTTAATAGTCCACTAGCTGTCGTTATCACACCAAGTAGAGAGCTAGCAACACAAATAGCCGAAGTTGCCGAGAATGTTTGTCAGAATCTTAACATTAATATATCCACACTGGTTGGAGGCAAGACCAAGCAGAAGATGATGAATCCTCCGATAGAATACAGCGATTTACTTATAACAACTTTAGGAGCCTACAGCAAGTTAGTCACAACCGGTATATGCAAGATACATAATGTGCATCACATTATATTGGATGAAGCGGACACATTACTTGATGATAGTTTCATTGATAAACTATCTTTGCTGTTGAAGAAATTTCCTATTCAATTCAAAGTAGATATAAAGAATCCTCCATCTGGGTGTCAAGTTACACTGGTCAGTGCAACATTACCTCATGAGTTACCGGAAGCAGTGAACTCCTTCATGGACCCACAGTCATTACGAACAATCACTACAGATAATGTCCATCGTATTCTACCCCATGTGCCTCATAAGTTCATCCGTCTCGGCAAAGCCCAGAAACCTCTTGAACTGTTGAAACTAGTGCAAGCGGATGTTAATTTACAGAGGCCAGTTATGATATTCTCTAACAAGACGTCGACTTGTGATTTCTTAGCGATGTTCCTGAATGAGAATAACATTGAATGTATAAATATTAACGGACGAATGGCTGTTCCACTGAAGATGGGGAAGTATGAAATGTTCAAAAACGGTCAGGTCAATGTACTCTCGTGCACTGATATTGCGTCACGTGGCTTGGATACTTTGAGGACTCGACACATCATTAACTACGATTTTCCTTTGTACACATCCGATTACATCCATCGTTGCGGAAGAACAGGTAGATTGGGTTCATCTAATGATTGTTCCATCACTAATTTCGTGGCTTGGCCAAGAGAGATACAACTCGTGCAAAAAATAGAGACCGCTCTGAGAAAACACGCAGCTTTACCAAATGTCAATGCCAACATCAAACGTCAGATTGAAGACAGAATATCTAGAATGACGTCGGCTCTATAA

Protein sequence:

>DPOGS207659-PA
MSKLRECLKLSYCRYYSAQQVKKKLPIITCKRPEFNHYEGQTYAKFDGIKLASQGWLHPKSKNDYFIVYGDANKKNQKEENVVYKKSFEEIGLCVELKEVMSRLGYELPTAIQARSFSPIIQGYNTALTAETGCGKTLAYLLPVLQHILEWKNHVKEDFNSPLAVVITPSRELATQIAEVAENVCQNLNINISTLVGGKTKQKMMNPPIEYSDLLITTLGAYSKLVTTGICKIHNVHHIILDEADTLLDDSFIDKLSLLLKKFPIQFKVDIKNPPSGCQVTLVSATLPHELPEAVNSFMDPQSLRTITTDNVHRILPHVPHKFIRLGKAQKPLELLKLVQADVNLQRPVMIFSNKTSTCDFLAMFLNENNIECININGRMAVPLKMGKYEMFKNGQVNVLSCTDIASRGLDTLRTRHIINYDFPLYTSDYIHRCGRTGRLGSSNDCSITNFVAWPREIQLVQKIETALRKHAALPNVNANIKRQIEDRISRMTSAL-