Monarch geneset OGS2.0

DPOGS206827
TranscriptDPOGS206827-TA1278 bp
ProteinDPOGS206827-PA425 aa
Genomic positionDPSCF300001 - 3449970-3451247
RNAseq coverage1419x (Rank: top 9%)
Annotation
HeliconiusHMEL0095940.097.88% 
BombyxBGIBMGA012772-TA0.096.47% 
DrosophilaHel25E-PB0.089.67% 
EBI UniRef50UniRef50_O001480.081.92%ATP-dependent RNA helicase DDX39A n=244 Tax=root RepID=DX39A_HUMAN
NCBI RefSeqXP_002089208.10.089.91%Hel25E [Drosophila yakuba]
NCBI nr blastpgi|1948567310.089.91%GG24297 [Drosophila erecta]
NCBI nr blastxgi|3407185950.090.82%PREDICTED: ATP-dependent RNA helicase WM6-like [Bombus terrestris]
Group
Gene OntologyGO:00055243.5e-42ATP binding
GO:00080263.5e-42ATP-dependent helicase activity
GO:00036763.5e-42nucleic acid binding
GO:00043865.9e-24helicase activity
KEGG pathwaydya:Dyak_GE189930.0 
 K12812 (UAP56, BAT1, SUB2)maps-> Spliceosome
InterPro domain[62-263] IPR0140011.7e-52DEAD-like helicase
[68-233] IPR0115453.5e-42DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[299-380] IPR0016505.9e-24Helicase, C-terminal
Orthology groupMCL10203 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206827-TA
ATGGCTGATAACGACGATCTTCTCGACTATGAAGATGAAGAGCAGGCAGATCAACAGAACGTTGACGGAGCGACCGAAGCGGCGCCAAAAAAGGAGGTTAAGGGTTCCTACGTATCGATTCACAGCTCTGGCTTTAGGGACTTTCTCCTTAAGCCTGAGATTCTACGAGCCATAGTGGACTGTGGTTTCGAACATCCATCAGAAGTTCAACATGAATGTATTCCTCAAGCGGTACTGGGTATGGATATCTTATGCCAAGCTAAGTCCGGTATGGGCAAAACCGCGGTGTTTGTACTAGCCACGTTACAGCAACTAGAGCCTTCGGATAATCATGTTTATGTTCTTGTCATGTGCCACACCCGAGAATTAGCTTTTCAAATCAGCAAAGAGTATGAACGCTTTTCTAAGTACATGGCCGGCGTCCGTGTGTCAGTTTTCTTTGGCGGCATGCCGATTCAAAAAGATGAGGATGTTTTAAAAACTGCATGCCCTCATATCGTCGTTGGGACACCAGGACGTATTTTAGCCCTCGTTAACAATAAGAAATTAAACTTGAAACATCTAAAACATTTCATACTTGATGAATGCGACAAGATGTTGGAATCTCTGGATATGAGAAGAGATGTGCAAGAAATATTCCGAAACACTCCTCATGGGAAACAAGTGATGATGTTTTCTGCCACTTTGAGTAAAGATATCAGACCCGTATGTAAGAAATTCATGCAGGACCCCATGGAAGTGTATGTTGATGATGAAGCAAAACTTACTCTGCATGGACTTCAGCAGCACTATGTGAAACTTAAAGAAAATGAGAAGAATAAGAAGTTATTTGAACTCTTAGATGTGTTGGAATTCAACCAAGTAGTTATATTTGTAAAATCGGTTCAAAGATGCATAGCTCTAGCACAACTTTTGACGGACCAAAACTTCCCTGCCATTGGAATTCATAGAAACATGACACAAGACGAGCGTCTTTCCCGTTACCAACAGTTCAAAGATTTCCAAAAGAGAATATTGGTAGCAACCAATCTGTTTGGAAGAGGGATGGATATTGAAAGAGTTAATATAGTATTTAATTATGACATGCCTGAAGACTCTGATACCTACCTCCATAGAGTGGCCCGAGCAGGAAGATTCGGTACCAAAGGTCTAGCCATAACAATGGTATCGGATGAAAATGATGCAAAAATCCTGAACGAAGTCCAGGACCGTTTTGATGTCAACATAACAGAGCTACCTGATGAGATTGAGCTATCCACTTACATAGAGGGACGATAA

Protein sequence:

>DPOGS206827-PA
MADNDDLLDYEDEEQADQQNVDGATEAAPKKEVKGSYVSIHSSGFRDFLLKPEILRAIVDCGFEHPSEVQHECIPQAVLGMDILCQAKSGMGKTAVFVLATLQQLEPSDNHVYVLVMCHTRELAFQISKEYERFSKYMAGVRVSVFFGGMPIQKDEDVLKTACPHIVVGTPGRILALVNNKKLNLKHLKHFILDECDKMLESLDMRRDVQEIFRNTPHGKQVMMFSATLSKDIRPVCKKFMQDPMEVYVDDEAKLTLHGLQQHYVKLKENEKNKKLFELLDVLEFNQVVIFVKSVQRCIALAQLLTDQNFPAIGIHRNMTQDERLSRYQQFKDFQKRILVATNLFGRGMDIERVNIVFNYDMPEDSDTYLHRVARAGRFGTKGLAITMVSDENDAKILNEVQDRFDVNITELPDEIELSTYIEGR-