Monarch geneset OGS2.0

DPOGS210327
TranscriptDPOGS210327-TA1230 bp
ProteinDPOGS210327-PA409 aa
Genomic positionDPSCF300025 - 661492-664547
RNAseq coverage728x (Rank: top 18%)
Annotation
HeliconiusHMEL0138550.094.33% 
BombyxBGIBMGA011965-TA2e-17191.23% 
DrosophilaCG10077-PA3e-14169.52% 
EBI UniRef50UniRef50_Q8MZI34e-13969.52%CG10077, isoform A n=16 Tax=Diptera RepID=Q8MZI3_DROME
NCBI RefSeqXP_001122489.11e-15076.23%PREDICTED: similar to CG10077-PA, isoform A, partial [Apis mellifera]
NCBI nr blastpgi|3800275102e-14976.23%PREDICTED: probable ATP-dependent RNA helicase DDX17-like [Apis florea]
NCBI nr blastxgi|3454976278e-15270.40%PREDICTED: probable ATP-dependent RNA helicase DDX17-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055241.5e-50ATP binding
GO:00080261.5e-50ATP-dependent helicase activity
GO:00036761.5e-50nucleic acid binding
KEGG pathwayame:7267684e-150 
 K12823 (DDX5, DBP2)maps-> Spliceosome
InterPro domain[163-366] IPR0140011.2e-65DEAD-like helicase
[168-339] IPR0115451.5e-50DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL30889 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210327-TA
ATGGAGAAGTATGATGATAGAAGAGGCGGCCGTGGCGGTGGACGTGGTGCCCCACGTGGGCGTGGTGGTAGCCGTGGGTCTATGAGTTCTAGGGGTGGTAGATCTAGCGATAGGTCACAAGAGAGAGGCAATAGGTTTAGCGGTGGCGGCCGCGGAAGAGATGTCGGTCGGGGAGGACGGGGCATGGGAAGCCGAGGCGGAGGAGGACGAAGTGATAACCGCGATGACTTCCGTGGTAAATTCAATAAGAATGACCAGCCCGGTGGTGCGTTACGGAAAATCCGCTGGGACAATGTCCAGCTGACGCCGTTCCAGAAAAACTTTTACGTTCCCCATCCTAATGTCGAAATGCGTTCCCAGGCTGAAGTGGAAGCATACCGTAGTCAACATCAGATTACGGTCAAGGGGAGGGATGTTCCTGCACCCAGTATGTTTTTTGATGAAGGTGGCTTTCCTGATTATGCCATGAAGGAAATACTTAAACAAGGCTTCCCTAATCCGACTCCAATTCAGGCTCAAGGATGGCCAATTGCTCTGTCCGGGCGGGACATGGTCGGAATTGCTCAAACAGGCTCCGGAAAAACCCTAGCATATATTCTGCCAGCTATTGTACACATTATAAATCAGCCTCGGCTCCTGAGAGACGAAGGACCCATAGTCCTCGTTTTAGCTCCAACTAGAGAGCTGGCCCAGCAAATACAAACGGTGGCAAATGAATTTGGTCAGAGCGTTCAAGTACGGAACACGTGTATATTTGGCGGGGCTCCCAAGGGCCCTCAAGGGCGCACACTGGAAAGAGGTGTTGAAATTGTCATCGCTACCCCCGGGAGACTTATAGACTTCCTTGAGAAGGACACAACAAACCTCCGTCGCTGCACCTACTTGGTACTTGATGAGGCAGACAGGATGTTGGACATGGGTTTTGAACCTCAGATTAGGAAGATCATTGAACAGATACGACCTGATCGGCAGGTCCTAATGTGGTCGGCAACTTGGCCCAAGGAGGTGCAGAATTTAGCGGAGGAGTTCCTACATGATTACATCCAGATCAACATTGGGTCGCTGTCGCTGTCTGCCAACCACAACATCCTGCAGATAGTGGACGTTTGCGAGGAGTGGGAGAAGAACGACAAACTGCTCACTCTACTCACTGAAATATCATCGGAGGAGGAAACCAAGACCATCATCTTTGCTGAGACGAAACGGAAGGCAAGCCAACTTATGATATGA

Protein sequence:

>DPOGS210327-PA
MEKYDDRRGGRGGGRGAPRGRGGSRGSMSSRGGRSSDRSQERGNRFSGGGRGRDVGRGGRGMGSRGGGGRSDNRDDFRGKFNKNDQPGGALRKIRWDNVQLTPFQKNFYVPHPNVEMRSQAEVEAYRSQHQITVKGRDVPAPSMFFDEGGFPDYAMKEILKQGFPNPTPIQAQGWPIALSGRDMVGIAQTGSGKTLAYILPAIVHIINQPRLLRDEGPIVLVLAPTRELAQQIQTVANEFGQSVQVRNTCIFGGAPKGPQGRTLERGVEIVIATPGRLIDFLEKDTTNLRRCTYLVLDEADRMLDMGFEPQIRKIIEQIRPDRQVLMWSATWPKEVQNLAEEFLHDYIQINIGSLSLSANHNILQIVDVCEEWEKNDKLLTLLTEISSEEETKTIIFAETKRKASQLMI-