Monarch geneset OGS2.0

DPOGS202155
TranscriptDPOGS202155-TA2490 bp
ProteinDPOGS202155-PA829 aa
Genomic positionDPSCF300162 - 94504-96993
RNAseq coverage555x (Rank: top 23%)
Annotation
HeliconiusHMEL0108790.089.37% 
BombyxBGIBMGA003429-TA0.083.33% 
DrosophilaCG10333-PA0.075.74% 
EBI UniRef50UniRef50_Q9BUQ80.064.71%Probable ATP-dependent RNA helicase DDX23 n=107 Tax=Eukaryota RepID=DDX23_HUMAN
NCBI RefSeqXP_002038991.10.075.03%GM17282 [Drosophila sechellia]
NCBI nr blastpgi|3123751000.078.47%hypothetical protein AND_15065 [Anopheles darlingi]
NCBI nr blastxgi|1571210450.073.58%DEAD box ATP-dependent RNA helicase [Aedes aegypti]
Group
Gene OntologyGO:00055243.8e-51ATP binding
GO:00080263.8e-51ATP-dependent helicase activity
GO:00036763.8e-51nucleic acid binding
GO:00043861.7e-34helicase activity
KEGG pathwaydse:Dsec_GM172820.0 
 K12858 (DDX23, PRP28)maps-> Spliceosome
InterPro domain[420-651] IPR0140012.2e-62DEAD-like helicase
[425-625] IPR0115453.8e-51DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[687-768] IPR0016501.7e-34Helicase, C-terminal
Orthology groupMCL13101 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202155-TA
ATGGGTAGAGAAAGAAGCGCAGAACGTCGACGATCTCGTTCTCGAGATAGGCGTGAGAGGGACCGTCACGATGACCGCGATACTTATAAAGATAAATTAAAACGACGTGATAGATCCAGAAGCCCGAGGAAAGAAAAATCCCGTAGTCCTTCGAAAAAGGAATCAGACGTTCGCTCTCGTAGCCCAAGAAGAGAAAAAGAGCGTTCCAGAAGTCCGATTCGTAAAGAACGCGCCCGCTCTAGAAGTCCACGTAAACGTGATGGCAAAGATAAAGACCGGCGGCCAGATGACAGAGAGAAGAATAAAACTAAAAAGGAAGAAGAGAAAACAGAGGAAGAACCCAAAATGGAAGAAGAACCTAAGCCCGTAAAACGAGAGCCTCTATCTCTAGAAGAGTTACTCGCCAAAAAGAAGGCTGAAGAAGAGGCGCGAAGTAAACCAGTATTCTTAACAAAGGAACAACGAGCAGCCTTGGCACTCGAAAGACGCAGAGAACAAGTTGAAGCCATGAGAGCTGCTGAACGACCAGCTGCGGTGGCGACGATAGATCTTACAGGAACTTCAAAGAAGGATGATGAAAAAAAATACCGTGACGACAGGGAAAAAGAAAGGGAACACGACAAAGAGCGAGAAAGAGAAAGAGAGCGAGAAAGAGAGAAAGAAAGAGAAAGAAGACACGAAGAGAGAAAATCAGGGGCCGATCGTAATAGAGATGATAAAAAAGAGAAAAATGAAGAATACAGCAAAACTAAAGATAAAGAAAGAGAGGAAGAAGCCATCAAAGCGAGGTACTTAGGTATAGTTAAAAAAAAGCGTCGAGTGAGACGGTTAAATGATCGGAAGTTTGTATTTGATTGGGACGCTTCTGAAGACACATCAAATGATTACAATGCACTTTATAAGGAGAGACATCAAGTGCAATTTTTTGGACGCGGCCACATTGCTGGCATTGATATTAAATCACAGAAAAAGGATTATTGCAAATTCTATGGCAATTTGTTGGAGAAAAGGAGAACAGAATTGGAGAAAGAACAAGAAAAATCTCGCCTAAAGAAAGTTAAGAAGAAGGAAGACAAACAGAAATGGGATGACAGACACTGGTCGGAGAAAGATCAAGATGAGATGACAGAGAGAGATTGGAGAATCTTCAGAGAGGATTACAATATTACTTTGAAAGGTGGAAGAATACCTAATCCGATTCGTTCATGGAAAGAAGCTAACTTTCATGAAGATATTATGGAGATCATTAGCAAAGTTGGCTACAAAAGCCCTACACCTATTCAACGACAAGCCATTCCAATTGGTCTTCAAAATAGGGACATAATTGGTGTAGCTGAAACTGGTTCTGGTAAAACATTAGCATTTCTCATACCACTGCTTACTTGGATACAGTCTTTGCCTAAAAATGAGAGGATGGAAGATGCTGATCAAGGTCCGTATGCTATAATTCTGGCCCCAACTCGTGAATTGGCACAACAAATTGAGGAAGAAACAAATAAGTTCGGTATACCGCTTGGTATAACATCTGTCGTTGTTGTGGGAGGTCTGTCGAGGGAGGAGCAAGGATTCAAGTTACGTCTAGGATGTGAGATCGTCATCGCCACGCCGGGTCGTCTCATAGACGTATTAGAAAATCGATATCTTGTGCTCAATCGCTGCACTTATGTGGTTCTTGACGAAGCCGATCGTATGATAGACATGGGTTTCGAACCTGATGTACAAAAAATTCTCGAATACATGCCCGTGTCTAACATAAAACCCGACACAGATGCTGCCGAGGACGCATCAGTGCTTCTTGCTAATTACAATTCTAAAAAGAAATTCCGTCAAACCGTGATGTTCACGGCTACTATGCCGCCGGCCGTTGAAAGACTCGCCCGAACATATTTACGTAGGCCGGCCATAGTGTACATCGGATCTGTTGGCAAACCAGTCGACAGAACAGAACAAGTCGTCTTCATGATCGGTGAGAATGAGAAACGCAGGAAGCTAACTGAGATATTACAACGCGGAGTCGAACCACCGATCATTATATTCGTCAACCAAAAGAAGGGTGCAGATGTTCTTGCGAAAGGTTTGGAAAAACTGGGTTTCAATGCCTGTACATTACATGGTGGTAAAGGCCAGGAACAACGTGACTTCGCTCTAGCTTCCCTCAAAAATGGTTCGAAGGATATACTGGTTGCTACAGATGTGGCCGGTCGTGGTATTGATATTAAAGATGTCAGTGTGGTCATTAATTATGACATGGCTAAGAGTATAGAAGACTACACACATCGCATTGGTCGTACCGGTCGTGCTGGCAAGACTGGTAAGGCTGTATCATTTGTTACTAAAGAGGATTCTGCTATATACTATGATCTGAAGCAAGTGCTACTCGCCAGCTCAGTGTCCACATGCCCTCCAGAGTTAATGAACCATCCAGAGGCTCAACACAAACCCGGAACTGTGGTAACCAAGAAGAGAAGAGAGGAAATGATATTTGCTTAA

Protein sequence:

>DPOGS202155-PA
MGRERSAERRRSRSRDRRERDRHDDRDTYKDKLKRRDRSRSPRKEKSRSPSKKESDVRSRSPRREKERSRSPIRKERARSRSPRKRDGKDKDRRPDDREKNKTKKEEEKTEEEPKMEEEPKPVKREPLSLEELLAKKKAEEEARSKPVFLTKEQRAALALERRREQVEAMRAAERPAAVATIDLTGTSKKDDEKKYRDDREKEREHDKEREREREREREKERERRHEERKSGADRNRDDKKEKNEEYSKTKDKEREEEAIKARYLGIVKKKRRVRRLNDRKFVFDWDASEDTSNDYNALYKERHQVQFFGRGHIAGIDIKSQKKDYCKFYGNLLEKRRTELEKEQEKSRLKKVKKKEDKQKWDDRHWSEKDQDEMTERDWRIFREDYNITLKGGRIPNPIRSWKEANFHEDIMEIISKVGYKSPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLIPLLTWIQSLPKNERMEDADQGPYAIILAPTRELAQQIEEETNKFGIPLGITSVVVVGGLSREEQGFKLRLGCEIVIATPGRLIDVLENRYLVLNRCTYVVLDEADRMIDMGFEPDVQKILEYMPVSNIKPDTDAAEDASVLLANYNSKKKFRQTVMFTATMPPAVERLARTYLRRPAIVYIGSVGKPVDRTEQVVFMIGENEKRRKLTEILQRGVEPPIIIFVNQKKGADVLAKGLEKLGFNACTLHGGKGQEQRDFALASLKNGSKDILVATDVAGRGIDIKDVSVVINYDMAKSIEDYTHRIGRTGRAGKTGKAVSFVTKEDSAIYYDLKQVLLASSVSTCPPELMNHPEAQHKPGTVVTKKRREEMIFA-