Monarch geneset OGS2.0

DPOGS212384
TranscriptDPOGS212384-TA1998 bp
ProteinDPOGS212384-PA665 aa
Genomic positionDPSCF300019 + 753033-758514
RNAseq coverage466x (Rank: top 27%)
Annotation
HeliconiusHMEL0066790.081.09% 
BombyxBGIBMGA012071-TA1e-7679.56% 
Drosophilal(1)G0007-PA2e-15462.28% 
EBI UniRef50UniRef50_Q9VY542e-15262.28%LD24737p n=12 Tax=Bilateria RepID=Q9VY54_DROME
NCBI RefSeqXP_001942903.13e-18051.54%PREDICTED: similar to pre-mRNA splicing factor ATP-dependent RNA helicase PRP16 [Acyrthosiphon pisum]
NCBI nr blastpgi|3287125085e-17951.54%PREDICTED: pre-mRNA-splicing factor ATP-dependent RNA helicase PRP16-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1892358660.056.21%PREDICTED: similar to pre-mRNA splicing factor ATP-dependent RNA helicase PRP16 [Tribolium castaneum]
Group
KEGG pathwayapi:1001658917e-180 
 K12815 (DHX38, PRP16)maps-> Spliceosome
InterPro domain[507-664] IPR0140018.5e-15DEAD-like helicase
Orthology groupMCL34938 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212384-TA
ATGTCGGACAGTGAAGAAAACCTGCATCGTTTGGAGGGAACTCCAAACAATGCTCCTGGAGGTCTTATTATCAAGAAAAAAGATAAACCGCCTGATTTCCAATTTGCCAAACCGTCCTTGTTAGGTCTAGATAAGTTGGCCGCTGCTAAAAGAATGCAGAACCGCTTAATTTCTTTTCAAAACGAAGACAACTATGGAGATGATATTGAGGATAAAAGTGGTTCCAGTGGTGTAAGGGAGCGGAAATATAGGAAGCATAATGAAGAAACCCCGACGTACACCGGTGGCTTATCAGAACAAGCCCGAGCAAGGATGTTGGAAAGATTAGAAATGAAAGAAAAGAAAGCGAGAGAAAAAGGTGTGCATAATTCAACACTAGAAGAGAAAAAGTCACATTCCAAAGATGAAGAAAACAGCCGATTTCACAATTATGGTCGTGGGCATCGTGACAAGGACAGAAGACGTGACTATGATAGAAGAGACAGAGACAGATATAGGGATCGGAATGACAGGAATAAGAGCGAAAGGGATAGAAGGAGAGATTCAGAGAGGAGGGAAAGAGACCGCGACACTGATAGAGATAGCTCACGACGGAGTTACTATGAACCCAGGTTTAAAGATGAACCAAGAACACCAAGCATAAAAGCTCTCAAACCAACCGACAAAACAGCTTGGGATGATGACGACGATGACCCTAAGGCCGTAAGGAAGTCAAGTTGGGACTTCCCCACACCACTGCCCAGAGACTTGGCGGATAGATCAGCTCGGAGTGAACGTAAACCCACCAGAGACTATAAAGGAAGAGCATACGAAGATACCATTAGAGCTACACCTCATAAATGGGTGAGCTCACGTCGAGGACTGGATGTAGACGACCCCGAGTGGCAGGAAGCAGAGAAGAAATTAGATCGAGCCTGGTACAACATGGGAGAGGGTGAAACCGACGAATCGGATCCGTTCGCCGGCACCAGCGCTGAGTATATAGCGAAGAAAGAAGAACAGATAGAGAAGAGAAGGAACCGGAAGGTGTCAGCGGCGAGGCAGCAGATAGACAGAGACAACGAGCTGTGGGAGAGGAACCGCATGCTCACCAGCGGGGTGGTGCACTCCATCAACGTCAACAACGACCTCGACGAGGAGAACGTAGACCGAGTACATCTTCTAGTACACAATATCGTTCCGCCATTCCTCGACGGCAGGATAGTGTTCACTAAACAACCTGAGCCTGTTATACCGGTCAAAGACCCGACATCGGACATGGCGATAAATGCTAGGAAAGGGTCCGCTTTAGTGAAGGCGTTCAGAGAACAGAAGGAAAGAAGAAGGGCACAGAAGAAACATTGGAAGTTGGAGGGAACGAAGATTGGTAACATAATGGGCATACAGAAACAAGAGGAAGAAATAGAAGACGGACCCACGAAACAGGCGTACAAATACGCCGAGCACTTGGATAAAGCGGGCGAGGAAGCGGAGTCCAAATCAGATTTCGTCAAGAAGTTGTCTATAACGGAGCAGAGACGTTTCTTGCCCGTGTTCGCTGTCAGAGAACAACTCATGCAGGTGGTGAGGGAGAACAACGTTATCATTATAGTCGGTGAAACCGGAAGTGGTAAGACGACCCAACTGACACAATACCTCCACGAGGACGGTTACAGTAAGATGGGCGCCATCGGCTGTACGCAACCCAGACGCGTGGCCGCCATGTCCGTGGCCAAGAGAGTTGCTGATGAAATGGGAGTTAAATTAGGTGAGGAAGTTGGTTACGCGATACGTTTCGAGGACTGCACCAACCCGTCCACGGTCATCAAGTACATGACGGACGGGATCCTGCTGCGGGAGGGTCTGCGGGACCCCGACCTCGACCAGTACAGCGCCATCATCATGGACGAGGCGCACGAGAGGTCTCTCTCCACTGACATGCTGTTCGGACTCCTGAGAGAGGCTTTTTGGGCTGCAGCCCAGGGCCCCGCGAATACAGAGGGCCCCCAACCGAACTGA

Protein sequence:

>DPOGS212384-PA
MSDSEENLHRLEGTPNNAPGGLIIKKKDKPPDFQFAKPSLLGLDKLAAAKRMQNRLISFQNEDNYGDDIEDKSGSSGVRERKYRKHNEETPTYTGGLSEQARARMLERLEMKEKKAREKGVHNSTLEEKKSHSKDEENSRFHNYGRGHRDKDRRRDYDRRDRDRYRDRNDRNKSERDRRRDSERRERDRDTDRDSSRRSYYEPRFKDEPRTPSIKALKPTDKTAWDDDDDDPKAVRKSSWDFPTPLPRDLADRSARSERKPTRDYKGRAYEDTIRATPHKWVSSRRGLDVDDPEWQEAEKKLDRAWYNMGEGETDESDPFAGTSAEYIAKKEEQIEKRRNRKVSAARQQIDRDNELWERNRMLTSGVVHSINVNNDLDEENVDRVHLLVHNIVPPFLDGRIVFTKQPEPVIPVKDPTSDMAINARKGSALVKAFREQKERRRAQKKHWKLEGTKIGNIMGIQKQEEEIEDGPTKQAYKYAEHLDKAGEEAESKSDFVKKLSITEQRRFLPVFAVREQLMQVVRENNVIIIVGETGSGKTTQLTQYLHEDGYSKMGAIGCTQPRRVAAMSVAKRVADEMGVKLGEEVGYAIRFEDCTNPSTVIKYMTDGILLREGLRDPDLDQYSAIIMDEAHERSLSTDMLFGLLREAFWAAAQGPANTEGPQPN-