Monarch geneset OGS2.0

DPOGS200681
TranscriptDPOGS200681-TA2979 bp
ProteinDPOGS200681-PA992 aa
Genomic positionDPSCF300353 - 129801-141601
RNAseq coverage1082x (Rank: top 12%)
Annotation
HeliconiusHMEL0177910.091.09% 
BombyxBGIBMGA008914-TA0.090.04% 
DrosophilaCG6227-PA0.077.72% 
EBI UniRef50UniRef50_Q7QE450.066.81%AGAP010656-PA (Fragment) n=1 Tax=Anopheles gambiae RepID=Q7QE45_ANOGA
NCBI RefSeqXP_001603634.10.070.32%PREDICTED: similar to ENSANGP00000016791 [Nasonia vitripennis]
NCBI nr blastpgi|490728400.092.20%DEAD box RNA helicase [Choristoneura fumiferana]
NCBI nr blastxgi|490728400.080.95%DEAD box RNA helicase [Choristoneura fumiferana]
Group
Gene OntologyGO:00055242.6e-49ATP binding
GO:00080262.6e-49ATP-dependent helicase activity
GO:00036762.6e-49nucleic acid binding
GO:00043861.4e-28helicase activity
KEGG pathwaynvi:1001199420.0 
 K12811 (DDX46, PRP5)maps-> Spliceosome
InterPro domain[353-558] IPR0140011.2e-66DEAD-like helicase
[358-531] IPR0115452.6e-49DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[595-676] IPR0016501.4e-28Helicase, C-terminal
Orthology groupMCL13845 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200681-TA
ATGGTTAGGAGTGGTCGTGACAGGGAGAGGGATCGTAGACGCTCACATAGTCGTTCGGCAAGTCCGGATAGAAAAAGAAGACGTTCTAGGTCTAGAAGTAGAGATAGAAATTCCAAATCCTCTAAGAGGAAGCGTAGCCGCAGCAGAGATAGAGATTCCAAGCGCGATCGCAGTAGAGACAGAGAGAGGGATCGTAAAAGTGATAAACGAGATGATAAGAGAAATGGTGCAAGTAGTAAGTCTAGAAAGAAATCTCCAGATAGGGAAAAGGAAAGAGATCGCTCCAAGTCAAAAGAGAAGGCGGTTAAATCTGAATCCGCTGATTATGCTCCTGGTACAGTAGATAAGGAAGAGGAACAAAGTAGATTGGAAGCGGAAATGCAAAAACGCCGCGAACGTATTGAGCGTTGGAGAGCTGAAAGGAAACGTAAAGAATTGGAATCAGCTAAAAAGGAAGTCCAGAAAGGCAGTATTGTGACCAATATACAGGTTCCGGCTGCTAAAAAGTGGTCTTTAGAAGATGATTCTGGAGACGTTGTTGAGGAAAAAATTGATGAAGAAGATGAAATTGATCCTTTGGATGCCTATATGCAAGAAGTTCAACAGGAAGTGCGCAAAGTGAATCAACTAGACCAAGCCCGTGGCATCAGCGTCCCAACAACAGGCGGCACAGGAGTTGTCATACTGACCGGAACGGCTAAGAAAAAAGTTACCGAACAGAAAAATAAAGGGGAGCTCATAGAGCAAAATCAAGATGGCTTAGAATATTCGTCGGAGGAAGAGACAGAAGATATAAAGGATGCAGCGGCTAATCTGGCATCTAAACAAAGAAAGGAATTAGCTAAAGTCGATCATGCCAGTTTGGATTATATGTCATTTAGAAAAGCATTTTATACTGAGGTTAGTGAGCTTGCCAGAATGACGCCAGAAGAAGTTGAGGCATACAGAACAGAGTTAGAGGGTATTAGGGTGAAGGGTAAGGGTTGTCCAAAGCCTATAAAAAATTGGGCTCACTGTGGCATAAGTAAAAAGGAACTTGATATACTGAAGAAATTGGGCTTTGAAAAACCTACCCCGATTCAGGCTCAGGCTATACCGGCCATAATGTCTGGAAGAGACCTGATTGGTATAGCAAAAACTGGTTCCGGTAAAACATTAGCATTCATATTGCCTATGTTCAGACATGTTCTCGACCAACCGCAGTTAGAAGACACAGATGGACCAATATCACTCATAATGACCCCAACGAGGGAACTTTGTATGCAGATAGGCAAAGATATTAAGAAGTTTGCCAAGTCTTTGGGCTTGAGAGTTGTCTGTGTGTATGGCGGAACTGGGATATCTGAACAGATAGCCGAGCTGAAACGCGGTGCTGAGATGATAGTCTGTACTCCTGGCCGTATGATCGATATGTTAGCAGCTAATTCCGGACGTGTGACTAATCTGAGACGAGTTACATACATTGTTCTTGACGAAGCTGACCGGATGTTTGATATGGGTTTCGAGCCGCAGGTTATGAAGATAATAGACAACGTGCGACCAGACAGACAGACGGTCATGTTCAGTGCGACGTTCCCGAGGCAGATGGAAGCCTTAGCCAGGCGTATATTACAAAAACCTATCGAAGTACAGGTTGGAGGTAGGAGTGTTGTATGTAAGGACGTGGAACAACATGTAGCTATACTAGAAGAGGAAGCAAAGTTCTTCAAATTACTGGAACTGTTGGGCCTGTACAGCCAGCTGGGGAGCATCATAGTGTTCGTCGATAAGCAGGAGAACGCGGACAGCTTGCTGAAAGATCTTATGAAGGCATCTTACTCTTGTATGAGTCTGCATGGAGGTATTGATCAATTCGACAGGGACTCGACTATAGTAGACTTCAAGAACGGCAAGGTGAAGCTGCTGGTGGCGACCAGCGTGGCTGCCAGGGGTCTGGACGTCAAACAGCTGGTGTTGGTGGTCAACTACGACTGTCCTAACCATTACGAGGATTATGTACATCGATGCGGTCGTACCGGTCGCGCGGGTAACAAGGGCTATGCCTGGACATTCCTCACGCCGGAGCAGGGCCGATACGCGGGGGACGTGTTGCGAGCCCTCGAAGCCGCTGGGGCTTCTCCCCCGGCCGAACTCAGGGCTCTGTGGGATAAGTACAAAGAGGCGCAGGAGAGGGACGGAAAAAAAGTTCACACAGGCGGTGGCTTCAGTGGCAAAGGTTTCAAATTCGACGAATCCGAAGCCCAAGCGGCGACTGAGAGGAAAAAGTACCAAAAGGCCGCTCTCGGCCTCCAAGACTCGGACGACGAGGACGTTGAGGGCGACCTCGACCAGCAGATAGAGGTCATGCTTGCCGCTAAGAAAATTGTCAAAGAAATTAAGCCGGGTGTAGCGACGGCTAATCCCCCAGCGGCAGCGGGGGCGAGTGTAGACGGGAAACTTGAACTGGCGAGACGGCTGGCCTCCAGAATAAACCTGGCCAAGGGCTTAGGCGTCGAACAGAAGGGAGCCACGCAACAAGCGGCCGAGGCCATACTTAAAGGGAACCCGTCTGCACACACCCTTATCACGGCCAAGACTGTAGCTGAACAGTTGGCGGCCAAGTTGAACACTCGCCTGAACTACCAGCCTCGCGACGAGAGCACGGCTGAACCGGCCGAGGAGGTGTTCAGGAAGTACGAGACGGAGCTCGAGATAAACGACTTCCCTCAGCAGGCCAGGTGGAGGGTCACCAGCAAGGAGGCGCTAGCGTTGATCAGTGAATATTCGGAGGCTGGTATCACAGTCAGGGGGACGTATGTACCCCCAGGGAAAGCTCCACCGGAAGGAGAGAGGAAACTGTACCTGGCCATCGAAAGTTCCCAAGAGCTGGCTGTAGCTAAAGCGAAGTCAGAAATAACAAGGCTGATTAAAGAAGAGCTCCTCAAGCTACAGACGTCAGCTCATCACATGATTAACAAAGCTAGATATAAGGTCCTCTGA

Protein sequence:

>DPOGS200681-PA
MVRSGRDRERDRRRSHSRSASPDRKRRRSRSRSRDRNSKSSKRKRSRSRDRDSKRDRSRDRERDRKSDKRDDKRNGASSKSRKKSPDREKERDRSKSKEKAVKSESADYAPGTVDKEEEQSRLEAEMQKRRERIERWRAERKRKELESAKKEVQKGSIVTNIQVPAAKKWSLEDDSGDVVEEKIDEEDEIDPLDAYMQEVQQEVRKVNQLDQARGISVPTTGGTGVVILTGTAKKKVTEQKNKGELIEQNQDGLEYSSEEETEDIKDAAANLASKQRKELAKVDHASLDYMSFRKAFYTEVSELARMTPEEVEAYRTELEGIRVKGKGCPKPIKNWAHCGISKKELDILKKLGFEKPTPIQAQAIPAIMSGRDLIGIAKTGSGKTLAFILPMFRHVLDQPQLEDTDGPISLIMTPTRELCMQIGKDIKKFAKSLGLRVVCVYGGTGISEQIAELKRGAEMIVCTPGRMIDMLAANSGRVTNLRRVTYIVLDEADRMFDMGFEPQVMKIIDNVRPDRQTVMFSATFPRQMEALARRILQKPIEVQVGGRSVVCKDVEQHVAILEEEAKFFKLLELLGLYSQLGSIIVFVDKQENADSLLKDLMKASYSCMSLHGGIDQFDRDSTIVDFKNGKVKLLVATSVAARGLDVKQLVLVVNYDCPNHYEDYVHRCGRTGRAGNKGYAWTFLTPEQGRYAGDVLRALEAAGASPPAELRALWDKYKEAQERDGKKVHTGGGFSGKGFKFDESEAQAATERKKYQKAALGLQDSDDEDVEGDLDQQIEVMLAAKKIVKEIKPGVATANPPAAAGASVDGKLELARRLASRINLAKGLGVEQKGATQQAAEAILKGNPSAHTLITAKTVAEQLAAKLNTRLNYQPRDESTAEPAEEVFRKYETELEINDFPQQARWRVTSKEALALISEYSEAGITVRGTYVPPGKAPPEGERKLYLAIESSQELAVAKAKSEITRLIKEELLKLQTSAHHMINKARYKVL-