Monarch geneset OGS2.0

DPOGS200424
TranscriptDPOGS200424-TA3066 bp
ProteinDPOGS200424-PA1021 aa
Genomic positionDPSCF300236 - 89527-94936
RNAseq coverage150x (Rank: top 53%)
Annotation
HeliconiusHMEL0024980.064.73% 
BombyxBGIBMGA008997-TA0.063.27% 
DrosophilaCG9323-PA9e-16436.64% 
EBI UniRef50UniRef50_E2BMJ40.039.30%Probable ATP-dependent RNA helicase DHX36 n=9 Tax=Formicidae RepID=E2BMJ4_HARSA
NCBI RefSeqXP_394965.30.038.22%PREDICTED: similar to DEAH (Asp-Glu-Ala-His) box polypeptide 36 [Apis mellifera]
NCBI nr blastpgi|3071690790.040.24%Probable ATP-dependent RNA helicase DHX36 [Camponotus floridanus]
NCBI nr blastxgi|3071690790.039.78%Probable ATP-dependent RNA helicase DHX36 [Camponotus floridanus]
Group
Gene OntologyGO:00055245e-17ATP binding
GO:00043865e-17helicase activity
GO:00036765e-17nucleic acid binding
GO:00080266.9e-09ATP-dependent helicase activity
KEGG pathway 
InterPro domain[199-389] IPR0140011e-24DEAD-like helicase
[500-599] IPR0016505e-17Helicase, C-terminal
[661-752] IPR0075021.9e-15Helicase-associated domain
[789-921] IPR0117094.4e-09Domain of unknown function DUF1605
[209-365] IPR0115456.9e-09DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL15106 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200424-TA
ATGTCTCGCGATTTCCAAAACTTTAGCCGACCCCGTGGTAGAGGTAGAAATTGGGATCAATCCCAACGAAATCATCGAACAAGACCTCCAGGACTTCGCGGAGCCGAAATTGGTTTATATTACAAAGAACTCAGCATGAAAAAAAAGAAAAAAGAGCCCGTGATAAATCTCAAAATCCCTTACTCAGTTCTCAAAGCATTAGAAAATGAATTAATAGCCATAAGGAAAATTGCTAGTACACAAAATATCACGTTACCAACAAAATTAACACAGAAATGTGAAAAAGAACAAGGTGAATATAATTTAACTAGTAAACCTGGAGTTTCAAGTAGAGAAAATTTTATGGACACCAATACCAAGGATAAATGTGACTCTAACTCTGGACAAAAACAGCAGTCTACCAGGATGTATGATTATAAATATGGATATGAAGATATCATAACAGGCTCATTTGATGAAAAACTCGACCAATGCATTACAAAAGGTATTACTATAAATACATGTGATGATGAGGTAGAAAGTTTAAATGAAGCATTCTTTATTGAGTATGAGGATATGTTGGAGAGGAATACTTATAAAAATATGTTGAAATTCCGCAAGAAATTACCAGCATACATAAAAGCCAAAGAATTGATTAAATCAATAAACGACAACCAAGTAATTGTTATTAGTGGAGAGACTGGTTGTGGAAAGTCAACTCAAGTACCACAAATCATCTTAGATCATGCTATTTGTAGCAAAAAAGGTGCCCATACTAAGATTTTGGTCACCCAACCGAGAAGAATTGCCGCCTCCTCACTGGCTATCCGAGTGGCTAAAGAAAGGGCAGAAAAGCTGGGCAATTCAGTGGGTTATGCTGTGAGATTAGAAAAGGTTGACGAGAGGTCTCGGGGGAGTATACAGTATTGTACAACTGGTATACTTTTGGCTGAACTGGAAGTAAACCAGGGTCTAACCAACTATAGTCACGTTATATTGGATGAAGTACACGAAAGAGATGTTCATGTTGATTTATCTATGTGCATGTTGCGAAAGGTTTTAAGAAAACGTAAAAATCTTAAACTAATTCTTATGAGTGCTACATTGGATGCTGAGAGCTTATCAGCTTACTTTGACAACTGCCCTCTAATGCACATCGAAGGACTGGCATATCCAGTACAAGATGTATATCTAGAGGATATATTGAATTTAACAAACTTCACACTACCCACCGAAAGACCGAAAGCACCGCAGGCTAAGTGGATGAAGTATAGAAAAAAAAATGTTTCAGATGCCATGGAAACAGACATCCAATACAGAGCTGAAATTGGCAACTGGCTGGAATCAAAGAAGAAAAATCTTAGTCTTCAAACATATAAAACTCTGCAAGACAGTAGAATTGAGGAGCTAAGTTTTGAATTGCTAGTTGATCTTTTGATCTACATCTGCAAGGGTGAACCTGGCGCAATACTAGTGTTTCTACCTGGCATCGGCGATATTACAAAACTGATGCGAATGATGGAATCCACAAATTTATTCCCAGCTAACAAATACGAAATTTATCCTCTGCATTCAAGGTTGCCAACATTAGAACAACATAAAATATTTGAAAGGCCACCAGATAATATTAGGAAAATAATAATAGCGACTAATATAGCTGAAACATCCATAACCATAGATGATGTTGTGTATGTAGTGGATTCAGCCAGGATAAAAATGAAAGGGCTGAATGTTGAGATGAATCTATCAACGTTGCAGACAGAATGGGTGTCTCAAGCAAATTTGCGACAACGGCGTGGGCGCGCTGGTAGATGTCAGCCAGGTATATGCTACCATTTGTTAACTTCATTCAGAGCTGAAAAACTAGAAGAACGTACACTACCGGAGTTACAAAGGAGTGATCTTTTGGAGCCGGTGCTCATGATTAAGAGGCTCCGCTTGGGTTTGGCTGAAGATGCACTGAAGATGGTGCCATCGCCACCAGCAGATTCAACAATACAATCAGCAGTGAAACATTTGCAAAGGTGTGGGGCCCTCAATACAGTGGAAACTCTTACTCCTTTGGGCTGGCACCTGGCACGTCTTCCAGTTCATCCAGCTGCTGGCAAACTGCTTGTTCTGGGAGCTCTTGCCGGATGCCTCGACAGGGCTGCGAGCCTCGCAGCCGTCTGGGGCTTCAAAGAACCCTTTCAGATGGTTATTGGTAAAGAGTACGAAGTGGATATGGCGAAGCGTGAATTCGCGATGGGCGAACCCAGCGACCATATCGCAGCTTCGGAAGCGATAATTCAATGGGAAAACTGTCCAAGAAGAGAGAGGTCATCATTTGCGTATAGGAACTTCCTGTCGAACAATACTTTGGAATTGCTTGTCGGTATGAAAAATCAGTTCGGGGACAACTTGAGACAGATGGGCTTCCTACGTTCCGGTAACGTCAGGTCTAAATGGGAAAATAGAAATGCAGATAACCTGAGCCTGTTCAAGGCTATCGTTGCTGCATCCCTGTATCCGAACATCGCTACAGTCAGATGGACCAATCTAAATAATTTCCGGAAGCAGCAAAGGATTTCAGCGTATACTCCAGAGGATGGGCGACTAGTTATACACCCGAGTAGCGTCATGGCGCCGCCAAAGAAAGGTCAAAACAGGGGCAAAGGCCCGTGTCCCTCGCAGCTGTGTAATAACCCTGGCGCCAACTGGCTCGTGTATTGGCTTAAGCAGAGATCGTCCGATCTCTTCCTACTTGACGTCACCTTAATTTACACGTTGCCTCTACTATTCTTTGGTGAATTCCAAATAACTGATGATGTAGAAAACCCGGAGAAGTGTTTTGTGACGATATCAAACATCAAAGTATGTTGTAAAAGAGAATGCACTGACAAACTCCTCGAGCTAAGATATCTGTTGGATAAGGTTTTGGAGGCGAAGGTCAATGACTCCAATGCTGCATCCAGTAACAGTGAATTTGAAGAATCTGTTTTGAAAACCGTTATTCAACTCATCACAGCAGAAGACGAGCAAGCTGAATATTTAGGACACGAGTTTTCTGATTCCGATGCTTCGACTACCGATGATAGAAATTATTAA

Protein sequence:

>DPOGS200424-PA
MSRDFQNFSRPRGRGRNWDQSQRNHRTRPPGLRGAEIGLYYKELSMKKKKKEPVINLKIPYSVLKALENELIAIRKIASTQNITLPTKLTQKCEKEQGEYNLTSKPGVSSRENFMDTNTKDKCDSNSGQKQQSTRMYDYKYGYEDIITGSFDEKLDQCITKGITINTCDDEVESLNEAFFIEYEDMLERNTYKNMLKFRKKLPAYIKAKELIKSINDNQVIVISGETGCGKSTQVPQIILDHAICSKKGAHTKILVTQPRRIAASSLAIRVAKERAEKLGNSVGYAVRLEKVDERSRGSIQYCTTGILLAELEVNQGLTNYSHVILDEVHERDVHVDLSMCMLRKVLRKRKNLKLILMSATLDAESLSAYFDNCPLMHIEGLAYPVQDVYLEDILNLTNFTLPTERPKAPQAKWMKYRKKNVSDAMETDIQYRAEIGNWLESKKKNLSLQTYKTLQDSRIEELSFELLVDLLIYICKGEPGAILVFLPGIGDITKLMRMMESTNLFPANKYEIYPLHSRLPTLEQHKIFERPPDNIRKIIIATNIAETSITIDDVVYVVDSARIKMKGLNVEMNLSTLQTEWVSQANLRQRRGRAGRCQPGICYHLLTSFRAEKLEERTLPELQRSDLLEPVLMIKRLRLGLAEDALKMVPSPPADSTIQSAVKHLQRCGALNTVETLTPLGWHLARLPVHPAAGKLLVLGALAGCLDRAASLAAVWGFKEPFQMVIGKEYEVDMAKREFAMGEPSDHIAASEAIIQWENCPRRERSSFAYRNFLSNNTLELLVGMKNQFGDNLRQMGFLRSGNVRSKWENRNADNLSLFKAIVAASLYPNIATVRWTNLNNFRKQQRISAYTPEDGRLVIHPSSVMAPPKKGQNRGKGPCPSQLCNNPGANWLVYWLKQRSSDLFLLDVTLIYTLPLLFFGEFQITDDVENPEKCFVTISNIKVCCKRECTDKLLELRYLLDKVLEAKVNDSNAASSNSEFEESVLKTVIQLITAEDEQAEYLGHEFSDSDASTTDDRNY-