Monarch geneset OGS2.0

DPOGS200364
TranscriptDPOGS200364-TA1893 bp
ProteinDPOGS200364-PA630 aa
Genomic positionDPSCF300026 + 841355-843247
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0020190.075.96% 
BombyxBGIBMGA005666-TA0.073.82% 
Drosophilamus309-PA5e-9537.45% 
EBI UniRef50UniRef50_E0VZQ30.052.15%ATP-dependent DNA helicase Q1, putative n=69 Tax=Bilateria RepID=E0VZQ3_PEDHC
NCBI RefSeqXP_002431597.10.052.15%ATP-dependent DNA helicase Q1, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2613359690.076.11%putative RecQ Helicase [Heliconius melpomene]
NCBI nr blastxgi|2613359690.076.11%putative RecQ Helicase [Heliconius melpomene]
Group
Gene OntologyGO:00063101.6e-252DNA recombination
GO:00080261.6e-252ATP-dependent helicase activity
GO:00055242.1e-24ATP binding
GO:00043862.1e-24helicase activity
GO:00036762.1e-24nucleic acid binding
KEGG pathway 
InterPro domain[4-624] IPR0045891.6e-252DNA helicase, ATP-dependent, RecQ type
[79-282] IPR0140016.8e-30DEAD-like helicase
[319-400] IPR0016502.1e-24Helicase, C-terminal
[85-251] IPR0115455.1e-19DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL16422 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200364-TA
ATGAAAACGATACAACAGTTGGAAAATGAAATAAAAGATGTAGATAAGCAGCTAGTCAAGGTCGAAACAGAAATAAATAAGTGGAAGAATGAACAACGGAAGCTTCACGAAAAGAAAACATCTTTAAAGAAAACGATTAGTAAACTAAAATCAGATACTTTAGCTAGTGTTGACTGGGGTGGTAGTTCTTATGAATGGTCTAATGATGTAAAAAATACTCTAATTAATATTTTTAAAATTGATGATTTTCGACCTAAACAACTTAGCGCTATCAATTCAACTTTGTCTCGGCAACATGCCTTAGTTGTGATGCCTACCGGCGCTGGTAAAAGTCTTTGCTATCAACTGCCAGCTCTAATAAAGCCAGGTATTACTATTGTAATATCTCCTCTAGTTTCTTTGATGGAAGATCAAGTGAGATCCTTAACAAATAAAAATATACCGGCGAAACTTATGACAAGCACCAGTTCTAAAGCAGAGACTACAGCAACACTAAATGTTCTTAAAGATAAAAACACAGAGGTCAAACTGTTGTATGTAACTCCTGAGAGACTTGCGAAGAGTAAGAGATTCATGTCAGCACTTCAAAAATGTCATGCTGAAGGTAGACTGCAGAGAATTGCTATAGATGAAGTTCATTGCTGTTCCCAATGGGGACATGACTTCAGACCTGATTATAAGTATTTAGGAATACTGTCTAATATGTTTCCCGGAGTTCCTATCTTAGGATTAACAGCTACAGCTACATCTCATGTATTGAATGATGTTCAAAAGATCCTCAATATAACTGGCTGCTTAGTAATTAAATCAACTTTCAATAGACCAAATTTATATTATAAAATACTAGAAAAACCTACATCACAAGAAGATTGCTTGACCATACTTGAGAAGCTCTTAAAATATAGATATAGGGGGGAAAGCGGAATTATCTATACCAACAGTATTAAAGATTCTGAAGAAATAGCTGAAGGTTTGAAGAAGAGGAATTTAAAAATAGCATGTTATCATGCAAATTTAAGTGCAGAAATAAGGTCAAAAGTACACATTAGGTGGCATGAGAAAAGTCTTCAGGCAATTGTAGCTACAGTGGCTTTTGGTATGGGAATTGATAAACCGGATGTAAGATTTGTTATACATCATACAATAAGTAAATCTATGGAAAATTATTATCAAGAGAGTGGTAGAGCTGGAAGAGATGGTCTGCGGGCAGAATGTGTAACATTGTATAGAATGCAAGATGTTTTCAAAGTTAGTACTATGGTGTTTTCCTCGGTGGGGAGCTTGGATCATTTATATGGCATGGTGAAATATTGTCTTAATGGAACATTATGTAGACGGCAATTGATCGCTGAACATTTTGATGAAGATTGGGGAGATGCAGATTGTAATAAGATGTGTGATGTTTGTTCTAATCCTAATGTAAATAGTAAAGAAATATCTCTTGAAATGCACTGCAAGATTCTGGATAGTATTATTAGCAATGCAGAAAAACAAGACACAAAACTCACAGCACAGAAGTTACTTGATGCTTGGTATCTGAAAGGTCCAGTACCTCTTAGACATAAAGGCAAAGAACCAAATTTTTCCAGACTTGTAGGTGAAGATGTAATAGCATTTTTGCTAACCCAAGGCTATTTAATAGAAGACTTTCATTTTACAGCATATTCAACAATAAGTTACATAAAAAAAGGTCCGAATATGGAAGCCATTAATGATATTAATTTTGAATTGAAAATGCCCGTTAGGAAATATTATGAATTTCATTTAGACAGGCCGGCACCAAGAGAATATGAATCTTTAGAAGGTAAACCTGAAATCAAAGAATCGAGAAAAAGAAAAATTTCTGAAGAGCATGATACAAGAAAATTGAAAACCGTCATAATTGATGATTAA

Protein sequence:

>DPOGS200364-PA
MKTIQQLENEIKDVDKQLVKVETEINKWKNEQRKLHEKKTSLKKTISKLKSDTLASVDWGGSSYEWSNDVKNTLINIFKIDDFRPKQLSAINSTLSRQHALVVMPTGAGKSLCYQLPALIKPGITIVISPLVSLMEDQVRSLTNKNIPAKLMTSTSSKAETTATLNVLKDKNTEVKLLYVTPERLAKSKRFMSALQKCHAEGRLQRIAIDEVHCCSQWGHDFRPDYKYLGILSNMFPGVPILGLTATATSHVLNDVQKILNITGCLVIKSTFNRPNLYYKILEKPTSQEDCLTILEKLLKYRYRGESGIIYTNSIKDSEEIAEGLKKRNLKIACYHANLSAEIRSKVHIRWHEKSLQAIVATVAFGMGIDKPDVRFVIHHTISKSMENYYQESGRAGRDGLRAECVTLYRMQDVFKVSTMVFSSVGSLDHLYGMVKYCLNGTLCRRQLIAEHFDEDWGDADCNKMCDVCSNPNVNSKEISLEMHCKILDSIISNAEKQDTKLTAQKLLDAWYLKGPVPLRHKGKEPNFSRLVGEDVIAFLLTQGYLIEDFHFTAYSTISYIKKGPNMEAINDINFELKMPVRKYYEFHLDRPAPREYESLEGKPEIKESRKRKISEEHDTRKLKTVIIDD-