Monarch geneset OGS2.0

DPOGS214457
TranscriptDPOGS214457-TA2211 bp
ProteinDPOGS214457-PA736 aa
Genomic positionDPSCF300441 + 33173-38196
RNAseq coverage745x (Rank: top 17%)
Annotation
HeliconiusHMEL0077920.086.19% 
BombyxBGIBMGA009578-TA0.093.09% 
DrosophilaCG11107-PA0.082.23% 
EBI UniRef50UniRef50_Q7K3M50.082.23%CG11107 n=16 Tax=Eukaryota RepID=Q7K3M5_DROME
NCBI RefSeqXP_392081.20.085.87%PREDICTED: similar to CG11107-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|910774300.086.01%PREDICTED: similar to ATP-dependent RNA helicase [Tribolium castaneum]
NCBI nr blastxgi|910774300.086.01%PREDICTED: similar to ATP-dependent RNA helicase [Tribolium castaneum]
Group
Gene OntologyGO:00043861e-33helicase activity
GO:00055245.3e-17ATP binding
GO:00036765.3e-17nucleic acid binding
GO:00080263.3e-06ATP-dependent helicase activity
KEGG pathwayame:4085350.0 
 K12820 (DHX15, PRP43)maps-> Spliceosome
InterPro domain[478-568] IPR0075021e-33Helicase-associated domain
[602-705] IPR0117095.3e-28Domain of unknown function DUF1605
[75-262] IPR0140014.6e-26DEAD-like helicase
[300-417] IPR0016505.3e-17Helicase, C-terminal
[88-236] IPR0115453.3e-06DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL10030 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214457-TA
ATGTCTAAGAGAAGGATTGAAGTGATGGATCCGTTTATTAAGAAAAAAAGAGAGGAGAAAGCGGCGGCAGCAGCTAAGGCTGGCGGCAGTGAGGCCTCGGAGTCGACGACAGCAGCCACGACGCCCGGCACACCCTCCAGCACACCCGCCAGCACGCCGGGGTTGAACCCATACACAGGTCTTCCCCACTCCCCTCGCTACCACGAGCTGCTGAGGCGACGTCTCGGCCTGCCGGTGTGGGAGTACAAGAACGACTTCATGAGACTGTTGAATACACACCAGTGCGTGGTGCTGGTCGGAGAGACCGGCTCGGGGAAGACCACGCAGATACCGCAGTGGTGCGTCGAGTTTGCTGCCGTGACCGGCGGACAGGCACACGGGGTGGCCTGCACCCAGCCCCGCAGGGTGGCCGCCATGTCCGTGGCACAGAGAGTGGCCGAGGAGATGGACGTGGCGCTGGGGCAACAGGTCGGCTACAGCATACGGTTCGAGGACTGTTCGGGACCGCAGACCGTACTGAAGTATATGACGGACGGTATGTTGCTGAGAGAGGGAATGTCCGACCCCATGCTGGAGCAGTACAGGGTCATACTGCTGGACGAGGCGCACGAGAGGACCCTCGCCACCGACATACTGATGGGGGTGCTCAAAGAGGTCATCAAGCAGAGGTCCGACCTCAAGCTCGTTATCATGTCGGCCACGCTGGACGCGGGCAAGTTTCAACTGTACTTCGACAACGCACCCTTGATGAACGTACCGGGGCGGACACATCCCGTCGAGATCTTCTACACGCCGCAGCCCGAGAGGGACTACCTGGAGGCGGCCATACGGACCGTCATACAGATACACATCTGCGAGGAGGTGGCCGGGGACATACTCTTGTTCCTGACCGGTCAGGAGGAGATCGAGGACGCCTGCAAGAGGATAAAGAGAGAGATAGACAACCTAGGACCGGACGTCGGCGAACTCAAGTGTATACCGCTGTACTCGACGCTGCCGCCCAATCTGCAGCAGAGGATATTCGAGCCGGCGCCCCCCAACAGGCCGAACGGCAGGATCGGCAGGAAAGTGGTGGTCTCCACGAACATAGCGGAGACCTCGCTCACCATAGACGGCGTTGTGTTCGTCATAGACACGGGCTTCTCCAAACAGAAGGTTTACAACCCGCGGGTGAGGGTGGAGTCTCTGCTGGTGTCTCCCATCAGTAAGGCCTCGGCCCAGCAGCGAGCGGGCCGAGCGGGCCGGACCAGGCCGGGGAAGTGCTTCAGGCTGTACACGGAGAAGGCCTACAAGGACGAAATGCAAGATAACACATACCCCGAGATCCTGAGATCAAACTTAGGATCTGTAGTTTTGCAGTTGAAAAAGCTGGGCATCGATGACTTGGTGCACTTTGACTTCATGGACCCGCCGGCGCCGGAGACGCTCATGCGCGCGCTGGAGTTGCTCAACTACCTGGCCGCGCTCGACGACGACGGCAACCTGACCGACCTGGGCGCTGTGATGGCGGAGTTCCCTCTCGACCCTCAGCTGGCTAAGATGTTGATCGCCAGCTGTAACCACAACTGTTCCAACGAGATACTCTCCATCACCGCCATGCTGTCAGTCCCGCAATGCTTCGTGCGTCCCAACGAAGTAAGGAAGGCTGCGGACGAGGCCAAGATGAGGTTCGCTCACATCGACGGTGACCACCTCACGCTGCTGAACGTGTACCACGCCTTCAAGCAGAATATGGATGACCCCCACTGGTGTTACGACAACTTCATCAACTACAGGTCACTCAAGTCTGGTGATAATGTCCGGCAACAACTCAGCAGGATCATGGACAGGTTCAACTTGAAGAGGACCAGCACCGAGTTCACAAGCAAAGACTACTACATTAATATAAGGAAAGCTCTCGTCAATGGCTTCTTTATGCAAGTGGCGCACCTAGAGCGGACCGGTCACTACCTGACGGTGAAAGACAACCAGCAGGTCCAACTGCACCCCTCCACGTGTCTAGACCACAAGCCCGACTGGGTCATATACAACGAGTTCGTGCTCACCACCAAGAACTACATACGAACAGTCACCGACATCAAACCGGAGTGGTTGCTCCGGATCGCCCCGCAGTACTATGAGTTGTCCAACTTCCCGCCGTGCGAGGCGCGGAGGCAGCTGGAACTGCTGCAAGCGAGGCTCGATTCCAAACTGTACCAGGAGGGGTTCTAG

Protein sequence:

>DPOGS214457-PA
MSKRRIEVMDPFIKKKREEKAAAAAKAGGSEASESTTAATTPGTPSSTPASTPGLNPYTGLPHSPRYHELLRRRLGLPVWEYKNDFMRLLNTHQCVVLVGETGSGKTTQIPQWCVEFAAVTGGQAHGVACTQPRRVAAMSVAQRVAEEMDVALGQQVGYSIRFEDCSGPQTVLKYMTDGMLLREGMSDPMLEQYRVILLDEAHERTLATDILMGVLKEVIKQRSDLKLVIMSATLDAGKFQLYFDNAPLMNVPGRTHPVEIFYTPQPERDYLEAAIRTVIQIHICEEVAGDILLFLTGQEEIEDACKRIKREIDNLGPDVGELKCIPLYSTLPPNLQQRIFEPAPPNRPNGRIGRKVVVSTNIAETSLTIDGVVFVIDTGFSKQKVYNPRVRVESLLVSPISKASAQQRAGRAGRTRPGKCFRLYTEKAYKDEMQDNTYPEILRSNLGSVVLQLKKLGIDDLVHFDFMDPPAPETLMRALELLNYLAALDDDGNLTDLGAVMAEFPLDPQLAKMLIASCNHNCSNEILSITAMLSVPQCFVRPNEVRKAADEAKMRFAHIDGDHLTLLNVYHAFKQNMDDPHWCYDNFINYRSLKSGDNVRQQLSRIMDRFNLKRTSTEFTSKDYYINIRKALVNGFFMQVAHLERTGHYLTVKDNQQVQLHPSTCLDHKPDWVIYNEFVLTTKNYIRTVTDIKPEWLLRIAPQYYELSNFPPCEARRQLELLQARLDSKLYQEGF-