Monarch geneset OGS2.0

DPOGS211320
TranscriptDPOGS211320-TA4362 bp
ProteinDPOGS211320-PA1453 aa
Genomic positionDPSCF300125 + 81574-95716
RNAseq coverage186x (Rank: top 49%)
Annotation
HeliconiusHMEL0093660.080.17% 
BombyxBGIBMGA004949-TA0.070.41% 
Drosophilaspn-E-PA0.033.55% 
EBI UniRef50UniRef50_UPI00022CA45B0.040.56%UPI00022CA45B related cluster n=3 Tax=unknown RepID=UPI00022CA45B
NCBI RefSeqXP_001600067.10.036.54%PREDICTED: similar to ATP-dependent RNA helicase [Nasonia vitripennis]
NCBI nr blastpgi|3800260410.037.46%PREDICTED: LOW QUALITY PROTEIN: probable ATP-dependent RNA helicase spindle-E-like [Apis florea]
NCBI nr blastxgi|3800260410.037.19%PREDICTED: LOW QUALITY PROTEIN: probable ATP-dependent RNA helicase spindle-E-like [Apis florea]
Group
Gene OntologyGO:00043862.4e-18helicase activity
GO:00055243e-17ATP binding
GO:00036763e-17nucleic acid binding
GO:00038241.1e-16catalytic activity
GO:00080261e-10ATP-dependent helicase activity
KEGG pathway 
InterPro domain[126-323] IPR0140011e-21DEAD-like helicase
[568-668] IPR0075022.4e-18Helicase-associated domain
[407-506] IPR0016503e-17Helicase, C-terminal
[1224-1292] IPR0090061.1e-16Alanine racemase/group IV decarboxylase, C-terminal
[1161-1239] IPR0226442.3e-12Orn/DAP/Arg decarboxylase 2, N-terminal
[899-1016] IPR0081914.9e-11Maternal tudor protein
[143-289] IPR0115451e-10DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[945-1005] IPR0029996e-06Tudor domain
Orthology groupMCL11470 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211320-TA
ATGGAAGAACTTAAAGCGTTTTTTAACAATCCGTGCCCGTCAAATGCTCAAGTAATCAAGTTGCGTGGACCTCTTACCGGTGGGCAAAGGATTACAAAAGATGATCCTATAAATGACTTAAATAGACGAGAAGAACATCGCCAATCAGTTGTTATTAGTGGTACTGATTATGCTCGTGAAGTAAAATTAAAAGAAACTGAAATGTATTTGAAGAATGAGCGAGAGCGTGAGAATTTCCAAAGTATATCAGGCGGTTTAGACTCCATGGAACAATTATCATCATGGGGTGCATCATTTCAAAACTTGGAGACGTTGACCAAAGAGGCCATGACACAAGTTTACAATAAATACTCCTTTCAAATGAAAGAAGACACAAAGAACTTAGCTATCAATGCCTATCATGACGATATTTTAGATAGAATAAAAGCATTTCCAGTAGTAATAATTGAAGGACCCACTGGGTGTGGGAAGACCACTCAAGTTCCACAATGGATATTAGATGATGCATATAATAATAGGAAGCCATGTAAAATAGTGGTAACTCAGCCAAGGAAGATAGCAGCTATATCCATTTCGAGAAGAGTTGCTCAAGAAAGGGGTTGGGATGTGGGAGGCCTTGTTGGATATCAGGTGGGCCTTGAAAACAGAACATCTAATGATACCAGAATACATTATGTTACAACAGGAGTACTGCTACAAAAACTTGTAAATGCTAAAAATATGAATGAATACACACATGTGGTGCTGGACGAAGTACATGAGAGGGAACAAGATATGGATTTTTTATTATTGGTAGTTAAAAAACTACTTTACACAGTATCACCAAGTGTCAAAGTCATCCTAATGTCTGCTACATTCAATTGTAGAGCATTTGCCGATTATTTCTTGATACCAACACCAGCTGGTTTGCAAATGTCTTCATGTTTGAAAGTCACAAATGATAAACCTGCATTCACTGTGAAGACGTTTTATTTGAATCACTTGACAAAGTTTGGAGCTATCTTACAACAATCGACACCTAAATCTAACGAGCCAGTTATTTTGCCGCAAATGTACCACCTTGTCGTTAAACTGGTGAATGCGTTTGAAAATATTGATAAACAAGAGGAGTATAGTGACAGGTCGGAGGCAGACCTGCCCTCAGTACTTATATTTCTGCCGGGCATACATGAAATTGAAGAGCTATATGCTTCTTTAATGGACATTGAACTGAGGAAGAAAGTGGGTGATGAAGAGTGTTCTTCATACAAATGGTGGGTTCTGCCGCTCCATTCGACTATCACAGCTGATGAACAGGTGAGAGTATTCCAAAGAGCTCCACCGGGACACAGGAAGATCATACTAGCTACAAACATAGCTGAGAGTTCCATCACTGTACCAGACATTAAATATGTTATCGACTTCTGTCTCATGAAAGTCTTGGTGGCCGATGAGAACACCAATTTCACGTCGCTGCAACTATCCTGGGCGTCGAAGACTAACTGCGAGCAGCGTGCGGGCCGTGCGGGCCGCGTCAGGGACGGCCGCGTCTATAGGCTGGTTCATGAGAAGTTTTATGACAACCTCCCCCAAGAATGCAAGCCAGAGATCATCCGTTGTCCTCTAGAGCGTCTGGTCCTTCTGTCCAAAATGCTGGACATGGGTACCCCGAGCGATATCCTCGCACTGGCTATGGACCCCCCTGACATGTCCAACATACATCGGACCATATTGGTGCTTAAGGAGGTCGGTGCTTTAAAGAAGACGATGGACGGGGAGTGGTGTGTCTCGGACGGTGATATAACACACTTGGGACGAATAATGGCCAAGTTGCCCCTGGACGTGAGAGTGTCGAAGCTGATCCTACTCGGATATATATACGGATGTCTCGAAGAGGCTGTTGTTATGGCTGCCGGTTTGTCCGTGAAGAATGTATTCAGCAGTCCGTTCCGCGAGCGCCTCAACGCCTACAACTCCAAACTGACGTGGGCCGACGGTTCAACGAGCGATTGCATTGCCCTACTTAACGTTTACAAGGTGTGGAATCATCTGCGTCAACAGAAGTATTTCAAGCAGCAAGGCACCAACGAAGTGCAATGGGCGCGTCGGTTCTACGTTCAAGTGAGGGCGTTGAGAGAGTTGGACGATATGGTGAAGGAGATACGGGCGAGGCTCTCGAGGGAGGGCATCCAGCAGGGGACAGCCCCGTGGAACAAACAGGAGCTACCGCTGGTGCTGAAGGTTGTGCTAGCGGGTGCATTCTACCCGCAGTACTTCACCCAAGCCAGCACGGACGAGTCCCGTGAGCGCGACGCGGTGAAGGTCGTGGGTGGGCTGGACCCGCGCGGATCGGTGTACCTCCGCGGGCTGCCGGACGCTCAGCCGGCCGAGATATACCAGGCCGCGGTCCGAGCCGCCGTCCACAGGCTGCTGGGAGACGAACCCAGGGTCACCGTTGACAGAAACAGCAGAAAAATTTACCTCACATTCCAAGACAATGAAAGCTCCAAAACCGGCAATAAAGGAGGTGATCCTACAATTCCCGGTCAAGTTTTGCTGCCTGTATATAAAGCTGTCAAAGCTAGACAACTTAAAATGGATGTCAGGATACCTTTGTTACCACTGGAAAAAGCAATAGCTCTATCAGAAGCAATGAAGTCTAAATTAGTTGATATGGATTTAAACGCCATGGTGCCCAGTCTGCCGGACATAGACGATACACATTTCGCTCTGAAAATCTCTCAGTTTATCAACGTAGGCAAGTTCTGGGTTCAACACGATGACGAGTCTACGAGGAATGAGCTTCGGGAGATCCAGAGGTCATTGAACTCTAGACCTCTTCTGGCTGTCACTGGGGATGTGGCAGTGGACGATCTGGTGACAGCACCCTACGCTGATGGGACCACCACATTGATGTACCGCGCACGGATTATTAGGATACTGCCGAGAGATATGTTAGAGGTGCTGTACATCGACTACGGCAGCGCAGGTCGCGTATCGCGCAGCAGTATACGCGTCCTGCCGCAGGTGTGCTGCAGGCCGCTGGCCTTGTGCTGCAGGCTGGCTGGGCTGGCTCCCGCACCACTGCTCGACTCTCATACACACTGGTCACAACCAGCACAGAAGATGTTCACCGAGCTGGTGGGGAGAGGGCGGCTCATTGCGAAGGTGTATTCAGTGACCCATGGCGTTGTGTCCATAGAATTGTTGTCGGAAGGTGGTAAGATTAATATCAACAAGGAACTCCTGGACAAGGGCTATGCTGTGCCCTGTGAAGAAAGCTATGATTCTAAGCTGAATCACGACCTTAGGGAGATGGCCACAGATTTGAATTTGGCGCAGAAGCGTGCGTACAACAAGGAACAAACCGAACTAGCTTTTACCCAACTACGGGAGGTGGAAGCGCCCGTCTATAGGGAATATACATCAGATGTCTGTTTGAAAGGACCATACAGCATGGAACACACAGACAGAATTGTACACAATGACGGATTATACACATCTCCGGTAACATACACAGCGGTGTACTCGCGCGGGATCCGCCTCGCCCGCGCGCTGTTCGAGGCGGGCGACGCCGCGGGCCACCGCATGTCTCTGCTGGACCTGGGGGGCGGATTCCCCGGACACCACGGCTCCTCCATACGAGAGGTGTCCGAGGTGATATCCGAGGCTCTGGCCACGTACTTCCCCAGCGGCGGGGTGCGCGTGATCGCGGAACCCGGTCGCTACTTCGCAGCAGCTGCTTACACTCTGTTCACGCTCGTACACGCCAAGAGACAGCTGCCTGCTGACGGCGAGGAGGACCCTCACACCATGTACTTCATCAACGACGGTGTGTACGGCTCCTTCAACTGTGTGCTGTACGACCATCAGGTAGTCACTCCAGAACTACTACAAGTAAACCATATCCGCCACCTGATGGACTACATGCTGTACTGCAGCGAGGGTCAGGAATCGCCGGCCATGGACGACGAGCTAAGGGTCAACGTGCCCAAGTATATACGAGAAGATCTCATGCAGTTGCTACTGAAACGTCGGAAACACCGTGAGACTGAAAGCGTTGTTAACGCCTGGGAATGGAAGTCTATACCGGAGGATGAGCTGTTGGAGATCACTATACCGGATATGGTTGAGAGAGCTGTCGTGTACTCATTGCACTTGCCATTAGAACTACGCCCAAAGGACAGAGACACTCTGCTGGCGTTGAAGAGGGACAACGATGACCTCAAAATGAGAGTCAGCAGGTCGCTGGTGTCTAACGACACTGTGCTAGTGTGCAAGCTGTGTGGATCGACGCCAATGCCGGTGCACTCGATGAGAATTCATCTCAGTTCAAATTCTCATCGCGATAAAGAGGAAGACTTCAAAGCTGTGTTATCATAG

Protein sequence:

>DPOGS211320-PA
MEELKAFFNNPCPSNAQVIKLRGPLTGGQRITKDDPINDLNRREEHRQSVVISGTDYAREVKLKETEMYLKNERERENFQSISGGLDSMEQLSSWGASFQNLETLTKEAMTQVYNKYSFQMKEDTKNLAINAYHDDILDRIKAFPVVIIEGPTGCGKTTQVPQWILDDAYNNRKPCKIVVTQPRKIAAISISRRVAQERGWDVGGLVGYQVGLENRTSNDTRIHYVTTGVLLQKLVNAKNMNEYTHVVLDEVHEREQDMDFLLLVVKKLLYTVSPSVKVILMSATFNCRAFADYFLIPTPAGLQMSSCLKVTNDKPAFTVKTFYLNHLTKFGAILQQSTPKSNEPVILPQMYHLVVKLVNAFENIDKQEEYSDRSEADLPSVLIFLPGIHEIEELYASLMDIELRKKVGDEECSSYKWWVLPLHSTITADEQVRVFQRAPPGHRKIILATNIAESSITVPDIKYVIDFCLMKVLVADENTNFTSLQLSWASKTNCEQRAGRAGRVRDGRVYRLVHEKFYDNLPQECKPEIIRCPLERLVLLSKMLDMGTPSDILALAMDPPDMSNIHRTILVLKEVGALKKTMDGEWCVSDGDITHLGRIMAKLPLDVRVSKLILLGYIYGCLEEAVVMAAGLSVKNVFSSPFRERLNAYNSKLTWADGSTSDCIALLNVYKVWNHLRQQKYFKQQGTNEVQWARRFYVQVRALRELDDMVKEIRARLSREGIQQGTAPWNKQELPLVLKVVLAGAFYPQYFTQASTDESRERDAVKVVGGLDPRGSVYLRGLPDAQPAEIYQAAVRAAVHRLLGDEPRVTVDRNSRKIYLTFQDNESSKTGNKGGDPTIPGQVLLPVYKAVKARQLKMDVRIPLLPLEKAIALSEAMKSKLVDMDLNAMVPSLPDIDDTHFALKISQFINVGKFWVQHDDESTRNELREIQRSLNSRPLLAVTGDVAVDDLVTAPYADGTTTLMYRARIIRILPRDMLEVLYIDYGSAGRVSRSSIRVLPQVCCRPLALCCRLAGLAPAPLLDSHTHWSQPAQKMFTELVGRGRLIAKVYSVTHGVVSIELLSEGGKININKELLDKGYAVPCEESYDSKLNHDLREMATDLNLAQKRAYNKEQTELAFTQLREVEAPVYREYTSDVCLKGPYSMEHTDRIVHNDGLYTSPVTYTAVYSRGIRLARALFEAGDAAGHRMSLLDLGGGFPGHHGSSIREVSEVISEALATYFPSGGVRVIAEPGRYFAAAAYTLFTLVHAKRQLPADGEEDPHTMYFINDGVYGSFNCVLYDHQVVTPELLQVNHIRHLMDYMLYCSEGQESPAMDDELRVNVPKYIREDLMQLLLKRRKHRETESVVNAWEWKSIPEDELLEITIPDMVERAVVYSLHLPLELRPKDRDTLLALKRDNDDLKMRVSRSLVSNDTVLVCKLCGSTPMPVHSMRIHLSSNSHRDKEEDFKAVLS-