Monarch geneset OGS2.0

DPOGS208953
TranscriptDPOGS208953-TA5073 bp
ProteinDPOGS208953-PA1690 aa
Genomic positionDPSCF300009 + 540388-550718
RNAseq coverage97x (Rank: top 61%)
Annotation
HeliconiusHMEL0146700.051.43% 
BombyxBGIBMGA002427-TA0.050.22% 
DrosophilaCG7922-PA1e-12740.13% 
EBI UniRef50UniRef50_Q16VR62e-13039.76%Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q16VR6_AEDAE
NCBI RefSeqXP_001660079.14e-13139.76%hypothetical protein AaeL_AAEL009460 [Aedes aegypti]
NCBI nr blastpgi|1571232488e-13039.76%hypothetical protein AaeL_AAEL009460 [Aedes aegypti]
NCBI nr blastxgi|1571232486e-12339.69%hypothetical protein AaeL_AAEL009460 [Aedes aegypti]
Group
Gene OntologyGO:00055242.7e-06ATP binding
GO:00080262.7e-06ATP-dependent helicase activity
GO:00036762.7e-06nucleic acid binding
KEGG pathway 
InterPro domain[82-241] IPR0140013.2e-08DEAD-like helicase
[132-215] IPR0115452.7e-06DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL10191 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208953-TA
ATGGATCATGGACCCGCCAATCAGACAAATGAACACAAAAACAGTATCAATATTGATGAGGACCTTTTCGACGATTCTATCTTGGAAATTTTTGAAAATTCAGATTTTGAGGAAACAGCAGAACAAACAAAACATGAAAAAAATAATACCTTAAATAAAAATGCACTTAATGTTAGTGCACTGTGTTGTGATGAAGAACTCAATGGATATGATAAAATGTTAGGAAAGACATGGATCTATCCAACCAACTACCCCATCAGAGATTACCAGTTCAATATTATAAAGGCAGCTATGGTAGAAAATTGTCTGGTAAGTCTGCCCACTGGATTGGGCAAGACTTTTATAGCAGCTGTGATTATGTATAATTTTTACCGATGGTATCCATTAGGTCATATGCAAGTCAACACTAGAAAAAGTCATTGGCAAACGAAAAGAGTATTTTTTGCCACCCCTCAAGTTATCTACAATGATATCAAGTCCGGAACTTGTCCCAGTGATAAGATCAAATGTCTTGTCATAGACGAAGCTCACAAAGCGAAGGGGAATTATGCTTACAGTAGTATTATCAAAACTTTGACAGAAATGGGATACTTTATATATAGAGTGCTGGCACTTTCTGCAACACCTGGTAACAAGGTCGAAGATGTTATACATATAGTGAAACATCTCCATATATCTCGTTTAGAATTAAGAACGGAGAACTGCTCGGATGTGAAGGCTTACAGTCATGCGAGGAACATTAACACTGTGGTAGTTGAACTCGGCCCTGAACTAACGAAGCTGAGAGAACAATATGTGGAGATCCTAGACGGATATACGAGAAAGTTAACAAAATTCAATATAATACAGAACCTGGGAAATCTATCGAAAGGTCGGATCGTTATGTTGTACAAGGAGTTTCAAAATCGGGAACGTGGTGCGAGACACCCACAGCACAGCTATATAATGCGAATATTCACACTTCTAATAACCCTGTACCACGGTCTGGAACTGCTGGTGAAGCACGGCTCGAGGGTATTCCTCAACTTCTTCGACGAACATCCAGAGAAGACTTGGGTACATGAAGACAATGAACTGACAGCTTTCTTTGACAAGCTCAGAGACAAACTCGGCCTCAATCCCCTTGACCTCGACAGAAGTGTCCTACCTGACGGAACAGTTCCAGAGGTGCCATCCAATTTAAATTTCGGTCATCCGAAATTTTCTAAGCTTAAAGAGATCATAATGAGACATTTTGATACTGCCAAAAATAAGGGTCAAGTCACTAAAGCAATAGTGTTCTGCGAGTACCGAGAGAGCGTGAACTTGGTGTACTGCCTGCTTCTTCAATGTCGTCCAACTATAGTGCCGGAAATGTTTGTAGGACATGGCGCTTCCGGTAAAGACGGTAAAACGGTTATATCTCAGAAGCAACAGTTGCGTGTGATGCGTAACTTCCGGTCGGGTGTGTGTAACACGCTGGTGTGCTCCTCTCCCTGCAACGCAATGGACCCGGCACCCGCACGAGGTTTCGACTCATATATAATTTTACATCAGATAAAACATATTACAGTAATGTTTTGTTCGTACAGGTGCGGTCGGACCGGTCGGGAACGAAGCGGGCAAGTGTTCATCCTGGTCACCGAGGGCAGAGAACATAGTACTCTGTTAGATTGTATACGACAGAACGATGGCTTGAACCAGAAGATACTCACGTCGGAGGAGGTTAAGAAAAGTTTGTTTAAGTCCAATCCACGCATGATACCGGCGGATTTCATGCCGGAATGTCAGAAAATGTTTATAACTGTCGCTAAAAAGGAATCGAAAGAAACATCTAAAGACATCAAAGAAAAGTCCAAAGGAAAAAAGGAGAAAGCAAAGGACAAACTCTTGAAGGGTCAAAAAGATTTAAGGAGCATGTTACTCCAGAAAGGTACTAGTTCGATAATATCTCAACGAGACGACACTACAAGTGAACGGACAAAACTGATATCTAAACAAGAATTTGATGTTCTGTTCCCTGAAGGATACGAGGATAGCAATATCTTTTCACAACCCAACGACTGTTGGGCTCTGAACGAGTTTTTGAAAGAAAATAATGACACCAAACAAATAGTGAGGCCGTCGCCACAGTGGGAACCACAGAAAGCATTACAGAATACTTTCAAAGTGAAAAATTCGTCGGATACTAAAATGTTAATAGAATTATTTGAACAAACTAAAAACACCCCAATGACAAAGTCACCGGTCAAAAAAGACGGCGACATCCGTGTTTTATTCAGTAAATCCACGAAATCTATAAAGAATTACACCAAATCCATGAACGAGCTAGGTATAGATAGTCATTCGAATATATCAAACGACGTCCTGAACTTGTTGGTCGATCTGAGTCTAGAAAACAAAAACATGGAAAAACGTTGTTACATATGTAGTGTGGTTTGTAGATGTTCAGGACTCGTAAAGAGATCAGGTCATAGATTTACCGATTTGTACTTCCATGAACAACCTGACCTGCCGGATGTAGAATTGGTTGATTATTTAACAAAAGAATCTATAAAGGATCTATGTGAGGAAATTAATACTAGAAATATTAATAACGAGTCTGATATTGAGACAGACGTCACTTATGTTGGTAAAAGTGAAATGAAAAGTACAGATTTGAATATATCAGTTGGAAACGTAAAGAAAGGTAGCGGGACATCAGATGCAAGGCAATTTGATGTAAATTACAGTGACGATGATTTATTTGAAGAAAGTGATTTATTCGATGAAATCGATATACCTACGGAGTCGTGTGATAGACTCTCTAAGGACAGCAAGTGTATGGATAAAGCTGTGACTGAGTTAGGCTCACGTGACACATCTACAAACAGTGCAGATATAGCTGATAATCATTTAAATGAATCGTGCAGTAAAATAAACACTGAAATAAAATCTCATGCAGATGAAATGTATAGTGTGGGAACAACTAGAAATAAGGCTAACACTAATAACAGTATGATAACAGGTTACGTGAATGATGTCAGGAATGAAAACACAGAAGACTGCAATATAATAGAGGTAGAAGACGTTTTCGATGATTTCGATGAATTTAACAACAAAACGACTGAACTAACGCCGCCATTAAGATATAAAGATGAAAATGAATTAGAAAAATCACCTATTCTGTGTACAGTTAGGACTAGTTATATTGAAAGGGGAGATAGAACAGTTAATAGGAATGAAAAATGTGATCAATACGACATGCTGGAAGTGAATGATATCTTTGATGATTTAAACACATCAGACCTTACATCCGCTAAAACTAAATCTACAAACAATAACAATAAATACTCTGAATTAGCGATACGAGACACAAATGAGGATTTTGAGTGTATAATAATCAATAATGAAGATACCACTGACATGCACACACGCGAGAACGGTTTGACTTCCGGTGGTGACGCTCATAAAGATACTGACACATCGGATGATACAATTACAGAAACATCAACAGCTTCCAATAAAAAAAACGATATAAGAGATGTCCTAAAATATTTTTATATCGATAAAATTTCTGATATATTCGAACACGGCTATTACAGCTTTCAAGGAAGTAATCGTATTGATTTTAAAAGAAAAAATGCTAAACCATCGAATGATTTACATGATGTTACTGAAATTTATAACAAAAATAACGATATAATAGAAATCAAGGATGACGGAACTTACGAAATGTTCAATGATACAGCAATTGATGGTAATAAAATTTATATTAAAAATAACGATATAATAGAAATCAAGGATGGCGGAAGTCACGAAATGTTCAATGATACTGTATTAGAAGGTTCTCTGAGTCCGTCCTTGTTGTCTGGTAGAAGTCAAACTGTGCTCCCGTCAAGATCGGCTTCACCTATTCTAAACACTCAAAGGTCCAAGAAAAATCTTAGCTTGAACAAACAAACTTCTCCAATATTAAGCAGTCAACGACGTAAAATGTCCTTAAATATCAACAGAAGTGACAAAAAACACAACGGTCCCATATCAAGCGTTGGACAGACGAGTTTAATACAAAAAGTTAACAGAACATCGCAAGTGGAACCGGACCATAGAAAAACAGTCAAATTAGAGATGGATAATAATAACAAGAGCAACACTTCCGATATCATCATATTAGATAGCGACGAGGACGATGGAGACGAGAAACACGTGAAACATAATACAACAAAACGTAGACTTGTAGAAGTTGAATGTGAGTCGCCGTATTTTAAGAAAAAACCAAAACTCGAGGGAAACAATACTAAACCGACATCGATAAAAGAAAAGGTCATGGCTGCGCTGACCTCCCATCACAATCTCGATGCGACGTTCCATAACGACGCACAAATAAACTACGCCCTGACGTCACCCATGAGAGACTCGGTAAAAGAAAACAAAGATCCGCTCCGAAGTAAACTGGACGTGTTAAAAAAGTTTCAATATGATAGCAAGAATAGATCGAGCACAGTTTTAAAGGAAAACAAACGTAATGTATCGGCAAGAATTAGCGAGAGTGATGACGATTTTGTGTCCCAGGTGCCGTATAGAAAGAAAATATTAACAAAATTAAAAAGAAAGCCCGCTAAGAGCAAAACGAGAGAGGCGAAAAAGCCAAGTGAATTCCTGGACCTGGAGGCTGAGTTATCAGAAGACGAGGACGTGAGTGAAGACGAGCTCTCAGATGATAGCACAGACAGCATCAAGAACTTCATATGTGACGACACTGTGGCGAGGGATGGCGACATACAAGCGATATACCTCGAGTCTATCAAGAGTCCAGTCAAAGGTGTTTTCAAACTACCACAACTACCGGCCTTGAAGAAACATGAAGTCTTATCACAGTACGTCGAAGAAGACACTTATGAAATGGATAGTTTTTGTGTGGACTCGCATGTAGTATTAAATGAAACCCATGAAATGTCGGAGTTGGAACTGGCCGAGATGATATTAGAGGAAAGAAGACGAAACAGAAGAAACAGAAATCAAGTTGAAGATACAGAGGAAAGTCTGATAATTAAGAAGACTTGTAGGAAGATCAAAAGACAGATAAACAGTGATTCTGAAGATAGCAACTGA

Protein sequence:

>DPOGS208953-PA
MDHGPANQTNEHKNSINIDEDLFDDSILEIFENSDFEETAEQTKHEKNNTLNKNALNVSALCCDEELNGYDKMLGKTWIYPTNYPIRDYQFNIIKAAMVENCLVSLPTGLGKTFIAAVIMYNFYRWYPLGHMQVNTRKSHWQTKRVFFATPQVIYNDIKSGTCPSDKIKCLVIDEAHKAKGNYAYSSIIKTLTEMGYFIYRVLALSATPGNKVEDVIHIVKHLHISRLELRTENCSDVKAYSHARNINTVVVELGPELTKLREQYVEILDGYTRKLTKFNIIQNLGNLSKGRIVMLYKEFQNRERGARHPQHSYIMRIFTLLITLYHGLELLVKHGSRVFLNFFDEHPEKTWVHEDNELTAFFDKLRDKLGLNPLDLDRSVLPDGTVPEVPSNLNFGHPKFSKLKEIIMRHFDTAKNKGQVTKAIVFCEYRESVNLVYCLLLQCRPTIVPEMFVGHGASGKDGKTVISQKQQLRVMRNFRSGVCNTLVCSSPCNAMDPAPARGFDSYIILHQIKHITVMFCSYRCGRTGRERSGQVFILVTEGREHSTLLDCIRQNDGLNQKILTSEEVKKSLFKSNPRMIPADFMPECQKMFITVAKKESKETSKDIKEKSKGKKEKAKDKLLKGQKDLRSMLLQKGTSSIISQRDDTTSERTKLISKQEFDVLFPEGYEDSNIFSQPNDCWALNEFLKENNDTKQIVRPSPQWEPQKALQNTFKVKNSSDTKMLIELFEQTKNTPMTKSPVKKDGDIRVLFSKSTKSIKNYTKSMNELGIDSHSNISNDVLNLLVDLSLENKNMEKRCYICSVVCRCSGLVKRSGHRFTDLYFHEQPDLPDVELVDYLTKESIKDLCEEINTRNINNESDIETDVTYVGKSEMKSTDLNISVGNVKKGSGTSDARQFDVNYSDDDLFEESDLFDEIDIPTESCDRLSKDSKCMDKAVTELGSRDTSTNSADIADNHLNESCSKINTEIKSHADEMYSVGTTRNKANTNNSMITGYVNDVRNENTEDCNIIEVEDVFDDFDEFNNKTTELTPPLRYKDENELEKSPILCTVRTSYIERGDRTVNRNEKCDQYDMLEVNDIFDDLNTSDLTSAKTKSTNNNNKYSELAIRDTNEDFECIIINNEDTTDMHTRENGLTSGGDAHKDTDTSDDTITETSTASNKKNDIRDVLKYFYIDKISDIFEHGYYSFQGSNRIDFKRKNAKPSNDLHDVTEIYNKNNDIIEIKDDGTYEMFNDTAIDGNKIYIKNNDIIEIKDGGSHEMFNDTVLEGSLSPSLLSGRSQTVLPSRSASPILNTQRSKKNLSLNKQTSPILSSQRRKMSLNINRSDKKHNGPISSVGQTSLIQKVNRTSQVEPDHRKTVKLEMDNNNKSNTSDIIILDSDEDDGDEKHVKHNTTKRRLVEVECESPYFKKKPKLEGNNTKPTSIKEKVMAALTSHHNLDATFHNDAQINYALTSPMRDSVKENKDPLRSKLDVLKKFQYDSKNRSSTVLKENKRNVSARISESDDDFVSQVPYRKKILTKLKRKPAKSKTREAKKPSEFLDLEAELSEDEDVSEDELSDDSTDSIKNFICDDTVARDGDIQAIYLESIKSPVKGVFKLPQLPALKKHEVLSQYVEEDTYEMDSFCVDSHVVLNETHEMSELELAEMILEERRRNRRNRNQVEDTEESLIIKKTCRKIKRQINSDSEDSN-