Monarch geneset OGS2.0

DPOGS208572
TranscriptDPOGS208572-TA3402 bp
ProteinDPOGS208572-PA1133 aa
Genomic positionDPSCF300064 + 1571579-1577266
RNAseq coverage204x (Rank: top 47%)
Annotation
HeliconiusHMEL0042910.059.25% 
BombyxBGIBMGA010637-TA0.056.69% 
DrosophilaRecQ5-PB4e-17541.22% 
EBI UniRef50UniRef50_A0NH760.050.78%AGAP001255-PA n=2 Tax=Coelomata RepID=A0NH76_ANOGA
NCBI RefSeqXP_001651140.10.052.61%DNA helicase recq5 [Aedes aegypti]
NCBI nr blastpgi|3479655910.050.78%AGAP001255-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479655910.038.75%AGAP001255-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00063104e-244DNA recombination
GO:00080264e-244ATP-dependent helicase activity
GO:00055241.5e-24ATP binding
GO:00043861.5e-24helicase activity
GO:00036761.5e-24nucleic acid binding
KEGG pathway 
InterPro domain[2-1103] IPR0045894e-244DNA helicase, ATP-dependent, RecQ type
[13-219] IPR0140012.4e-27DEAD-like helicase
[264-345] IPR0016501.5e-24Helicase, C-terminal
[21-189] IPR0115452e-22DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL12331 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208572-TA
ATGGATAACGTTACAGAAAAATTATTGCAATGTTTTGGTCACCGAAGGTTTAAAAGTGAATTGCAAGAGAGAGCAGTGCGAGCAATCGCGCGGGGAGTTCACGATGTCTTCGTTTCTATGCCTACTGGTTCTGGAAAATCATTGTGTTTCCAATTGCCAGCGATGTTGCAGGAGAATAAGGTCGCAGTCGTTTTCTCGCCTTTATTAGCTTTAATTAAGGATCAAGTCGATCATTTGACTAAATTAAAAATCGCAGCCGAATCTATAAATTCTAAAATGACTCAAAAAGACAGAGAAAGAGTTTTGAATGACCTTCGTAGTATGAAACCTAACACTAGGTTTCTATATGTAACACCGGAGCAGGCAGCTACAGGTACATTTAAAGCGCTCATGGAACATCTCGTGAAGTATAAGAAGGTTTCATATGTAGTGGTAGATGAGGCTCACTGTGTAAGTGAATGGGGTCATGACTTTAGACCCGATTACCTTAAGTTAGGCAACCTGAGGGAAAAATTTAAAAGTATACCATGGGTCGCCCTCACTGCAACTGCCAGTGCCGAAGTAACTAAAGACATATTAGAAAATCTTAAGTTGCTAAATCCAGTTGCACAGTACAAAACTCCCAGTTTCAGAAGAAATTTATACTACGATGTTGTTTATCAGAATTGTATCCAGGATGAAATAGGAGACCTGGTGGAATTTTTAAAGAAAAATTTGAAGGATGAAATAAGTGTTAAACCAAAAGATAAGAGTGCGGCCATAGTGTACTGTCGGACAAGGGAACAGACGGAAGATATTGCCAGCATGCTAACAAAAAGAGGTCTAAACTGTTTGGCATATCATGGAGGCTTAAAGAGTAGCGAGCGTGTGTCTGTGCAGGACCGTTGGTCTAATGGTGAAGTTCCCTGTGTGAGTGCGACTGTTTCATTCGGTATGGGTGTTGACAAAGCCAGCGTTAGGGCTGTGGTCCACTGGGGTCTGCCGCAGAACGTGGCCGCTTACTACCAGGAGTCTGGGCGGGCCGGTCGCGACGGTAAGCCCGCGTTCTGTCGTATATACTACTGCCGCAGTGAGCGTAACGCCGTTGATTTTTTGCTGAAATCGGAAATAGCCCGCTCTAAGACGCCCGAGCAAAAACAACGGTGCAAGAACGCGTATAAGAGTTTCGAAGTTATGGTCAAATATTGTGAAGAAGTCAGATGTCGCCACAAGATCTTCGCAGATTTCTTCGGCGAGGAGGTCCCTCAATGCATCACCCGCTGCGACTCGTGCACAGACGAGCGTGCCGTCCGCCGCGCTCTAGACCAGCACACGCGGCGGGCGATGAACGCCAGTCTGGGGAACACTGGCTTCGATAACAACGACTCCGCCAACTTGTACGGGGAGGGACGACACGGACAAAAGAAAGAGGCGGAGTCGTACTATGGCGACGGCAGCGGAGACTCGGACAGTGATTCTAGTAGACGTCGCGTTGCTGAAGAAACGAAAAGTCTTATATTAAAAGAATTCGCAGCTCGAAAAAAGAATACAGAGAAAAATAAAGCGAACAGTGACGCTGAATCCGCTAAGCACTCGAAATGTAAAGCAGCTGAGAGCACAGGGACGAAGGTGACCGGTCTCACAGTAGCGGGTCGTGAAAGCTATCTTTCCTTACTCACTGAAGCTTTGAGAGCAAATATCGCTAACATGAAAGATATTGAAGAAACAGAACATACATTGTCGCGAGCGGACGTTGAACAGTTCGCTATTGATTTAGAATATGACGCGTTCTCTAAAAGCACCGTCATAAGCCTGTATAGGAGGGCCATGTCGAAATTGATAGCAGCTATCAAGGGATGTAAGGACACGCTGTACCCGGGTCTCAAAACTTTTGAACCAAAGAAAACAGATTCGCTTATGGATTTTATAAAAGATTTCGAAGCTAAACGAGGTACACAAAAGTATCTAGGTTTTATAACGGCCGCTCAGTTAGATAACAGTACTGCAAACACAGAACAAACTTTATCTAAATCCGATAAAGACAGTAAGAGGAAAGCTAATTTATTCAAAAAAGATCCTTTGACTCAAACCAAGCTGCAAAACTTCTTTTCAACAAAATCATCTCCTGAACCTCTAAGCTCAGACATGAGCGAGGACGAGGGTGGTCTAGTCATAGATGAGAACGCTAAAGTAGATGACACAGAAACCACTCTACTCATAGAAGATAAAGATAAAACGACTTCCGACTGTAAGGAAAGTTTAAAACCGTATATAGAGAGGAGCGATAGTGATTCCAGTAATAAAAGGCAACAGACTCTCGTGATTAACATAACGTTAGAGGGAATACCGAAACACGAGGAAAATAAAGAAGGAAAAGTTAAACAAGAAAAAGATTGCAAACTAACGTCCCCCGAAAAGATGAAGCCCACTGCAAAGAGAAAAATAAAGGCGCTGTTCGGAGAATCCTCTGAGAGCGAAACGGAATTAAATGAAATCAAACGGTCTCGGACATCGCAGAAACTAAACGAACACAAGTCACCCAATAAACAGTCGCATAGAAGAAGATCTGGAGAAATACAGTCTAAAACACGGAGCACTAAGCATAGAGAGAAGAAGTCAAAGCGTAACGATGAAGACAGAGATAGAAAGATACCAGACGAAAACAAGGCAGCGAGACCCGCTGCTGTACAAAACATATCCAACACATCCATCGTGAAAGAAGAGAGTGAAAGGAGACAAGAGAATAAAGAAGTGTTGCATGAGAACATCGTCATGAATACTGACAGCATTGACGATCCTGAAATAATCAGTGAAAATGATATGATCAACTCGGATGGTGACAAAATAGTAGACGAACAATCTCTAGAAAAAGCACACAAACTTAATCTTGAAGCTGAAAAGGTATTGCAGGAGTTGAAGATATTTTCTGAAATCAAACCGGAAGTAAAACCTGAAGTAAAAGAAAAAAATCAAATCGAAGTCAAAGTAAGTAAACCAAGGAGTCCGGTCACTATGTCCAAATCACCGACAAAACATGGCACAAAAGATAAATTAATACCCGACACTACGAAATCCAAAACCGTTATATCTGCTGACAAACTTAAAGCTAAGCTACCGAAAGCGAGGGAAGAAATAAAAGACGATGTCAAGAAAAAAGAAAGACTCAAAGAAAAGAAGTCGGAGAAAATAGATGTCGCTGGCCTTGTTGTTAAATTGTTGATGCCTTATTATAAAAAGAAAAAAATTAGCAACAGAGACTTGTTTAAGATAACCGCCAGGCATATCGTGCATCAATTGCTTGCCATACAAATAACAGAAGAAAAAGCGATAGAAATGCTTCTTAAGAAGACTTTCAGCAAGGAATTAAGAATTGAAAAGGAAAGTGATCTAGAGACTAAATTAAAGTTCAACAATGTCACGTGA

Protein sequence:

>DPOGS208572-PA
MDNVTEKLLQCFGHRRFKSELQERAVRAIARGVHDVFVSMPTGSGKSLCFQLPAMLQENKVAVVFSPLLALIKDQVDHLTKLKIAAESINSKMTQKDRERVLNDLRSMKPNTRFLYVTPEQAATGTFKALMEHLVKYKKVSYVVVDEAHCVSEWGHDFRPDYLKLGNLREKFKSIPWVALTATASAEVTKDILENLKLLNPVAQYKTPSFRRNLYYDVVYQNCIQDEIGDLVEFLKKNLKDEISVKPKDKSAAIVYCRTREQTEDIASMLTKRGLNCLAYHGGLKSSERVSVQDRWSNGEVPCVSATVSFGMGVDKASVRAVVHWGLPQNVAAYYQESGRAGRDGKPAFCRIYYCRSERNAVDFLLKSEIARSKTPEQKQRCKNAYKSFEVMVKYCEEVRCRHKIFADFFGEEVPQCITRCDSCTDERAVRRALDQHTRRAMNASLGNTGFDNNDSANLYGEGRHGQKKEAESYYGDGSGDSDSDSSRRRVAEETKSLILKEFAARKKNTEKNKANSDAESAKHSKCKAAESTGTKVTGLTVAGRESYLSLLTEALRANIANMKDIEETEHTLSRADVEQFAIDLEYDAFSKSTVISLYRRAMSKLIAAIKGCKDTLYPGLKTFEPKKTDSLMDFIKDFEAKRGTQKYLGFITAAQLDNSTANTEQTLSKSDKDSKRKANLFKKDPLTQTKLQNFFSTKSSPEPLSSDMSEDEGGLVIDENAKVDDTETTLLIEDKDKTTSDCKESLKPYIERSDSDSSNKRQQTLVINITLEGIPKHEENKEGKVKQEKDCKLTSPEKMKPTAKRKIKALFGESSESETELNEIKRSRTSQKLNEHKSPNKQSHRRRSGEIQSKTRSTKHREKKSKRNDEDRDRKIPDENKAARPAAVQNISNTSIVKEESERRQENKEVLHENIVMNTDSIDDPEIISENDMINSDGDKIVDEQSLEKAHKLNLEAEKVLQELKIFSEIKPEVKPEVKEKNQIEVKVSKPRSPVTMSKSPTKHGTKDKLIPDTTKSKTVISADKLKAKLPKAREEIKDDVKKKERLKEKKSEKIDVAGLVVKLLMPYYKKKKISNRDLFKITARHIVHQLLAIQITEEKAIEMLLKKTFSKELRIEKESDLETKLKFNNVT-