Monarch geneset OGS2.0

DPOGS200823
TranscriptDPOGS200823-TA2619 bp
ProteinDPOGS200823-PA872 aa
Genomic positionDPSCF300071 - 610614-618182
RNAseq coverage321x (Rank: top 36%)
Annotation
HeliconiusHMEL0114740.079.33% 
BombyxBGIBMGA009877-TA0.083.77% 
DrosophilaEtl1-PA0.048.68% 
EBI UniRef50UniRef50_D6WXY60.054.02%Putative uncharacterized protein n=4 Tax=Neoptera RepID=D6WXY6_TRICA
NCBI RefSeqXP_967093.10.054.02%PREDICTED: similar to helicase [Tribolium castaneum]
NCBI nr blastpgi|910892090.054.02%PREDICTED: similar to helicase [Tribolium castaneum]
NCBI nr blastxgi|910892090.053.87%PREDICTED: similar to helicase [Tribolium castaneum]
Group
Gene OntologyGO:00036775.9e-76DNA binding
GO:00055245.9e-76ATP binding
GO:00043866.1e-23helicase activity
GO:00036766.1e-23nucleic acid binding
GO:00055154.9e-06protein binding
KEGG pathwaypic:PICST_571102e-129 
 K01509 (E3.6.1.3)maps-> Purine metabolism
InterPro domain[330-628] IPR0003305.9e-76SNF2-related
[323-523] IPR0140016.1e-32DEAD-like helicase
[727-809] IPR0016506.1e-23Helicase, C-terminal
[80-138] IPR0090604.9e-06UBA-like
Orthology groupMCL13175 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200823-TA
ATGTCTGAAAGTACGAGTCCTAGTGTATTAAGTTATCTAAGACAATATAGATTTCAAAAGAAACCTACCCCAGGTTCTAGCGGGGTTGTAACTGTTTCAGCATCATCTTCTGGCTCTCCATCACAAATTCGTATGCCAGCTCCAAGCACAGCAAATAGAAATGGACCTATAGTATATAAAAGAATTAGAATCCAAGATTCTGATTCAGATGACAACCCATCACCAGCTAAGAAGGTTGTGGTAGAACTAACAACAGCAGTCAAAGAAAGAAGGTTCCGGAACATGCTAGAAATGTTTCCTGATATTTCTCCTCTATTTGTAAAGAACACTTTGGTGAAACATGGTTGGAATGAGGAGAGATCTCGTGATGAGCTCCTCAAACACGATCCAGAGAGTAATGACAAAGCTTCATCATCCACATTCTTTGCATCACGTCCTGGATTTTCCGGACAATTACCGAGAACCAATGGTATGATAGTCAGAGTTACCAGCGGGCCTAGTGTGGGAGTTAGAAAACCCTTAGCTGCTAAAGGAGGAAGGGCGAGACGAGGTAGCAGCGGCAGTGAAGATGACGACTATGGAGTTAGAAAGGGCAAAGATGACAGAGTATATGACAGTGAGGACTCCGATAATGAGGTGACAGATGATCTTGTAGGTGATAAACGAAAGGTGTTTGAGTTCCTTAACACAAGCAGCATGAATGAATTGTCTTTACTAAGCGGCTGTTCCCAGAAGAAGGCGGAGGCGATCATGGCCCAAAGACCGTTTAAAGGCTGGGTGGATATGGTGGAAAAATTCAATAACAACAAGATGTTGAGCACAGATCTGTTGAATTCAACACAAGAGTTGCTAAGCACTAGGAACAATATTCAGCGTTTGATGAAGAAGTGTGTGGGGCTCGCCCAGCAGTTAGAGTCCGCAGTGGCAGCTGGAGCTGGGAGGTTGAAGCAACCTGGAATACTAGACTCCAGTCTGAAGTTGGCACCTTATCAGCTGGTAGGTTTGAACTGGCTGGCTGTACTTCACAAACAGGGAGTATCAGGTATATTGGCTGATGAAATGGGACTTGGGAAAACGGTACAAGTCATAGCATTTCTTGCGCACTTGAAGGAAACGGGACAGGCTAGGGGAACGCATCTGATTGTTGTACCTGCAAGTACTTTAGATAATTGGAGTAGTGAGCTGTCGCGGTGGTGTCCTTCACTCAGGGTCAGCAAGTACTATGGAAATCCTGAAGAGAGGAGACAGCTCAGGATAGAGTACTCCAGAGGACTGGATCAGATAGACATTGTACTTACCACTTACACAATGGTGTCCAGCTGTCCCGAGGAACGTAAAATGTTTCGTATAACACCAATGCACTATGTGGTCTACGACGAGGCTCACATGCTCAAAAACATGTCCACACAAAGATATGATAATTTGCTTAAAATAAAGTCGAAGCATCGTCTATTGCTAACGGGGACACCGCTTCAGAATAATCTGGTGGAGTTGATGTCTCTACTGTGCTTCGTCATGCCGCACATGTTCTCTGGGAACACCGATGACCTCAAGAATCTGTTCCAGAAGAACGCGAAAGCAAAAACCACAAAAAAGACAAATGGCAACACCGATGACGAAGTGCCGGCGTTCGAGCAAAGTCAGATCACTCAAGCTAAGAGGATTATGAAACCGTTTGTTCTGCGTCGGCTAAAGCGCGACGTGTTACAAGACCTACCTCAGAAGACGAACCACACAGAACTGTGCCCTATGTCGGAGAAACAACAGAGGCAGTACAAAGAGCTCATAGCTGGCTTTGCGGCTAAAGATGGAACAATCCACGCAACGACGGAACAAAGCGGCATATCGATGATGATGGACATGCGTAAACTGGCCAACCATCCCCTACTGCTGCGTTACCACTACGACCAACACACTACCCGCAAGATGGCCGCCCGCCTGGCCAGGGATCCCGACTACAAGGAGAAGAACGAACAATATCTGTTCGATGATCTCATGTGCATGTCCGACTTCCAAATACATCAACTCACCCAACAGTACTCTTGTATTAGACAATATGCAGTGCCGGATACTTTAATAGAAGATTCCGGCAAGTTCCAAAAGTTAGACTCAATGCTGCCACAATTGCAAGCTGAAGGTCATCGAGTGCTCATCTTCAGTCAGTTCACGATGATGTTGGATGTCATCGAGCCTTACCTTAGAATGAGAAACTACAGGTACCTCCGGCTAGATGGCAGCACTGCAGTCAATGAAAGACAAGACCTGATTGATCAGTACAATACCGAGGATATATTTGTGTTCCTGTTGTCCACGAAGGCGGGCGGACTGGGCATCAATCTGACCGCAGCGGACACTGTCATTATACACGATATAGACTTTAATCCATATAACGACAAGCAGGCAGAAGACAGATGTCACAGGATGGGTCAGACCCGCCCGGTCACTATATACCGTTTGCTGAGCGCTGGTACCATTGAGGAAGGTATCTATCAGGTTGCTCAGGAAAAACTTAACTTGGAGAAACACGTCACTGGCGCCGATGAAAACGAATCGACAGAGCAAAAGAATGTCGTCCGCCTGTTGTCCGCGGCGTTGGGTCTGACGTCACCGAACAAATGA

Protein sequence:

>DPOGS200823-PA
MSESTSPSVLSYLRQYRFQKKPTPGSSGVVTVSASSSGSPSQIRMPAPSTANRNGPIVYKRIRIQDSDSDDNPSPAKKVVVELTTAVKERRFRNMLEMFPDISPLFVKNTLVKHGWNEERSRDELLKHDPESNDKASSSTFFASRPGFSGQLPRTNGMIVRVTSGPSVGVRKPLAAKGGRARRGSSGSEDDDYGVRKGKDDRVYDSEDSDNEVTDDLVGDKRKVFEFLNTSSMNELSLLSGCSQKKAEAIMAQRPFKGWVDMVEKFNNNKMLSTDLLNSTQELLSTRNNIQRLMKKCVGLAQQLESAVAAGAGRLKQPGILDSSLKLAPYQLVGLNWLAVLHKQGVSGILADEMGLGKTVQVIAFLAHLKETGQARGTHLIVVPASTLDNWSSELSRWCPSLRVSKYYGNPEERRQLRIEYSRGLDQIDIVLTTYTMVSSCPEERKMFRITPMHYVVYDEAHMLKNMSTQRYDNLLKIKSKHRLLLTGTPLQNNLVELMSLLCFVMPHMFSGNTDDLKNLFQKNAKAKTTKKTNGNTDDEVPAFEQSQITQAKRIMKPFVLRRLKRDVLQDLPQKTNHTELCPMSEKQQRQYKELIAGFAAKDGTIHATTEQSGISMMMDMRKLANHPLLLRYHYDQHTTRKMAARLARDPDYKEKNEQYLFDDLMCMSDFQIHQLTQQYSCIRQYAVPDTLIEDSGKFQKLDSMLPQLQAEGHRVLIFSQFTMMLDVIEPYLRMRNYRYLRLDGSTAVNERQDLIDQYNTEDIFVFLLSTKAGGLGINLTAADTVIIHDIDFNPYNDKQAEDRCHRMGQTRPVTIYRLLSAGTIEEGIYQVAQEKLNLEKHVTGADENESTEQKNVVRLLSAALGLTSPNK-