Monarch geneset OGS2.0

DPOGS204771
TranscriptDPOGS204771-TA5367 bp
ProteinDPOGS204771-PA1788 aa
Genomic positionDPSCF300231 + 192041-201017
RNAseq coverage1059x (Rank: top 12%)
Annotation
HeliconiusHMEL0150360.081.28% 
BombyxBGIBMGA010194-TA1e-8338.72% 
Drosophiladom-PA0.055.68% 
EBI UniRef50UniRef50_UPI00022467AF0.054.95%UPI00022467AF related cluster n=1 Tax=unknown RepID=UPI00022467AF
NCBI RefSeqXP_002427447.10.050.00%Helicase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3454838730.054.95%PREDICTED: hypothetical protein LOC100115939 [Nasonia vitripennis]
NCBI nr blastxgi|3454838730.055.38%PREDICTED: hypothetical protein LOC100115939 [Nasonia vitripennis]
Group
Gene OntologyGO:00036778.3e-87DNA binding
GO:00055248.3e-87ATP binding
GO:00043861.8e-20helicase activity
GO:00036761.8e-20nucleic acid binding
KEGG pathway 
InterPro domain[575-858] IPR0003308.3e-87SNF2-related
[568-761] IPR0140015e-36DEAD-like helicase
[221-292] IPR0139992.4e-28HAS subgroup
[221-292] IPR0065621e-23HSA
[1132-1215] IPR0016501.8e-20Helicase, C-terminal
Orthology groupMCL15355 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204771-TA
ATGAGTGGACGAAATGATCCCGCCTCAACTTTTGGATCAGCTTCTTCACCGTTACCTCTCATGGGTGTGGAACGTGGAACCGGTTGTGGAGGCGGGGGAGGGGGAGGGGGCGGCGAGGAACCTCTCAACGGGGTCGCCGTGGAGCGACGTGATGACGAGCCGCCTAGAAAGAAGAGCAAGCTGCATGGGGTCGAGGACGTGTCGGCGCTCAGGAAACGCGTGTTGGAGTACAAGTTGCTCCGGTTGAAGAATCTGCGAGAAAGGTTCACGGAGCAACTGAGCGAGTTGTATTTTCTTCAGGCGGGTGGAAACATGATGGACTACTCGGCTTGGCGAAAAAAACCTCCAGGTCCACAGCTGACCGCCTTTTTAGAGTCCCGCCGGCCGCCCCTCGTGGTGCCACCCCCGCCGGAGCCGCCCCCCCAGCGGGCCAGGGTTTCCTCTCCGGCGGTGGCGGTGGCGTCCGAGCCGGCCCCGGCCTGTGCTCCTAGCGTCGAGCCTACTGTGGTGACGTCCTCCGCCGCGCCCCCTCCCGTCACCCCGGCAGCCGCCGCCGCCGCCGCCGCGGACGAAATGGTCGAAAAGGCCAAACAGGAGGCGTACGTGGCGGCGCGAGTAGCGGAGCTCGCCCGTGCTGGCCTGTGGACCGAACGGAGGCTGCCACGCGTGCTCGAACCGCCGAGACCTAAGACGCATTGGGATTACTTGCTCGAGGAGATGGCCTGGCTTGCGCAAGATTTTGCCCATGAACGGAAGTGGAAAAAGCAAGCCGCCAAAAAGTGTGCTCGTGCGGTTCAAAAATATTTTCAAGACAAAGCTGTCGCGGCTCAGAAAGCTGAAAAAGCTCAAGAGTTACAGTTGAAGAAGATTGCTGCCTTCGCAGCCAAAGAAATCAGAAATTTTTGGTCCAATGTCGAAAAGCTGGTCGAGTGGAAACGCGTCCGCCGAGTAGAACGAGCCCGCAAGGAGGCTCTCGATGAGCAACTGAGTTATATCGTAGACCGCACGGAGCGCTACTCGCGTCAACTGGCAGCCAACCTGGGCGCGCCCGCAGCCCCTGCTGCGACCCCGGCCGTCCCGGACACGCCGCCCTCCGACGACGAGTTTCAGCCGCGAGACGACTCCGATGACGACGAGGAGACCATAGCGGCTGCCGAGCGAGAGGCCGCACACGACGCCACGGATCATCGCGACGAGCTTGAGGCGCTACGCCGGGAGTCCGACCTCGACCTCGGCGACCTGTTGCCCCCGGGATACGTGCCAGCCCACTCGCCGCCGCCCTCGGACTACGGGCCCGACGTCGACTCCGCCGACGACGAGGACACCATCGCGGAACAGGAACAGAACGAACGGCCCGAGGACGCCGCCGCAGAGCTCGCCGCGCTCAGGAACGACGCGGACCTCGACATACACGAGCTGCTCGGGAGATACAATGCCGAGGACGGGGACGGCTCCACGGCCACGGAGCGCGACACGGACGACGAAGACGAGCCTTCGGAAATAAGTAGCGACGAGTCCGCGGACTCCGAGCAGCTAGGAGCTCTCATGGAAACTGACGAAATCAAAGAAGAGGCGAGAAGGGACGATGCCAATGCGGAAGGAGAGCAGCGCGTCGAGGCTGCCGCCTCGCTCGCCGCCTCTCTTCAACCCACTGGCACCACGCTTTCGGAGACGGCCGTGGCCACGCCCGTGCCGGGGTTGCTGCGACACTCGCTCCGGGAGTACCAGCACGTGGGCCTTCACTGGCTGGCCACCATGCACGCGCGGGGACTCAACGGCATACTGGCCGACGAGATGGGGCTCGGCAAGACCATACAGACCATAGCTCTGCTGGCGCACCTGGCGCTGGACCGCCGAGACTGGGGACCGCACCTCGTGGTCGCGCCCACCTCCGTCGTCCTCAACTGGGAGATGGAGTTCAAGAAGTGGTGTCCGTCCTTTAAGATCCTCACCTACTACGGCACCATCAAGGAGAGGAAACTGAAACGTGTCGGTTGGACCAAAACGAACTCGTTCCACGTGTGTATAACCTCGTACAAACTGGTAGTGCAGGACCATCAGAGTTTTCGTAGGAAGAAGTGGAAGTATCTCATATTAGACGAAGCTCAGAATATCAAAAATTTTAAATCTCAGAGATGGCAGATGTTACTGAATTTCCAGACAGAAAGACGGTTACTCCTGACGGGCACGCCGCTACAGAACAGTCTGCTGGAGCTGTGGTCGCTCATGCACTTCCTCATGCCGGACGTGTTCGCCTCGCACTCGGAGTTCCGCGAGTGGTTCGCGCCCGTCGCCGGCATCGCGGAGGGCTCGCACCGGTACAGCGACGAGCTCGTGAGGAGACTGCACGAGGTACTGCGACCTTTCCTGTTGCGGCGCCTGAAGGCGGACGTGGAGCGACAGATGCCGCGCAAGTACGAGCACGTGCTCATGTGTCGACTCTCCAAGAGGCAGCGGTTTCTCTACGACGACTTCATGTCCCGAGCGAAAACGAAAGAGAGTCTCGCCTCGGGCAACCTGCTGAGCGTAATCAACGTGCTGATGCAACTCCGCAAGGTGTGCAACCACCCCGACCTGTTCGAGCCGCGGCCGGTGTCCTCTCCGCTCCAGCTGCCACCGTTGCCTTACCGCGTGCCTTCACTGGCGCTCGTTGCCGATGTGGTGCGACGAGTGGAGCTGGCGGCGCGCCTCGGCGGGGACCTCGCGACGCTCGAAGTGTCCGGCGCTGGCGCCTTCGCCGCTCACCGCGCTCGCCACCTGGCTCCGCCGCGGCGCCTCATAGAGGAGCTGCCGGACCCCCCGCCGCCGCCTACTCCGCGTCCCCCGCCCTCCGNNNNNNGTCCGCTCACCGCCGCGCCAGCCTGCGGCGCATGGCCGCGGTCAACGAGCGCCGCTGCTGGCGCCTGCCCCTGTTCGGCGCGGACTTGCGCGCGGCGGTGGACGTGGGTCCGCCTCCCTTGCCGCCGCGGGATATCTCGGACGTGCTCCGCGACCTGCACGACGTCATCGACAGGTTCACTATGTTCATTCCCGTTTTGTCAAGCGAGGGTTCCCATTAAGGAAGGCAATATACCTTCTCCCCCCCCCAAACCCCTCGTTCATGATCTGCTGGTCGTCGTGCCGGGCGCGCGGGCTCCAGATGTGGGCGGGCGCGGGGGAGGGGGCGCGGGCCTGGCGGCGGGCGGCGCGCCCCCCCCGAGGGCGGGGCTGCGAGCCGCGCGCGCCGCCCTCACCTTGCTTCACGTTCCGGCCGCGCGCGCAGCCGTCGCCTTCCCGCACCCGAGACTGCTGCAATACGACTGCGGTAAGCTGCAGACGCTGGACGGTCTGCTCCGGCGGCTGAAGGCGGGCGGCCACCGCGTGCTGATCTTCACTCAGATGACGAGGGTGCTGGACGTGCTGGAGGCGTTTCTCTGTATGCACGGCCACGCCTACCTCCGCCTGGACGGCGCCACCCGTGTGGATCAGCGCCAGCCGCTCGTGGACCGATTCAACGCGGATCCTCGAATTTTCGCTTTCATCCTGTCCACGCGCAGCGGCGGCGTCGGCCTCAACCTCACCGGAGCGGACTCCGTGGTGTTCTACGACTCGGACTGGAACCCCACCATGGACGCGCAGGCTCAGGACCGCTGCCACCGGATCGGTCAGACGCGCGACGTGCACGTGTTCCGCCTCGTCACCACGGCCACCGTCGAAGAGAACATTCTGCGCAAGGCCGAACAGAAACGGACCCTCGGCCACCTCGCCATCGAAGACGGACACTTCACCACGTCCTATCTGAGAGCGGCCAACATCAAGGAGTTGTTCGGAGCGGAGACGGAGCCGACGGCCGGCCAGAGAGACTGCGAATCGGCAGAGGGTGGGGAGTTGGAGTCCGCCCTCGCGGCCGCGGAGGACGAGGCGGACGCCGCCGCCGCCCAGGCGGCCAGGGCGGAGGCTCAGGGTGACCTCGCCGAGTTCGACGAGACCGTACCGCTAGATGAAGACACGCGCGCAGCTTCCCCCGGACATGCGGGAGACGAAGACCGGGGAGAGTTCGCCGCCCTCATGAAACAGTTAACGCCGGTGGAGAAATACGCAATGAGACTGGTGGAGAGCAGCGAGGCGGCCACTGAGGCGGAGCGGGCGGCGCTCGGGGAGATGAGGAGGCAGCTGAGGGAGTGGGAACAGGCGAGGCGCGCGCTCCGGGACGAGGCTAGCGACCAGCACGAGCAAGAGACGGAACCGGAGCACGACCAGGACCTGGAGCTCACTTACTGTCGGGAGGACGCCCGCACGGAGATATGGATCGACGGCAACGGAGCGGCGGAGCGCATGCCGATGTGGTGTCCGCCCACCCCGCCCTCCAGCGACGGAGACGTGTACTGCGACAGCTGGGCGCGAGCCCTGTACCGGCGTGGGGCGGCCGCCGACGCTCTCCTACCGCCCGTCCGCTGGAGAGACTCTCGTGCTCCGCGGTCACCGCGCCGCACTAGGCCGGCTCCCCGCTCCGCCCACGCTCCTCCTTCGCTGTTCGACCGCGGCGGCCCGCGTCCACGGCCCCGCGTCCGCGCGCCCCCCGCCCCGCCCAGGGACCACGTCACTCCTCCGCCCGACTGGGCTCCTTGCGAGGACGCCGCTCTACGGCGAGCGCTCCGTCTGCAGCGTTTACCGCCGGAACCTCCGGCCGCTCACGCTCCCAACTGGGATTGGCTCGCCGACCTGGTCGGGGAGGTCGCCCGCGCCTATCGCTCTCCCCGCTCCTGTCGCGATCGTCACGACGCGCTCGCCGACCCGGAGCGCGCCCGCCGCAAACATCGGAAGCCTCCGCCGGCCCGACGTCGGCCGGACGATGACGCCCCGCGTCCGCCGCTACAACGACTCGACGCGATGCGGGAAGCCGCCGAGCGACGTCGCGCGGCGCCCAAGCGTCGCCTGGACGACGCCTCGCACCACAATCCCAAGCACGCCGCGCTGCTCGCCGACCATGGAGTGGACTACGACGCTCCTCCTTCACCGATGGAGGTGGCCACACGACGAGCGGAGCGTATCGCGAAGGAGAAGATGAAGGTCGGCGCGAGCGCTTCCAACGCCGGGAACGTCGCGCCGCCGCCGGCCGCCGCCAGCGGCGCGCCTCCGCCCGTCACCGCACAGCGTATAGTAGTGGCGGCGCACGGGGCTCCGGCAGCGGCCGCGGCGGCGCCCGGGCCCGGCAAGCCGGAGGTCCGCCGTCCGCGGCCGGGGGAGGCTCCGCGAGCTCAGGCGCCGGCCGCCGCACAGCTGCTGTACCGCCAGCAGACGCTCGCCGGCAGGCACCATCTGAAGATATTGCACCACTCGACACCGACCCAGCCGCAGGTAGGGCCGCGCCGGGGACTGCGGCGCTAG

Protein sequence:

>DPOGS204771-PA
MSGRNDPASTFGSASSPLPLMGVERGTGCGGGGGGGGGEEPLNGVAVERRDDEPPRKKSKLHGVEDVSALRKRVLEYKLLRLKNLRERFTEQLSELYFLQAGGNMMDYSAWRKKPPGPQLTAFLESRRPPLVVPPPPEPPPQRARVSSPAVAVASEPAPACAPSVEPTVVTSSAAPPPVTPAAAAAAAADEMVEKAKQEAYVAARVAELARAGLWTERRLPRVLEPPRPKTHWDYLLEEMAWLAQDFAHERKWKKQAAKKCARAVQKYFQDKAVAAQKAEKAQELQLKKIAAFAAKEIRNFWSNVEKLVEWKRVRRVERARKEALDEQLSYIVDRTERYSRQLAANLGAPAAPAATPAVPDTPPSDDEFQPRDDSDDDEETIAAAEREAAHDATDHRDELEALRRESDLDLGDLLPPGYVPAHSPPPSDYGPDVDSADDEDTIAEQEQNERPEDAAAELAALRNDADLDIHELLGRYNAEDGDGSTATERDTDDEDEPSEISSDESADSEQLGALMETDEIKEEARRDDANAEGEQRVEAAASLAASLQPTGTTLSETAVATPVPGLLRHSLREYQHVGLHWLATMHARGLNGILADEMGLGKTIQTIALLAHLALDRRDWGPHLVVAPTSVVLNWEMEFKKWCPSFKILTYYGTIKERKLKRVGWTKTNSFHVCITSYKLVVQDHQSFRRKKWKYLILDEAQNIKNFKSQRWQMLLNFQTERRLLLTGTPLQNSLLELWSLMHFLMPDVFASHSEFREWFAPVAGIAEGSHRYSDELVRRLHEVLRPFLLRRLKADVERQMPRKYEHVLMCRLSKRQRFLYDDFMSRAKTKESLASGNLLSVINVLMQLRKVCNHPDLFEPRPVSSPLQLPPLPYRVPSLALVADVVRRVELAARLGGDLATLEVSGAGAFAAHRARHLAPPRRLIEELPDPPPPPTPRPPPSXXXPLTAAPACGAWPRSTSAAAGACPCSARTCARRWTWVRLPCRRGISRTCSATCTTSSTGSLCSFPFCQARVPIKEGNIPSPPPKPLVHDLLVVVPGARAPDVGGRGGGGAGLAAGGAPPPRAGLRAARAALTLLHVPAARAAVAFPHPRLLQYDCGKLQTLDGLLRRLKAGGHRVLIFTQMTRVLDVLEAFLCMHGHAYLRLDGATRVDQRQPLVDRFNADPRIFAFILSTRSGGVGLNLTGADSVVFYDSDWNPTMDAQAQDRCHRIGQTRDVHVFRLVTTATVEENILRKAEQKRTLGHLAIEDGHFTTSYLRAANIKELFGAETEPTAGQRDCESAEGGELESALAAAEDEADAAAAQAARAEAQGDLAEFDETVPLDEDTRAASPGHAGDEDRGEFAALMKQLTPVEKYAMRLVESSEAATEAERAALGEMRRQLREWEQARRALRDEASDQHEQETEPEHDQDLELTYCREDARTEIWIDGNGAAERMPMWCPPTPPSSDGDVYCDSWARALYRRGAAADALLPPVRWRDSRAPRSPRRTRPAPRSAHAPPSLFDRGGPRPRPRVRAPPAPPRDHVTPPPDWAPCEDAALRRALRLQRLPPEPPAAHAPNWDWLADLVGEVARAYRSPRSCRDRHDALADPERARRKHRKPPPARRRPDDDAPRPPLQRLDAMREAAERRRAAPKRRLDDASHHNPKHAALLADHGVDYDAPPSPMEVATRRAERIAKEKMKVGASASNAGNVAPPPAAASGAPPPVTAQRIVVAAHGAPAAAAAAPGPGKPEVRRPRPGEAPRAQAPAAAQLLYRQQTLAGRHHLKILHHSTPTQPQVGPRRGLRR-