Monarch geneset OGS2.0

DPOGS212017
TranscriptDPOGS212017-TA5349 bp
ProteinDPOGS212017-PA1782 aa
Genomic positionDPSCF300054 - 863649-887673
RNAseq coverage96x (Rank: top 62%)
Annotation
HeliconiusHMEL0135940.093.39% 
BombyxBGIBMGA010171-TA0.086.93% 
DrosophilaCG5205-PA0.065.48% 
EBI UniRef50UniRef50_Q9VF560.065.48%CG5205 n=24 Tax=Coelomata RepID=Q9VF56_DROME
NCBI RefSeqXP_321922.40.066.30%AGAP001234-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479655540.066.30%AGAP001234-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479655540.066.93%AGAP001234-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055241.2e-24ATP binding
GO:00080261.2e-24ATP-dependent helicase activity
GO:00036761.2e-24nucleic acid binding
GO:00043861.1e-16helicase activity
KEGG pathway 
InterPro domain[973-1282] IPR0041799.5e-91Sec63 domain
[5-216] IPR0140011.3e-28DEAD-like helicase
[11-187] IPR0115451.2e-24DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[306-394] IPR0016501.1e-16Helicase, C-terminal
Orthology groupMCL10167 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212017-TA
ATGGCTTTCGAGAATATCAAGGAATTAAACCGCATCCAATCTGTTGTGTTCCAAACGGCGTATAACACTAACGAAAATTTGCTCATTTGCGCCCCCACCGGCGCTGGTAAGACAAACATAGCCCTGCTAACTGTAGTTCACCAGCTCAAACAGCATATAGAGAATGACGTCATCATGAAGAATAAATTTAAGATTATCTACATAGCTCCTATGAAGGCACTGGCTTCGGAGATGACAGCCAGTTTTGGCAAACGTCTTCAAAGTCTTGGCATCACGGTCCGAGAACTCACAGGAGACATGAAACTAACTAAAGCAGAAGTTCAACAGACGCAGATGATTGTTACGACGCCCGAGAAATGGGATGTTGTTACCAGGAAAGGGGCTACGGATACTGAATTGGCGTCAATAGTGAAATTACTCATCATAGATGAAGTACATTTACTACACGGAGACAGAGGACCCATAGTGGAGGCTATCGTGGCCAGGACATTGAGACAGGTTGAATCAACTCAGAACATGATAAGAATCGTGGGCCTGTCAGCAACGTTACCCAACTACGTCGATGTTGCCAGATTCCTCCGTGTGAACCCCAACATTGGTCTATTCTACTTCGACTCCCGTTTCCGTCCGGTGCCGCTCGAACAGCAGTTTATAGGAGTGAAGGAAATAGGTTCCGGCGGGGGAACACACCTGAGACAGATCCAGACAATGAACGAAATATGCTACGACAAAGCCTCTGAGATGGTGCAGAAAGGTCACCAAGTAATGGTTTTCGTTCACGCTCGTAACGCAACCCATCAGACGGCTTTGATTCTGAAAGAGATCGCCCAAAAGAAAGGACACCTCAAGTATTTTGAGCCCGAGGACTCTGGAGGTTTCTTAAAGGCCAAGAAGTCTATCGGCAGCAGTCCTAACAAACAATTGGCAGAGCTCTTTTCCGCTGGTTTTGCCTGTCATCACGCTGGGATGTTGAGAAGCGATAGGAACATGGTAGAAAAATACTTCGCAGAGGGATACATCAAAGTACTCGTGTGTACCTCAACACTGGCTTGGGGAGTCAATCTACCAGCACATGCGGTTGTCATAAGGGGTACAGAAATCTACGACCAAAGTCACGGAACGTTCGTCGACCTCAGCATACTGGACGTGCTCCAAATCTTCGGTCGTGCTGGGAGGCCGCAGTTCGACACATCCGGCACAGGCATCATCATAACGACCCACGACAAACTGACTCACTACTTGAAGAGTATGACCAATCAGTTCCCGATAGAGAGCAACTTCATCAATCTCTTAGCAGACAATTTGAACGCCGAGGTGGCGCTCGGCACAGTCACTAATATCGACGAAGCGGTTGAGTGGTTGAGCTACACTTACCTATTTGTCAGAATGAGGATCAATCCGCAGGTCTACGGTCTGACATATACCGATGTCCAAGAGGACCCCACTTTGGAAACTAGGAGACGCGAACTAATTACAAGCGCGGCCATGCAACTAGACCGCACGCATATGCTGAGATACAACGAACGTACCGGAGACCTGCATATTACCGACCTCGGCCGGACGGCCAGCCACTATTACATAACATGTGAAACCATGGAGGTGTTCAACACCATGGTACGAAAATCTATGACTCAGGGTTATGTGCTGGAGATGCTGACCAGGTGTTCGGATTTCCAGCAGTTGAAAGTCAGAAAAGAAGAATTGACAGAACTGTGGAATCTGAAAGACATGTATTGTGAGTTACGAATCGAAGACGCTCCAGAGGACATACACTGGAAGATCAACATCTTATTACAGACATATCTGTCACGCGGTCGGGTCAGCGGGTCCTCGCTGCAATCAGACTTGAATTACATCAGCCAGACAGCTCGCCATTTGAAGAAATGCGCCGAGGAATTCCCATTGCTGGACATGGAGGCTAGCTTACATCCTATAACGAGGACTGTATTGAGAATAAGACTCACCATCACACCCAACTTTAAGTGGAATGACAAGTACCACGGTAAGGCACCAGAGGCCTTTTGGATCTGGGTCGAAGATCCCGACACCGACATAATGTACTACCATGAATATTTCCTCATTACCAAAAAACAAGTCATAACAAACGAGCCTCAAGAATTAGTTATAACGATTCCTATATCTGAGCCTTTGCCTCCTCAGTACTATATCAGAGCTACTTCAGAAAGATGGCTAGGCTCAGAGAGTGTACTGCCTTTGACATTCCAACATCTAATTCTACCAGAAACTCATCCACCTCATACAGGTGACGTTTCACCGGATATCCGGGCCATAAGACAGTCGCAAGTCATAGTGACGACTCCTGAGAAGTGGGACGGCATCAGCAGGTCCTGGCAGACGAGGAATTACGTGAGGGACGTGGCTCTCATAGTCATAGATGAGATACACCTGTTGGGAGAAGATAGGGGACCGGTTTTAGAAGTCATTGTGTCCAGAACCAATTTCATAGAATCCCATACGTCTCGTCGTCTCCGCATCATAGGTCTCTCCACGGCCCTCGCCAACGCTAAGGATCTAGCCAACTGGCTGAACATTGGAGAAATTGGATTATACAACTTCAGGCCGTCCGTCAGACCTGTCCCGTTGGAGGTCCATATATCAGGCCACGCGGGTCGTCACTACTGTCCGCGGATGATGTCTATGAACAAACCCACATTCAGCGCTATCAGAACACACTCCCCTGCCTCACCAGCTCTGGTCTTCGTGTCCAGTAGACGGCAGACCAGATTAACGGCGCNATACAAAAATTGTAACTTAAAGGTCTACGGTCTGACATATACCGATGTCCAAGAGGACCCCACTTTGGAAACTAGGAGACGCGAACTAATTACAAGCGCGGCCATGCAACTAGACCGCACGCATATGCTGAGATACAACGAACGTACCGGAGACCTGCATATTACCGACCTCGGCCGGACGGCCAGCCACTATTACATAACATGTGAAACCATGGAGGTGTTCAACACCATGGTACGAAAATCTATGACTCAGGGTTATGTGCTGGAGATGCTGACCAGGTGTTCGGATTTCCAGCAGTTGAAAGTCAGAAAAGAAGAATTGACAGAACTGTGGAATCTGAAAGACATGTATTGTGAGTTACGAATCGAAGACGCTCCAGAGGACATACACTGGAAGATCAACATCTTATTACAGACATATCTGTCACGCGGTCGGGTCAGCGGGTCCTCGCTGCAATCAGACTTGAATTACATCAGCCAGAACGCGGTTCGTATAGTTCGCGCGCTTTTTGAAATAACGTTAAGGAAAAATAACGCATACATGGCGGGATTGTATCTCAAGATGGCGAAAATGATGGAACTTCAGCTATGGGATTTCTATAGTGATATGAGACAGTTCAACTGCTTCCCCAACGAGATATTGAAGCATATAGAGTACCCGTTACTGAAACCGGATCAACTGAGAGATATGGATTGGAAGGAAATAGGCGACCTAATACGTAACCCTAAGACAGCTCGCCATTTGAAGAAATGCGCCGAGGAATTCCCATTGCTGGACATGGAGGCTAGCTTACATCCTATAACGAGGACTGTATTGAGAATAAGACTCACCATCACACCCAACTTTAAGTGGAATGACAAGTACCACGGTAAGGCACCAGAGGCCTTTTGGATCTGGGTCGAAGATCCCGACACTGATATAATGTACTACCATGAATATTTCCTCATTACCAAAAAACAAGTCATAACAAACGAGCCTCAAGAATTAGTTATAACGATTCCTATATCTGAGCCTTTGCCTCCTCAGTACTATATCAGAGCTACTTCAGAAAGATGGCTAGGCTCAGAGAGTGTACTGCCTTTGACATTCCAACATCTAATTCTACCAGAAACTCATCCACCTCATACAGATCTGCTAGAGTTACAACCTCTGCCAGTGACAGCCCTGAACAATCCCTCCTACGAAATGCTATACAACTTCAGTCACTTCAATCCGATACAGACACAAATATTCCATGCGCTGTATCACACCGACCATAACATACTACTCGGAGCGCCGACCGGCTCGGGGAAGACGATAGTCGCTGAAGTGGCCATGTTCAGGGTCTTCAACCAATATCCGGGTTGTAAGGTCGTCTACATCGCGCCACTTAAAGCTCTTGTCAAAGAAAGAATAAAGGATTGGAAAGTGAGGCTGGAAGAGAAACTTGGAAAAAACGTCGTGGAATTAACGGGTGACGTTTCACCGGATATCCGGGCCATAAGACAGTCGCAAGTCATAGTGACGACTCCTGAGAAGTGGGACGGCATCAGCAGGTCCTGGCAGACGAGGAATTACGTGAGGGACGTGGCTCTCATAGTCATAGATGAGATACACCTGTTGGGAGAAGATAGGGGACCGACAAAACAAGACATCCTAGACTATCTGACTTGGACCTACTTCTTCCGGAGGTTGCTGAAGAACCCTTCGTATTACAATTTAGAGAGCATCGAGCCGCAAGACATTAACTGTTACCTTTCAAATTTAGTGCAAACCTCTTTGGACGCCTTGGCTAACGCGAATTGCATCGAGATAGAAGAGGTCCTAATTTGCACGGCGACGCTAGCTTGGGGTGTAAATTTCCCCGCGCACCTGGTCGTTATCAAGGGTACGGAGTACTTCGATGGAAAACAGAAGAGATACGTGGACATGCCCATCACTGATGTACTGCAGATGATGGGACGAGCGGGACGCCCACAAGCTATAATAGACACAACAGCGGAGAACGGTTGGTTATCAGTTTGTTTGATATCACAAATGTTGATGCAATGTATCGTTCAAGCACGATGGTATACGGAATCGGCTCTAACGACTCTACCGCATATAGAGTCACAGCATTTGTACATGTTCTTACACATGACCAGAGACACTAATAAGCCATGTTTCACATTGAATGGCTTAAAGGTAGTTTGTGCGAAAAATTATGAGCTACTAGCCAAATATATGAGACGGGAATTTGAAGAAAATCAAATTGAACATGTGTACAGGGAATACACGATCGTAATAGACATGCAAAGACGTGGTGGTAACCCAAATAACGTCTTATGTCCGCGTTTCCCCCGGGGTAAGAATGAGGGCTGGTTCATCACCCTGGGGTCTATAGAAAATGGAGAGCTACAAGCTTTAAAACGCGTTCCGCCTAAGGGTACATCAAATGTTACTTTCTACACACCATCGCAAAATGGACGCATAATATACACAATGTATGTAATGAGTGACAGTTACATGGGTCTAGATCAACAGTATGACTTACAATTTGACATCATTGGCCCTTTACCCACAGAGACGGTTGACAGAGTATACGATACCATAGACAAAGTTATCATTGAATGA

Protein sequence:

>DPOGS212017-PA
MAFENIKELNRIQSVVFQTAYNTNENLLICAPTGAGKTNIALLTVVHQLKQHIENDVIMKNKFKIIYIAPMKALASEMTASFGKRLQSLGITVRELTGDMKLTKAEVQQTQMIVTTPEKWDVVTRKGATDTELASIVKLLIIDEVHLLHGDRGPIVEAIVARTLRQVESTQNMIRIVGLSATLPNYVDVARFLRVNPNIGLFYFDSRFRPVPLEQQFIGVKEIGSGGGTHLRQIQTMNEICYDKASEMVQKGHQVMVFVHARNATHQTALILKEIAQKKGHLKYFEPEDSGGFLKAKKSIGSSPNKQLAELFSAGFACHHAGMLRSDRNMVEKYFAEGYIKVLVCTSTLAWGVNLPAHAVVIRGTEIYDQSHGTFVDLSILDVLQIFGRAGRPQFDTSGTGIIITTHDKLTHYLKSMTNQFPIESNFINLLADNLNAEVALGTVTNIDEAVEWLSYTYLFVRMRINPQVYGLTYTDVQEDPTLETRRRELITSAAMQLDRTHMLRYNERTGDLHITDLGRTASHYYITCETMEVFNTMVRKSMTQGYVLEMLTRCSDFQQLKVRKEELTELWNLKDMYCELRIEDAPEDIHWKINILLQTYLSRGRVSGSSLQSDLNYISQTARHLKKCAEEFPLLDMEASLHPITRTVLRIRLTITPNFKWNDKYHGKAPEAFWIWVEDPDTDIMYYHEYFLITKKQVITNEPQELVITIPISEPLPPQYYIRATSERWLGSESVLPLTFQHLILPETHPPHTGDVSPDIRAIRQSQVIVTTPEKWDGISRSWQTRNYVRDVALIVIDEIHLLGEDRGPVLEVIVSRTNFIESHTSRRLRIIGLSTALANAKDLANWLNIGEIGLYNFRPSVRPVPLEVHISGHAGRHYCPRMMSMNKPTFSAIRTHSPASPALVFVSSRRQTRLTAXYKNCNLKVYGLTYTDVQEDPTLETRRRELITSAAMQLDRTHMLRYNERTGDLHITDLGRTASHYYITCETMEVFNTMVRKSMTQGYVLEMLTRCSDFQQLKVRKEELTELWNLKDMYCELRIEDAPEDIHWKINILLQTYLSRGRVSGSSLQSDLNYISQNAVRIVRALFEITLRKNNAYMAGLYLKMAKMMELQLWDFYSDMRQFNCFPNEILKHIEYPLLKPDQLRDMDWKEIGDLIRNPKTARHLKKCAEEFPLLDMEASLHPITRTVLRIRLTITPNFKWNDKYHGKAPEAFWIWVEDPDTDIMYYHEYFLITKKQVITNEPQELVITIPISEPLPPQYYIRATSERWLGSESVLPLTFQHLILPETHPPHTDLLELQPLPVTALNNPSYEMLYNFSHFNPIQTQIFHALYHTDHNILLGAPTGSGKTIVAEVAMFRVFNQYPGCKVVYIAPLKALVKERIKDWKVRLEEKLGKNVVELTGDVSPDIRAIRQSQVIVTTPEKWDGISRSWQTRNYVRDVALIVIDEIHLLGEDRGPTKQDILDYLTWTYFFRRLLKNPSYYNLESIEPQDINCYLSNLVQTSLDALANANCIEIEEVLICTATLAWGVNFPAHLVVIKGTEYFDGKQKRYVDMPITDVLQMMGRAGRPQAIIDTTAENGWLSVCLISQMLMQCIVQARWYTESALTTLPHIESQHLYMFLHMTRDTNKPCFTLNGLKVVCAKNYELLAKYMRREFEENQIEHVYREYTIVIDMQRRGGNPNNVLCPRFPRGKNEGWFITLGSIENGELQALKRVPPKGTSNVTFYTPSQNGRIIYTMYVMSDSYMGLDQQYDLQFDIIGPLPTETVDRVYDTIDKVIIE-