Monarch geneset OGS2.0

DPOGS211135
TranscriptDPOGS211135-TA1617 bp
ProteinDPOGS211135-PA538 aa
Genomic positionDPSCF300007 - 275891-279903
RNAseq coverage212x (Rank: top 46%)
Annotation
HeliconiusHMEL0172180.068.22% 
BombyxBGIBMGA003010-TA2e-9360.65% 
Drosophilaspas-PA6e-0622.49% 
EBI UniRef50UniRef50_G6DQV10.0100.00%Werner helicase interacting protein n=5 Tax=Eumetazoa RepID=G6DQV1_DANPL
NCBI RefSeqXP_001647950.12e-15651.45%werner helicase interacting protein [Aedes aegypti]
NCBI nr blastpgi|3838618151e-15652.10%PREDICTED: ATPase WRNIP1-like [Megachile rotundata]
NCBI nr blastxgi|3838618157e-15152.10%PREDICTED: ATPase WRNIP1-like [Megachile rotundata]
Group
Gene OntologyGO:00055243.9e-16ATP binding
GO:00001661.3e-12nucleotide binding
GO:00171111.3e-12nucleoside-triphosphatase activity
GO:00036773.4e-05DNA binding
GO:00062813.4e-05DNA repair
KEGG pathway 
InterPro domain[387-534] IPR0218864.6e-49MgsA AAA+ ATPase C-terminal
[158-270] IPR0039593.9e-16ATPase, AAA-type, core
[154-274] IPR0035931.3e-12ATPase, AAA+ type, core
Orthology groupMCL16891 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211135-TA
ATGGAGGGTGGAGAATCTGAGACAAATTTTGTTTCTTGTCCTGTATGTAACAGAGAGTACGATAAAAATATGATAGAGAAACATGTAGATAAATGCATTTTCTTAAATACCTTATCAAATAATACTAAAAGAAGCGCATCACATTTAGAAGAAAGTAGTATACAGCAGAAAAAAGCAAAGGTAGAAAAGAAAAGTCCTATCAGCAACATACAGCCATCTAAAGCAAAAACCACTCAGCCAAAAAAAAGTCCTGTGAGTGGTCCATCAAAGGGAAGGGCAAGTTTTCATGAACTACAAAGTGGTGATGCTGAATCTACAAATATAAAATCAACTGCTAAAACAAGTGTCAAAAGTATACCACTAGCCGAGTCAATGCGACCAAATTGTCTTGAAGATATTGTGGGGCAGATTGAATCATTTGGTGCTGGATCAATGTTATATTCAATGCTGATAAAGAGCAAAATACCTAACATGATATTGTGGGGTCCACCAGGGTGCGGAAAAACTTCACTTGCCAATGTTGTGGCAAATATTTGCAAAGAGCAGCCCAATCTGAGATTTGTGAAGCTGTCAGCAACTATGTCTGGTATAAATGACGTCAAGGAGGTGGTGAAAGTTGCTAAAAATGAAGCACAATTTAAAAGACAGACTGTTTTGTTTATGGATGAAATTCATAGGTTCAACAAATTACAACAAGATACATTTCTGCCGCATGTTGAAAATGGAACTATAACATTGATCGGAGCCACAACTGAGAATCCATCGTTCAGTTTGAACAATGCCCTGTTGAGTCGTTGTAGAGTCGTTGTTCTTAGTAAGTTATCAGTCGATGATGTCACACAAATATTACAGAGAGCCATATTAAGAAATGATTTAGCTACTATTATTGATAATTCAAATAAAGAAGTCAACGGAGGTGTTCAAAAAAGATGCAGGATATCTCTGGAATCAATCCGTTGGCTGGCCGAGGTCAGTGATGGAGATGCACGTGTCGCTCTCAGTGCTCTGGAGTTGACACTGGCCGCTGGAACTCCTGACACAATTATTAGTTTGGAAGATTTGAAGAATGGGATCAAGCGCACGCACATGCTATACGACCGGACGGGTGAAGAGCACTACAACATAATATCCGCCATGCACAAGTCGATTCGTGCCGGTGACGACAACGCGGCCCTGTACTGGAGCACCCGCGCGTTACACGGCGGGGAAGACCCGCTGTACGTGGCCAGGAGGTTGGTGCGGGCGGCGTGCGAGGACATCGGGTTGGCTGATCCGAACGCGTTAGTGGAGGCGGTGGCGTGTCTGCAGGGTTGTCAGCTGATCGGTATGCCGGAGTGCGACGTGTTGGTGGCCCAGTGTGCGGTGCGCCTCGCCCGCGCCGACAAGAGTAGAGAAGTCTACAAGGCTATGACTCGCGTGCAGAAGGCTCTCAAGGAGGCCAGGGGTCCGCTACCCTCGGTGCCATTACATCTTAGGAACGCTCCCACCAAGCTTATGAAGGATTTAGGATATGCTAAAGGATACAACTTACATCACAAAGATGTATCGGGTCTTACATACATGCCGGAAGGAATGGAGAACGAAAACTTCTTTCGAAAGGAAGACATTTGTAATTAA

Protein sequence:

>DPOGS211135-PA
MEGGESETNFVSCPVCNREYDKNMIEKHVDKCIFLNTLSNNTKRSASHLEESSIQQKKAKVEKKSPISNIQPSKAKTTQPKKSPVSGPSKGRASFHELQSGDAESTNIKSTAKTSVKSIPLAESMRPNCLEDIVGQIESFGAGSMLYSMLIKSKIPNMILWGPPGCGKTSLANVVANICKEQPNLRFVKLSATMSGINDVKEVVKVAKNEAQFKRQTVLFMDEIHRFNKLQQDTFLPHVENGTITLIGATTENPSFSLNNALLSRCRVVVLSKLSVDDVTQILQRAILRNDLATIIDNSNKEVNGGVQKRCRISLESIRWLAEVSDGDARVALSALELTLAAGTPDTIISLEDLKNGIKRTHMLYDRTGEEHYNIISAMHKSIRAGDDNAALYWSTRALHGGEDPLYVARRLVRAACEDIGLADPNALVEAVACLQGCQLIGMPECDVLVAQCAVRLARADKSREVYKAMTRVQKALKEARGPLPSVPLHLRNAPTKLMKDLGYAKGYNLHHKDVSGLTYMPEGMENENFFRKEDICN-