Monarch geneset OGS2.0

DPOGS200891
TranscriptDPOGS200891-TA1875 bp
ProteinDPOGS200891-PA624 aa
Genomic positionDPSCF300066 - 465504-469299
RNAseq coverage493x (Rank: top 25%)
Annotation
HeliconiusHMEL0127300.089.51% 
BombyxBGIBMGA000542-TA0.085.97% 
Drosophilabor-PC0.067.39% 
EBI UniRef50UniRef50_B4GFN90.062.84%GL22252 n=2 Tax=Drosophila RepID=B4GFN9_DROPE
NCBI RefSeqXP_974479.10.069.20%PREDICTED: similar to ATPase family AAA domain-containing protein 3 [Tribolium castaneum]
NCBI nr blastpgi|910838950.069.20%PREDICTED: similar to ATPase family AAA domain-containing protein 3 [Tribolium castaneum]
NCBI nr blastxgi|1700432940.072.57%ATPase family AAA domain-containing protein 3 [Culex quinquefasciatus]
Group
Gene OntologyGO:00055241.6e-23ATP binding
GO:00001663.5e-12nucleotide binding
GO:00171113.5e-12nucleoside-triphosphatase activity
KEGG pathway 
InterPro domain[19-281] IPR0219119.1e-96ATPase family AAA domain-containing protein 3, domain of unknown function DUF3523
[347-473] IPR0039591.6e-23ATPase, AAA-type, core
[343-476] IPR0035933.5e-12ATPase, AAA+ type, core
Orthology groupMCL11263 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200891-TA
ATGTCTTGGTTATTTGGATATAGTCGGCCTCAGCAGCCCCCTTCTGATGAACCTCCAGCTAATGGAGGAGAGCCTGCGGCGGCTGCTCCAGTTAACCTTAGTAAAGCCGAGAAGAAGGCAATGGAAGCCTACAGATTTGATTCCAGTGCTTTGGAAAGAGCTGCTCAAGCTGCCCGAGAATTAGAAAGATCAAGACATGCAAAAGATGCTTTGGAGATAAGTAAGTTACAAGAGACAACTCGGCAACAAGAGCAAATGGCTAAGATCAAAGAATATGAAGCTGCTATTGAACAAGCCAAAGTTGAGCAAAAGAGGGTTGATTATGAAGAAAGACGTAAGACATTACAGGAAGAAACAAAGCAACACCAAATGAGAGCCCAGTATCAAGATCAGCTGGCAAAAACCCGTTATGAAGAGCAATTGCTACAACATCAAAAGTCTCAAGATGAAATATTAAGAAAACAAGAAGAGAGTGTTGCTAAGCAGGAAGCATTAAGACGAGCCACCATCGAGCATGAAATGGAGTTACGTGAGAAAAATAAGTTGAAGGCAATAGAAGCTGAAGCGAGAGCTAAAGCCAAGGCAGACCGAGAGAATAGAGATATAACTCTCGAACAAATAAAACTGAAAGCTGCAGAAAATAGAACCACTATATTGGAGAGTATACAAACAGCAGGTAGTGTGATTGGGACGGGTTTAAATGCACTCGTCACAGACTGGGACAAGACCCTCGCTGCTGCTGGTGGTTTGTCTCTACTTGCCTTAGGTGTCTACTCCGCTAAAGGTGCTACATCTGTAGCTGCTAGATTCCTTGAGGCTAGGATTGGTAAACCCACGTTAGTCAATGAGACATCTAGATTTTCTCTCCTTGAAGCTGTTAAGCATCCAATCCTCACAATATCGCGTGCTGTATCAAATTTCAAGAAACCCACCGACGCTTTGGGTGGTGTAGTATTAGCTCCAAATTTAGAAAGACGCCTAAGAGATATAGCAATAGCCACTAAGAACACTAGGATGAATAAAGGTTTCTATAGAAATCTGCTCATGTATGGACCTCCCGGTACTGGAAAAACATTGTTCTCAAAGAAATTGGCTAAACATTCTGGTATGGAGTATGCTATAATGACGGGTGGTGATGTAGCCCCTATGGGCAAGCATGCTGTTGCAGCTATACATAAAATGTTCGACTGGGCCAACACCAGCCGTAAAGGCGTGTTGCTCTTCATTGATGAAGCGGACGCATTCCTCCGCAAACGGTCTTCGGAACATATTAGTGAGGACCTTCGTGCTGCTCTTAATGCATTCCTCTATAGAACCTCAGATCAAAGTAGCCGTATTATGCTTGTGTTGGCATCAAACACTCCACAACAGTTTGATTCTGCCATAAACGACCGTCTGGATAAGATGATCGAGTTTGGTCTACCAGGACTGGAGGAGAGGGAACGCTTGATCAGATTGTACTTTGATAAGTTCGTGCTTCAACCTGCTTCACAGGGTAAAAGACGTCTCAATGTGGATCAGTTTGACTACAGTCTTCTCTGTACTAAGCTAGCCGAACGTACAGCGGGTATGTCTGGCCGAGCACTCTCTAAGTTAGGGGTGGCTTGGCAGGCGGCTGCGTATGCTTCTGACGATGGAAGACTAACTGAACAGATGTGTATTGACATCTGCGATGATGCTGTGCAGGACCACAGGCAGAAGATGGAATGGTTGTCTGTTGAAGAGAAGTCAAGGACGATGATACCCTATTTATTAGACTTGCCGCCCTTAGACAAAGTAACAGATGCCAAAAAGGCTACCATCACTGAACTTCCAGAGACCAAAGAGGAACGCAAAACCAAAAAGAAAACAAAAGAAGTGGAATTAGAAAAGTGA

Protein sequence:

>DPOGS200891-PA
MSWLFGYSRPQQPPSDEPPANGGEPAAAAPVNLSKAEKKAMEAYRFDSSALERAAQAARELERSRHAKDALEISKLQETTRQQEQMAKIKEYEAAIEQAKVEQKRVDYEERRKTLQEETKQHQMRAQYQDQLAKTRYEEQLLQHQKSQDEILRKQEESVAKQEALRRATIEHEMELREKNKLKAIEAEARAKAKADRENRDITLEQIKLKAAENRTTILESIQTAGSVIGTGLNALVTDWDKTLAAAGGLSLLALGVYSAKGATSVAARFLEARIGKPTLVNETSRFSLLEAVKHPILTISRAVSNFKKPTDALGGVVLAPNLERRLRDIAIATKNTRMNKGFYRNLLMYGPPGTGKTLFSKKLAKHSGMEYAIMTGGDVAPMGKHAVAAIHKMFDWANTSRKGVLLFIDEADAFLRKRSSEHISEDLRAALNAFLYRTSDQSSRIMLVLASNTPQQFDSAINDRLDKMIEFGLPGLEERERLIRLYFDKFVLQPASQGKRRLNVDQFDYSLLCTKLAERTAGMSGRALSKLGVAWQAAAYASDDGRLTEQMCIDICDDAVQDHRQKMEWLSVEEKSRTMIPYLLDLPPLDKVTDAKKATITELPETKEERKTKKKTKEVELEK-