Monarch geneset OGS2.0

DPOGS207341
TranscriptDPOGS207341-TA1593 bp
ProteinDPOGS207341-PA530 aa
Genomic positionDPSCF300188 + 139673-154477
RNAseq coverage1872x (Rank: top 7%)
Annotation
HeliconiusHMEL0022055e-10896.43% 
BombyxBGIBMGA010271-TA0.091.69% 
Drosophilakatanin-60-PA0.059.36% 
EBI UniRef50UniRef50_F4WF410.064.73%Katanin p60 ATPase-containing subunit A-like 1 n=15 Tax=Metazoa RepID=F4WF41_ACREC
NCBI RefSeqXP_002423714.10.067.34%Katanin p60 ATPase-containing subunit, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420057280.067.34%Katanin p60 ATPase-containing subunit, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420057280.067.84%Katanin p60 ATPase-containing subunit, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055248.9e-40ATP binding
GO:00001663.4e-20nucleotide binding
GO:00171113.4e-20nucleoside-triphosphatase activity
KEGG pathway 
InterPro domain[286-420] IPR0039598.9e-40ATPase, AAA-type, core
[282-422] IPR0035933.4e-20ATPase, AAA+ type, core
[484-528] IPR0154154.6e-07Vps4 oligomerisation, C-terminal
Orthology groupMCL11334 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207341-TA
ATGGCAGTTTCTGTTGGCGAAATTTGCGAGAATACTAAGTTGGCTAGAGAAATGGCATTAATGGGGAACTATGAATCTGCTCTTGTATACTACGAAGGAACCGTTCAGATGATCCACAGACTTCTCATAACTATAGCTGACCCCACACGAAAGTCGAAGTGGCAGTTGGTCCAGAAGCAGATGGCCAGGGAATACGAACAACTGAAAGCGACGGTAGCAACCTTACAGATGTTCCAGCATGAGGGTGAAAAAGCGATCACGCCATTAACCAGCACGTTAGAAGACCTGCCCACCCGCGACCAGATGTGGGCACCGGCGCCTCACGAGATAGACCCCGACATCTGGCCGCCCCCGCCGGACAGAGATCCCGCCTGGCCATCGCCCACCAGTGTTGAACACAAAGGCCCACCGACTATGAAATCAGCGAGGAACAATCCAAGGAACGCCAGGACGAATGATAAGAAGACGCCGGCTGGACGGGTGGCGACCACATCACACAGGAAGACCTCGGACGTCAGGAACCCCAAATTAAACACCAATAAAACACACAGCGCCAAGACGAAAGAGCAATCGAACAAGGATCACAGTACAAAGGACAAGCAGGATAGGGACAATAACAACGGAGACACGGACGAAGAGAAACACAAGGAGGACGAGAGAAGGTTCGAGCCGCCGTCAGCTGCCGACGGGGATCTGGTGGATATGCTGGAACGTGATATAGTACAAAAGAATCCCAACATCCGATGGGATGACATTGCTGACTTGGCCGAGGCCAAGAGGCTGTTGGAGGAGGCTGTGGTCCTACCAATGTGGATGCCGGACTTCTTTAAGGGTATCCGTCGTCCGTGGAAGGGTGTTCTGATGGTGGGTCCGCCTGGCACGGGGAAGACGATGCTCGCTAAGGCCGTGGCCACGGAGTGTGGAACGACATTCTTCAACGTGTCCAGCTCGACGCTCACATCCAAATACCGGGGGGAATCCGAGAAGTTGGTGCGACTACTGTTCGAAATGGCTCGTTTCTACGCTCCCAGTACTATCTTCATAGACGAGATCGATTCCCTGTGTTCCCGCCGCGGGTCAGACAGTGAACACGAGGCCTCCAGGCGAGTGAAGTCAGAACTGTTGGTGCAGATGGATGGCCTCGGCTCAGCCACGGACGAGCCGGCTAAGGTGGTGATGGTGCTAGCTGCTACAAACTTCCCTTGGGACATTGACGAAGCGTTACGGCGAAGGCTTGAAAAACGAATCTACATACCGCTGCCAACGCAAGAGGGTCGAGAGGCGTTACTGCAGATTAACCTTAGAGAGGTCAAAGTTGACCCCGAGGTAGACCTGCGACTTATAGCAAAAAAACTTGACGGATACTCTGGAGCTGATATAACTAATGTCTGCAGAGATGCCTCAATGATGTCCATGCGACGGAAAATAGCTGGCTTGAAGCCGGAGCAGATAAAACAACTTGCCAAGGAAGAACTAGACCTGCCCGTTACGAGGCAGGATTTCTTAGAAGCACTGTCGAAATGCAATAAATCAGTGTCCAAAGGTGACATACAGAAATACCTCACCTGGATGGCAGAGTTTGGATCCTCCTGA

Protein sequence:

>DPOGS207341-PA
MAVSVGEICENTKLAREMALMGNYESALVYYEGTVQMIHRLLITIADPTRKSKWQLVQKQMAREYEQLKATVATLQMFQHEGEKAITPLTSTLEDLPTRDQMWAPAPHEIDPDIWPPPPDRDPAWPSPTSVEHKGPPTMKSARNNPRNARTNDKKTPAGRVATTSHRKTSDVRNPKLNTNKTHSAKTKEQSNKDHSTKDKQDRDNNNGDTDEEKHKEDERRFEPPSAADGDLVDMLERDIVQKNPNIRWDDIADLAEAKRLLEEAVVLPMWMPDFFKGIRRPWKGVLMVGPPGTGKTMLAKAVATECGTTFFNVSSSTLTSKYRGESEKLVRLLFEMARFYAPSTIFIDEIDSLCSRRGSDSEHEASRRVKSELLVQMDGLGSATDEPAKVVMVLAATNFPWDIDEALRRRLEKRIYIPLPTQEGREALLQINLREVKVDPEVDLRLIAKKLDGYSGADITNVCRDASMMSMRRKIAGLKPEQIKQLAKEELDLPVTRQDFLEALSKCNKSVSKGDIQKYLTWMAEFGSS-