Monarch geneset OGS2.0

DPOGS214624
TranscriptDPOGS214624-TA1749 bp
ProteinDPOGS214624-PA582 aa
Genomic positionDPSCF300050 + 275116-285341
RNAseq coverage364x (Rank: top 33%)
Annotation
HeliconiusHMEL0069720.084.26% 
BombyxBGIBMGA005075-TA0.089.43% 
Drosophilaspas-PA2e-18060.25% 
EBI UniRef50UniRef50_UPI00020643C50.062.33%UPI00020643C5 related cluster n=3 Tax=unknown RepID=UPI00020643C5
NCBI RefSeqXP_393080.30.061.38%PREDICTED: similar to spastin CG5977-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3287885550.062.33%PREDICTED: spastin [Apis mellifera]
NCBI nr blastxgi|3287885550.062.37%PREDICTED: spastin [Apis mellifera]
Group
Gene OntologyGO:00055243.7e-36ATP binding
GO:00001662.1e-17nucleotide binding
GO:00171112.1e-17nucleoside-triphosphatase activity
KEGG pathway 
InterPro domain[342-471] IPR0039593.7e-36ATPase, AAA-type, core
[17-95] IPR0073306.1e-22MIT
[338-474] IPR0035932.1e-17ATPase, AAA+ type, core
Orthology groupMCL13515 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214624-TA
ATGTCGTATATAAACAATGTAGGCCCCGGGGATCCTCTTCTTGCCAAACAAAAACATCATCATCGAAAAGCTTTTGAGTACATATCAAAAGCGTTGAAGATTGACGAAGAAAATGAGGGACAAAAAGAATTAGCAATAGAACTGTACAAGAAAGGTATCTATGAATTGGAGCGAGGGATTGCAGTAGACTGCTGGGGGGGTCGGGGCGACGCCTGGCAGAGGGCCCAAAGGCTCCATGATAAGATGAAAACCAATCTAGGCATGGCCAAGGATCGTTTACATTTCCTCGCCAACCTAGTCGCCCTCAGTAAGTTGGGGGTAGAGAGTGAGCCTGAGAGAAGTGAAAAAAGACCTACGGAGTCTCCTCTTAAAGTGAGAAGGCCATTAGAGAAGTCCAAGACAACGCTACTAGCACACACAGAAAGTAACAGTGGTCAAACGAAGCCACCAAATGAAGGGTCCGTGGGACGCGATGAGTCGGACACATTAGTTTCCGATCGAGTTGTCACCACAATGGGTTCAGGTTACGACAGACGAGCTCCGTCCGCCGATAACACGTTATTCGCAGGTCGTAAGCTGACGACCGCGGGTCGGAGGGTACCCGGCGGGGGTCCTCTGATGAAATCTCAGACCCTGCCGCGATCCATGGGCAGGTCTTCGTCACAGCCCAACAGCTCCAATGGCTACACCAGATACCCTGTGAAACCAGCATCAACACCGCCTGCTGTAAAACGACAGCTGTCGGTACCAGTGAACGGGTCTCCTGTTCGGCGTGCTGCAGGAGGGGGCTCGCAGCGCGGGACGCCCACCAGAAGTAGAACCCCGCAACCCACACTCGCAGTTCGGGGCGTGGACCCGAAACTCGTCCAATTGATATTGGACGAGATCGTTGAGGGAGGCCCTAAGGTTCATTGGGAAGATATCGCTGGGCAGGAGGCAGCAAAACAAGCGCTACAGGAAATGGTAGTGCTGCCGTCGCTCCGACCGGAACTGTTCACTGGTCTGAGATCACCGGCACGAGGTCTGCTGTTATTCGGTCCCCCCGGTAATGGTAAGACGTTGCTGGCTCGATGCGTGGCGGCGGAGTGTTCCGCCACGTTTTTCTCGATATCGGCCGCGTCACTCACCAGCAAGTATGTGGGTGACGGGGAGAAGATGGTGAGGGCGCTGTTCCAGGTGGCCAGGGAACTACAGCCATCGATAATCTTCGTGGACGAAGTGGACTCGTTGCTTTGCGAGCGATCGACGGGCGAGCATGAGGCGTCCAGGAGATTGAAGACTGAGTTCTTGGTGGAATTCGACGGCCTGCCAGCCGCCGGCGCTGACAGGGTCATTGTGATGGCGGCCACCAACCGCCCACAAGAGCTGGACGAAGCTGCTCTCAGACGGTTCCCCAAGCGTGTGTACGTATCATTACCTGATAGCCGCACACGCGGGGCCCTGCTCCGGAGGGTGTTGACGAGGGGTGCTGCGGCGGCCGCGATCAGTGACGACGAGCTGGCGCGCCTCGCCGCCCTCACCGATGGCTACTCCGGCAGCGACCTCACCGCCCTCTGCCGGGACGCCGCTCTGGGACCCATACGGGAGTTAGACCCGGAGGAAGTGAAATGCTTGGACCTGTCGCTGGTTCGTAGCATCACGTTCCAGGACTTCATGGACGCTCTCAAGCGGATCCGACCTTCGGTGTCACCTCTCAGCCTCGTGGGCTACGAGAAGTGGTCCGTGCAGTACGGGGAACTGGGAGTGTGA

Protein sequence:

>DPOGS214624-PA
MSYINNVGPGDPLLAKQKHHHRKAFEYISKALKIDEENEGQKELAIELYKKGIYELERGIAVDCWGGRGDAWQRAQRLHDKMKTNLGMAKDRLHFLANLVALSKLGVESEPERSEKRPTESPLKVRRPLEKSKTTLLAHTESNSGQTKPPNEGSVGRDESDTLVSDRVVTTMGSGYDRRAPSADNTLFAGRKLTTAGRRVPGGGPLMKSQTLPRSMGRSSSQPNSSNGYTRYPVKPASTPPAVKRQLSVPVNGSPVRRAAGGGSQRGTPTRSRTPQPTLAVRGVDPKLVQLILDEIVEGGPKVHWEDIAGQEAAKQALQEMVVLPSLRPELFTGLRSPARGLLLFGPPGNGKTLLARCVAAECSATFFSISAASLTSKYVGDGEKMVRALFQVARELQPSIIFVDEVDSLLCERSTGEHEASRRLKTEFLVEFDGLPAAGADRVIVMAATNRPQELDEAALRRFPKRVYVSLPDSRTRGALLRRVLTRGAAAAAISDDELARLAALTDGYSGSDLTALCRDAALGPIRELDPEEVKCLDLSLVRSITFQDFMDALKRIRPSVSPLSLVGYEKWSVQYGELGV-