Monarch geneset OGS2.0

DPOGS201010
TranscriptDPOGS201010-TA1077 bp
ProteinDPOGS201010-PA358 aa
Genomic positionDPSCF300147 + 166916-170952
RNAseq coverage469x (Rank: top 26%)
Annotation
HeliconiusHMEL0137887e-6585.07% 
BombyxBGIBMGA009099-TA1e-16075.93% 
Drosophilanmd-PA4e-11555.88% 
EBI UniRef50UniRef50_UPI00022C8E326e-12860.92%UPI00022C8E32 related cluster n=5 Tax=unknown RepID=UPI00022C8E32
NCBI RefSeqXP_975024.11e-13163.27%PREDICTED: similar to no mitochondrial derivative CG5395-PA [Tribolium castaneum]
NCBI nr blastpgi|910897232e-13063.27%PREDICTED: similar to no mitochondrial derivative CG5395-PA [Tribolium castaneum]
NCBI nr blastxgi|910897234e-12863.27%PREDICTED: similar to no mitochondrial derivative CG5395-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055243.6e-33ATP binding
GO:00001663.9e-15nucleotide binding
GO:00171113.9e-15nucleoside-triphosphatase activity
KEGG pathwaypic:PICST_338731e-69 
 K01509 (E3.6.1.3)maps-> Purine metabolism
InterPro domain[112-241] IPR0039593.6e-33ATPase, AAA-type, core
[108-246] IPR0035933.9e-15ATPase, AAA+ type, core
Orthology groupMCL12514 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201010-TA
ATGGTGGAAGGTTCAGCATTTACGCGGAACGATGTGTTCCAAATGGCTATTCGTGTTGCTTTTGTATCCGCGGTTACTTATTTCTCTATAAAATGGCTCGTCAATCAAATAGATCCGACTTCTAAGAGTCGAAAGAAAGCTGAAGAAAGAGCGCGGGAACAGTTACGCAAAATTGTAGGATTGAAATGTTTTTTTTTGTTTGTAAACAATGTTAATTGGAAGGATATAGCGGGTCTAGATCATCTCATCAATGAACTCCGTGAGACTGTTATTCTGCCGATACAGAAACGGGAGCTGTTTGCCGACAGTCGACTCACACAGCCACCTAAAGGTGTACTGTTGCATGGGCCACCCGGTTGCGGTAAAACTCTGATAGCCAAAGCTACGGCCAAGGAAGCCAACATGAGCTTCATAAACCTGGACGTGTCGCTGTTGACTGACAAATGGTACGGAGAAACACAGAAGCTGGCCGCCGCCGTGTTCAGCTTGGCCGTTAAATTACAACCTTGTATAGTTTTCATCGATGAGATTGAATCCTTTCTCCGGACCCGCACGGCTCATGACCATGAGGCCACAGCCATGATGAAGACACAGTTCATGTCGCTGTGGGACGGCCTGATCACCGACAACACGTGTAACGTTATTATCATGGGCGCTACGAACCGTCCCCAGGACTTGGACAAGGCGATCCAGCGTCGTATGCCGGCCACCTTCCATGTGCCGATGCCGAATCTCCAGCAGAGAGAGCACATCCTCCAGCTGATACTCAAATCAGAGCCCACAGCTGATGATATCGACTACGCCCGTCTAGCCTCGAGCACAGATGGATTCTCAGGCTCCGATCTTCACGAGCTCTGTCGCCAGGCGGCCGTCTACAGAGTTAGAGATCTGGCCAGGGAGGAGTTACAGAGGGAACAGTCAAAAACCAACAACACAAACTCAGATTCTGACGAGGAGTACTGTGATGCTGTCAGACCCATCACGATGGAGGATTTAAGGATGTCGCTTAGCAAGCTCAAGGAATCCAAGATACAGTGCGGATCACTGGCTCCCGGGATGAGAATTGAACTCGACTAG

Protein sequence:

>DPOGS201010-PA
MVEGSAFTRNDVFQMAIRVAFVSAVTYFSIKWLVNQIDPTSKSRKKAEERAREQLRKIVGLKCFFLFVNNVNWKDIAGLDHLINELRETVILPIQKRELFADSRLTQPPKGVLLHGPPGCGKTLIAKATAKEANMSFINLDVSLLTDKWYGETQKLAAAVFSLAVKLQPCIVFIDEIESFLRTRTAHDHEATAMMKTQFMSLWDGLITDNTCNVIIMGATNRPQDLDKAIQRRMPATFHVPMPNLQQREHILQLILKSEPTADDIDYARLASSTDGFSGSDLHELCRQAAVYRVRDLAREELQREQSKTNNTNSDSDEEYCDAVRPITMEDLRMSLSKLKESKIQCGSLAPGMRIELD-