Monarch geneset OGS2.0

DPOGS214668
TranscriptDPOGS214668-TA1008 bp
ProteinDPOGS214668-PA335 aa
Genomic positionDPSCF300321 + 17542-19524
RNAseq coverage1696x (Rank: top 7%)
Annotation
HeliconiusHMEL0047704e-16488.82% 
BombyxBGIBMGA001943-TA3e-17992.24% 
DrosophilaCG1598-PA4e-14776.90% 
EBI UniRef50UniRef50_B0WEV53e-14979.82%ATPase ASNA1 homolog n=17 Tax=Metazoa RepID=ASNA_CULQU
NCBI RefSeqXP_002424214.11e-15180.62%Arsenical pump-driving ATPase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420067622e-15080.62%Arsenical pump-driving ATPase, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420067621e-15180.62%Arsenical pump-driving ATPase, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055241.7e-91ATP binding
GO:00466851.7e-91response to arsenic-containing substance
GO:00154461.7e-91arsenite-transporting ATPase activity
KEGG pathway 
InterPro domain[27-324] IPR0163001.7e-91Arsenical pump ATPase, ArsA
Orthology groupMCL12113 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214668-TA
ATGGATGAATCTAATGACTTTGAACCTCTAGAACCTTCATTAAAAAATGTAATAGAACAAAAGTCCCTCCGCTGGATATTTGTAGGAGGGAAAGGTGGCGTAGGAAAAACTACGTGTAGCTGCAGTTTGGCAGTCCAGTTATCAAAAGTTCGAGAGTCGGTTCTTATAATATCCACTGATCCGGCTCATAACATATCGGATGCCTTCGATCAGAAATTTTCTAAAGTGCCAACTAAGGTAAAAGGGTTTGATAACCTGTTTGCTATGGAGATAGATCCTAATGTAGGGTTAACAGAATTGCCCGAAGAATACTTTGAAGGCGAGACCGAGGCCATGAGACTTGGAAAAGGCGTGATGCAGGAGATCGTTGGAGCATTCCCCGGCATTGATGAAGCCATGAGCTATGCGGAGGTTATGAAGCTCGTCAAAGGTATGAACTTCAGTGCAGTCGTGTTTGACACAGCACCCACTGGGCACACATTGCGTTTGTTATCATTCCCTCAGGTGGTTGAAAAGGGTCTCGGTAAATTGATGCGACTAAAATCAAAGGTGGCTCCGTTCATCAATCAAGTGGCAACACTGTTTGGACTCGCTGAATTCAATTCGGACATGTTCAGCAACAAACTGGATGAGATGTTATCGGTCATAACACAAGTTAACACACAGTTCAAAGATCCGAATCAAACGACATTTGTCTGCGTGTGTATCGCTGAATTCCTCTCGTTGTATGAAACTGAAAGACTCGTCCAGGAACTAACGAGATGTGGAATTGATACTCATAATATAATCGTTAATCAGTTGCTCCTAAGAACATCAGCACCTTGTGAACTATGTGCAGCTCGACATAAAGTACAAGAGAAATACCTTGAACAAATAGCAGATTTATATGAAGATTTCCATGTAACAAAATTACCGTTGTTGGACAGGGAGGTACGCGGGGCGGCAGCTGTTCAGTCTTTTTCAGAACACTTACTGACACCATACGTTCCACCTGCTACCAGTTCTTAA

Protein sequence:

>DPOGS214668-PA
MDESNDFEPLEPSLKNVIEQKSLRWIFVGGKGGVGKTTCSCSLAVQLSKVRESVLIISTDPAHNISDAFDQKFSKVPTKVKGFDNLFAMEIDPNVGLTELPEEYFEGETEAMRLGKGVMQEIVGAFPGIDEAMSYAEVMKLVKGMNFSAVVFDTAPTGHTLRLLSFPQVVEKGLGKLMRLKSKVAPFINQVATLFGLAEFNSDMFSNKLDEMLSVITQVNTQFKDPNQTTFVCVCIAEFLSLYETERLVQELTRCGIDTHNIIVNQLLLRTSAPCELCAARHKVQEKYLEQIADLYEDFHVTKLPLLDREVRGAAAVQSFSEHLLTPYVPPATSS-