Monarch geneset OGS2.0

DPOGS208959
TranscriptDPOGS208959-TA2454 bp
ProteinDPOGS208959-PA817 aa
Genomic positionDPSCF300009 + 787445-796257
RNAseq coverage1615x (Rank: top 8%)
Annotation
HeliconiusHMEL0157760.088.64% 
BombyxBGIBMGA002432-TA0.086.83% 
DrosophilaVha100-2-PB0.069.57% 
EBI UniRef50UniRef50_Q9VE750.069.57%Vha100-2, isoform A n=19 Tax=Coelomata RepID=Q9VE75_DROME
NCBI RefSeqXP_001657344.10.071.98%vacuolar proton atpases [Aedes aegypti]
NCBI nr blastpgi|3479662050.072.49%AGAP001587-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479662050.072.49%AGAP001587-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00159910ATP hydrolysis coupled proton transport
GO:00331770proton-transporting two-sector ATPase complex, proton-transporting domain
GO:00150780hydrogen ion transmembrane transporter activity
KEGG pathwayaag:AaeL_AAEL0140530.0 
 K02154 (ATPeVI, ATP6N1A)maps-> Collecting duct acid secretion
    Oxidative phosphorylation
    Lysosome
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[2-818] IPR0024900ATPase, V0/A0 complex, 116kDa subunit
Orthology groupMCL10092 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208959-TA
ATGGGGGCCATGTTCAGGAGTGAAGAGATGGCGCTCTGCCAGCTCTTCATTCAACCAGAAGCGGCGTATACCTCCGTCTCGGAGCTGGGAGAAGCAGGTTCTGTGCAGTTCAGAGACCTGAATCCAGACGTGAACGCTTTCCAACGTAAGTTTGTTAATGAGGTCCGTCGCTGCGATGAGATGGAACGAAAATTGAGGTACATAGAGGTTGAGGTACATAAGGACAAAGTTAATGTACCCGCTGTCAAAGACATGCCGAGAGCGCCTAACCCACGTGAGATCATCGATCTGGAGGCGCATCTCGAGAAAACGGAGAATGAAATTCTGGAGTTGTCACATAACGCGATCAACCTCAAACAAAACTATCTAGAGCTGACGGAATTAAAACACGTGCTGGAAAAAACAGAAGCATTCTTCGCCGCTCAAGAAGAAATCGGAATGGACTCGCTCACCAAGTCACTCATCTCTGACGAGGCCGGTCAGCAGGCGGCGACTCGCGGTCGTCTTGGGTTCGTAGCTGGCGTGGTCCAACGCGAACGCGTTCCCGCCTTCGAACGAATGTTGTGGAGAATCTCGAGAGGAAACGTATTCTTGAGACGAGCCGAACTTGACAAGCCCTTGGAGGACCCTAATACAGGCAACGAGATCTATAAAACGGTTTTCGTGGCGTTCTTCCAAGGCGAGCAGCTCAAGTCCCGCATCAAGAAAGTGTGCACCGGCTTCCATGCCTCCCTTTATCCTTGCCCGCCTTCTAACACCGAACGACTTGATATGGTCAAGGGGGTCAGAACTCGACTTGAAGACCTTAATATGGTGCTTAACCAAACCCAAGACCATAGACAACGTGTGTTGGTCAGCGTGGCAAAGGAATTGGGCAGTTGGTCGATAATGGTCCGCAAGATGAAAGCCATCTACCACACCCTCAACTTGTTCAACATGGATGTCACCAACAAATGTCTCATTGGCGAATGCTGGGTGCCAACAGCTGATCTACCAAACGTGCAAAAAGCCCTCGTTGACGGTTCCAGTGATGAAGTGCCTCCAACCTTCAACCGCACCAACAAATTCACTCGCGGATTCCAGACTCTCATCGACGCCTATGGAGTCGCCTCCTACAGGGAATGTAATCCAGCGCTGTACACCATCATCACTTTCCCGTTCCTGTTCGCGGTGATGTTTGGAGACCTGGGTCACGGCCTCATCATGGCTCTCTTCGGCCTCTGGATGGTTGTCAAGGAAGTGTCCCTCGCCGCAAAGAAATCCAACAACGAAATCTGGAACATTTTCTTCGCCGGTCGCTACATCATACTTCTCATGGGCTGCTTCTCTATGTACACCGGCTTGGTTTACAACGACATATTCTCGAAATCCATGAATATCTTCGGATCCGCTTGGTTCAATCCGTACGATAATCAGACGCTTGAAAGGTTTGAAGCTTTCACATTGGACCCTAAGGCTTCTTACGTAGACAAACCATATTTCTTTGGTATTGATCCTATCTGGCAGACTGCTGAGAATAAGATTATCTTCCTTAACTCTTACAAAATGAAACTGTCCATAATATTCGGCGTCATTCACATGATCTTCGGCGTTTGCATGAGCGTCGTCAACTACAACTTCTTCAAGCGCCGCTACTCAATCTTCCTGGAGTTTCTTCCACAAATCATTTTCCTGTTTCTCCTCTTCGCTTACATGGTATTCATGATGTTCTACAAGTGGGTGGCCTACAGCACCTTAGCTACAGATGAGGCGTATACCCAGGGTTGTGCGCCATCAGTGCTGATTCTCTTCATCAACATGATGCTGTTCTCGAGTACGGAACCCGAAGGCGGCTGCAAGGAGTACATGTTCGAGGGTCAGGAAACTCTACAGCGCGCGTTCGTTCTCGTGGCGCTTTGTTGCATACCAGTCATGTTGTTGGGCAAACCGTTGTACTTGTTGTGTGCCGCCAAAAAGAAGCATGACAAGCCGCAATCGAACGGTAGCGTGAACCAGGGCATCGAAATGCAAGAACAGACTGATATAGAGCAAGCCCCGAAGCCCGCGGCCGGCGGACACGACCATGATGATGAACCGTTCAGCGAAATCATGATCCATCAAGGAATACACACCATCGAATATGTTCTCAGTACAATCTCCCACACAGCTTCCTACCTACGACTATGGGCGTTGTCCCTCGCCCACGCTGAGTTATCTGAGGTGCTATGGAACATGGTGCTCCAACTCGGTCTCAAGGACCACAACTGGGTCGGTAGCATCAAATTGTACGTGGCCTTCATGTTCTGGTCTCTCTTCACACTGGCGATCCTCGTCATGATGGAGGGACTTTCAGCTTTCTTGCACACGCTGCGTTTGCATTGGGTGGAATTCATGAGCAAATTCTACGCTGGTTTGGGATACATCTTCCAACCGTTCTGCTTCAAGACGATCCTCGAACAAGAGGATGAAGATTAA

Protein sequence:

>DPOGS208959-PA
MGAMFRSEEMALCQLFIQPEAAYTSVSELGEAGSVQFRDLNPDVNAFQRKFVNEVRRCDEMERKLRYIEVEVHKDKVNVPAVKDMPRAPNPREIIDLEAHLEKTENEILELSHNAINLKQNYLELTELKHVLEKTEAFFAAQEEIGMDSLTKSLISDEAGQQAATRGRLGFVAGVVQRERVPAFERMLWRISRGNVFLRRAELDKPLEDPNTGNEIYKTVFVAFFQGEQLKSRIKKVCTGFHASLYPCPPSNTERLDMVKGVRTRLEDLNMVLNQTQDHRQRVLVSVAKELGSWSIMVRKMKAIYHTLNLFNMDVTNKCLIGECWVPTADLPNVQKALVDGSSDEVPPTFNRTNKFTRGFQTLIDAYGVASYRECNPALYTIITFPFLFAVMFGDLGHGLIMALFGLWMVVKEVSLAAKKSNNEIWNIFFAGRYIILLMGCFSMYTGLVYNDIFSKSMNIFGSAWFNPYDNQTLERFEAFTLDPKASYVDKPYFFGIDPIWQTAENKIIFLNSYKMKLSIIFGVIHMIFGVCMSVVNYNFFKRRYSIFLEFLPQIIFLFLLFAYMVFMMFYKWVAYSTLATDEAYTQGCAPSVLILFINMMLFSSTEPEGGCKEYMFEGQETLQRAFVLVALCCIPVMLLGKPLYLLCAAKKKHDKPQSNGSVNQGIEMQEQTDIEQAPKPAAGGHDHDDEPFSEIMIHQGIHTIEYVLSTISHTASYLRLWALSLAHAELSEVLWNMVLQLGLKDHNWVGSIKLYVAFMFWSLFTLAILVMMEGLSAFLHTLRLHWVEFMSKFYAGLGYIFQPFCFKTILEQEDED-