Monarch geneset OGS2.0

DPOGS209797
TranscriptDPOGS209797-TA2523 bp
ProteinDPOGS209797-PA840 aa
Genomic positionDPSCF300117 - 682554-699402
RNAseq coverage525x (Rank: top 24%)
Annotation
HeliconiusHMEL0043480.084.67% 
BombyxBGIBMGA008019-TA0.069.60% 
DrosophilaVha100-2-PB0.057.09% 
EBI UniRef50UniRef50_Q9VE750.057.09%Vha100-2, isoform A n=19 Tax=Coelomata RepID=Q9VE75_DROME
NCBI RefSeqXP_002102943.10.058.11%GD20170 [Drosophila simulans]
NCBI nr blastpgi|68152790.084.09%V-ATPase 110 kDa integral membrane subunit [Manduca sexta]
NCBI nr blastxgi|68152790.084.29%V-ATPase 110 kDa integral membrane subunit [Manduca sexta]
Group
Gene OntologyGO:00159910ATP hydrolysis coupled proton transport
GO:00331770proton-transporting two-sector ATPase complex, proton-transporting domain
GO:00150780hydrogen ion transmembrane transporter activity
KEGG pathwaydsi:Dsim_GD201700.0 
 K02154 (ATPeVI, ATP6N1A)maps-> Collecting duct acid secretion
    Oxidative phosphorylation
    Lysosome
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[26-831] IPR0024900ATPase, V0/A0 complex, 116kDa subunit
Orthology groupMCL10092 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209797-TA
ATGGAAGACGACACTGTGGAGGAAATGTACGAAGCAACAACGAATTCAATAGATGATCTTGGTCCAGGTCCTGACAGCACGATGTTTAGAAGCGAGGAAATGGTGCTATGCCAGCTTTTCGTTCAGCCAGAAGCAGCTTATGTATCCATGTACGAGCTCGGTGAAGCAGGAATAGCCCAGTTCAGAGACTTGAACCCTCACGTTAATGATTTCCAACGGCGTTATGTGTCGGAGGTGAGGCGATGCAGTGAAATGGAGAGGAAGTTGCGTTGGGTGAGCGGTGAACTACCAGAACCGCCACCACCTCCGAAACATTCACCAAGAGTATTGACGCCGAGAGAAATTAATATACTAGAGGAAAGGATTGACTACATAGAGTCGGAAATTCAAGAAATTACGAGAAACGCTCAAAATCTAAAGACAGATTATCTTGCATTGATAGAACTGAAACTATTAATTGAGAAGACGCAAACGTTTTTTCAAGATCACAGCGCTCATAGAAAGATATCCGCTTCGGTTCAAGTATATAATAACGAAGGGGCGATAGGCCACTTAGGATTTATAGCAGGTGTCGTGGCTACTCCCAGGGTTGCCTCGTTTGAGAGAATGCTATGGAGAATATCGCACGGGAATATTTTCTTTAAACAGGCACAAATTGATCAACCACTCAAAGATCCAGTAACAGGACATGAGCTGCAAAAGACAGTATTCGTTGTATTTTTTCACGGAGAACAAATTAAGTTAAGAGTGAAGAAAGTCTGTCACGGTTTCCAAGCAACGCTGTACCCATGTCCGGCTACTTACAAGGAACAACAGGAAATGATTGCAGGAATTGGATCCCGAATCAAAGATTTAGAGATGGTACTAGAACAGACGGAGCAGCACAGAAGGTTAGTGCTAGCCAATATAGGGCGAGATATAAGCACTTGGATGGTGGCAGTTCGGAAGGAGAAAGCGATTTACCACACCTTGAACATGTTCAGTATGGACATTGTGAAGAAATGCTTGATTGCTGAATGCTGGGTCCCTCGACAGGACCTGCACATCCTACAGAAAGCCTTGGACAATGGTGTGAAAGCAAGCGGCAGTCCGATTCCATCGATACTACATCATGTACCGACCAGAGAAGTCCCACCGACATTTAACAGAACCAATAAATTTACACGTGGCTTCCAAACTCTTATTGATTCCTATGGAATCGCTAGCTATAGGGAAGTTAATCCAGCGTTGTACACGATCATAACATTCCCGTTCCTTTTCGCCGTCATGTTCGGCGACATGGGCCACGGTCTCATTATAACAATATTCGCCGCAACGCTCGTCATAAACGAAAGAAACTTCGCCAAAAAGAAAACAGACAACGAAATATGGAACATATTTTTCGGTGGACGCTACATAATGCTGTTGATGGGAATATTCTCTATATACACCGGTCTGATATACAATGATTTGTTTTCGAAATCATTAAACGTATTCGGCAGCAGTTGGAAGAATGTTTACGATCTGGACACGCTGACGAACAGGAGTAATTTTGATTTGGACCCGGCTGTAGCTTACACACAGACACCGTACCCTCTCGGCTTAGATCCGGCTTGGCAGTTCGCAGCTAACAATATAATATTCCTAAACTCCTTCAAGATGAAACTGTCTATAATTTTTGGTGTCATCCATATGGCGTTCGGGGTAACATTGAGTGTGGTAAACTTCAACTTCTTTAAGAAAACTGAACTGATACTGCTACAGTATGTACCACAAATACTGTTTTTGCTTCTACTGTTCTGGTATCTCTGTATACTAATGTTCATAAAATGGTTCATGTATTCGGCGATAGCGACAGATCCAGCACTGGGCACATCCTGTGCTCCGTCAGTGTTAATCATCTTCATCAACATGATGCTCCTGAAGCCAGCAGAAACCGCTCCTCCTTGCCGGACCTTTATGTTCGACGGTCAAGACGCTATACAAAAAGCCTTCCTAGCCATAGCCTTTTTATGTGTGCCAGTTATGCTCTTCGGAAAACCAGTTTATCAAATAATCGCTGCTAAGAAAAAAAAGCAATCCCAGCAAGGTGTCGAGAGTGGGGAGATTGAACCGAGCGAGGACGACGGCGGTCTCAGTGAAGTTCTCATCACTCAAGCGATTCACACCATAGAGTATGTGCTGGGAACCGTCTCACACACGGCCTCCTACCTACGGCTGTGGGCTCTGTCTTTGGCGCATGCGCAACTATCAGCGGTTCTGTGGCAGCGCGTTCTTCGCATGGGCCTTAGTGGTGGTTCTCCAGTCAATGCCATCATGTTGTACGTGATATTCGCGGTGTGGGCGTTCTTCACTCTCGCCATACTTGTTCTCATGGAAGGCCTATCGGCGTTCTTGCACACCCTTCGGTTACATTGGGTTGAGTTCATGAGCAAATTCTACGATGGGCAGGGATATTCGTTTTTCCCGTTCTCCTTTTCGGCCATTCTCGAAAATGATGAAGAGGAGGTTCCAGCCAAGCCGAACGGTAGACCACCTGAGTGA

Protein sequence:

>DPOGS209797-PA
MEDDTVEEMYEATTNSIDDLGPGPDSTMFRSEEMVLCQLFVQPEAAYVSMYELGEAGIAQFRDLNPHVNDFQRRYVSEVRRCSEMERKLRWVSGELPEPPPPPKHSPRVLTPREINILEERIDYIESEIQEITRNAQNLKTDYLALIELKLLIEKTQTFFQDHSAHRKISASVQVYNNEGAIGHLGFIAGVVATPRVASFERMLWRISHGNIFFKQAQIDQPLKDPVTGHELQKTVFVVFFHGEQIKLRVKKVCHGFQATLYPCPATYKEQQEMIAGIGSRIKDLEMVLEQTEQHRRLVLANIGRDISTWMVAVRKEKAIYHTLNMFSMDIVKKCLIAECWVPRQDLHILQKALDNGVKASGSPIPSILHHVPTREVPPTFNRTNKFTRGFQTLIDSYGIASYREVNPALYTIITFPFLFAVMFGDMGHGLIITIFAATLVINERNFAKKKTDNEIWNIFFGGRYIMLLMGIFSIYTGLIYNDLFSKSLNVFGSSWKNVYDLDTLTNRSNFDLDPAVAYTQTPYPLGLDPAWQFAANNIIFLNSFKMKLSIIFGVIHMAFGVTLSVVNFNFFKKTELILLQYVPQILFLLLLFWYLCILMFIKWFMYSAIATDPALGTSCAPSVLIIFINMMLLKPAETAPPCRTFMFDGQDAIQKAFLAIAFLCVPVMLFGKPVYQIIAAKKKKQSQQGVESGEIEPSEDDGGLSEVLITQAIHTIEYVLGTVSHTASYLRLWALSLAHAQLSAVLWQRVLRMGLSGGSPVNAIMLYVIFAVWAFFTLAILVLMEGLSAFLHTLRLHWVEFMSKFYDGQGYSFFPFSFSAILENDEEEVPAKPNGRPPE-