Monarch geneset OGS2.0

DPOGS215153
TranscriptDPOGS215153-TA1356 bp
ProteinDPOGS215153-PA451 aa
Genomic positionDPSCF300348 - 16109-24673
RNAseq coverage500x (Rank: top 25%)
Annotation
HeliconiusHMEL0173742e-9161.06% 
BombyxBGIBMGA013964-TA2e-17585.39% 
DrosophilaVha44-PF4e-13875.48% 
EBI UniRef50UniRef50_E0VAJ31e-16863.64%Vacuolar ATP synthase subunit C, putative n=1 Tax=Pediculus humanus corporis RepID=E0VAJ3_PEDHC
NCBI RefSeqXP_395359.35e-17968.75%PREDICTED: similar to Vacuolar H+ ATPase 44kD C subunit CG8048-PC, isoform C [Apis mellifera]
NCBI nr blastpgi|2700102520.073.23%hypothetical protein TcasGA2_TC009631 [Tribolium castaneum]
NCBI nr blastxgi|2700102520.073.29%hypothetical protein TcasGA2_TC009631 [Tribolium castaneum]
Group
Gene OntologyGO:00159916.3e-253ATP hydrolysis coupled proton transport
GO:00168206.3e-253hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances
GO:00331806.3e-253proton-transporting V-type ATPase, V1 domain
KEGG pathwayame:4118921e-178 
 K02148 (ATPeVC, ATP6C)maps-> Collecting duct acid secretion
    Oxidative phosphorylation
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[1-451] IPR0049076.3e-253ATPase, V1 complex, subunit C
Orthology groupMCL12654 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215153-TA
ATGAGTGAATACTGGTTAATTAGTGCCCCTGGCGACAAAACCTGCCAACAAACATGGGATACTTTGAATAATGCCACCAAATCTGGCAGCCTCAGTGTGAACTACAAATTCCCTATACCCGACCTCAAGGTGGGTACATTGGATCAACTCGTTGGTTTGTCGGATGATCTCGGCAAACTTGATACTTTTGTTGAGGGTGTTACAAGGAAGGTGGCGCAATATCTCGGAGAGGTACTTGAAGATCAACGCGACAAACTTCACGAGAATCTGACGGCAAACAATAGCATCGACTCGTATTCCGACCCGGGGGGAGGCGAGCAGCCGTCTACTCCGTCGCCGGTTTGCGACTCGGCTTTCTCTGAGCATCACGGATGTTGGCCTTGCGAGCATCATGATAATGGTGCGCCGCCAACGCCGTCTAGCGGGCGCAGCTCACCGCGCTGGGATGATGATGAGGATGAGGCGCATCACATACTCACGCAAGGCGACCTGCCCACTTATTTGACACGTTTTCAATGGGATATGGCCAAGTACCCCATAAAGCAAAGTCTGCGCAACATCGCCGATATAATCAGTAAACAGGTAGGACAAATCGACGCTGATTTGAAGATGAAGTCAGCCGCGTACAACTCGCTGAAGGGCAACTTGCAGAGCTTGGAGAAGAAACAGACCGGCAGCCTGTTGACCCGTAACCTGGCTGACCTCGTCAAGAGGGAACACTTCATCCTAGACAGCGAATATCTGACCACTCTCCTCGTCATTGTCCCCAAGGCGATGTTTAACGACTGGAACGCTAACTACGAGAAGATAACCGACATGATAGTGCCGCGCTCCACCCAGCTCGTGCATCAAGACAACGACTACGGCCTCTTCACCGTGACTCTGTTCCGTAAGGTCGTCGACGAGTTCAAGCTGCACGCTCGCGAGCGCAAGTTCATCGTGCGCGAGTTCTCATACAACGAGGCGGACCTCGCCGCCGGCAAGAACGAGATCACCAGGCTCGTCACCGACAAGAAGAAGCAGTTCGGCCCACTGGTGAGATGGTTGAAAGTGAACTTCTCTGAATGTTTCTGTGCCTGGATCCACGTGAAGGCTCTCCGGGTGTTCGTGGAGTCCGTTCTGAGATACGGCCTGCCGGTGAACTTCCTAGCGGTGGTGATGGTCCCAGCTCGCAAGAGTATGAAGAAGCTGCGCGACGTCCTGCAGCACCTGTACGCGCACCTCGACCACTCCGCACAACAGCACGGACACGCTGCACAGGATAACGCTGAGTTGGCTGGTCTAGGCTTCGGTCAATCGGATTACTTCCCGTATGTTTTCTACAAAATCAACATTGACATGGTCGAGAAAGCTTAG

Protein sequence:

>DPOGS215153-PA
MSEYWLISAPGDKTCQQTWDTLNNATKSGSLSVNYKFPIPDLKVGTLDQLVGLSDDLGKLDTFVEGVTRKVAQYLGEVLEDQRDKLHENLTANNSIDSYSDPGGGEQPSTPSPVCDSAFSEHHGCWPCEHHDNGAPPTPSSGRSSPRWDDDEDEAHHILTQGDLPTYLTRFQWDMAKYPIKQSLRNIADIISKQVGQIDADLKMKSAAYNSLKGNLQSLEKKQTGSLLTRNLADLVKREHFILDSEYLTTLLVIVPKAMFNDWNANYEKITDMIVPRSTQLVHQDNDYGLFTVTLFRKVVDEFKLHARERKFIVREFSYNEADLAAGKNEITRLVTDKKKQFGPLVRWLKVNFSECFCAWIHVKALRVFVESVLRYGLPVNFLAVVMVPARKSMKKLRDVLQHLYAHLDHSAQQHGHAAQDNAELAGLGFGQSDYFPYVFYKINIDMVEKA-