Monarch geneset OGS2.0

DPOGS200799
TranscriptDPOGS200799-TA1485 bp
ProteinDPOGS200799-PA494 aa
Genomic positionDPSCF300454 + 57266-63442
RNAseq coverage17365x (Rank: top 1%)
Annotation
HeliconiusHMEL0169540.078.95% 
BombyxBGIBMGA002241-TA0.097.77% 
DrosophilaVha55-PC0.097.14% 
EBI UniRef50UniRef50_P212810.090.23%V-type proton ATPase subunit B, brain isoform n=187 Tax=root RepID=VATB2_HUMAN
NCBI RefSeqXP_002020318.10.097.35%GL13570 [Drosophila persimilis]
NCBI nr blastpgi|4013270.098.18%H(+)-transporting ATPase [Manduca sexta]
NCBI nr blastxgi|4013270.098.18%H(+)-transporting ATPase [Manduca sexta]
Group
Gene OntologyGO:00159914.7e-300ATP hydrolysis coupled proton transport
GO:00168204.7e-300hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances
GO:00331804.7e-300proton-transporting V-type ATPase, V1 domain
GO:00055243e-61ATP binding
GO:00331781.8e-15proton-transporting two-sector ATPase complex, catalytic domain
GO:00460345.1e-12ATP metabolic process
GO:00164695.1e-12proton-transporting two-sector ATPase complex
GO:00159925.1e-12proton transport
GO:00469335.1e-12hydrogen ion transporting ATP synthase activity, rotational mechanism
GO:00469615.1e-12proton-transporting ATPase activity, rotational mechanism
KEGG pathwaydpe:Dper_GL135700.0 
 K02147 (ATPeVB, ATP6B1)maps-> Collecting duct acid secretion
    Oxidative phosphorylation
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[29-493] IPR0057234.7e-300ATPase, V1 complex, subunit B
[155-383] IPR0001943e-61ATPase, F1/V1/A1 complex, alpha/beta subunit, nucleotide-binding domain
[401-485] IPR0007931.8e-15ATPase, F1/V1/A1 complex, alpha/beta subunit, C-terminal
[33-99] IPR0041005.1e-12ATPase, F1/V1/A1 complex, alpha/beta subunit, N-terminal
Orthology groupMCL11362 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200799-TA
ATGGCAAAAACTCTCTCAGCTTCCCAGGCAGACAAGGAACATGCCCTGGTCGTATCCCGCGACTTCATATCCCAACCCCGTCTGACATATAAGACGGTGTCAGGTGTAAATGGGCCACTGGTCATCCTCGACGAGGTCAAGTTCCCTAAGTTCTCAGAAATTGTACAGCTGAGACTTGCAGATGGCACCCTACGTTCGGGACAGGTGTTGGAGGTCAGCGGCAACAAGGCCGTGGTCCAGGTGTTTGAGGGTACCTCGGGTATTGATGCTAAAAACACACTGTGCGAGTTCACCGGTGACATCCTGAGAACTCCCGTGTCCGAAGATATGTTGGGTCGTGTATTCAACGGTTCCGGCAAGCCCATAGACAAGGGACCCCCAATCCTGGCCGAGGACTTCTTGGACATCCAGGGTCAGCCCATCAACCCCTGGTCACGTATATACCCTGAGGAGATGATTCAGACCGGTATATCCGCCATTGACGTGATGAACTCCATCGCCCGCGGTCAGAAGATCCCTATTTTCTCTGCCGCCGGTTTGCCTCACAACGAGATTGCGGCACAGATCTGTAGACAGGCCGGTCTTGTCAAGGTGCCTGGCAAGTCTGTCCTCGACGACCACGAGGACAACTTCGCCATAGTGTTCGCCGCTATGGGTGTGAACATGGAGACGGCTCGTTTCTTCAAGCAGGACTTCGAAGAGAACGGTTCTATGGAGAACGTGTGCCTGTTCCTTAACCTGGCCAATGACCCCACCATCGAGAGGATCATCACACCGCGTCTGGCTCTCACAGCCGCTGAGTTTTTGGCTTACCAGTGCGAGAAACACGTGTTGGTCATCCTGACTGACATGTCTTCGTACGCTGAAGCCCTGCGTGAAGTGTCCGCCGCCCGTGAAGAGGTACCCGGACGACGTGGTTTCCCCGGTTACATGTACACCGATTTGGCCACCATCTACGAGCGAGCCGGACGTGTGGAGGGTAGAAACGGATCCATCACTCAGATACCCATCCTGACTATGCCCAACGATGACATCACCCATCCCATCCCTGACTTGACGGGTTACATTACCGAGGGACAGATCTACGTCGACCGTCAGCTACACAACCGTCAGATCTACCCACCGGTGAATGTGCTGCCCTCTCTGTCCCGTCTCATGAAGTCCGCTATCGGCGAGGGCATGACCCGCAAGGATCACTCCGACGTGTCCAACCAGCTGTACGCGTGCTACGCCATCGGTAAGGACGTGCAGGCGATGAAGGCTGTAGTGGGAGAGGAAGCTCTCACGCCCGACGACTTGCTGTACTTAGAGTTCCTCACTAAGTTTGAGAAGAACTTTATCACTCAGAGTAACTACGAGAACCGCACCGTGTTCGAGTCTCTGGACATCGGCTGGCAGTTGCTGCGCATCTTCCCCAAGGAGATGCTGAAGCGTATCCCCGCCTCCATCCTCGCCGAGTTCTACCCGAGGGATTCGCGTCACTAA

Protein sequence:

>DPOGS200799-PA
MAKTLSASQADKEHALVVSRDFISQPRLTYKTVSGVNGPLVILDEVKFPKFSEIVQLRLADGTLRSGQVLEVSGNKAVVQVFEGTSGIDAKNTLCEFTGDILRTPVSEDMLGRVFNGSGKPIDKGPPILAEDFLDIQGQPINPWSRIYPEEMIQTGISAIDVMNSIARGQKIPIFSAAGLPHNEIAAQICRQAGLVKVPGKSVLDDHEDNFAIVFAAMGVNMETARFFKQDFEENGSMENVCLFLNLANDPTIERIITPRLALTAAEFLAYQCEKHVLVILTDMSSYAEALREVSAAREEVPGRRGFPGYMYTDLATIYERAGRVEGRNGSITQIPILTMPNDDITHPIPDLTGYITEGQIYVDRQLHNRQIYPPVNVLPSLSRLMKSAIGEGMTRKDHSDVSNQLYACYAIGKDVQAMKAVVGEEALTPDDLLYLEFLTKFEKNFITQSNYENRTVFESLDIGWQLLRIFPKEMLKRIPASILAEFYPRDSRH-