Monarch geneset OGS2.0

DPOGS202679
TranscriptDPOGS202679-TA474 bp
ProteinDPOGS202679-PA157 aa
Genomic positionDPSCF300039 + 807938-810143
RNAseq coverage13147x (Rank: top 1%)
Annotation
HeliconiusHMEL0145212e-7896.82% 
BombyxBGIBMGA001302-TA9e-7489.70% 
DrosophilaVha16-1-PD5e-7593.55% 
EBI UniRef50UniRef50_G3WMU43e-6083.87%Uncharacterized protein n=1 Tax=Sarcophilus harrisii RepID=G3WMU4_SARHA
NCBI RefSeqNP_001155531.12e-7595.48%hypothetical protein LOC100162391 [Acyrthosiphon pisum]
NCBI nr blastpgi|2408492634e-7495.48%V-type proton ATPase 16 kDa proteolipid subunit [Acyrthosiphon pisum]
NCBI nr blastxgi|2408492632e-7595.48%V-type proton ATPase 16 kDa proteolipid subunit [Acyrthosiphon pisum]
Group
Gene OntologyGO:00331799.3e-59proton-transporting V-type ATPase, V0 domain
GO:00159919.3e-59ATP hydrolysis coupled proton transport
GO:00150789.3e-59hydrogen ion transmembrane transporter activity
GO:00331776.4e-20proton-transporting two-sector ATPase complex, proton-transporting domain
KEGG pathwayapi:1001623916e-75 
 K02155 (ATPeVPL, ATP6L)maps-> Collecting duct acid secretion
    Oxidative phosphorylation
    Lysosome
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[28-52] IPR0002459.3e-59ATPase, V0 complex, proteolipid subunit C
[11-118] IPR0115553.5e-54ATPase, V0 complex, proteolipid subunit C, eukaryotic
[83-157] IPR0023796.4e-20ATPase, F0/V0 complex, subunit C
Orthology groupMCL14889 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202679-TA
ATGGCCGAGTCAAATCCTATCTATGGACCCTTCTTTGGAGTTATGGGGGCGGCGTCTGCTATCATTTTCAGCGCCCTGGGAGCCGCCTATGGAACGGCTAAGTCAGGTACCGGTATCGCTGCTATGTCTGTGATGCGGCCTGAGCTGATCATGAAATCTATCATTCCCGTTGTCATGGCGGGTATCATTGCCATCTACGGTCTGGTCGTGGCTGTGCTGATTGCTGGATCCCTCGAACCCCCAGCAACCTACTCCCTTTTCAAAGGGTTCATCCATTTGGGTGCCGGTCTCGCTGTAGGCTTCTCTGGTCTGGCCGCTGGTTTCGCCATAGGCATTGTGGGTGATGCCGGTGTCCGCGGCACAGCCCAGCAGCCGAGGTTATTCGTGGGAATGATCCTTATCCTCATTTTCGCCGAAGTGTTGGGTCTATACGGTCTCATCGTCGCCATCTACCTTTACACGAAACAGAGTTAA

Protein sequence:

>DPOGS202679-PA
MAESNPIYGPFFGVMGAASAIIFSALGAAYGTAKSGTGIAAMSVMRPELIMKSIIPVVMAGIIAIYGLVVAVLIAGSLEPPATYSLFKGFIHLGAGLAVGFSGLAAGFAIGIVGDAGVRGTAQQPRLFVGMILILIFAEVLGLYGLIVAIYLYTKQS-