Monarch geneset OGS2.0

DPOGS203911
TranscriptDPOGS203911-TA699 bp
ProteinDPOGS203911-PA232 aa
Genomic positionDPSCF300005 - 882755-884957
RNAseq coverage24x (Rank: top 77%)
Annotation
HeliconiusHMEL0039744e-9780.00% 
BombyxBGIBMGA002432-TA1e-6052.42% 
DrosophilaVha100-2-PB1e-5749.78% 
EBI UniRef50UniRef50_Q9VE751e-5549.78%Vha100-2, isoform A n=19 Tax=Coelomata RepID=Q9VE75_DROME
NCBI RefSeqXP_396263.32e-5753.57%PREDICTED: similar to Vha100-2 CG18617-PB, isoform B isoform 1 [Apis mellifera]
NCBI nr blastpgi|3287857727e-5653.57%PREDICTED: v-type proton ATPase 116 kDa subunit a isoform 1-like isoform 1 [Apis mellifera]
NCBI nr blastxgi|2700026246e-5550.22%hypothetical protein TcasGA2_TC004949 [Tribolium castaneum]
Group
Gene OntologyGO:00159911e-98ATP hydrolysis coupled proton transport
GO:00331771e-98proton-transporting two-sector ATPase complex, proton-transporting domain
GO:00150781e-98hydrogen ion transmembrane transporter activity
KEGG pathwayame:4128106e-57 
 K02154 (ATPeVI, ATP6N1A)maps-> Collecting duct acid secretion
    Oxidative phosphorylation
    Lysosome
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[1-221] IPR0024901e-98ATPase, V0/A0 complex, 116kDa subunit
Orthology groupMCL34523 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203911-TA
ATGGTTCTTTTATCAACGTCAAAACCGGTTGAAGAAGACTGTGATGCTTACATGTTCAGTAACCAGGAACGTTTTCAACGATTATTGGTCATAATTGGAGTAATTTGTGTTCCAATTCTATTATTTGGAACACCGGTATATCTCAATAAGGCCAATAAGAAAAAGAAAGCTGAAGCCCTTAAAAAGGTTAGTCAATTTCGAAGGTATCAGCGGCGTGAATCCGATAACAAGCGAGTGGAAGATAAAATACTAAAAGAAGTAGCAAAGTACTCAGTACCTTTTGGCGAACTAATGATTCATCAAGCTGTGCACACGATCGAATTTGTACTAAGCACTATATCCCACACGGCCTCCTACCTACGTCTGTGGGCGCTGTCCTTAGCACATGAACAATTGTCGGAGATGTTATGGGTAATGGTGTTTGCTAAGCTTGGTTTACGAGAATATTCAATGACTGGCGGTGTTAAAATATTTCTCATATTTGCTGTTTGGGCGGTCTTCAGTCTTTCAATCTTAGTAGTTATGGAAGGATTGTCCGCGTTCCTTCATACTTTACGATTGCATTGGGTTGAATTTATGAGCAAGTTCTATTCTGGAACAGGCTATCCGTTTAAACCCTTTAGTTTTAAAGCAATTTTAAGCGGCGAAGGCAAAGATGACAAATCTGAGGCAATGTGTAAGAAGAAGGTCGCAAATTAA

Protein sequence:

>DPOGS203911-PA
MVLLSTSKPVEEDCDAYMFSNQERFQRLLVIIGVICVPILLFGTPVYLNKANKKKKAEALKKVSQFRRYQRRESDNKRVEDKILKEVAKYSVPFGELMIHQAVHTIEFVLSTISHTASYLRLWALSLAHEQLSEMLWVMVFAKLGLREYSMTGGVKIFLIFAVWAVFSLSILVVMEGLSAFLHTLRLHWVEFMSKFYSGTGYPFKPFSFKAILSGEGKDDKSEAMCKKKVAN-