Monarch geneset OGS2.0

DPOGS208202
TranscriptDPOGS208202-TA1020 bp
ProteinDPOGS208202-PA339 aa
Genomic positionDPSCF300179 - 230255-234442
RNAseq coverage401x (Rank: top 30%)
Annotation
HeliconiusHMEL0035791e-13668.42% 
BombyxBGIBMGA002311-TA7e-14474.71% 
DrosophilaCG31030-PB5e-2026.10% 
EBI UniRef50UniRef50_Q7PXW02e-3633.43%AGAP001624-PA n=5 Tax=Culicidae RepID=Q7PXW0_ANOGA
NCBI RefSeqXP_321475.41e-3432.51%AGAP001624-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479662638e-3633.43%AGAP001624-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479662637e-3632.10%AGAP001624-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00159911.2e-21ATP hydrolysis coupled proton transport
GO:00331801.2e-21proton-transporting V-type ATPase, V1 domain
GO:00469611.2e-21proton-transporting ATPase activity, rotational mechanism
GO:00469331.2e-21hydrogen ion transporting ATP synthase activity, rotational mechanism
KEGG pathwaytca:6630295e-16 
 K03662 (ATPeVS1, ATP6S1)maps-> Oxidative phosphorylation
    Lysosome
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[264-339] IPR0083881.2e-21ATPase, V1 complex, subunit S1
Orthology groupMCL18269 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208202-TA
ATGAGGATATACGGCGAGCAGACCGCCATAGTCGTGTTCAACAACCAGGATTCTGTACCTCTGTCCAGTGGAATCTTCCCTCGCCTGAGGAAGATGCTGGAGGGTCAGACTACGCTGGTCCTCCCTCAGGACTTCTTAGACGTCCATCCCGATTACATTAGTAACGAAACACAAGTGTTAACACTACAAGGCCCGTGGTCTGAACGCGACGCTCAGATGACGGAGACGTTTGCGAGGCTTCAGGACATATACGGCGCTGGAAGAGTTCTAGGAGTACTAGGTAACTCAGTCTCTCAAAGTCAACCGTATTACGAGACTGAATGGACTCGAGTAAGACGTCAGGCTAAGGACGTGACCACTACGTCAGCTACCACCGAAGACGTGCAAGAGAAACCAGCCAGGAATCTCCAGAAATATGCTTTGTATAATGCAACAGGTCCCCCCGGGAAGGCGGCGCTGCTGTATTCTTCAGGCTGGCCGGAACTGCGTTGGCCTGACGGCACGGTGACAGTGTTAGACAGTCCCGTTGGCGAGCCCACCATCAAACCAACCCGACTCTACACCATACTGGTAGTGAGGTTCGCTGACGGAGACAATACCAGAGATAAGATAACTCTGGAGTTCTCGTTCAAGCAGTCTGGTTCGTGGTGGTCGGCAGTAGGTGTGGAAATCCGCCGAGGGTTGGAGACAACCGGGTTAAACATGCCCGCCCTCGATCCACCCGCCGCTGTCCTGGGAAGGGCCTTCCATTGCTCTCTACCGCTCATATATGAGGCGGATGACGCTAGACTTACCTTCCCTGATATCAGAATTCAACCGTTTATGGAGAGCACAGAGAAGTTTGCTGATGCGTTTGATTGTATCGGCTTCACGACGGTACCTATCTGGTCCGGCCTGATGGTGACGGGTCTGATGCTGGTGGTGCTGTTCGTATCCATCTGTATGATCATGGACATCAAGACGATGGACCGCTTTGAGAACAACCGCTCCAAACAACTCACTATCACCGTCTATGAATAA

Protein sequence:

>DPOGS208202-PA
MRIYGEQTAIVVFNNQDSVPLSSGIFPRLRKMLEGQTTLVLPQDFLDVHPDYISNETQVLTLQGPWSERDAQMTETFARLQDIYGAGRVLGVLGNSVSQSQPYYETEWTRVRRQAKDVTTTSATTEDVQEKPARNLQKYALYNATGPPGKAALLYSSGWPELRWPDGTVTVLDSPVGEPTIKPTRLYTILVVRFADGDNTRDKITLEFSFKQSGSWWSAVGVEIRRGLETTGLNMPALDPPAAVLGRAFHCSLPLIYEADDARLTFPDIRIQPFMESTEKFADAFDCIGFTTVPIWSGLMVTGLMLVVLFVSICMIMDIKTMDRFENNRSKQLTITVYE-