Monarch geneset OGS2.0

DPOGS210123
TranscriptDPOGS210123-TA675 bp
ProteinDPOGS210123-PA224 aa
Genomic positionDPSCF300017 + 1515217-1516850
RNAseq coverage1780x (Rank: top 7%)
Annotation
HeliconiusHMEL0106993e-8688.54% 
BombyxBGIBMGA000231-TA7e-8384.90% 
DrosophilaVhaPPA1-1-PA6e-6967.62% 
EBI UniRef50UniRef50_E9PNL31e-5466.47%ATPase, H+ transporting, lysosomal 21kDa, V0 subunit b n=82 Tax=Eukaryota RepID=E9PNL3_HUMAN
NCBI RefSeqNP_001040169.12e-8184.90%vacuolar ATP synthase 21 kDa proteolipid subunit [Bombyx mori]
NCBI nr blastpgi|3455321622e-8388.54%vacuolar ATP synthase 21 kDa proteolipid subunit [Heliconius numata arcuella]
NCBI nr blastxgi|3455321624e-9188.54%vacuolar ATP synthase 21 kDa proteolipid subunit [Heliconius numata arcuella]
Group
Gene OntologyGO:00159917.4e-14ATP hydrolysis coupled proton transport
GO:00331777.4e-14proton-transporting two-sector ATPase complex, proton-transporting domain
GO:00150787.4e-14hydrogen ion transmembrane transporter activity
GO:00331792.4e-13proton-transporting V-type ATPase, V0 domain
KEGG pathwayame:4090742e-74 
 K03661 (ATPeVPF, ATP6F)maps-> Oxidative phosphorylation
    Lysosome
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[47-110] IPR0023797.4e-14ATPase, F0/V0 complex, subunit C
[63-87] IPR0002452.4e-13ATPase, V0 complex, proteolipid subunit C
Orthology groupMCL14352 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210123-TA
ATGAGATACTTTATGACCTACCTGTTTTTGCTGATCGTGGGGCTGGCGGTACCTATAGTTTCGATGTACTATCTTCTGTCTGGAAAGGGTGAGCAAATAAGTGTCGGATGGTTCTTGGAGAAGACATCCCCATATATGTGGGCATGCTTAGGAATTGCCATGGCCGTGTCTTTCTCTGTGGTCGGTGCTGCTATGGGCATTCACACGACGGGAGTGAGCATAGTCGGAGGAGGTGTCAAGGCCCCTAGGATTAAAACTAAGAACTTGATCTCCGTCATTTTCTGTGAAGCCGTAGCTATTTACGGTTTGATCACAGCAATCGTCCTTTCTGGTATCTTGGAGCAGTACAAGGAACCAGTCATCGACAAAAATATCGAAGAAGCGAATTGGATGGCTGGTTACGTGATGTTCGGAGCTGGTTTAGCTGTGGGCTTGGTAAACTTGTTCTGTGGTATCGCTGTCGGAATCGTGGGTTCCGGCGCGGCCTTGGCTGATGCTGCTAATGCTGCCCTTTTTGTCAAGATCCTCATCGTAGAAATCTTCGGATCAGCCATTGGGTTGTTTGGACTCATTGTTGTTAACAAGCATTTTCCAGTTGCCCGTGCTAAAATAACCGAGAAGCAACTTTGTTTCAATAAGTGTTGGAATTATCTTCAGCCACTGTTTGTTGTGTAA

Protein sequence:

>DPOGS210123-PA
MRYFMTYLFLLIVGLAVPIVSMYYLLSGKGEQISVGWFLEKTSPYMWACLGIAMAVSFSVVGAAMGIHTTGVSIVGGGVKAPRIKTKNLISVIFCEAVAIYGLITAIVLSGILEQYKEPVIDKNIEEANWMAGYVMFGAGLAVGLVNLFCGIAVGIVGSGAALADAANAALFVKILIVEIFGSAIGLFGLIVVNKHFPVARAKITEKQLCFNKCWNYLQPLFVV-