Monarch geneset OGS2.0

DPOGS205422
TranscriptDPOGS205422-TA849 bp
ProteinDPOGS205422-PA282 aa
Genomic positionDPSCF300407 + 353897-355100
RNAseq coverage19x (Rank: top 80%)
Annotation
HeliconiusHMEL0223225e-11971.48% 
BombyxBGIBMGA001592-TA4e-10163.29% 
DrosophilaVha36-3-PA3e-3939.62% 
EBI UniRef50UniRef50_Q1HPT67e-9963.29%Vacuolar ATP synthase subunit D n=2 Tax=Obtectomera RepID=Q1HPT6_BOMMO
NCBI RefSeqNP_001040532.11e-9963.29%vacuolar ATP synthase subunit D [Bombyx mori]
NCBI nr blastpgi|1140531472e-9863.29%vacuolar ATP synthase subunit D [Bombyx mori]
NCBI nr blastxgi|1140531476e-9363.29%vacuolar ATP synthase subunit D [Bombyx mori]
Group
Gene OntologyGO:00331784.2e-46proton-transporting two-sector ATPase complex, catalytic domain
GO:00159914.2e-46ATP hydrolysis coupled proton transport
GO:00426264.2e-46ATPase activity, coupled to transmembrane movement of substances
GO:00469614.2e-46proton-transporting ATPase activity, rotational mechanism
KEGG pathwayphu:Phum_PHUM6039507e-38 
 K02149 (ATPeVD, ATP6M)maps-> Collecting duct acid secretion
    Oxidative phosphorylation
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[15-210] IPR0026994.2e-46ATPase, V1/A1 complex, subunit D
Orthology groupMCL26600 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205422-TA
ATGAATTCGGAAAACCGTTATCCTGTAACGGCATCTTTATTTATGTTACGTGAAATAAAAAATCGACAAGAAAAGGTCAGCAGAGGATATCAACTTCTTAAGAAAAAAGCAGAAGCTCTCCGTATCCGCGGCCGTCAGGCAGCTGCTGAATTGGCTACGACACAAGCGATTTTGGGACATGCTTTAAGAGAAGCTTATATATCTTTGGCTGCTATTAAATTCACCAACGGCGAATCCAATGCCTTAGTTTTAGAAAATGTTGGCGAAGCTCAAATCCGAGTACAACGTGTGCCGGAAAATATTTCGGGAGTTGCTACCGTGTCTTTGCAGGTATTGGAAGATACTACAGCAAACGATTCATTGCGATATGCAGGACTGGGTGCTGGAGGGCATCGTACCACCGAAACCAAAAAGGCTTTTCGAGAAGCCATTAAGATATTAATAAGATTTGCTTCCCTGAGAAGTAACTGTGTGTTGCTAGATGAAGCTATAAAATCTGCCTTGAGAAAAGTTAATGGCATAGAAAAAGTAATAATGCCTAAGCTACGAAATACTGAAAATTACATTTTAATGGAAATGGATGAAAGGGAACGTGAAGAATTTCATAGACTAAAAATGGTGAAAGCTAAGAAAAACCTTGGACAGCCACTTTTAAAGTCGAAGTCGAACAGGAAATTTCCCTTTGGTGACAGTGATAAAGCTAAATCCTCTTTGGAGTCTATAGATAATCATCTGGAATCTTTGGAGATCTGTTCTTGCCCAACATTATCGACAACCACTGTGTCAGCTGGAGACTTTAAACCAGTGTGCTATCCTCATAATTGGGATGACGAGGATTTACTATTTTAA

Protein sequence:

>DPOGS205422-PA
MNSENRYPVTASLFMLREIKNRQEKVSRGYQLLKKKAEALRIRGRQAAAELATTQAILGHALREAYISLAAIKFTNGESNALVLENVGEAQIRVQRVPENISGVATVSLQVLEDTTANDSLRYAGLGAGGHRTTETKKAFREAIKILIRFASLRSNCVLLDEAIKSALRKVNGIEKVIMPKLRNTENYILMEMDEREREEFHRLKMVKAKKNLGQPLLKSKSNRKFPFGDSDKAKSSLESIDNHLESLEICSCPTLSTTTVSAGDFKPVCYPHNWDDEDLLF-