Monarch geneset OGS2.0

DPOGS214027
TranscriptDPOGS214027-TA750 bp
ProteinDPOGS214027-PA249 aa
Genomic positionDPSCF300238 - 190805-194276
RNAseq coverage2408x (Rank: top 5%)
Annotation
HeliconiusHMEL0047796e-12695.58% 
BombyxBGIBMGA008542-TA8e-10894.42% 
DrosophilaVha36-1-PA1e-11884.74% 
EBI UniRef50UniRef50_Q9Y5K81e-9772.80%V-type proton ATPase subunit D n=33 Tax=Eutheria RepID=VATD_HUMAN
NCBI RefSeqXP_001600508.11e-12287.95%PREDICTED: similar to vacuolar ATP synthase subunit D [Nasonia vitripennis]
NCBI nr blastpgi|1565425682e-12187.95%PREDICTED: V-type proton ATPase subunit D 1-like [Nasonia vitripennis]
NCBI nr blastxgi|1140532494e-12294.38%vacuolar ATP synthase subunit D [Bombyx mori]
Group
Gene OntologyGO:00331783.6e-158proton-transporting two-sector ATPase complex, catalytic domain
GO:00159913.6e-158ATP hydrolysis coupled proton transport
GO:00426263.6e-158ATPase activity, coupled to transmembrane movement of substances
GO:00469613.6e-158proton-transporting ATPase activity, rotational mechanism
KEGG pathwaynvi:1001198324e-122 
 K02149 (ATPeVD, ATP6M)maps-> Collecting duct acid secretion
    Oxidative phosphorylation
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[2-250] IPR0026993.6e-158ATPase, V1/A1 complex, subunit D
Orthology groupMCL13821 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214027-TA
ATGTCTGGGAAGGATAAATTAGCGATTTTCCCTTCTCGGGGAGCCCAAATGTTAATAAAAGGCCGTTTGGCTGGAGCCCAGAAAGGGCATGGTCTCTTGAAAAAGAAGGCTGATGCCTTACAAGTCAGATTCCGTATGATCTTAAGCAAAATCATTGAGACAAAAACTCTGATGGGTGAAGTGATGAAAGAGGCTGCATTTTCCCTGGCGGAAGCTAAGTTTACAACCGGTGACTTCAACCAAGTGGTGCTCCAAAACGTTACTAAAGCACAAATTAAAATTCGCTCCAAGAAAGACAATGTCGCTGGTGTAACTCTGCCGATATTTGAGTCCTACCAGGATGGTTCTGACACATATGAGTTGGCGGGCCTTGCTCGTGGTGGACAACAGCTCTCCAAGCTGAAGAAGAACTTCCAGAGCGCTGTCAAGTTACTGGTGGAGCTGGCTTCCCTACAGACCTCATTCGTCACCCTGGACGAGGTCATCAAGATAACCAACCGTCGTGTCAATGCTATCGAGCATGTCATTATTCCCCGTCTGGAGCGCACCTTGGCCTACATCATCTCGGAGCTGGACGAGCTGGAGCGTGAGGAGTTCTACCGCCTCAAGAAGATCCAGGACAAGAAGAAGATCATCAAGGACAAAGCTGAAGCGAGAAAAGCTCAAATGTTGGCAGCCAACCGTGACCAGGACATGAGAGACAGTGTTGCCAACCTGTTGGACGAAGGGGATGAAGATTTGCTCTTCTAA

Protein sequence:

>DPOGS214027-PA
MSGKDKLAIFPSRGAQMLIKGRLAGAQKGHGLLKKKADALQVRFRMILSKIIETKTLMGEVMKEAAFSLAEAKFTTGDFNQVVLQNVTKAQIKIRSKKDNVAGVTLPIFESYQDGSDTYELAGLARGGQQLSKLKKNFQSAVKLLVELASLQTSFVTLDEVIKITNRRVNAIEHVIIPRLERTLAYIISELDELEREEFYRLKKIQDKKKIIKDKAEARKAQMLAANRDQDMRDSVANLLDEGDEDLLF-