Monarch geneset OGS2.0

DPOGS207382
TranscriptDPOGS207382-TA1698 bp
ProteinDPOGS207382-PA565 aa
Genomic positionDPSCF300267 + 34320-39774
RNAseq coverage1389x (Rank: top 9%)
Annotation
HeliconiusHMEL0122380.084.74% 
BombyxBGIBMGA008879-TA0.085.29% 
DrosophilaVhaSFD-PB0.060.47% 
EBI UniRef50UniRef50_D6WVU70.063.16%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WVU7_TRICA
NCBI RefSeqXP_310211.40.061.59%AGAP009486-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2700121700.063.16%hypothetical protein TcasGA2_TC006281 [Tribolium castaneum]
NCBI nr blastxgi|1582883330.062.93%AGAP009486-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00159912e-227ATP hydrolysis coupled proton transport
GO:00002212e-227vacuolar proton-transporting V-type ATPase, V1 domain
GO:00469612e-227proton-transporting ATPase activity, rotational mechanism
GO:00054885.8e-109binding
GO:00168201.2e-48hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances
KEGG pathwayaga:AgaP_AGAP0094860.0 
 K02144 (ATPeV54kD)maps-> Oxidative phosphorylation
    Lysosome
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[1-547] IPR0049082e-227ATPase, V1 complex, subunit H
[269-426] IPR0119895.8e-109Armadillo-like helical
[270-541] IPR0160246.8e-107Armadillo-type fold
[424-541] IPR0119871.2e-48ATPase, V1 complex, subunit H, C-terminal
Orthology groupMCL12401 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207382-TA
ATGGCTAACATTGAAGAAAATGTAAGCAAACTAATAGGCGATGAAAAGATTGACATGATCGCTGCTACCAGCGTTCTACAAATAAGGGCCAGTGAAATCCGGCAAAGCCAAATTAATTGGCAGTCATATCTGCAATCGCAGATGATTACTCAGCGTGATCATGATTTCATTGTGAACCTGGACCAGAGAGGTCAAAAGGATCTGCCGGACAGAAACCCAGAAGGATGTGCCGATGTATTTCTAAACTTGGTAGCGCACATCAGCAAAGATAACACCATTCAATATGTCCTTGTCCTGATTGACGATATTCTATCAGAGGATAAGTCGAGAGTGAAGATCTTCCGTAATGCTCGTCATGGGAATGTCTGGCAGCCTTTCCTCAGCCTGCTTAACCGTCAGGATGAATTTGTCCAGCATATGTCGTCACGCATCATCGCCAAGCTTGCCTGCTGGCATCCGCAGCTCATGGAGAAGAGCGACCTGCACCATTACCTGTCCTGGCTCAAGGATGAGCTTAAAATGAATCTGTGTGTCATTGTGTACGTAATGATGAAAATTGAATCGATGGCCGAGAGGAGTGCTGTAACATTGAGAGGTTTGTTTTCTGGTAAGGCGGAGAAAGTGTCTGCAGAGTTAGAACATGCCGAGCAGGTTATGTTGGAGCATGAGAAACAAAATTTCTCTGATAAACCAACTAAAACGCACAAGGAAAAGGATGATTCAAAGGATTCCGAGAAGTATAAGAGCCTGTACAGCTCTTTTGATGTCTTAAAATCTCATGAGGACCTCCCTGATGGTCTTGTAAAGAACAACGATTACATCCAGTCGGTGGCACGCTGTCTTCAGATGATGCTTCGCGTTGACGAGTACCGCTTCGCTTTCCTATCCGTTGACGGCATCTCGACTCTCCTTTCTATCCTCGCTTCGAGAGTCAACTTCCAGGTCCAATACCAGTTGGTGTTCTGCTTGTGGGTGCTGACTTTCAATCCTCTGTTGGCCGAAAAAATGAACAAGTTCAATGCTATTCCCATCCTGGCCGACATCCTCAGTGACTCTGTCAAGGAGAAGGTCACTCGCATCGTGCTGGCCGTGTTCCGCAACCTCATTGAGAAACCCGAAGACCAGCAGGTGGCCAAGGAGCATTGCATAGCTATGGTTCAGTGCAAGGTCCTGAAGCAGCTTTCTATACTAGAACAGAAGCGGTCTGACGACGAGGACATCATGAATGATGTCGACTTCTTGAACGAACGTCTGCAGGCCTCCGTACAAGACCTCAGCTCCTTCGATCAGTATGCTACCGAAGTGAAGAGTGGACGTCTGGAATGGTCACCGGTGCACAAATCAGCTAAGTTCTGGCGCGAGAACGCCGCCCGTCTGAACGAGCGCGGTCAGGAGCTGCTACGGACGTTGGTACATCTGTTGGAGAAGAGCCGGGACCCCGTAGTACTCGCCGTAGCCTGCTACGACGTTGGCGAATACGTGCGTCACTACCCACGCGGCAAGCACATCATTGAACAACTGGGTGGAAAACAACGTGTCATGTATTTACTGAGCCACGAGGACCCCAATGTTCGGTATGAGGCACTACTCGCTGTACAGAAACTCATGGTTCACAACTGGGAGTATCTCGGCAAGCAATTGGAGAAGGAACAAATCGACAAACAGTCTGGTGGAACAGCCGTCGGTGCTAAAGCTTAA

Protein sequence:

>DPOGS207382-PA
MANIEENVSKLIGDEKIDMIAATSVLQIRASEIRQSQINWQSYLQSQMITQRDHDFIVNLDQRGQKDLPDRNPEGCADVFLNLVAHISKDNTIQYVLVLIDDILSEDKSRVKIFRNARHGNVWQPFLSLLNRQDEFVQHMSSRIIAKLACWHPQLMEKSDLHHYLSWLKDELKMNLCVIVYVMMKIESMAERSAVTLRGLFSGKAEKVSAELEHAEQVMLEHEKQNFSDKPTKTHKEKDDSKDSEKYKSLYSSFDVLKSHEDLPDGLVKNNDYIQSVARCLQMMLRVDEYRFAFLSVDGISTLLSILASRVNFQVQYQLVFCLWVLTFNPLLAEKMNKFNAIPILADILSDSVKEKVTRIVLAVFRNLIEKPEDQQVAKEHCIAMVQCKVLKQLSILEQKRSDDEDIMNDVDFLNERLQASVQDLSSFDQYATEVKSGRLEWSPVHKSAKFWRENAARLNERGQELLRTLVHLLEKSRDPVVLAVACYDVGEYVRHYPRGKHIIEQLGGKQRVMYLLSHEDPNVRYEALLAVQKLMVHNWEYLGKQLEKEQIDKQSGGTAVGAKA-