Monarch geneset OGS2.0

DPOGS205239
TranscriptDPOGS205239-TA1203 bp
ProteinDPOGS205239-PA400 aa
Genomic positionDPSCF300265 + 286926-289164
RNAseq coverage2695x (Rank: top 5%)
Annotation
HeliconiusHMEL0027335e-15866.58% 
BombyxBGIBMGA014403-TA2e-11351.59% 
DrosophilaVhaAC45-PA8e-3530.37% 
EBI UniRef50UniRef50_D6W7U72e-4434.00%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W7U7_TRICA
NCBI RefSeqXP_974187.15e-4534.00%PREDICTED: similar to vacuolar ATP synthase subunit S1 [Tribolium castaneum]
NCBI nr blastpgi|910928009e-4434.00%PREDICTED: similar to vacuolar ATP synthase subunit S1 [Tribolium castaneum]
NCBI nr blastxgi|910928003e-4833.50%PREDICTED: similar to vacuolar ATP synthase subunit S1 [Tribolium castaneum]
Group
Gene OntologyGO:00159913.6e-27ATP hydrolysis coupled proton transport
GO:00331803.6e-27proton-transporting V-type ATPase, V1 domain
GO:00469613.6e-27proton-transporting ATPase activity, rotational mechanism
GO:00469333.6e-27hydrogen ion transporting ATP synthase activity, rotational mechanism
KEGG pathwaytca:6630291e-44 
 K03662 (ATPeVS1, ATP6S1)maps-> Oxidative phosphorylation
    Lysosome
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[321-400] IPR0083883.6e-27ATPase, V1 complex, subunit S1
Orthology groupMCL16035 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205239-TA
ATGGCGTTTTGCCGTTTAGTGTTCCCTATATTAGTGCTAAGTGTCGTGTCCTGCTTCGCCAACCTGCAGGTACCTGTATTTCTGTGGGGTGATTTGAAAACATCTATAAAATCGAATCCTCTCGTTCGCGTAACTGAAAGCGAATTCGAGGACACGCTTAAACAAGAAATCAAGGACCATTTTACGGTCATTTTCGTCGATGAATCTCTTTCTGTTGAAGATTTTTCACGCAAAAATGACGATGGCGAAACTTCGTTCCCATATCTGCATGCTAACATCGGTAACTCCGTGTACTTACCAGCCGTCGATAACCCGATAGCCGCATTGGACAATTTAGCGGACCCGGAAAAGGTAGATCGAGTCAAGCTAACCGAAAATGGCCTGTCCGCTGAATTTGAACCGGAAAGTGTGAAAATTTTGTTCATAACCCTGAAGGACGCGCGCGTTGGTGAGTCAAGGACCAGCTTGCTACGGAGACACAATGACTTCATGGAAGAAATGTTTACTAAACTCCAAAATCAATATGGAAATGTGGTTGCCATATACACGGCTGACTTCCCTTCCTGGGTCGTCCCGGAAAGCCACTCCAGACTTCGACGTCAAGCTGAACCCTTAGCCCTTAACCAGTACTCAATCAACGGCCTCAAACTGTATGTCCAAGACCTGATCCTCTCGGTTAACAGCGAGAAAACACACTTGAACACTGTATCAAGTCAGAGCTCGACATTCAACGGCACAGACATGCTAACGACCATCGGCTTCGGAGAAAACACTCTGACATTGAGCTTTTCACAGAAAATGGGCTACTGGTATTTTAAAACGGTCACTCTGGAACAGAAGGCACCATCTCAAGTCACCGAGATACTTTACCCTAAGGAAGAGGTGTTCTCCTTCATGGATATGTCATACCGCTGCGGACAGGACGTCTCATTCACCAGCATCAATGACACCAATGTGTACAGCGTCACGTTCTCAAACATGAAGGTTCAACCATTCTTCAAAGATACCAATAGTTCGATAGTCTTCGGTGATTCGGTCAACTGCGTTGGATTCTTCAGTGTTCCTATCTGGTCTGGGCTTTTCGTGGTTTTCATACTCCTAGCGATCACCTTCTATGGTATCCTGATGATGATGGACATCCGCACCATGGACCGCTTCGACGACCCCAAGGGAAAAACCATAACAATTAACGCGAACGAGTAA

Protein sequence:

>DPOGS205239-PA
MAFCRLVFPILVLSVVSCFANLQVPVFLWGDLKTSIKSNPLVRVTESEFEDTLKQEIKDHFTVIFVDESLSVEDFSRKNDDGETSFPYLHANIGNSVYLPAVDNPIAALDNLADPEKVDRVKLTENGLSAEFEPESVKILFITLKDARVGESRTSLLRRHNDFMEEMFTKLQNQYGNVVAIYTADFPSWVVPESHSRLRRQAEPLALNQYSINGLKLYVQDLILSVNSEKTHLNTVSSQSSTFNGTDMLTTIGFGENTLTLSFSQKMGYWYFKTVTLEQKAPSQVTEILYPKEEVFSFMDMSYRCGQDVSFTSINDTNVYSVTFSNMKVQPFFKDTNSSIVFGDSVNCVGFFSVPIWSGLFVVFILLAITFYGILMMMDIRTMDRFDDPKGKTITINANE-