Monarch geneset OGS2.0

DPOGS203910
TranscriptDPOGS203910-TA1239 bp
ProteinDPOGS203910-PA412 aa
Genomic positionDPSCF300005 - 886497-891862
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0157762e-9742.34% 
BombyxBGIBMGA000489-TA7e-12960.68% 
DrosophilaVha100-2-PB6e-9843.17% 
EBI UniRef50UniRef50_Q9VE759e-9643.17%Vha100-2, isoform A n=19 Tax=Coelomata RepID=Q9VE75_DROME
NCBI RefSeqXP_001358543.18e-9843.28%GA15015 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1257745692e-9643.28%GA15015 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|1951456751e-9443.38%GL23189 [Drosophila persimilis]
Group
Gene OntologyGO:00159917.3e-145ATP hydrolysis coupled proton transport
GO:00331777.3e-145proton-transporting two-sector ATPase complex, proton-transporting domain
GO:00150787.3e-145hydrogen ion transmembrane transporter activity
KEGG pathwaydpo:Dpse_GA150152e-97 
 K02154 (ATPeVI, ATP6N1A)maps-> Collecting duct acid secretion
    Oxidative phosphorylation
    Lysosome
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[2-406] IPR0024907.3e-145ATPase, V0/A0 complex, 116kDa subunit
Orthology groupMCL44277 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203910-TA
ATGGGATGCATGTTGAGAAGTGATCTTATGACATTTTGTGATATATTTATCCAGCCTGAAACAGCCTTCGAAATTGTCGCCCATTTCGGAGAGATGGGCTGCGCACAATTCGTAGATGAGGTTAAAAGGAACCTCATTAGTTGCAGTGGTGGTATATTTAAAGCTAGGATGACACCAGATGTTAAGGCCTTTCAAAGAAATTATGTCACGGAAGTTTGTCGTTGTGCTGAGATGGAACGAAAACTCCTATATATGGAATCAGAAATGCTAAAGGATAACATCGAGATAGTCATGTATGATAGTCTAAAGCCAGCAGCGCTGCCTCTTAATGAATTGAGTGCATTGGAAAATATTATTGACAAATGGGAAAGTGATGTCATTGACATGTCAGAAAATCAAACTACCCTTTTGAAGAATTATTTAGAATTGACAGAAATGAACTACGTTCTGAATAATATTGGACCAATGTTAGGGGAATCAGAAATGACAGAGGAGGCCATATTTGGAAAGACCGCCGCAGGTGATACAGGCTTACAAGGTCGGCTATTCGTAATTAGCGGCGTTGTGAAACGATCGAGATCTTTCCCTTTTGAGATGATGATGTGGAGGGTATCTCATGGAAATATTTACTATCGATTAGCATCGCAGGATACAATATTACAAGACCCAGTAACCGGTCAAGACATTCGAAAAGTCGCTTTTCTAGCGATATTCCAGGGTGAGCAGTTATCTGCTAGACTTGAGAAGGTCTGCTCCGGCTTTCATGTCAATATGTACACATGTCCGCAATCATATAATGATCGGATGGATATGATGATACAGCTAGGAACTAGGATCGGTGACTTGGAACAAGTGATGAGTAAAACCAAATATTATCGGTGTAAAGCTCTACGGACAGTGAGCAAACAATGGGATACTTGGATGGTGCAAATCAAAAAGTCTAAAGCTGTTTATCACACAATGAATATGTTTACTTTAGATATTACGAGAAAGTGTCTGATTGGGCAGTGTTGGGTTCCGGATACTGATCTGCAAAAGGTTGAAGATATGTTAGCACGTATAACGGAAAAAGAAGGGTCTAACGTTGAATCTTTTATATTGAAGTCCGACGATGCCGACGAACCACCCACTTACCATCGCACTAACAAGTTCACTAAAGGTTTCCAGGCGCTCATCAACGCCTATGGCGACTCCACTTATAGGGAATTGAATCCCGGCAAGAATTTTTTAACATTTTAA

Protein sequence:

>DPOGS203910-PA
MGCMLRSDLMTFCDIFIQPETAFEIVAHFGEMGCAQFVDEVKRNLISCSGGIFKARMTPDVKAFQRNYVTEVCRCAEMERKLLYMESEMLKDNIEIVMYDSLKPAALPLNELSALENIIDKWESDVIDMSENQTTLLKNYLELTEMNYVLNNIGPMLGESEMTEEAIFGKTAAGDTGLQGRLFVISGVVKRSRSFPFEMMMWRVSHGNIYYRLASQDTILQDPVTGQDIRKVAFLAIFQGEQLSARLEKVCSGFHVNMYTCPQSYNDRMDMMIQLGTRIGDLEQVMSKTKYYRCKALRTVSKQWDTWMVQIKKSKAVYHTMNMFTLDITRKCLIGQCWVPDTDLQKVEDMLARITEKEGSNVESFILKSDDADEPPTYHRTNKFTKGFQALINAYGDSTYRELNPGKNFLTF-