Monarch geneset OGS2.0

DPOGS203862
TranscriptDPOGS203862-TA5913 bp
ProteinDPOGS203862-PA1970 aa
Genomic positionDPSCF300010 + 3564174-3575659
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0084862e-2857.02% 
BombyxBGIBMGA009669-TA8e-8040.42% 
DrosophilaVha36-1-PA4e-2130.59% 
EBI UniRef50UniRef50_Q9XGM12e-1830.29%V-type proton ATPase subunit D n=32 Tax=Eukaryota RepID=VATD_ARATH
NCBI RefSeqXP_002010380.16e-2129.63%GI15892 [Drosophila mojavensis]
NCBI nr blastpgi|1951318951e-1929.63%GI15892 [Drosophila mojavensis]
NCBI nr blastxgi|2702898514e-2619.22%predicted protein [Pediococcus acidilactici 7_4]
Group
Gene OntologyGO:00331782e-26proton-transporting two-sector ATPase complex, catalytic domain
GO:00159912e-26ATP hydrolysis coupled proton transport
GO:00426262e-26ATPase activity, coupled to transmembrane movement of substances
GO:00469612e-26proton-transporting ATPase activity, rotational mechanism
KEGG pathwaydmo:Dmoj_GI158922e-20 
 K02149 (ATPeVD, ATP6M)maps-> Collecting duct acid secretion
    Oxidative phosphorylation
    Phagosome
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[27-218] IPR0026992e-26ATPase, V1/A1 complex, subunit D
Orthology groupMCL30680 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203862-TA
ATGTTTAAGGATGACGATGACAAAGACGAAGGAGCAGCCTTCGGCATCACCAGGTATCAGTGCATGCCGTCGTTGGTGGCGTTGCAGCAGATGAGGAACCGTCTCCACCTGGCTTTCCTCGGCAAGAAGCTGATGAAATGGACTGCCTTGGCTACCGGCAGGGAGATGAGGCGGCTCGCCCAGGAGTTATACGACGTGTACGAGGGTTTCGGTGAGGAGTTGAGAAATGCCTTCATTCTGCTCGCCAGATGTCGCTACTTCTCTCCGCCATTAAACAGCATAGTCTTAGAAGATATTGGACAAAAGGCTGGTGTGTTAGTACATCAGGCTACAAAGACGGTGTCGGCAGTGAAGATAGTCCAATTTGACATATTCGAGACTGAATATCCCCCTTATCCTCACTTGGGATTGGAGAAAGGGGGACAGACCATATACGAGACGAAGAAAGCCTGGCTGGATCTGCTCAAGCGGCTTATTATGATGATGCAGCTCAGAGCCAGTTTCCTCATGGTGGAGCTAGCCCATAAAAATGCCAATAAGAAAATGAACGTTTTGGGAAAAGTTGTCATACCGAGAACAAATGTAACAATGGATTATATAAATAACGAGCTGGAGGAATATGCGAGAGAGGAGTTTTTTAGATTAAAAAAGGTTTTGGAAATTAAAAGGAAAATGTATGGAGATAAGGATGGGAAGAAAGAAAAGCCGAAGGACCCTGGGGATCTATCATTGTGTCCAGGTTGTGTTAAGGACGATGAAGATTCAAAAGAAGATAAGATCAAGGAACTACAGATTAAAACTCCCGAAGAGGACATGTCGGCGTATGGCGGGTTGATGCAAGAGAGCCAACTGAATGAACTAATAAAACAAATAAGCGAGGTCGTGGATACCACTAAGGATATGGATAAAAAGGGTCTGGTCTCAGCCAGTGTTGAGGATTTTATAAAGACCGCAGAGGAGTTGAAAGCGAAGTTGGCACAAAAATCGGAATTAGACGAAAAAGAAGATGTGAAAAATGTAGAAGATGAGCTGGAGAAGGTAGCAAAGGAACTAGAAGACTTGGATGAAAACCCTAGCATAGACCCGCAGATGGAAGACGATATGGAGTATGGTAAATTATGCGACTCGTGTAAAAATAAATGTATAATGAAAGATAGAGGTAAAGATAAAGATGTCCAATCCGTCTTGTTTGACGATGAAGATTCAATAAAATCTACGGCACCAAGGACCGCGACGAACTCTACGCCAGGGTCAGGACTATCTGAAGAAACTGTACAACTAAAGCAATCAAGCCAAGAGGAATTTCTTGATGATTCTGATTCCGATTTGGGGGCTTTTGAGAAGGAAGTAAAAGAAATCACCACAATAACAAGAACAAGAAACGAAGATGGATCCGTTAGTGTGCAAAAGAAAATTGTAAAAATTGAAAGACAATTTAGACCCGTTCGAAACCAAGGCTCTCAAGGTCCTAGCCGATTCGATTCTGGCAATCCATCGAGGAACGCATCCAATCAGCCATCGACTACTTCATCCCGGAAATCAAATCCATATTCGGGTTCAAAAACTGTTGGTTCTATAAAAGTTCCAATACCTTCTTATAAATCTATATTTATACCAAATAAAAATATAAGGAAGGCTAAGTCGGAAACATTTAATCATTACACCCCGATGTCTGATCCCATAGAAAGACAATGTTTGTCGTCAGGAACATTGCTCTCAGCAGATTTAGATTTCAATATTAACCCAATGAGAAGTAATAATTCTGTATTCGGAGAAAATAGATCGAATTATCAAAATGGATTTTACAATCAAAGTCACCTAACAAAAGTTAATAGCTACCAACAGAAGAATGTTACATACGAAAGTTTATCGGATCCTGACAACAATAAAACGCTGTCGTCATCTGACGAGGATTTGTTTAGAGATAGCGGCAATAGTAATTCAATTTCCGATGCCACAGTTAATCCGAAATGTAATCAGAAACTAGGGTGGTTATCAGCAGCTCCGACAAAACCTGAACCAAATATTAAGTCAAGATTTTATATCCGCAGGAAATGGAATGTCCTGGGGAAAGTGGTTGTACCAAGAATCAAATTAACAAGTGCTTATATAAATTCAGAGCTAGAGGAAATTGAAAGGGAGGACAACTTTAGATTGAAAAGATTCAAAGAGTTAAAGATTAAGAAAAAAAAGGAAGTTGAGGGACCAGCAGAAAAACCGGGACCGAAAGAATCAATAACGAATAAAAATGAGGATTCCTCTTTGACCTCTTATAAATCAGTAAGATCGGACAAATTTCCTGTGTGTGATCAAAGTTCTGTCTGCGTTTGTAAACCTGCTTCAGTTGGAGGCGATGAACTGGTGTCGTTGAAGAATATTGACTTTATAGACCAAAGAGACTTCAGCTTAGACTCTGTCAGAGCGAACAAAAACAAGAATGAAAATAAAACTGACAGTATATTAAAATTATTCCAAAACGAGAGTAGAAATTCAAAAATAACAACAATCAGGGAACTTAAAGATTTTAAAGAGGAAGTTTATGATGATGCAGGCACAATACCTTGTTCGTGTCAATCTAAATATAGAAAGAGTAGTATAACTACGGGTGAAGAAGTATGCTGTAGAAGAGTAAGTGAATCACAAAGCGAATTGTGGAGGCAAGCGTCGGGAGCGATCCCATGTGTTTGTAAAACTAAAGCAATAGAATTTATAGACCAAAGGGAAAAAGGAAATGATGGCAGCGTAAATGAACCGTTATTTTTAGAGCCAGATACCAAAAGTGCTTGTTGTAAGTCAAAAACACCGTCAATTGAATTTATAGATCAAAGAGATGCCCGTTTTGAAAGCTTTTCGAATCCAACAAACGGAGGCTCTCAAAATGACAACAGAACAACAAATTATTTCATTTACTGCACCATATCCATTGTAAAAGGAGGACGAAAATTTATGCAACAAGCAAAACCAAGTTGTTGTCATAAATCACAACAAATTGAATTCATAGACCAAAGAGAGACAAACAGTGAAACTAAATTTATACCCGACTCAAAAGCTGATTGTTGTAATAAATCACGACAGATTGAATTTATAGATCAAAGAGAGGTAAATAATGATATGAAGTTTGTTCCCGACGCGAAAGTGGATTATTGTAAATCACAACACATAGAGTTCATTGATCAAAGAGAGATAGATAATGAAATGAAGTCTGTTCCTGACTCGAAAGAAATTGAATTTATTGACCAAAGAGAGATAGATAGTGGAATGATGGTTGATCCTGAGTCAGAAGTGAATTCTTATAAAACGCAAAAAATTGATTTCATAGACCAAAGAGAGATAGATAACGAAATGAAATTTATTCCCGACTCAATACTAAATGATGGTAAAACACAAAAAATTGAATTCATAGACCAAAGAGACCATAGACTAAATGATAATATAAAATATATTTTTGACTCTTGTAAATCACAAAAAATTGATTTTATAGACCAAAGAATGAGAAATAATGTAACCGATGTTATAGATGACTCGAAAACCTCCTGTTGCAAATCTAGCTCGCCGACAATTGAATTCGTAGATCAGAGGGATCCAGGAAGAGTAGTCAATTTGAATACAAATAGCCCATGCTGCGGTGAAACAGACAAGGAAAGGGGTCGTAGAGAACAAACAAAACATAAGATAGACTCAAAAACTTCCTGCTGTAAATCAATATCACAATCAATTGAATTTGTAGATCAAAGGGAAATAGAAAAAAATGGTAATTCAACAAACAGTGAAATGAATTCAAAAGAGGAAATGAAATATAAGCCAGATTCAGATACTTCCTGTTGCAAATCTAAATCGCATTCAAGAGAATATATAGACAGAGAAAATAAAAGAAACGAAGGTAATTTAAACCAAAATAATGTATGTTGTAAAGAAGAAAGTGAACTTAATACGAGGGATGTGAATTTAAAAGATAAAATTAAATTTAAGTCCAATTCAAAAACGCCCTGTTGCAAATCTAAGTCGTTAGATTTCATAGATGAAAGAAAATTATTAAAAAAAGATTCCAGCGATTTTAAAAATCGTGATACAGGAAATTATGTGTCCTTATTAAAAAATAACAGAGATATTTGTTCGTCTTGTAAATCAAGTAGTTCGCAGTACGCTGAAAATAGTTCAGATGTAACACAGTCTTTAGACATTAATACAGATTTTAGTGGTTCCGTTTTTCAAAATCCAGAATCAAATATAGAGAGTTGTACTCAAAGTATTTGCAGCCGCTGTTCACTCGCAAATAAAAACTTAACAAGCGATGACGCACTTAAGTCTCTGTCTTCTGATAAAAATTGTACTAACATATGTGAAAAACAATTACAGGATAAAAAAGGCAGTTCAAAACATATTTGTAAGGGACGTAAGGCAAAAAGTGAAAATAACGATATCGGCTTTAAACAAACTGTAAAAGAAATAAACGAAATGAAGATAACCTGCAAGGACGGCAGCTTCAAAGCTGAGAAGCGAATTAAGACAATAACAAATGAGAAAAACAACGTACCATGCGGAACCCAAACAAACGATCAGGCAAAAAAGGTTCAGACATGTTCAAGTTTAAGTGGTGGGAGACCATCAAAAGTGAGTTCCAAATCTAGTATCAATATTAGAATAATCGCTAAGAGTGGCACCGTTACACCAAAGGAGGGCACAAATGACATTGAGGTCACACCCAGTGACTTTAGCATACTCAAGAGAATACTTATACGGGCTAGCAAGTCTTCAGCGTGTATTACGAAATCAAAAAAGTCGAAATCTAGTTCTAAGGGCAGTTGCAGTTCTAGTTTAAGTGCGAAATCGTCAAGTTCCACAAAAAGGGACCTATCTTCTTGCCCTAATGACAAGTGTACACGAAAAAAACCAAAGAAACACAATTGCCAAATCGAGGATAAAGAACCTTCTATGAAGAGTAGTAAGACGACAATGACCTCAAAATGCACTTGTTGTTCTGGGTGGAAAACTAAATCTTCCACTTCGATAGAAAGAGAACCCCCTCCCTGCCACCAAGAAATGCGTAAAGGTAGCCAAGATCAAACAAAAAAATTCAACTATTTTAATAGAAAAGTGAAAAGCGAAACAGTTCAATTTGGGCCGTCATGTTGTAGTGTCCGGAGTTCTGTGAAAAACGTGTCCACTGAGTCCGTTGCCGATTCATGTAGTAAGCTAATTTCTAGAATCGTAAAAAAACATTCATGCGTATCAAATACCAAAACAAAATCGTCGCCCGCAACTAAACTTGATAGTAAACTCTGCACTTGTAATGACAAAAAGAGCGCTGATGCAAAGGAATGTATATGCAGTAAATCGAAAAGCACGCCACAGAAGACGTGTACGTGTAGTAAAACAAAGGAAAAAGATAAATGCTCATGTAATGTTAGGGAGAAGAATAAAAGTAAATGTACTTGTTCAACACCGAAGCAGTGCACTTGTAAAAGGTTACCACCGAAACGATGTCCAAGTGACTGTGACAGAATTTCAAACAGGCCAACAGAGTACTTAGACAAGCTTGAAACAAAATACTTCAGCAAGAACTGCTGCAAATCAGCAACCCCGAGTGACGTGCTATCATCTGGTAGCAACGATTCGAAAACAAATAATGCAGCCACACAAAAAAGTTCAAAGCATTCCTGCGACTCAGTCTGCTCTCAGGATAAATTTGTCGTTGAAGTTCCTTCTCAATGCTCGTTCTGTGGAATGAAAGAGGCTTCGACTATGACCAGAAGATCTGATAGGTATGATGTTAGGCCATTAAGATTACTTTCAAGAAGGAAGCCATACTTCTGCTGTCAATCTGATACGGGGTTACTATCGCGCGCTAAGGCCTTAAAGATTAAAATTACAAGATGTCGTTCCGCTGACGATAGGAGACGATATTGCTACTGA

Protein sequence:

>DPOGS203862-PA
MFKDDDDKDEGAAFGITRYQCMPSLVALQQMRNRLHLAFLGKKLMKWTALATGREMRRLAQELYDVYEGFGEELRNAFILLARCRYFSPPLNSIVLEDIGQKAGVLVHQATKTVSAVKIVQFDIFETEYPPYPHLGLEKGGQTIYETKKAWLDLLKRLIMMMQLRASFLMVELAHKNANKKMNVLGKVVIPRTNVTMDYINNELEEYAREEFFRLKKVLEIKRKMYGDKDGKKEKPKDPGDLSLCPGCVKDDEDSKEDKIKELQIKTPEEDMSAYGGLMQESQLNELIKQISEVVDTTKDMDKKGLVSASVEDFIKTAEELKAKLAQKSELDEKEDVKNVEDELEKVAKELEDLDENPSIDPQMEDDMEYGKLCDSCKNKCIMKDRGKDKDVQSVLFDDEDSIKSTAPRTATNSTPGSGLSEETVQLKQSSQEEFLDDSDSDLGAFEKEVKEITTITRTRNEDGSVSVQKKIVKIERQFRPVRNQGSQGPSRFDSGNPSRNASNQPSTTSSRKSNPYSGSKTVGSIKVPIPSYKSIFIPNKNIRKAKSETFNHYTPMSDPIERQCLSSGTLLSADLDFNINPMRSNNSVFGENRSNYQNGFYNQSHLTKVNSYQQKNVTYESLSDPDNNKTLSSSDEDLFRDSGNSNSISDATVNPKCNQKLGWLSAAPTKPEPNIKSRFYIRRKWNVLGKVVVPRIKLTSAYINSELEEIEREDNFRLKRFKELKIKKKKEVEGPAEKPGPKESITNKNEDSSLTSYKSVRSDKFPVCDQSSVCVCKPASVGGDELVSLKNIDFIDQRDFSLDSVRANKNKNENKTDSILKLFQNESRNSKITTIRELKDFKEEVYDDAGTIPCSCQSKYRKSSITTGEEVCCRRVSESQSELWRQASGAIPCVCKTKAIEFIDQREKGNDGSVNEPLFLEPDTKSACCKSKTPSIEFIDQRDARFESFSNPTNGGSQNDNRTTNYFIYCTISIVKGGRKFMQQAKPSCCHKSQQIEFIDQRETNSETKFIPDSKADCCNKSRQIEFIDQREVNNDMKFVPDAKVDYCKSQHIEFIDQREIDNEMKSVPDSKEIEFIDQREIDSGMMVDPESEVNSYKTQKIDFIDQREIDNEMKFIPDSILNDGKTQKIEFIDQRDHRLNDNIKYIFDSCKSQKIDFIDQRMRNNVTDVIDDSKTSCCKSSSPTIEFVDQRDPGRVVNLNTNSPCCGETDKERGRREQTKHKIDSKTSCCKSISQSIEFVDQREIEKNGNSTNSEMNSKEEMKYKPDSDTSCCKSKSHSREYIDRENKRNEGNLNQNNVCCKEESELNTRDVNLKDKIKFKSNSKTPCCKSKSLDFIDERKLLKKDSSDFKNRDTGNYVSLLKNNRDICSSCKSSSSQYAENSSDVTQSLDINTDFSGSVFQNPESNIESCTQSICSRCSLANKNLTSDDALKSLSSDKNCTNICEKQLQDKKGSSKHICKGRKAKSENNDIGFKQTVKEINEMKITCKDGSFKAEKRIKTITNEKNNVPCGTQTNDQAKKVQTCSSLSGGRPSKVSSKSSINIRIIAKSGTVTPKEGTNDIEVTPSDFSILKRILIRASKSSACITKSKKSKSSSKGSCSSSLSAKSSSSTKRDLSSCPNDKCTRKKPKKHNCQIEDKEPSMKSSKTTMTSKCTCCSGWKTKSSTSIEREPPPCHQEMRKGSQDQTKKFNYFNRKVKSETVQFGPSCCSVRSSVKNVSTESVADSCSKLISRIVKKHSCVSNTKTKSSPATKLDSKLCTCNDKKSADAKECICSKSKSTPQKTCTCSKTKEKDKCSCNVREKNKSKCTCSTPKQCTCKRLPPKRCPSDCDRISNRPTEYLDKLETKYFSKNCCKSATPSDVLSSGSNDSKTNNAATQKSSKHSCDSVCSQDKFVVEVPSQCSFCGMKEASTMTRRSDRYDVRPLRLLSRRKPYFCCQSDTGLLSRAKALKIKITRCRSADDRRRYCY-