Monarch geneset OGS2.0

DPOGS207818
TranscriptDPOGS207818-TA2961 bp
ProteinDPOGS207818-PA986 aa
Genomic positionDPSCF300042 + 722420-731061
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0119730.060.89% 
BombyxBGIBMGA000725-TA0.037.80% 
DrosophilaMdr50-PA2e-13146.65% 
EBI UniRef50UniRef50_G3HRY00.042.27%Multidrug resistance protein 1 n=14 Tax=cellular organisms RepID=G3HRY0_CRIGR
NCBI RefSeqXP_001641800.10.038.87%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|3442511040.042.27%Multidrug resistance protein 1 [Cricetulus griseus]
NCBI nr blastxgi|3442511040.042.18%Multidrug resistance protein 1 [Cricetulus griseus]
Group
Gene OntologyGO:00068105.1e-47transport
GO:00550855.1e-47transmembrane transport
GO:00055245.1e-47ATP binding
GO:00426265.1e-47ATPase activity, coupled to transmembrane movement of substances
GO:00160215.1e-47integral to membrane
GO:00168872.5e-23ATPase activity
GO:00001662.9e-20nucleotide binding
GO:00171112.9e-20nucleoside-triphosphatase activity
KEGG pathwaybta:2815850.0 
 K05658 (ABCB1)maps-> ABC transporters
InterPro domain[2-504] IPR0115275.1e-47ABC transporter, transmembrane domain, type 1
[3-196] IPR0011409.3e-45ABC transporter, transmembrane domain
[792-919] IPR0034392.5e-23ABC transporter-like
[279-465] IPR0035932.9e-20ATPase, AAA+ type, core
Orthology groupMCL10075 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207818-TA
CAGATATTTCGGATACGAATTTCATATTTAAGGGCCGCTCTTAATCAAGATTTTGCATATTTTGACCTTCACCAGACTGGAGATTTCGCTTCAAGGATAGCTGAAGACATGATCAAATTGGAGGAAGGTATCGGAGATAAGGTATCTTCTTTGGTACACAACGCGGCCGTTTCATTAAGTTGTATAATAATGGCTCTGATAAAAGGTTGGAAACTAGCTTTATTGTGCCTTAGTACAGCTCCAATAACATTTTTTCTAGTTGGTGTTACTGGTAAGATTGCCAATAACTTATATAAGAAACAAGCCAAAGCGAAAGCCCAGGCTAGTGCCGTAGCCGAAGAGGTTCTCGGTTCTATTAAAACAGTTTATGCCTTTAATGCTCAACAATATGAAATAAAACGGTATAAGAAACATCTGGCCAATGCAAGAAGGATATTTATTAGAAAGGAAACTTTCACTGGAATGTCGATGGGCCTATTGTATTTATGCGTTTTTAGTTCCTACGCCATGGCGTTCTATATAGGAATATACTTAATAATAAACGAGCCCGAAAAATACAATGCCGACGTCATGTTTTCTGTGTTTTTTGGAGTTATGACTGCCTTAACATATGTGGGTATGATAGGATCTCTAATGTCGTCTTTCGGATCAGCTCAGGGGGCAGGTGCCCAAGTCTTCCACATTCTGGATAACGTTCCCACTATAAACCCATTACTTGATCGGGGTATTAGGCCCGATGGTATAAATGGCGTTATAGAGTTGAAAGACGTAGTTTTCCACTATCCTTCTCGACCTTCAGTCCTTGTATTGGATTCGATAAATATAGATGTACGTAGCGGGCAGACTATCGCCTTGGTTGGAAATTCCGGTTGCGGAAAAACTACCATTATTCAACTTATATCAAGATTCTACGACGTGGATCGTGGAAGCGTTCGCATTGACGGCCGTGACGTTCGCGAACTATCTGTGAGGTGGCTACGTCATCAAATAGGCCTTGTTAGACAGGAACCGGTGCTGTTCAATACCAGCATATTTGAGAACATTCGCCTAGGAAGCGTGGATGTTTCTTATGATGACGTCATCACCGCTTCTAAACAAGCAAATGCCCATGAATTCATTATGGAGCTCCCTTCAGGTTACGAAACGCTAGTAGGAGATCGAGGAGCATCTTTATCAGGTGGACAAAAGCAGAGGGTGGCGATAGCGAGAGCCCTCGTTAGGAATCCCCGCATATTATTACTCGATGAAGCAACAAGTGCCCTTGACACTGTATCAGAAACAAAAGTTCAGGAAGCGTTAAATCGGGCAGCTAAAGGCCGCACAACTATAGTCATCGCACATCGCTTGTCAACCATTCGAAACGTTGACAAAATATTTGTGATGCAAAAGGGACGTGTTGTAGAAACTGGGAATCATGAGGAATTAATAAAAAAGGGGGGCGAGTACTATCACATGTTCACGACATCTGAGCAGCTACCACTAAATGAAGAGTTACAAGTAGATGACGAGCCTTCCAGAGAGCGTTCAAATATATCAAAGGAGACTGTGGATTTAAAAAAGACGCTATCTTTCGGTGCCGCGGGCGCGTACCTCACGGAACGCCTGCGTATGCGCATGTTCAAAAATCTTCTGGTTCAGGATGTGGCGTTTTATGACGAACGGGAGAATTCACCAGGAGCTTTGTGTGCCAGACTGTCTGCTGAAGCGGCTTATGTGCAAGGAGCCACAGGTCAACGTATTGGTATAATTTTACAAGGAGTTGGATCCATTGGCTTAGCGCTGTTTCTAGCTATGTGGTTCGAATGGCGAGTCGGTATGGTCGCCCTTGCTTTCTTGCCATTAGTTGTTATAGTAATCTGGCAACAAACAAAAGCCACAGATAAGGAATCACAAGGATACGCGAAAGCTCTCGAAAATAGCACAAAGATTGCTGTCGAGGCCTTATCTAATATACGCACCGTTGCGTGTCTCGGCCGAGAGCCTGCGATGGTGGTCGAGTATGCTCACTGCCTCAGACCTGCGCGGAGGCCTGCGGTGCTCGCGGCCCACTGGCGGGGGGTGCTGTCAGGCCTGTCGCGGTCTATGTTTAACTTTATTAACGCAGCTGCGCTCACCTACGGCGGGCACGTGGTCGCCGACGGAGTACCCTACCAAGATATACTCATAACGACGCAATCTTTACAAATGGCGTCATCCCAGGCCCAGAGCGCGTTCGCTTACGCACCAGACTTCCAAAGAGGAATTAACGCAGCTGCCAGAATTGTTAACCTTATAAATATGAAACCCACTATAGTAGACCCCGAGGAGCCAACTAGGAACTTTCTATCAGAAGGAGAGACCGTTGTTTTAGTGGGTGAAAGCGGATGTGGGAAAAGTACTGTTATTCAACTCCTACAAAGATACTACGATCCTGATTCGGGCACTATTACTTTAGAAAACAAACCCCTAACACATTTACGAGTAGACGAAGTCCGTGCGAACTTCGCACTAGTATCTCAAGAACCGACACTCTTCGAACGCAGCATCCGCGAAAATGTTGAGTATGGAGACATCTCCAGACCGGTCACTATGAAAGAGATCGTGGATGCCACCAAGCTTGCAAACATTCATGACTTTATAGTTTCCCTACCGCAGGGTTATGAGACCAACATCGGTTCGAAGGGCATACAACTGTCTGGAGGACAGAAACAGAGAGTCGCAATAGCGAGAGCGCTGATAAGGCAGCCAAAGATCTTGCTCTTAGATGAAGCCACCAGCGCTCTCGATGGTGAAAACGAGAAGGTGGTGCTGTCATCTTGCCGTGCTGGCCGTACATGTATACTGGTCTCTCACCGGCCGCGTGTGATAGCTTCGTCGTTGATACACGTGCTGGCTGCGGGCCGTGTACTGGAGCGAGGGACACACGAACAGCTCATGGGGAAACGGGGCCTATACTACACTTTAAATGCTAAAGGACAATGA

Protein sequence:

>DPOGS207818-PA
QIFRIRISYLRAALNQDFAYFDLHQTGDFASRIAEDMIKLEEGIGDKVSSLVHNAAVSLSCIIMALIKGWKLALLCLSTAPITFFLVGVTGKIANNLYKKQAKAKAQASAVAEEVLGSIKTVYAFNAQQYEIKRYKKHLANARRIFIRKETFTGMSMGLLYLCVFSSYAMAFYIGIYLIINEPEKYNADVMFSVFFGVMTALTYVGMIGSLMSSFGSAQGAGAQVFHILDNVPTINPLLDRGIRPDGINGVIELKDVVFHYPSRPSVLVLDSINIDVRSGQTIALVGNSGCGKTTIIQLISRFYDVDRGSVRIDGRDVRELSVRWLRHQIGLVRQEPVLFNTSIFENIRLGSVDVSYDDVITASKQANAHEFIMELPSGYETLVGDRGASLSGGQKQRVAIARALVRNPRILLLDEATSALDTVSETKVQEALNRAAKGRTTIVIAHRLSTIRNVDKIFVMQKGRVVETGNHEELIKKGGEYYHMFTTSEQLPLNEELQVDDEPSRERSNISKETVDLKKTLSFGAAGAYLTERLRMRMFKNLLVQDVAFYDERENSPGALCARLSAEAAYVQGATGQRIGIILQGVGSIGLALFLAMWFEWRVGMVALAFLPLVVIVIWQQTKATDKESQGYAKALENSTKIAVEALSNIRTVACLGREPAMVVEYAHCLRPARRPAVLAAHWRGVLSGLSRSMFNFINAAALTYGGHVVADGVPYQDILITTQSLQMASSQAQSAFAYAPDFQRGINAAARIVNLINMKPTIVDPEEPTRNFLSEGETVVLVGESGCGKSTVIQLLQRYYDPDSGTITLENKPLTHLRVDEVRANFALVSQEPTLFERSIRENVEYGDISRPVTMKEIVDATKLANIHDFIVSLPQGYETNIGSKGIQLSGGQKQRVAIARALIRQPKILLLDEATSALDGENEKVVLSSCRAGRTCILVSHRPRVIASSLIHVLAAGRVLERGTHEQLMGKRGLYYTLNAKGQ-