Monarch geneset OGS2.0

DPOGS207748
TranscriptDPOGS207748-TA4290 bp
ProteinDPOGS207748-PA1429 aa
Genomic positionDPSCF300042 - 651393-660790
RNAseq coverage141x (Rank: top 55%)
Annotation
HeliconiusHMEL0119730.071.43% 
BombyxBGIBMGA007494-TA0.068.57% 
DrosophilaMdr50-PA0.049.14% 
EBI UniRef50UniRef50_E9LP500.068.93%ATP-binding cassette sub-family B member 1 n=4 Tax=Obtectomera RepID=E9LP50_TRINI
NCBI RefSeqXP_001810982.10.051.52%PREDICTED: similar to Multi drug resistance 50 CG8523-PA [Tribolium castaneum]
NCBI nr blastpgi|3198947620.068.93%ATP-binding cassette sub-family B member 1 [Trichoplusia ni]
NCBI nr blastxgi|3198947620.068.93%ATP-binding cassette sub-family B member 1 [Trichoplusia ni]
Group
Gene OntologyGO:00068104.6e-59transport
GO:00550854.6e-59transmembrane transport
GO:00055244.6e-59ATP binding
GO:00426264.6e-59ATPase activity, coupled to transmembrane movement of substances
GO:00160214.6e-59integral to membrane
GO:00168872.5e-25ATPase activity
GO:00001662.1e-18nucleotide binding
GO:00171112.1e-18nucleoside-triphosphatase activity
KEGG pathwaytca:6598470.0 
 K05658 (ABCB1)maps-> ABC transporters
InterPro domain[841-1174] IPR0115274.6e-59ABC transporter, transmembrane domain, type 1
[867-1133] IPR0011403.4e-45ABC transporter, transmembrane domain
[138-263] IPR0034392.5e-25ABC transporter-like
[123-309] IPR0035932.1e-18ATPase, AAA+ type, core
Orthology groupMCL10075 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207748-TA
ATGGGCTTCATTTATTTCTGCTTGTTCGGTGCTTACTCTCTCTGCTACTGGTTTGGATATAAACTAATAGTTGATGAACCAGAGACGTACGACGTGGACACGATGATGGCTGTATTATTCGGTGTTTTGATGGGTTCAACCAACTTCGGCATATCTTCTACTCTCATGGATGCTTTTGGAGTCGCACGTGGAGCAGGAGCACAAATCTTTAACTTAATTGATAACGTTCCTAAAATTAATCCATCACTCAATCGTGGGATTACTCCCAAAAGTATAGATGGAGATATTGAGTTCAAAAACGTATTCTTTCATTACCCTTCTAGACCAGATGTGCCAATATTAAAAGGCATTAATATCAGTGTCAAAAAAGGCCAATCGGTGGCGTTAGTTGGACATTCAGGCAGTGGCAAGTCCACTATTATCCAGCTAATATCAAGAAATTACGATGTAATAGATGGAAATGTACTAATTGATGGGACAAACGTCAAAAAATTGTCTGTTAGATGGTTGAGAGCTCAAATTGGCCTAGTTGGTCAAGAGCCAATACTTTTCGATACAACAGTTCGCGAAAATATAAGATACGGCCGTGAGGATGCTTCAGACCTAGATATTGAAGAAGCTGCCAGAGAAGCCAATGCACACGAATTTATCACGAAGCTTCCTTTAGGATATGATACATTAGTTGGAGAGCGGGGAGCATCATTGTCAGGAGGACAAAAACAGAGAATTGCCATAGCTCGGGCGCTTGTCCGTAACCCACGGATATTATTGCTAGATGAAGCAACTAGCGCATTAGATACATCCTCGGAGGCCAAAGTACAGAAGGCCCTAGACAAAGCCCAAGAAGGTCGTACAACGATCGTTGTAGCGCATAGACTCTCAACCATAAGAAATGTCGACAAAATCTATGTTTTTAAAGAAGGGAATGTGGTAGAAAGTGGAAGTCATGACGAACTTCTTTCTAAAAAAGGTCACTTTTACGACATGCTAATGCTACAAGCTGCACCACATTTAAACGAGACTGATCAGGGCACACAATTGGAACTGTCTGAATCCGTTCTGAATGAGAAAGAAGAAGAGCTTATCGAAATGAGAGACCAAGATTGCGAAGAAACACAAGAGGAACCTAAGATTTCATTCTTCCAAGTACTTAAACTGAACTCACCAGAATGGAATGATGTCATAAAACTAGAAGACGGGATAGGAGAAAAGCTGGCTACGTTCGTATATTATCAAGTGACATTTTTAAGTTCTGTTATAATGGCCCTTGTGAAAGGCTGGAAACTTTGTCTTCTATGTTTAATTTCGTTTCCAATTACCTTAGTTTTAATAGGTATTGCGGGCTTTATGGCATCAAGATTGTCTTACAAAGAAGCCGTTGCTTCTGCAAAAGCTGGATCTGTGGCTGAGGAAGTATTGTCATCGATAAGAACAGTATTTGCTTTTAGTGGGCAAAAAAAGGAAACTGAAAGATATGAAAAATACCTAATCGAAGCTAGGAGTATCAATATTAAAAAAGGTATTTTTAACGGAATTATAATGGGCTTCATTTATTTCTGCTTGTTCGGTGCTTACTCTCTCTGCTACTGGTTTGGATATAAACTAATAGTTGATGAACCAGAGACGTACGACGTGGACACGATGATGGCTGTATTATTCGGTGTTTTGATGGGTTCAACCAACTTCGGCATATCTTCTACTCTCATGGATGCTTTTGGAGTCGCACGTGGAGCAGGAGCACAAATCTTTAACTTAATTGATAACGTTCCTAAAATTAATCCATCACTCAATCGTGGGATTACTCCCAAAAGTATAGATGGAGATATTGAGTTCAAAAACGTATTCTTTCATTACCCTTCTAGACCAGATGTGCCAGTACTAATTGATGGGACAAACGTCAAAAAATTGTCTGTTAGATGGTTGAGAGCTCAAATTGGCCTAGTTGGTCAAGAGCCAATACTTTTCGATACAACAGTTCGCGAAAATATAAGATACGGCCGTGAGGATGCTTCAGACCTAGATATTGAAGAAGCTGCCAGAGAAGCCAATGCACACGAATTTATCACGAAGCTTCCTTTAGGATATGATACATTAGTTGGAGAGCGGGGAGCATCATTGTCAGGAGGACAAAAACAGAGAATTGCCATAGCTCGGGCGCTTGTCCGTAACCCACGGATATTATTGCTAGATGAAGCAACTAGCGCATTAGATACATCCTCGGAGGCCAAAGTACAGAAGGCCCTAGACAAAGCCCAAGAAGGTCGTACAACGATCGTTGTAGCGCATAGACTCTCAACCATAAGAAATGTCGACAAAATCTATGTTTTTAAAGAAGGGAATGTGGTAGAAAGTGGAAGTCATGACGAACTTCTTGCTAAAAAAGGTCACTTTTACGACATGCTAATGCTTCAAGCTGCACCACATTTAAACGAGACTGATCAGGGCACACAATTGGAACTGTCTGAATCCGTTCTGAATGAGAAAGAAGAAGAGCTTATCGAAATGAGAGACCAAGATTGCGAAGAAACACAAGAGGAACCTAAGATTTCATTCTTCCAAGTACTTAAACTGAACTCACCAGAATGGAAGTCAATCACTGCGGCCAGCGTCTGCGCTATTCTAAACGGTTTCGCAATGCCACTTCTAGCAGTTGTTATGGGAGACTTTATGGGTGTGCTTTCTAATAATGATCCAGGTTGGGTTAGAGCTGAAGTTATTAAATATGTATTAATTTTTTTGGCAATTGGTATTTTCTCTGGACTCACAAACTTCGTTACGGTGTTTATGTATGGTATAGCAGGAGAATATCTTACTGCCCGTTTGCGTAAACTGTTATTTGTGCATATGCTCCAACAGGAGGTTGCTTTCTTTGATGATAAAAATAACTCAACGGGAGCTCTTTGCGCTCGGTTATCAGGCGATGCTGCATCAGTTCAAGGGGCAACAGGTCAAAGAATTGGGACAGTGTTGCAGGCTCTTAGTACATTTAGCGTCGCATTAGGAATCTCATTATATTATGAATGGCGTTTAGGATTAGTCGCTTTATCTTTGGCACCTATCATGGGAGCTGTGCTGTACAAGCAAGGGAGAATGATAACAGCACAAACTTTTGGAACAGCTAAAACAATGGAAGACAGTTCCAAGATAGCCGTAGAAGCGGTGGCGAATGTTCGCACAGTTGCATCACTAGGTCGTGAGCAAATTATTCTCAATAACTACGCAACTCAGCTTCTGCCAGCACTAGTGGCTGCAAAACGAACCGCACATTGGCGAGGTGTGGTCTTTGGGCTCTCTAGAGGGCTTTTTAACTTCGTGTACTCCATAGCCATGTTTTATGGAGGCAATCTAATGGTGTATCAGGGAGTGTCATATGAAATAGTACTCAAGTCAGCTCAAACATTATTAATGGGTTCCACTTCCGCAGCTCAAGCCTTTGCTTTTGCTCCTAATTTCCAAAATGGTATTAAAGCAGCTGCTAGAATTATTGTTACATTGAGAAGGCAGTCAAAAATCGTTGATCCGGCTAAACCTGCTGTCAAAAACTTTAAAGGTGCAGGCGTAGCAAATATTAGAAATGTTCAATTTACATATCCAACGAGGCCTCTAATACAAGTACTAAAAAACTGTAGCCTAGAAATCGAAAAAGGACAAACAATAGCCCTAGTCGGTTCAAGTGGGTGCGGCAAGAGTACCATCATACAGCTGTTAGAAAGATATTACGATCCTGATGTCGGCACAGTGGATCAAAGAGGTATTCCAATAAGAAAACTCAAATTGGCAGATGTGAGACAGTCGATAGGCTTCGTACAACAAGAACCTATTCTCTTTGATCGTACCATAGAAGAAAATATTGCCTACGGTGACAATTCACGGCAGCCTAGCATGGATGAAATAATTGAAGCAGCTAAACAAGCTAATATTCACAGCTTTATCGTATCTTTACCGATGGGTTATGAGACCAACATCGGTTCAAAGGGCACCCAGCTTTCTGGAGGACAAAAACAAAGGGTTGCTATAGCAAGAGCTTTAATAAGACGACCAAAGATGTTACTGCTGGATGAAGCAACTAGTGCTTTGGACACAGAAAGTGAAAAGGTGGTTCAAGCGGCCCTGGAAGCAGCTAAAGCGGGTCGTACGTGTGTCATGATCGCTCACCGACTGAGTACGGTGCGTGACGCTGACGTCATATGTGTCCTCAACAATGGAAGTGTCGCAGAGCGAGGAACACACGCAGAGCTATTAGAACTCAAAGGACTTTATTATAATCTGTATACGAAAGGAAGTGCATGA

Protein sequence:

>DPOGS207748-PA
MGFIYFCLFGAYSLCYWFGYKLIVDEPETYDVDTMMAVLFGVLMGSTNFGISSTLMDAFGVARGAGAQIFNLIDNVPKINPSLNRGITPKSIDGDIEFKNVFFHYPSRPDVPILKGINISVKKGQSVALVGHSGSGKSTIIQLISRNYDVIDGNVLIDGTNVKKLSVRWLRAQIGLVGQEPILFDTTVRENIRYGREDASDLDIEEAAREANAHEFITKLPLGYDTLVGERGASLSGGQKQRIAIARALVRNPRILLLDEATSALDTSSEAKVQKALDKAQEGRTTIVVAHRLSTIRNVDKIYVFKEGNVVESGSHDELLSKKGHFYDMLMLQAAPHLNETDQGTQLELSESVLNEKEEELIEMRDQDCEETQEEPKISFFQVLKLNSPEWNDVIKLEDGIGEKLATFVYYQVTFLSSVIMALVKGWKLCLLCLISFPITLVLIGIAGFMASRLSYKEAVASAKAGSVAEEVLSSIRTVFAFSGQKKETERYEKYLIEARSINIKKGIFNGIIMGFIYFCLFGAYSLCYWFGYKLIVDEPETYDVDTMMAVLFGVLMGSTNFGISSTLMDAFGVARGAGAQIFNLIDNVPKINPSLNRGITPKSIDGDIEFKNVFFHYPSRPDVPVLIDGTNVKKLSVRWLRAQIGLVGQEPILFDTTVRENIRYGREDASDLDIEEAAREANAHEFITKLPLGYDTLVGERGASLSGGQKQRIAIARALVRNPRILLLDEATSALDTSSEAKVQKALDKAQEGRTTIVVAHRLSTIRNVDKIYVFKEGNVVESGSHDELLAKKGHFYDMLMLQAAPHLNETDQGTQLELSESVLNEKEEELIEMRDQDCEETQEEPKISFFQVLKLNSPEWKSITAASVCAILNGFAMPLLAVVMGDFMGVLSNNDPGWVRAEVIKYVLIFLAIGIFSGLTNFVTVFMYGIAGEYLTARLRKLLFVHMLQQEVAFFDDKNNSTGALCARLSGDAASVQGATGQRIGTVLQALSTFSVALGISLYYEWRLGLVALSLAPIMGAVLYKQGRMITAQTFGTAKTMEDSSKIAVEAVANVRTVASLGREQIILNNYATQLLPALVAAKRTAHWRGVVFGLSRGLFNFVYSIAMFYGGNLMVYQGVSYEIVLKSAQTLLMGSTSAAQAFAFAPNFQNGIKAAARIIVTLRRQSKIVDPAKPAVKNFKGAGVANIRNVQFTYPTRPLIQVLKNCSLEIEKGQTIALVGSSGCGKSTIIQLLERYYDPDVGTVDQRGIPIRKLKLADVRQSIGFVQQEPILFDRTIEENIAYGDNSRQPSMDEIIEAAKQANIHSFIVSLPMGYETNIGSKGTQLSGGQKQRVAIARALIRRPKMLLLDEATSALDTESEKVVQAALEAAKAGRTCVMIAHRLSTVRDADVICVLNNGSVAERGTHAELLELKGLYYNLYTKGSA-