Monarch geneset OGS2.0

DPOGS213261
TranscriptDPOGS213261-TA5436 bp
ProteinDPOGS213261-PA1811 aa
Genomic positionDPSCF300124 + 470950-485998
RNAseq coverage302x (Rank: top 37%)
Annotation
HeliconiusHMEL0211300.072.79% 
BombyxBGIBMGA009452-TA0.080.35% 
DrosophilaMdr49-PA0.049.42% 
EBI UniRef50UniRef50_E9LP510.077.81%ATP-binding cassette sub-family B member 2 n=10 Tax=cellular organisms RepID=E9LP51_TRINI
NCBI RefSeqXP_623564.20.056.51%PREDICTED: similar to Multidrug resistance protein homolog 49 (P-glycoprotein 49) [Apis mellifera]
NCBI nr blastpgi|3198947640.077.81%ATP-binding cassette sub-family B member 2 [Trichoplusia ni]
NCBI nr blastxgi|3198947640.077.73%ATP-binding cassette sub-family B member 2 [Trichoplusia ni]
Group
Gene OntologyGO:00068104.8e-60transport
GO:00550854.8e-60transmembrane transport
GO:00055244.8e-60ATP binding
GO:00426264.8e-60ATPase activity, coupled to transmembrane movement of substances
GO:00160214.8e-60integral to membrane
GO:00168872.9e-24ATPase activity
GO:00001665.9e-19nucleotide binding
GO:00171115.9e-19nucleoside-triphosphatase activity
KEGG pathwayame:5511670.0 
 K05658 (ABCB1)maps-> ABC transporters
InterPro domain[835-1171] IPR0115274.8e-60ABC transporter, transmembrane domain, type 1
[859-1137] IPR0011402.5e-52ABC transporter, transmembrane domain
[580-705] IPR0034392.9e-24ABC transporter-like
[565-751] IPR0035935.9e-19ATPase, AAA+ type, core
Orthology groupMCL10075 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213261-TA
ATGAAACGAAATGAGTCGGTGAAGAGCGTCCAAAGATTTTCGCGGCAAAGTCGCAGCGAGCAGCGACCGGCGCTAGCGCTTAGCTTTCGCCAGAATTCGCTTACAATCAGCTCGCTAGCTCAGCAAGCTGCTTTTATAGCTGAAAAAGCTTATAAGGAAGAAGAGTCGGATCCAAATCGCCAGACTGAAGCCGTTTCATATTTCAAGCTATTCAGATTCGCTCAGCGATGGGAGTTTGTGATGTTGTTCGCTGGTATTATATTCGCGTGTCTCAACGGACTATTCGTTCCCGTGGGTGTGATCATCTACGGAGAGTTCACATCCCTCCTCATAGATCGTACCGTCATGAACGGAACCTCTACTCCTACGCTTACCATCAATTGGTTCGGCGGTGGTAGGATTCTCACAAACGCCAGTCCAGAGGAAAATCGTCGTGCTTTGATAGAAGACTCACAAGCTTTTGGTATAGGATGCACCGTGTTCTCTGTGCTTCAGTTTCTGTGCGGTGTCATCAGCGTAGATCTATTTAATTATGCCGCCTTAAGACAAATTGAGAGGGTTAAGGAAAGATTTTTACAATCAGTCCTCCGTCAAGACATAACTTGGTATGATCTTAACACGTCGATGAATTTTGCTACTAAAGTTTTACTAGTTTTATTAGTACCCGTTAAGCTCTGGCTTTGGCTAGGGTTGGGATTGGGGCTGGGCACAAACGCCAGTCCAGAGGAAAATCGTCGTGCTTTGATAGAAGACTCACAAGCTTTTGGTATAGGATGCACCGTGTTCTCTGTGCTTCAGTTTCTGTGCGGTGTCATCAGCGTAGATCTATTTAATTATGCCGCCTTAAGACAAATTGAGAGGGTTAAGGAAAGATTTTTACAATCAGTCCTCCGTCAAGACATAACTTGGTATGATCTTAACACGTCGATGAATTTTGCTACTAAAGTATCTGATGACGTTGAAAAGTATCGCGAAGGTATCGGTGAGAAAGTACCTATGTTAATATACCTCGTGATGTCCTTTGTGACGGCTGTACTTATATCCCTGGCGTACGGCTGGGAACTGACCCTTGTTATATTGTCATGTGCTCCTGTTATCATTGCTACCACGGCTGTTGTTGCTAAGGTTCAATCATCGCTCACTACACAAGAACTGAAAGCGTATAGCATAGCTGGAGTTATCGCGGAAGAAGTCTTGGCTTCTATCCGGACAGTGGTTGCGTTTGGAGGAGAGGAAAAAGAGATTGAGAGATACCAGGAACGTTTAGCTCCGGCTAAGAAAACAGGAGTGAAGAAAGGAATATATTCTGGTATCGGCAGCGGCGTGATGTGGTTTATTATTTACGCTACATATGCACTCTCCTTTTGGTACGGGGTCGGTTTGATCCTAGACAGCCGGCATCTGCCCACTCCAGTTTATACCCCAGCTGTTCTTATGATTGTATTCTTTAGTATCCTCCAAGGTGCACAAAACGTTGGTCTGACTGCACCCCATCTTGAAGCCATAGCAAATGCAAGAGCATCAGCGGGGGCGATATTCTCTGTTCTTGATAGAAAACCAGCCATAGACAGCCTTTCAACTGAAGGAACAACTCCTGTCCTAGACGGAGATTTAGAACTAAAAGATGTTTACTTCAGATATCCAGCTAGGAAAGACGTTCAGGTACTAGACGGTCTTAGTCTCAAAATAAATCGCAATGAGACTGTTGCGTTAGTGGGTGCGAGTGGTAGCGGTAAGTCGACGGTATTACAACTATTGCAAAGGATGTACGACCCGGATGTTGGCTCAGTTACCGCTTCCGGTCACGATCTACGTGATATTAACGTCAGGCACTTTAGGAACCACATCGCTGTCGTGGGACAAGAACCGGTCCTGTTTGCGGGAAGTATTAAAGAAAATATAAGGATGAGTAATCCGACTTGTACCGACGAAGAGATTATAATGGCATCTAAACAAGCTTACTGTCACAGTTTCATTAAACACTTGCCAAATGGCTACGACACAATGATCGGTGAGCGCGGTGCTCAATTATCTGGGGGTCAAAAACAACGAATAGCCATCGCTAGAGCTTTAGTTCGAAAGCCCAAAATACTTATTTTAGATGAGGCTACATCAGCGTTGGACTCTCAGAGCGAGGCCAAAGTACAGCGAGCTTTGGACGCAGCGGCTCATGGACGAACCACGATCATGGTCAGCCACAGATTAGCCACAGTACTGAATGCAAACCGAATCGTATTCATTGAGAAAGGCGAAGTTTTAGAAGAAGGAACTCATGAAGAACTTTTGAGTCTAAGAGGTCGCTACTACCAGCTGGTGCTGGAAAATGAACCCAGCATAGCACCGAGTTCAGCGGATACGGACACTCCTGGAAAACCTAATAATCAAACTGTCACAGATACTAAATTTCGAAGATCGAAATTGACCAAAATGGTATCTCTCGATTCTATGAAGAGTGATTCAATAGACGAAGACTCGGCTTCAGAAGACAGCGTTGTAATAGAAGAGAAGGAAGAAAGAGAATTTGAACCGACCACGTGGCAAATTCTCAAATTATGCAAACCAGAAAAATATCTCATGTGTATTGGAATCTTTGCAGCCTTTGCCGTCGGATCATCTTTTCCGTGTTTCGCTATACTGTTTGGTGAGACCTATGGTCTCTTGGAAAGCAAGAATGAAGATTACGTTCGTCAAGGTACTAACTATATAGCAATCTTTTTTCTGATGGTAGGAATTTACACTGGGATTGGCATCTTCTTTCAAATATTTATTTTCAACTTGACCGGAGTCCGCCTCACTGCTCGGTTAAGAGTGGCAGCTTTCCGTGCGATGCTCCGTCAAGAGATTGGCTGGTTTGACGACGCGGTGAACGGCGTCGGCGCTCTGTGTTCCCGACTGGCGGCGGACGCGGCCGCCGTGCAGGGAGCAACAGGCACGAGAATAGGTGCTTTAATGCAAGCATCAGCTACGATCCTCATAGGCATTCTAGTGTCGATGTATTACACGTGGAAAATGACCCTCGTGTCCCTGGTGTCTGTGCCCATGGTGATTATAGCGGTGGTGCTTGAAGGACGGGTGCTGGCAGAAGGTATCGCGGCCATCAGAGAGGCCTCCAACAAAGCTACGACGATCGCCACCGAGGCCATCACTAACATAAGGACGGTGTGTGCTTTCTGCGGCGAGGAGGGGACGTTGTCGCGGTACAAAGACGCTGGGGGAGCAGCTCGGGTCGCAGCTCGCTCCAGCTTGAGGTGGCGGGGAGCTGTGTTCGCTTTCGGACAAACCGCGCCTGTAGCGGGCTACGCGCTCGCTCTGTGGTACGGCGGAGTGTTGGTCGCTAATGGAGAAGTTCCTTATAAAGATGTCATTAAGGTGTCTGAAGCTTTGATATTTGGAGCGTGGATGATGGGTCAAGCGCTGGCTTTCGCACCCAATTTCGGAGCTGCAGTACTGGCGGCGGGGCGAGTCATGACATTACTTGCAAGACAACCGCTCGTCGCTGATACTCACGCGCCCTCCGTTCCTGAAGCTTACGTAGCTGAAGGTAAAATCCAATACAAAAACATAAAATTCCGATATCCGACTCGGCGTGAGGTCCAGGTACTTCGCGGGTTGTCCCTCTCAGTGTCCATGGGCCGTCGGGTAGCACTCGTGGGCCCCAGCGGTTGTGGGAAGTCCACGCTCATACAGCTACTGCAGAGACTATACGATCCCGACGACGGAAATGTGTACTTAGACGACCACAGCATAGTAAGCGACATGCGTCTCTCAACTCTTCGCCGTAACCTCAGTATAGTATCGCAAGAGCCAGTGCTGTTCGACCGGACGATCGCCGAGAACATCGCCTATGGAGACAACACCAGAAACGTCTCAATTGAAGACATAGTCGCTGCCGCGAAAGCCGCTAACGTACATTCGTTTATTGCCGCTTTACCTAACGGTTACGAGACACGGATCGGCGCTCGTGCGTCTCAACTGTCCGGTGGCCAGAAGCAGCGTATTGCAATAGCGAGAGCACTCGTTCGTGACCCGCGCGTGTTACTTCTAGATGAGGCGACCTCCGCCCTCGACACACACAGCGAGAGGGTTGTCCAAGAAGCTTTGGACCGTGCAAGTGAAGGCAGAACATGCCTCATAATAGCCCATCGACTAGCCACGATACAAAACGCTGACGTCATTTGCGTCATAGACCAAGGAGTCGTCGCTGAAATGGGGACCCATAGAGAACTCATAGCATTGAAGAAGATCTACGCGCGACTGTACGAGTTGCAGTGCGGGTTCATAGAGGAAAGCGGCGAGGAGGGGACGTTGTCGCGGTACAAAGACGCTGGGGGAGCAGCTCGGGTCGCAGCTCGCTCCAGCTTGAGGTGGCGGGGAGCTGTGTTCGCTTTCGGACAAACCGCGCCTGTAGCGGGCTACGCGCTCGCTCTGTGGTACGGCGGAGTGTTGGTCGCTAATGGAGAAGTTCCTTATAAAGATGTCATTAAGGTGTCTGAAGCTTTGATATTTGGAGCGTGGATGATGGGTCAAGCGCTGGCTTTCGCACCCAATTTCGGAGCTGCAGTACTGGCGGCGGGGCGAGTCATGACATTACTTGCAAGACAACCGCTCGTCGCTGATACTCACGCGCCCTCCGTTCCTGAAGCTTACGTAGCTGAAGGTAAAATCCAATACAAAAACATAAAATTCCGATATCCGACTCGGCGTGAGGTCCAGGTACTTCGCGGGTTGTCCCTCTCAGTGTCCATGGGCCGTCGGGTAGCACTCGTGGGCCCCAGCGGTTGTGGGAAGTCCACGCTCATACAGCTACTGCAGAGACTATACGATCCCGACGACGGAAATGTGTACTTAGACGACCACAGCATAGTAAGCGACATGCGTCTCTCAACTCTTCGCCGTAACCTCAGTATAGTATCGCAAGAGCCAGTGCTGTTCGACCGGACGATCGCCGAGAACATCGCCTATGGAGACAACACCAGAAACGTCTCAATTGAAGACATAGTCGCTGCCGCGAAAGCCGCTAACGTACATTCGTTTATTGCCGCTTTACCTAACGGTTACGAGACACGGATCGGCGCTCGTGCGTCACAACTATCCGGTGGCCAGAAGCAGCGTATTGCAATAGCGAGAGCACTCGTTCGTGACCCGCGTGTGTTACTTCTGGATGAAGCGACCTCCGCCCTCGACACACACAGCGAGAGGGTTGTCCAAGAAGCTTTGGACCGTGCAAGTGAAGGCAGAACATGCCTCATAATAGCCCATCGACTAGCCACGATACAAAACGCTGACGTCATTTGCGTCATAGACCAAGGAGTCGTCGCTGAAATGGGGACCCATAGAGAACTCATAGCATTGAAGAAGATCTACGCGCGACTGTACGAGTTGCAGTGCGGGTTCATAGAGGAAAGTGAAGAAAACTTGCCCGAAGAACCCGAGTAA

Protein sequence:

>DPOGS213261-PA
MKRNESVKSVQRFSRQSRSEQRPALALSFRQNSLTISSLAQQAAFIAEKAYKEEESDPNRQTEAVSYFKLFRFAQRWEFVMLFAGIIFACLNGLFVPVGVIIYGEFTSLLIDRTVMNGTSTPTLTINWFGGGRILTNASPEENRRALIEDSQAFGIGCTVFSVLQFLCGVISVDLFNYAALRQIERVKERFLQSVLRQDITWYDLNTSMNFATKVLLVLLVPVKLWLWLGLGLGLGTNASPEENRRALIEDSQAFGIGCTVFSVLQFLCGVISVDLFNYAALRQIERVKERFLQSVLRQDITWYDLNTSMNFATKVSDDVEKYREGIGEKVPMLIYLVMSFVTAVLISLAYGWELTLVILSCAPVIIATTAVVAKVQSSLTTQELKAYSIAGVIAEEVLASIRTVVAFGGEEKEIERYQERLAPAKKTGVKKGIYSGIGSGVMWFIIYATYALSFWYGVGLILDSRHLPTPVYTPAVLMIVFFSILQGAQNVGLTAPHLEAIANARASAGAIFSVLDRKPAIDSLSTEGTTPVLDGDLELKDVYFRYPARKDVQVLDGLSLKINRNETVALVGASGSGKSTVLQLLQRMYDPDVGSVTASGHDLRDINVRHFRNHIAVVGQEPVLFAGSIKENIRMSNPTCTDEEIIMASKQAYCHSFIKHLPNGYDTMIGERGAQLSGGQKQRIAIARALVRKPKILILDEATSALDSQSEAKVQRALDAAAHGRTTIMVSHRLATVLNANRIVFIEKGEVLEEGTHEELLSLRGRYYQLVLENEPSIAPSSADTDTPGKPNNQTVTDTKFRRSKLTKMVSLDSMKSDSIDEDSASEDSVVIEEKEEREFEPTTWQILKLCKPEKYLMCIGIFAAFAVGSSFPCFAILFGETYGLLESKNEDYVRQGTNYIAIFFLMVGIYTGIGIFFQIFIFNLTGVRLTARLRVAAFRAMLRQEIGWFDDAVNGVGALCSRLAADAAAVQGATGTRIGALMQASATILIGILVSMYYTWKMTLVSLVSVPMVIIAVVLEGRVLAEGIAAIREASNKATTIATEAITNIRTVCAFCGEEGTLSRYKDAGGAARVAARSSLRWRGAVFAFGQTAPVAGYALALWYGGVLVANGEVPYKDVIKVSEALIFGAWMMGQALAFAPNFGAAVLAAGRVMTLLARQPLVADTHAPSVPEAYVAEGKIQYKNIKFRYPTRREVQVLRGLSLSVSMGRRVALVGPSGCGKSTLIQLLQRLYDPDDGNVYLDDHSIVSDMRLSTLRRNLSIVSQEPVLFDRTIAENIAYGDNTRNVSIEDIVAAAKAANVHSFIAALPNGYETRIGARASQLSGGQKQRIAIARALVRDPRVLLLDEATSALDTHSERVVQEALDRASEGRTCLIIAHRLATIQNADVICVIDQGVVAEMGTHRELIALKKIYARLYELQCGFIEESGEEGTLSRYKDAGGAARVAARSSLRWRGAVFAFGQTAPVAGYALALWYGGVLVANGEVPYKDVIKVSEALIFGAWMMGQALAFAPNFGAAVLAAGRVMTLLARQPLVADTHAPSVPEAYVAEGKIQYKNIKFRYPTRREVQVLRGLSLSVSMGRRVALVGPSGCGKSTLIQLLQRLYDPDDGNVYLDDHSIVSDMRLSTLRRNLSIVSQEPVLFDRTIAENIAYGDNTRNVSIEDIVAAAKAANVHSFIAALPNGYETRIGARASQLSGGQKQRIAIARALVRDPRVLLLDEATSALDTHSERVVQEALDRASEGRTCLIIAHRLATIQNADVICVIDQGVVAEMGTHRELIALKKIYARLYELQCGFIEESEENLPEEPE-