Monarch geneset OGS2.0

DPOGS202687
TranscriptDPOGS202687-TA5199 bp
ProteinDPOGS202687-PA1732 aa
Genomic positionDPSCF300438 + 43347-66738
RNAseq coverage422x (Rank: top 29%)
Annotation
HeliconiusHMEL0141850.063.96% 
BombyxBGIBMGA011220-TA0.061.81% 
DrosophilaCG5789-PA0.039.15% 
EBI UniRef50UniRef50_B0WL150.039.58%Multidrug resistance protein 2 n=2 Tax=Eukaryota RepID=B0WL15_CULQU
NCBI RefSeqXP_002137937.10.039.98%GA27495 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1984547390.039.98%GA27495 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|1571322649e-17342.40%ATP-binding cassette transporter [Aedes aegypti]
Group
Gene OntologyGO:00068102.6e-47transport
GO:00550852.6e-47transmembrane transport
GO:00055242.6e-47ATP binding
GO:00426262.6e-47ATPase activity, coupled to transmembrane movement of substances
GO:00160212.6e-47integral to membrane
GO:00168875.3e-17ATPase activity
GO:00001668.8e-10nucleotide binding
GO:00171118.8e-10nucleoside-triphosphatase activity
KEGG pathwaytgu:1002323632e-175 
 K05673 (ABCC4)maps-> ABC transporters
InterPro domain[1019-1402] IPR0115272.6e-47ABC transporter, transmembrane domain, type 1
[1128-1362] IPR0011405.7e-27ABC transporter, transmembrane domain
[1479-1602] IPR0034395.3e-17ABC transporter-like
[1464-1648] IPR0035938.8e-10ATPase, AAA+ type, core
Orthology groupMCL10003 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202687-TA
ATGGATCCCGGATACTTCGATGTGGAGAAAAAACAGAATCCTCGTGAGACGGCAAACATATTTTCAAAAATAACGTTCTGGTATATTCTTCCAATCTTCGCTAAGGGAAGGAAGGGACAGCTCAGTATATCTGATGTTTACAGATGTTCTCCAGAACACAGGTCTACACCTGGGGGAGACGCTCTCGGTGCGAAATGGCAGGAACAACTCAATAAACAAAGTATAGGAAAGAAGCCATCCTTGATGAAAGCTATATTTGGTGTTTTCGGCTTGGAGTTCACTATAGTGAACTTTCTGTATTCTGTCGTGGACACATCAGTCAAAATAATTACACCCACCTGCCTCGAAGGCTTGATAAATTATTTTTCTTCAACATCCGGTGTCACCAAATTCGATGCCTACATGTACGCTGTTGGTGTTGTCGGTCTAACTTTCATTAGCGCCGTCATAATGCAGCCCACTATATTATATTTGTACGATATTTCCATGAGGGTCCGAGTAGCGACGTCTTCACTTATATATAGGAAGATTCTCCGTCTCGATATAAACGCTGGCGGCAAAGCCTCCGAAGGCCTCGCCGGTCATGCCATCAATCTTTTGACCACTGATGCCCAAAGGTTCGACATGGCCTCTTTGTTTATTATGGACCTCATAAGGACCCCGATAGAGGGCGTGGTTGTGACTTATCTGATGTACAACCAAGTTGGTGCTTCTACGTTTATTGGAGTTTCATTGCTGTTCCTGATCATACCACTGCAGGCTTACTTGGGTAAGATATCGAGTAGACTACGACACCAGACAGCGATACGAACCGACAATCGCATACGATTGATGAACGAGGTTATACAAGGGATAGAGGCTATCAAGATGTACGCCTGGGAGAAAGCGTTCGCAAGGATCATAGGCGAAGCCAGGAGGAAAGAAATGAACATTATCAAAAAGATATCCTGGCTGCGTGCGATCATGATATCTTGCGTGAAGCTCAATTCCAAGATCGCGATATTTCTGACCCTGGTGTCGTTCATATCGTTCAAGAACGAGCTGACAGCCTCCAAGGTGTTCGTGGTGTTTTCCTATTACAATATCTTGAAGTATTCCCTGGTGGACTTCCTGTCTCTCGCCATCTCCTTCACCCTGGAGGCGCAGGTGAGCGTGCACAGGATGCAGGAGTTCCTGATGCTACCGGAGGTCGATAACTGTGACGGAACCGACCTCGTCACCATCGAGGGGACCAAAACTGTAGCTATGAGCGGAATCTTCGAGAAAATTGGCAACGGACAACAGGAGTACATTAAGTCTGAGTCTAACCTGGAACAGCTAAAACCAGCTGTACTCGTTAGTTTCAAAGATTACACCGCGCACTGGAAGAGCTCAGAAGACGAAGACATGAAGAATAGAGTGCTAGCGCTCGACGACATTAACTTAAACATCAAACCGGAAACGTTGACCACTATAGTAGGTACCGTTGGTTCGGGGAAGTCGACGCTCCTGCTCGCCATGTTGCGGGAACTGTCTCCTACCTCCGGTCACCTCGCCGTCCGAGGGAGCATCGCCTACGCTGCTCAGGAGCCTTGGCTGTTTGAAGCTTCTGTCCGCCAGAACATCCTGTTCGGTCAAGACCTGGACTTGCGCAGATACAAGCAGGTCATCAAGTGTTGCCAGCTGAAAAGTGACCTGGAAATACTGCCTTATGGAGATAAGACGGTTGTAGGCGAGCGGGGTGTGAAAGAAATGAACATTATCAAAAAGATATCCTGGCTGCGTGCGATCATGATATCTTGCGTGAAGCTCAATTCCAAGATCGCGATATTTCTGACCCTGGTGTCGTTCATATCGTTCAAGAACGAGCTGACAGCCTCCAAGGTGTTCGTGGTGTTTTCCTATTACAATATCTTGAAGTATTCCCTGGTGGACTTCCTGTCTCTCGCCATCTCCTTCACCCTGGAGGCGCAGGTGAGCGTGCACAGGATGCAGGAGTTCCTGATGCTGCCGGAGGTCGATAACTGCGACGGAACCGACCTCGTCACCATCGAGGGGACCAAGACTGTAGCTATGAGCGGAATCTTCGAGAAAATTGGCAACGGACAACAGGAGTACATTAAGTCTGAGTCTAACCTGGAACAGCTAAAACCAGCTGTACTCGTTAGTTTCAAAGATTACACCGCGCACTGGAAGAGCTCAGAAGACGAAGACATGAAGAATAGAGTGCTAGCGCTCGACGACATTAACTTAAACATCAAACCGGAAACGTTGACCACTATAGTAGGTACCGTTGGTTCGGGGAAGTCGACCCTCCTGCTCGCCATGCTGCGGGAACTGTCTCCTACCTCCGGTCACCTCGCCGTCCGAGGGAGCATCGCCTACGCTGCTCAGGAGCCTTGGCTGTTTGAGGCTTCGGTCCGCCAGAACATCCTGTTCGGTCAAGACCTGGACTTGCGCAGATACAAGCAGGTCATCAAGTGTTGCCAGCTGAAAAGTGACCTGGAAATACTGCCTTATGGAGATAAGACGGTTGTAGGCGAGCGGGGTGCGTCGTTATCCGGGGGACAGAGAGCTCGTATCGCGCTAGCTCGCTGCGTCTACCAACACGCAGACGTCTACTTGTTGGACGACCCACTAGCAGCTGTGGACGCTAAGGTGGCTCAGGCTATATACGAGGAGTGTATCCGATCTTTCCTCCGCGAGAAGGCCACGGTTCTGGTCACCCACCACGTGCAGTACGCCAGGCACGCCGCGCAAGTGTGCGTCATGAAACTGGGCAAGGCGAGTGCGTCGTTATCCGGGGGACAGAGAGCTCGTATCGCGCTAGCTCGCTGCGTCTACCAACACGCAGACGTCTACTTGTTGGACGACCCCCTAGCAGCTGTGGACGCTAAGGTGGCTCAGGCTATATACGAGGAGTGTATCCGATCTTTCCTCCGCGAGAAGGCCACGGTTCTGGTCACCCACCACGTGCAGTACGCCAGGCACGCCGCGCAAATCGTAGCTCAAGGGACCTACAACGACTTGAAGAATGACGTCAAGGAGTTTGAGAAACTGATCGAGATGGAGGCCAAGGACGAGGAGAAGAAGATGAAGCAGAAGGTGGCCTCGTACGAGAACCAGGACAGTCTGGAGCACGGCCACAAGCTCCGCTCGCAGCGGTCGCTGTCCGAGGTGTCCCAGCTGTCCTTCAACACGGACGGCGAAACTAATATCGGTCCGGAGTTTGAAGGTGAAAAACAGAGCGAGGGTTCCGTAGACTTCGGAGTGTACTTCTCCTTCATAAAGAGCGGTGGGACTCAGTTCACGATGTTGGTTCTGGGGTCGCTCTTCCTCCTAGCTCAGTTCTTCTACTCCGCGACGGACGTGTGGCTCAAGGATTGCCAGCTCAGGTACGAGGATCTTCCTACCAACCATTTCCACCTGTCCCGGGAACACTGCATCTATATCTACGCGGCTCTCATCGTGGTGTCGATGTTCTTGACCTGGAACAAACTGCTGGTGTTCTACAACACCTGCATCAAGGCCTCCGTAGTGCTGCACGATACTATGTTCAGGGGTGTGACGAACGCTCCGATGTGGTTCTTCCACCACAACCCGTCAGGTCGAATCTTGAACCGCTTCTCCAAAGACATGGGCCAGATAGACACGTTACTTCCCGTGGCGTTGGTCGACTGTCTCGGGTTCTTCCTGGAGGTGATCGCCATTCTGGCTGTAGTGTGTATCGTCAACTGGTGGCTACTCCTGCCGACCGCCGTCCTGGCGTTCCTGCTTCACCTCATGAGATCTCTCTACTTGTGCACGAGCCGCGAGCTGAAGAGAGCGGAGGCTATCGCTCGTAGTCCGTCGTTGTCTCACTGCGGCGGCACCGTCCACGGCCTGACCACCATCAGGTCGTGCCACAAGCAGGTGACGCTCGCCAAGGAGTTCGACAAACTGCAAGACCTGCACTCCTCGGCCTGGACCTTAGTCATCACCACTAACAGCGCCTTCGGCTACTGGATGGACATGGCCTGCTGCTTATACTTAGCTGCCGTGACCTTTAGCTTCTTCTTCTTCAGCGAAGAAGACTCGAGCGGCGGTAACGTTGGGCTCGCGATCACCCAGGTGATGGGTCTAGTGGGGATGTGCCAGTACGGCATGAGACAGACGGCTGAAGTGGAGAACCAAATGACATCAGTTGAAAGAATTTTAGAATACATCAAACTGCCAGCGGAACACCCTGTAGAACCTGACAGCAGGGCGCTGAAGAATGAACATCCTGGGTTTGACTTCAGCAAGTGGCCCAGTGAAGGTGAGATAGTATTTGAAAATGTATCTCTGGAATACGAAAAGCCTCCGAAGAGTGACGGAAAAGCCGACACCGAGCCGGCTTTCGCGATAAGGGGGGTGAACTTTAAGATAAGGGCCGGGGAAAAGGTCGCTGTGGTGGGGAGAACCGGCGCCGGCAAGAGCTCGCTGATAGCTGCCTTATTCAAACTGAGCAAGATCACCGGTCGTGTGACGGTGGACAACATCACGTCGGACGAGGCTGGTCTCCGGGCGTGGCGGGCCCGTCTCTGTGCTCTGCCGCAGCGACCCGCGCTGTTCGCGGCCTCCCTCAGAGACAACTTGGACCCTGAGAGAAAATACACAGACGCAGAAATATATACAGCGCTTAATGAGGTGGAGCTCCAGGAAACCGTGGCCAGCATGCCCGGGGGCCTCGCGTCCAAGGTCGGGGACGGCGGCGGCAACCTGTCCAGCGGGCAAAGACAGCTGGTGTGTTTGGCTCGAGCGGCGCTGGCGAGGCGCGCCGTGCTCATACTAGACGAGGCCACCGCTAACGTGGACACGGAGACGGACAAACACATCCAGAGAACGATACGGACAAAGTTCGCAGACTCCACCGTGCTCACGATCGCCCACAGACTCAACACGGTCATGGACTACGACCGAGTCATCGTCATGGACAATCAGTTAACCTCACATCTATTCCAGGCTCCCCTGGCGCTCCCTGAGGACTTCCTGGCGGCTCCCGAGAGACTGGACGAGGATGAGGAGGCGTTGGTGACGTCACGCGACCGACACAGAACATACTCCAACAGGTCGGAGGCTCCCGAATACTTGGGGCTGTTCCGGTCCCTGGTGGAGCAGACGGGGCGAGCCACGGCCGACCAGCTCATGACACTCGCCAAGGAGAGTTACGAGCGTATGTTGGAGAAGAAGACGCGCTGA

Protein sequence:

>DPOGS202687-PA
MDPGYFDVEKKQNPRETANIFSKITFWYILPIFAKGRKGQLSISDVYRCSPEHRSTPGGDALGAKWQEQLNKQSIGKKPSLMKAIFGVFGLEFTIVNFLYSVVDTSVKIITPTCLEGLINYFSSTSGVTKFDAYMYAVGVVGLTFISAVIMQPTILYLYDISMRVRVATSSLIYRKILRLDINAGGKASEGLAGHAINLLTTDAQRFDMASLFIMDLIRTPIEGVVVTYLMYNQVGASTFIGVSLLFLIIPLQAYLGKISSRLRHQTAIRTDNRIRLMNEVIQGIEAIKMYAWEKAFARIIGEARRKEMNIIKKISWLRAIMISCVKLNSKIAIFLTLVSFISFKNELTASKVFVVFSYYNILKYSLVDFLSLAISFTLEAQVSVHRMQEFLMLPEVDNCDGTDLVTIEGTKTVAMSGIFEKIGNGQQEYIKSESNLEQLKPAVLVSFKDYTAHWKSSEDEDMKNRVLALDDINLNIKPETLTTIVGTVGSGKSTLLLAMLRELSPTSGHLAVRGSIAYAAQEPWLFEASVRQNILFGQDLDLRRYKQVIKCCQLKSDLEILPYGDKTVVGERGVKEMNIIKKISWLRAIMISCVKLNSKIAIFLTLVSFISFKNELTASKVFVVFSYYNILKYSLVDFLSLAISFTLEAQVSVHRMQEFLMLPEVDNCDGTDLVTIEGTKTVAMSGIFEKIGNGQQEYIKSESNLEQLKPAVLVSFKDYTAHWKSSEDEDMKNRVLALDDINLNIKPETLTTIVGTVGSGKSTLLLAMLRELSPTSGHLAVRGSIAYAAQEPWLFEASVRQNILFGQDLDLRRYKQVIKCCQLKSDLEILPYGDKTVVGERGASLSGGQRARIALARCVYQHADVYLLDDPLAAVDAKVAQAIYEECIRSFLREKATVLVTHHVQYARHAAQVCVMKLGKASASLSGGQRARIALARCVYQHADVYLLDDPLAAVDAKVAQAIYEECIRSFLREKATVLVTHHVQYARHAAQIVAQGTYNDLKNDVKEFEKLIEMEAKDEEKKMKQKVASYENQDSLEHGHKLRSQRSLSEVSQLSFNTDGETNIGPEFEGEKQSEGSVDFGVYFSFIKSGGTQFTMLVLGSLFLLAQFFYSATDVWLKDCQLRYEDLPTNHFHLSREHCIYIYAALIVVSMFLTWNKLLVFYNTCIKASVVLHDTMFRGVTNAPMWFFHHNPSGRILNRFSKDMGQIDTLLPVALVDCLGFFLEVIAILAVVCIVNWWLLLPTAVLAFLLHLMRSLYLCTSRELKRAEAIARSPSLSHCGGTVHGLTTIRSCHKQVTLAKEFDKLQDLHSSAWTLVITTNSAFGYWMDMACCLYLAAVTFSFFFFSEEDSSGGNVGLAITQVMGLVGMCQYGMRQTAEVENQMTSVERILEYIKLPAEHPVEPDSRALKNEHPGFDFSKWPSEGEIVFENVSLEYEKPPKSDGKADTEPAFAIRGVNFKIRAGEKVAVVGRTGAGKSSLIAALFKLSKITGRVTVDNITSDEAGLRAWRARLCALPQRPALFAASLRDNLDPERKYTDAEIYTALNEVELQETVASMPGGLASKVGDGGGNLSSGQRQLVCLARAALARRAVLILDEATANVDTETDKHIQRTIRTKFADSTVLTIAHRLNTVMDYDRVIVMDNQLTSHLFQAPLALPEDFLAAPERLDEDEEALVTSRDRHRTYSNRSEAPEYLGLFRSLVEQTGRATADQLMTLAKESYERMLEKKTR-