Monarch geneset OGS2.0

DPOGS208066
TranscriptDPOGS208066-TA4266 bp
ProteinDPOGS208066-PA1421 aa
Genomic positionDPSCF300282 - 99215-127066
RNAseq coverage806x (Rank: top 16%)
Annotation
HeliconiusHMEL0033510.087.89% 
BombyxBGIBMGA011220-TA0.035.49% 
DrosophilaCG7627-PB0.043.73% 
EBI UniRef50UniRef50_G3JWY50.047.20%ABC transporter family C protein ABCC2 n=6 Tax=Ditrysia RepID=G3JWY5_PLUXY
NCBI RefSeqXP_001600523.10.043.01%PREDICTED: similar to ATP-dependent bile acid permease [Nasonia vitripennis]
NCBI nr blastpgi|3362395150.047.20%ABC transporter family C protein ABCC2 [Plutella xylostella]
NCBI nr blastxgi|2964278260.046.70%ABC transporter family C protein ABCC2 [Heliothis subflexa]
Group
Gene OntologyGO:00068101.1e-45transport
GO:00550851.1e-45transmembrane transport
GO:00055241.1e-45ATP binding
GO:00426261.1e-45ATPase activity, coupled to transmembrane movement of substances
GO:00160211.1e-45integral to membrane
GO:00168872.2e-15ATPase activity
GO:00001662.3e-08nucleotide binding
GO:00171112.3e-08nucleoside-triphosphatase activity
KEGG pathwaydme:Dmel_CG317930.0 
 K01509 (E3.6.1.3)maps-> Purine metabolism
InterPro domain[778-1157] IPR0115271.1e-45ABC transporter, transmembrane domain, type 1
[111-377] IPR0011401.3e-27ABC transporter, transmembrane domain
[1207-1326] IPR0034392.2e-15ABC transporter-like
[1192-1372] IPR0035932.3e-08ATPase, AAA+ type, core
Orthology groupMCL10003 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208066-TA
ATGGTCAGTGGAAACAAAAACGCCGGCGATCCCATTAAAATACCGCCCAAAAATCGCGAAAAAAAAACTAATAACTATAAAAGAGCCAACATATTTTCAAAAGCATGGTTTGTATGGATGATGCCAACTTTTTGGCGAGGATTCAAAAGAGACCTGTACGATGCGGACCTCACCAAACCCAAAGACAACCATCTGTCTGATAAACTTGGAGATCGTCTCGAAAAGAAATGGTTGGAAGAAATAGCATCGGCAAACAAGCAAGGAAGGAAACCATCGCTACTCCGAGCAATGACTAAAACATTTTGGTTAAGCTACGCACCTTCCGGAGTCATGTTCCTTATACAAGCTCTCATACTCAAGCCGTTCCAGCCTGTTGCCTTGAGTATACTGTTGACATACTGGGAGCCTGGTTCAAATATGACGTACGATCAAGCGGTGTACTGCGCTTTCACCGTAATACTGATGTCTCTACTGATAGCATTCCTCAACCACCATGGCACTTACTCTACTCAACAGTTTGGAATGAAAGTCCGTATTGCCGCCTGCTCCCTTATCTATAGAAAGGTGATGCGGATGAGCTCCGGCGCGCTAGCCCAAACAACTGCCGGTCAAGTAGTAAATCTGTTGTCCAATGATGTCAATCGATTTGACTACGCTTTCATTTACACTCACTTCATCTGGCTTTTGCCACTCCAAGTGATAATTGTCTGTTACCTCATATACATCAAAATTGGGTATGCAGCGATTGTTGGAGTCATTGGCATCGTCTTGCAAACAATTCCTGTACAATCTTATATGAGCAAATTGGCAGCGAGACTGCGAATGAAAACAGCATCTAGAACTGACGAACGTGTCCGAATTATGGATGAAATAATAAGCGGAATGCAGGTCATCAAAATGTACGCTTGGGAGAAACCGTTCGAGCAGGTTGTGGCATTGGCTCGTAAAAATGAAATCAACTGCATTACATCAGCGTCTTATCTTCGAGGAGTTTACCTTAGCTTTATGGTGTTCACTGAACGGCTAACACTTTATATCACCCTTTTATCCTATTCACTATTTGGATTTCAAGTTACCGCTGACATTGTTTTTCCATTGGCGCAATTCTTCAATACCCTTCAAGGCACACTGTCAATTATCATGTCAAATGCCGTATCATTTTTAGCTGAAGCATTGATTTCAGTTCAACGACTTGAGGCGTTCATGTTATTGGGTACAAAGTCTGTACCTGTAGTTAAAAGCAAGACGTTCACACTAAGACGCACCTATATAGAAATTATTATGGATCAAACAGAAAGGAGTTCCAGTGATGAAAGGGAAGATCTTCGAAGTGTAGCGGGTGTAGACATTGCAAAACTTGTTGCCAGTGTTGAAAAGAAAAATAAAAATCTTCGAGAAATTGACCTTTGTGCAAATAAATTCAAAAAAATTGATGAAGAAGGAATTTACAATCCAGGTTTTGATTGCGACGAAAAAGGTTTGATGACTCCAGCGATAAGTACGACTCCTATCAGCAATGATGTTGGCATACTTATACAGAATGTGAGCGCTAGCTGGACCGAAGATGGTCCTATTACCTTGCGCAATATTAATATTACAGTTCCTAAAGGGAAACTATGTGCTATAATTGGATCAGTAGGATCAGGAAAGAGTTCTATTTTACAACTTTTGCTAAATGAATTACGCTCCAGTGCTGGACGGATACACCTCTGCGGGCCCCTTTCATATGCTAGCCAAGAACCTTGGTTATTTGTGGCAACAGTCCGTCAAAATATTTTATTCGGTTTGCCATATAATCCGAAAAAATATAAGGAAGTAGTCAGAGTATGTGCACTGCAGAAGGACTTTCTCCAATTCCCCCACGGAGATCAAACTCTTGTCGGAGAAAGAGGTGCCTCTTTATCTGGAGGGCAAAGAGCCCGTATAAATCTAGCAAGAGCTGTATACCGCCAGGCGGATGTTTATCTACTGGATGATCCATTGTCTGCAGTAGATGCACACGTTGGTCGACAACTCTTCGATGAGTGCATAACGGGTTATTTAAGGCACACCACTAGAGTTCTTGTTACTCATCAACTTCACTACCTTAAAGTCGCTGATTACATTGTTATCATGAATAATGGCATCATTGAAGCAAAAGGAACCTATGAGGAATTACTTACTTCGGGTAAAGACTTTGCTAAGTTGTTGTGCTCCGTTCAGGAAGATGATAATACGGAGGCTGAAAAGCCACTACCGATATCACGAAGGACTTCAGCAAGGCTATCCACGACAAGGAGGCCCTCGCTGGCAGAATCTACCACAGGCTGTGATATTCCTGCTCAAGAGATGGAAGAAGAAGAAAGAGAATCTGGCTCAATGGGATGGCATGTATACGGAGCTTATTTGAAAGCTGGAGGTCGTACCGCCAGAATAGTTTTCATGGTTTTACTTTTAGTAATAGGACAACTTTCTGCTACTCTCTGTGATTATTGGGTTACTTTCTGGACGAATGAAGTTACCCTTCTTAAAGAAAGACAATCAAATGTGACTAAGGATTATGAACAAGATATAAAACCAGTAAATTCTACATTCAGCCTAAATAGTTACTTCTCTGGAATTAAAGCTACACCAAATTTAAATATTCACGCCTACATTGGACCGTTAGACACATCGCAATATCTTTATGTATATACAGCACTTATTGTTTGTTGTATCTTCTTTATCACGGCCCGAGCATTCATGTTCTTCAAGGTTTGCATGACAGCCTCTAGGAACCTCCACAACGACATGTTCCATGCTATGCTACGTGGGGTTATGAGATTCTTCGATATGAATTCGTCAGGTCGTATTCTCAACCGCTTCTCAAAAGATATTGGTGCCTTAGACGAACTGCTGCCGCGTTTTCTTCTCGAATGCATACAAATCTACCTCGTGATGTTCAGTATTCTCACGTTGAATGCTGCTGCACTCGTCTGGACTTTATTGCCTACGACTATCATCTTACTACTCTTCTATGTTATTCTTCAGATATACTTAAAATCAGCTCAATCTATTAAAAGATTAGAGGGGACAACGAGAAGTCCAGTCTTTTCACACATGTCGGCCACCCTCAATGGCATCAGCACTATTAGATCCTCAGGAGCACAACAACGTTTAATACAAGAATTCGATCATTTCCAAGACATTCACACGTCTACTTGGAGCAGCTACCTTGCGAGCGGCGTAACTCTAGGATTCTGGCTTGACTTTATTTGTGTGATCTACCTTGCAATTGTCATTGTGGCTTTCCTTGTTATAGATAGCAAAACCATTTTCTCCGGTAACGTGGGACTTGCGATATCACAAACTCTTATCTTGACCGGTATGCTGCAATTCGGTGTACGACAAACAGCTGAAGTTATCTCACAAATGACCAGCGTTGAGCGCATCCTGCAATACACTAATATTGAAAAGGAATCGCAATGGGAGAAAGGAGCTAAAGAAACCCCTAAAGGATGGCCTTGGAGGGGTCGCATAGAGTTCCGTAATTGCTATATGAAGTACACACCTGAAGATTTACCTGTATTGAAGAATCTAAATGTAGTTATAGAGAGTGGTTGGAAGGTTGGAATAGTCGGTAGAACGGGTGCTGGAAAATCATCACTAATCTCTTCTCTGTTTCGTCTTTCCATATTAGAGGGTGAAGTCCTTATTGACGATGTAGATACCGCCTACCTTGCACTTCAGGAAGTAAGGTCTAAAATATCGATTATACCGCAAGAACCGGTACTGTTCTCCGCTACAATACGATACAATTTAGATCCATTCAACTGTTATGATGACGAACAACTTTGGAAAGCGCTTGAAGCGGTCGATTTGAAGGCTGCGATACCTGCACTAGATTTCAAAGTTTCCGAAGGTGGTTCAAACTTTTCTCTTGGACAGAGACAACTCGTGTGCCTTGCTCGTGCTATCCTTCGCGGTAACCGAATTCTTGTGTTGGATGAGGCCACAGCAAACGTGGATCCAAAAACGGACGAGTTTATCCAAAGAACAATAAGAAAGCGTTTTTCAGACTGCACAGTTTTGACAGTTGCACATCGACTCAACACTATTATGGACTCCGATCGCGTGATGGTAATGGATACCGGACGATTAGTTGAGTTTGATCATCCTTACAAATTGCTCTGCAACCCTGAAGGTCATTTCACAAAAATGGTTAAAGAAACTAGTGACAAATTGGCAGCTCAGCTCTACGAAATAGCAAAAAGAACATATCTCGAAAACGGAAACGTAATAGAGTGA

Protein sequence:

>DPOGS208066-PA
MVSGNKNAGDPIKIPPKNREKKTNNYKRANIFSKAWFVWMMPTFWRGFKRDLYDADLTKPKDNHLSDKLGDRLEKKWLEEIASANKQGRKPSLLRAMTKTFWLSYAPSGVMFLIQALILKPFQPVALSILLTYWEPGSNMTYDQAVYCAFTVILMSLLIAFLNHHGTYSTQQFGMKVRIAACSLIYRKVMRMSSGALAQTTAGQVVNLLSNDVNRFDYAFIYTHFIWLLPLQVIIVCYLIYIKIGYAAIVGVIGIVLQTIPVQSYMSKLAARLRMKTASRTDERVRIMDEIISGMQVIKMYAWEKPFEQVVALARKNEINCITSASYLRGVYLSFMVFTERLTLYITLLSYSLFGFQVTADIVFPLAQFFNTLQGTLSIIMSNAVSFLAEALISVQRLEAFMLLGTKSVPVVKSKTFTLRRTYIEIIMDQTERSSSDEREDLRSVAGVDIAKLVASVEKKNKNLREIDLCANKFKKIDEEGIYNPGFDCDEKGLMTPAISTTPISNDVGILIQNVSASWTEDGPITLRNINITVPKGKLCAIIGSVGSGKSSILQLLLNELRSSAGRIHLCGPLSYASQEPWLFVATVRQNILFGLPYNPKKYKEVVRVCALQKDFLQFPHGDQTLVGERGASLSGGQRARINLARAVYRQADVYLLDDPLSAVDAHVGRQLFDECITGYLRHTTRVLVTHQLHYLKVADYIVIMNNGIIEAKGTYEELLTSGKDFAKLLCSVQEDDNTEAEKPLPISRRTSARLSTTRRPSLAESTTGCDIPAQEMEEEERESGSMGWHVYGAYLKAGGRTARIVFMVLLLVIGQLSATLCDYWVTFWTNEVTLLKERQSNVTKDYEQDIKPVNSTFSLNSYFSGIKATPNLNIHAYIGPLDTSQYLYVYTALIVCCIFFITARAFMFFKVCMTASRNLHNDMFHAMLRGVMRFFDMNSSGRILNRFSKDIGALDELLPRFLLECIQIYLVMFSILTLNAAALVWTLLPTTIILLLFYVILQIYLKSAQSIKRLEGTTRSPVFSHMSATLNGISTIRSSGAQQRLIQEFDHFQDIHTSTWSSYLASGVTLGFWLDFICVIYLAIVIVAFLVIDSKTIFSGNVGLAISQTLILTGMLQFGVRQTAEVISQMTSVERILQYTNIEKESQWEKGAKETPKGWPWRGRIEFRNCYMKYTPEDLPVLKNLNVVIESGWKVGIVGRTGAGKSSLISSLFRLSILEGEVLIDDVDTAYLALQEVRSKISIIPQEPVLFSATIRYNLDPFNCYDDEQLWKALEAVDLKAAIPALDFKVSEGGSNFSLGQRQLVCLARAILRGNRILVLDEATANVDPKTDEFIQRTIRKRFSDCTVLTVAHRLNTIMDSDRVMVMDTGRLVEFDHPYKLLCNPEGHFTKMVKETSDKLAAQLYEIAKRTYLENGNVIE-