Monarch geneset OGS2.0

DPOGS203074
TranscriptDPOGS203074-TA3072 bp
ProteinDPOGS203074-PA1023 aa
Genomic positionDPSCF300294 - 4632-25201
RNAseq coverage111x (Rank: top 59%)
Annotation
HeliconiusHMEL0117310.053.48% 
BombyxBGIBMGA010849-TA0.067.55% 
DrosophilaCG4562-PA0.043.21% 
EBI UniRef50UniRef50_E2B3S70.047.69%Probable multidrug resistance-associated protein lethal(2)03659 n=9 Tax=Formicidae RepID=E2B3S7_HARSA
NCBI RefSeqXP_001600523.10.047.85%PREDICTED: similar to ATP-dependent bile acid permease [Nasonia vitripennis]
NCBI nr blastpgi|3454980540.047.94%PREDICTED: probable multidrug resistance-associated protein lethal(2)03659-like [Nasonia vitripennis]
NCBI nr blastxgi|3072147440.047.37%Probable multidrug resistance-associated protein lethal(2)03659 [Harpegnathos saltator]
Group
Gene OntologyGO:00068101.8e-45transport
GO:00550851.8e-45transmembrane transport
GO:00055241.8e-45ATP binding
GO:00426261.8e-45ATPase activity, coupled to transmembrane movement of substances
GO:00160211.8e-45integral to membrane
GO:00001662.7e-10nucleotide binding
GO:00171112.7e-10nucleoside-triphosphatase activity
KEGG pathwaydpo:Dpse_GA182600.0 
 K05673 (ABCC4)maps-> ABC transporters
InterPro domain[695-1022] IPR0115271.8e-45ABC transporter, transmembrane domain, type 1
[151-406] IPR0011409.2e-34ABC transporter, transmembrane domain
[550-670] IPR0035932.7e-10ATPase, AAA+ type, core
Orthology groupMCL10003 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203074-TA
ATGGAATCAAACAAAAAAAAAGGCCGGCCGCCGCATCCGAGAGCAAAGGCAAATCCATTTTCAGCATTAACATTTGGATGGACACTACCTATGTTTTGGAGTGGATTACGGAAAGAACTAGAAGAATCAGATTTATACCAGCCTCTGGAAGAGCATGCGTCTGGCCCTCTGGGTGATAAATTCGCTCGTCTATGGGAGGAGGAAGTAGCCAGGGCTGAAGGCAAGCGCACTCCAAGTCTCCTCAGAGTAATTCTAAGAGCTTACGCAGCGAGATGCATGCTGTATGGGTTCGTGCTATTCTTCATGGAGTGTGGGATACGCATACAGCAGCCGCGCATGCTGGGTCTTTTTATCGGCTATTTCGGACAGGACGATCAAGTGTTGATGCACGATCGTCTCTCAGAGTTGAAGGACAAAGTGATGCGAGCCAGATCGAATATGACACAGCCCATCATAGCCCAGCCGGTTTTCCTAGGCAAGCTCGTGGAATATTACAGTCCAGATCAGAAGACCATGAAACCTCAAGAGGCGTATATGTACGCTGGTGCTGTAGTGCTGTGTTCCGCTTTGAATGTGTTTGTGGTCCATCCATACATGATGGCGATATTACACATGGGCATGAAATTCAGGGTCGCTTGCTGTTCGCTTATATACAGGAAGTCTCTCAGATTGTCGAAAACAGCTCTAGGTGAGACAACAATAGGTCAAGTAGTCAACCTGCTATCAAACGACGTGAATAGATTCGACGTGGCCGTTATATTCCTTCACTACCTATGGATCGGACCCCTCGCTACGGTCATCGTGACGTACTTCATGTGGCTCGAGATAAGCTGGGCGGCTGTGGTCGGTGTTGGGTTCATGCTGGCTTTCATACCTTTGCAAGCGTACTTGGGGAAACGTACGTCGGTATTGCGTTTGAAGACAGCCATCCGTACTGACGAGAGGGTGAGATTGATGAACGAGATACTCTCCGGTATCCAGGTCATCAAGATGTACACGTGGGAGAAGCCGTTCGCTGATCTGGTCGCTAAGGCCAGGAAGCAGGAAATAAAACAGATCCGCGCTACGTCATATATACGGGGTGTCCTGACTTCTTTCATAATGTTTACCACGAGGATATGTCTGCTGGTGTCAATACTGGCTTTTGTGCTGGAGAATAATGTGATCAGCGCCAAACAGGTGTTTGTGGTCACCAGCTTCTATAACATACTGAGACAGACCATGACAGTATTTTTCCCACAAGGTATAGCACAAGTGGCTGAAGCGACAATATCCATCCAAAGGTTACAGAATTTCATGTTGTATGAAGATACAAGCAAGCCGGTCCCAGGTCTGGCTGAGATACAGACCTCGACCAAACCTAAAGCGAAGGAGGTCAAAGAAGAACCGAGACCTTCGATGGAATCCAAAGAGGATCTTGAAACTAAAGACTCAAAGCCAGTGTTAGATGAACCGGAAAATAAGGTTGCAGAAGCCAAAGGAAATGGTAAAGGTGGTCCTACAATGGAGTCTGCGGAGGAAGATGACGAGGAGTTGGCGACGAGAGTGGAACAGGACTCAAGGGGCATCAGGCTGAAGTACGCCACCGCCAAGTGGATAGCATCACACACAGAGAATACACTCACGGATCTATCACTCACTGTGAAACCTGGCAAATTAATAGCAGTGATAGGTCCAGTTGGTGCTGGCAAGTCATCTCTGCTGCATGTGCTGCTGAAGGAGGTAGCGTACGTCAAAACATTCTGTTCGGACAAGCGATGGATCGTCCTCGGTACAACGCTGTGGTACGGAGGTCAGCGTGCTCGCATCTCCCTGGCGCGTGCTGTGTATAAGCGCGCGGATATCTATCTGTTGGATGATCCTCTGTCCGCCGTGGACGCGCACGTCGGCCGCCATCTCTTCGAGTCGTGTGTGGTGGGCTACCTCAAGAACACCACCAGGGTGCTTGTAACACATCAGCTGCAGTTCCTGAGAGACGTCGATCAGATTATCATATTAAAGAATGGTTCTATAGCGGCTGCGGGTGATTTCGAAACGCTCAGCGCTTCCGGAATGGACTTCGCCACTTTACTGGCGAGGGGAGAGGAGGAAGAGAGACCGGCTCCGGAAGAAAAATCCATTGTGGAGGCAGAGGAATCAATGCTCCAAGGCAGTTTCAGGAAACGTCAGATGAGCATACATTCGGTCAGTTCGGTGGATAACCTAACAGCCACGGCGCCACCAGAGGGCGGTAGGGAGGAAGCGGAAATGCGATCAGCTGGTGCAGTTTCCGGTGCTGTGTACGGCGCCTATCTGGGTGCAAGCGGACATCCGCTGATGGTTGCTCTTATGGTACTGGTGGCTGTGCTGGCGCAGTTGCTAGGATCTGGCTCTGATTGGTGGACCAGTTATTGGGTGAATCAAGAGGAGGATCATCCACAGACGGTGTTAAGGACACTAGACTCGAGTAACACGTCAGGTCCGCTACAGTACTCCTCAAACTTCACACAGGCTCTGCTTGAAAACGCACACTTCAGTTCCGGTCTAACCAGATACGACTGCATTTATATTTATACTGGTATGGTGGTGTCGCTGGTGGTGATATCTCTGCTGCGGTCATTCATGTTCTTCTCTATGGCGATGCGAGCGTCGACTCGGCTACACAACAACATGTTCAGTTCCATAACGCGGGCGCCGATGAGATTCTTCCACACCAACCCATCAGGGAGAATCCTCAACAGATTCTCGAAGGACATGGGAGCGGTCGACGAGGTACTCCCGGCTGCCTTGCTTGATGTACTGCAAATCGGTCTATCCCTGATCGGTATAGTGGTGGTGGTGGCGGTGGTGAACTTCTGGCTGCTGGTCCCCACACTCTTCATAGGTCTGATCTTCTACGGTCTTCGCATATTCTACCTGTCGTCCAGCCGCAGCATCAAGCGCCTCGAGGGTGTGACTCGCAGTCCAGTGTTCTCTCACCTGAACGCGTCTCTTCAAGGCATCACCACTATTCGTGCGTTCGGTGCCCAGGAAGCTCTCATCAGAGAGTTTGATAACCATCAGGACCTACATAGCTCTGCCTGGTATGTATAA

Protein sequence:

>DPOGS203074-PA
MESNKKKGRPPHPRAKANPFSALTFGWTLPMFWSGLRKELEESDLYQPLEEHASGPLGDKFARLWEEEVARAEGKRTPSLLRVILRAYAARCMLYGFVLFFMECGIRIQQPRMLGLFIGYFGQDDQVLMHDRLSELKDKVMRARSNMTQPIIAQPVFLGKLVEYYSPDQKTMKPQEAYMYAGAVVLCSALNVFVVHPYMMAILHMGMKFRVACCSLIYRKSLRLSKTALGETTIGQVVNLLSNDVNRFDVAVIFLHYLWIGPLATVIVTYFMWLEISWAAVVGVGFMLAFIPLQAYLGKRTSVLRLKTAIRTDERVRLMNEILSGIQVIKMYTWEKPFADLVAKARKQEIKQIRATSYIRGVLTSFIMFTTRICLLVSILAFVLENNVISAKQVFVVTSFYNILRQTMTVFFPQGIAQVAEATISIQRLQNFMLYEDTSKPVPGLAEIQTSTKPKAKEVKEEPRPSMESKEDLETKDSKPVLDEPENKVAEAKGNGKGGPTMESAEEDDEELATRVEQDSRGIRLKYATAKWIASHTENTLTDLSLTVKPGKLIAVIGPVGAGKSSLLHVLLKEVAYVKTFCSDKRWIVLGTTLWYGGQRARISLARAVYKRADIYLLDDPLSAVDAHVGRHLFESCVVGYLKNTTRVLVTHQLQFLRDVDQIIILKNGSIAAAGDFETLSASGMDFATLLARGEEEERPAPEEKSIVEAEESMLQGSFRKRQMSIHSVSSVDNLTATAPPEGGREEAEMRSAGAVSGAVYGAYLGASGHPLMVALMVLVAVLAQLLGSGSDWWTSYWVNQEEDHPQTVLRTLDSSNTSGPLQYSSNFTQALLENAHFSSGLTRYDCIYIYTGMVVSLVVISLLRSFMFFSMAMRASTRLHNNMFSSITRAPMRFFHTNPSGRILNRFSKDMGAVDEVLPAALLDVLQIGLSLIGIVVVVAVVNFWLLVPTLFIGLIFYGLRIFYLSSSRSIKRLEGVTRSPVFSHLNASLQGITTIRAFGAQEALIREFDNHQDLHSSAWYV-