DPGLEAN22391 in OGS1.0

New model in OGS2.0DPOGS203074 
Genomic Positionscaffold47:- 2976-23545
See gene structure
CDS Length2940
Paired RNAseq reads  3384
Single RNAseq reads  8577
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010849 (0.0)
Best Drosophila hit  CG4562, isoform A (0.0)
Best Human hitmultidrug resistance-associated protein 4 isoform 2 (7e-81)
Best NR hit (blastp)  PREDICTED: similar to ATP-dependent bile acid permease [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to ATP-dependent bile acid permease [Nasonia vitripennis] (0.0)
GeneOntology terms




  
GO:0043190 ATP-binding cassette (ABC) transporter complex
GO:0042626 ATPase activity, coupled to transmembrane movement of substances
GO:0005215 transporter activity
GO:0055085 transmembrane transport
GO:0005524 ATP binding
GO:0016021 integral to membrane
InterPro families




  
IPR009147 Cystic fibrosis transmembrane conductance regulator
IPR003593 ATPase, AAA+ type, core
IPR001140 ABC transporter, transmembrane domain
IPR003439 ABC transporter-like
IPR017940 ABC transporter, integral membrane type 1
IPR011527 ABC transporter, transmembrane domain, type 1
Orthology groupMCL10006

Nucleotide sequence:

ATGGAATCAAACAAAAAAAAAGGCCGGCCGCCGCATCCGAGAGCAAAGGCAAATCCATTT
TCAGCATTAACATTTGGATGGACACTACCTATGTTTTGGAGTGGATTACGGAAAGAACTA
GAAGAATCAGATTTATACCAGCCTCTGGAAGAGCATGCGTCTGGCCCTCTGGGTGATAAA
TTCGCTCGTCTATGGGAGGAGGAAGTAGCCAGGGCTGAAGGCAAGCGCACTCCAAGTCTC
CTCAGAGTAATTCTAAGAGCTTACGCAGCGAGATGCATGCTGTATGGGTTCGTGCTATTC
TTCATGGAGTGTGGGATACGCATAGCCCAGCCGGTTTTCCTAGGCAAGCTCGTGGAATAT
TACAGTCCAGATCAGAAGACCATGAAACCTCAAGAGGCGTATATGTACGCTGGTGCTGTA
GTGCTGTGTTCCGCTTTGAATGTGTTTGTGGTCCATCCATACATGATGGCGATATTACAC
ATGGGCATGAAATTCAGGGTCGCTTGCTGTTCGCTTATATACAGGAAGTCTCTCAGATTG
TCGAAAACAGCTCTAGGTGAGACAACAATAGGTCAAGTAGTCAACCTGCTATCAAACGAC
GTGAATAGATTCGACGTGGCCGTTATATTCCTTCACTACCTATGGATCGGACCCCTCGCT
ACGGTCATCGTGACGTACTTCATGTGGCTCGAGATAAGCTGGGCGGCTGTGGTCGGTGTT
GGGTTCATGCTGGCTTTCATACCTTTGCAAGCGTACTTGGGGAAACGTACGTCGGTATTG
CGTTTGAAGACAGCCATCCGTACTGACGAGAGGGTGAGATTGATGAACGAGATACTCTCC
GGTATCCAGGTCATCAAGATGTACACGTGGGAGAAGCCGTTCGCTGATCTGGTCGCTAAG
GCCAGGAAGCAGGAAATAAAACAGATCCGCGCTACGTCATATATACGGGGTGTCCTGACT
TCTTTCATAATGTTTACCACGAGGATATGTCTGCTGGTGTCAATACTGGCTTTTGTGCTG
GAGAATAATGTGATCAGCGCCAAACAGGTGTTTGTGGTCACCAGCTTCTATAACATACTG
AGACAGACCATGACAGTATTTTTCCCACAAGGTATAGCACAAGTGGCTGAAGCGACAATA
TCCATCCAAAGGTTACAGAATTTCATGTTGTATGAAGATACAAGCAAGCCGGTCCCAGGT
CTGGCTGAGATACAGACCTCGACCAAACCTAAAGCGAAGGAGGTCAAAGAAGAACCGAGA
CCTTCGATGGAATCCAAAGAGGATCTTGAAACTAAAGACTCAAAGCCAGTGTTAGATGAA
CCGGAAAATAAGGTTGCAGAAGCCAAAGGAAATGGTAAAGGTGGTCCTACAATGGAGTCT
GCGGAGGAAGATGACGAGGAGTTGGCGACGAGAGTGGAACAGGACTCAAGGGGCATCAGG
CTGAAGTACGCCACCGCCAAGTGGATAGCATCACACACAGAGAATACACTCACGGATCTA
TCACTCACTGTGAAACCTGGCAAATTAATAGCAGTGATAGGTCCAGTTGGTGCTGGCAAG
TCATCTCTGCTGCATGTGCTGCTGAAGGAGGTAGCGTACGTCAAAACATTCTGTTCGGAC
AAGCGATGGATCGTCCTCGGTACAACGCTGTGGTACGGAGGTCAGCGTGCTCGCATCTCC
CTGGCGCGTGCTGTGTATAAGCGCGCGGATATCTATCTGTTGGATGATCCTCTGTCCGCC
GTGGACGCGCACGTCGGCCGCCATCTCTTCGAGTCGTGTGTGGTGGGCTACCTCAAGAAC
ACCACCAGGGTGCTTGTAACACATCAGCTGCAGTTCCTGAGAGACGTCGATCAGATTATC
ATATTAAAGAATGGTTCTATAGCGGCTGCGGGTGATTTCGAAACGCTCAGCGCTTCCGGA
ATGGACTTCGCCACTTTACTGGCGAGGGGAGAGGAGGAAGAGAGACCGGCTCCGGAAGAA
AAATCCATTGTGGAGGCAGAGGAATCAATGCTCCAAGGCAGTTTCAGGAAACGTCAGATG
AGCATACATTCGGTCAGTTCGGTGGATAACCTAACAGCCACGGCGCCACCAGAGGGCGGT
AGGGAGGAAGCGGAAATGCGATCAGCTGGTGCAGTTTCCGGTGCTGTGTACGGCGCCTAT
CTGGGTGCAAGCGGACATCCGCTGATGGTTGCTCTTATGGTACTGGTGGCTGTGCTGGCG
CAGTTGCTAGGATCTGGCTCTGATTGGTGGACCAGTTATTGGGTGAATCAAGAGGAGGAT
CATCCACAGACGGTGTTAAGGACACTAGACTCGAGTAACACGTCAGGTCCGCTACAGTAC
TCCTCAAACTTCACACAGGCTCTGCTTGAAAACGCACACTTCAGTTCCGGTCTAACCAGA
TACGACTGCATTTATATTTATACTGGTATGGTGGTGTCGCTGGTGGTGATATCTCTGCTG
CGGTCATTCATGTTCTTCTCTATGGCGATGCGAGCGTCGACTCGGCTACACAACAACATG
TTCAGTTCCATAACGCGGGCGCCGATGAGATTCTTCCACACCAACCCATCAGGGAGAATC
CTCAACAGATTCTCGAAGGACATGGGAGCGGTCGACGAGGTACTCCCGGCTGCCTTGCTT
GATGTACTGCAAATCGGTCTATCCCTGATCGGTATAGTGGTGGTGGTGGCGGTGGTGAAC
TTCTGGCTGCTGGTCCCCACACTCTTCATAGGTCTGATCTTCTACGGTCTTCGCATATTC
TACCTGTCGTCCAGCCGCAGCATCAAGCGCCTCGAGGGTGTGACTCGCAGTCCAGTGTTC
TCTCACCTGAACGCGTCTCTTCAAGGCATCACCACTATTCGTGCGTTCGGTGCCCAGGAA
GCTCTCATCAGAGAGTTTGATAACCATCAGGACCTACATAGCTCTGCCTGGTATGTATAA

Protein sequence:

MESNKKKGRPPHPRAKANPFSALTFGWTLPMFWSGLRKELEESDLYQPLEEHASGPLGDK
FARLWEEEVARAEGKRTPSLLRVILRAYAARCMLYGFVLFFMECGIRIAQPVFLGKLVEY
YSPDQKTMKPQEAYMYAGAVVLCSALNVFVVHPYMMAILHMGMKFRVACCSLIYRKSLRL
SKTALGETTIGQVVNLLSNDVNRFDVAVIFLHYLWIGPLATVIVTYFMWLEISWAAVVGV
GFMLAFIPLQAYLGKRTSVLRLKTAIRTDERVRLMNEILSGIQVIKMYTWEKPFADLVAK
ARKQEIKQIRATSYIRGVLTSFIMFTTRICLLVSILAFVLENNVISAKQVFVVTSFYNIL
RQTMTVFFPQGIAQVAEATISIQRLQNFMLYEDTSKPVPGLAEIQTSTKPKAKEVKEEPR
PSMESKEDLETKDSKPVLDEPENKVAEAKGNGKGGPTMESAEEDDEELATRVEQDSRGIR
LKYATAKWIASHTENTLTDLSLTVKPGKLIAVIGPVGAGKSSLLHVLLKEVAYVKTFCSD
KRWIVLGTTLWYGGQRARISLARAVYKRADIYLLDDPLSAVDAHVGRHLFESCVVGYLKN
TTRVLVTHQLQFLRDVDQIIILKNGSIAAAGDFETLSASGMDFATLLARGEEEERPAPEE
KSIVEAEESMLQGSFRKRQMSIHSVSSVDNLTATAPPEGGREEAEMRSAGAVSGAVYGAY
LGASGHPLMVALMVLVAVLAQLLGSGSDWWTSYWVNQEEDHPQTVLRTLDSSNTSGPLQY
SSNFTQALLENAHFSSGLTRYDCIYIYTGMVVSLVVISLLRSFMFFSMAMRASTRLHNNM
FSSITRAPMRFFHTNPSGRILNRFSKDMGAVDEVLPAALLDVLQIGLSLIGIVVVVAVVN
FWLLVPTLFIGLIFYGLRIFYLSSSRSIKRLEGVTRSPVFSHLNASLQGITTIRAFGAQE
ALIREFDNHQDLHSSAWYV