DPGLEAN19552 in OGS1.0

New model in OGS2.0DPOGS208634 
Genomic Positionscaffold1445:- 94419-108258
See gene structure
CDS Length3525
Paired RNAseq reads  5513
Single RNAseq reads  12048
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010849 (8e-13)
Best Drosophila hit  CG4562, isoform A (0.0)
Best Human hitmultidrug resistance-associated protein 5 isoform 1 (8e-163)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC010434 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC010434 [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0043190 ATP-binding cassette (ABC) transporter complex
GO:0042626 ATPase activity, coupled to transmembrane movement of substances
GO:0005215 transporter activity
GO:0055085 transmembrane transport
GO:0005524 ATP binding
GO:0016021 integral to membrane
InterPro families





  
IPR003439 ABC transporter-like
IPR017940 ABC transporter, integral membrane type 1
IPR003593 ATPase, AAA+ type, core
IPR017871 ABC transporter, conserved site
IPR009147 Cystic fibrosis transmembrane conductance regulator
IPR011527 ABC transporter, transmembrane domain, type 1
IPR001140 ABC transporter, transmembrane domain
Orthology groupMCL10006

Nucleotide sequence:

ATGTGGCTCGAGATAAGCTGGGCGGCTGTGGTCGGTGTTGGGTTCATGCTGGCTTTCATA
CCTTTGCAAGCGTACTTGGGGAAACGTACGTCGGTATTGCGTTTGAAGACAGCCATCCGT
ACTGACGAGAGGGTGAGATTGATGAACGAGATACTCTCCGGTATCCAGGTCATCAAGATG
TACACGTGGGAGAAGCCGTTCGCTGATCTGGTCGCCAAGGCCAGGAAGCAGGAAATAAAA
CAGATCCGCGCTACGTCATATATACGGGGTGTCCTGACTTCTTTCATAATGTTTACCACG
AGGATATGTCTGCTGGTGTCAATACTGGCTTTTGTGCTGGAGAATAATGTGATCAGCGCC
AAACAGGTGTTTGTGGTCACCAGCTTCTATAACATACTGAGACAGACCATGACAGTATTT
TTCCCACAAGGTATAGCACAAGTGGCTGAAGCGACAATATCCATCCAAAGGTTACAGAAT
TTCATGTTGTATGAAGATACAAGCAAGCCGGTCCCAGGTCTGGCTGAGATACAGACCTCG
ACCAAACCTAAAGCGAAGGAGGTCAAAGAAGAACCGAGACCTTCGATGGAATCCAAAGAG
GATCTTGAAACTAAAGACTCAAAGCCAGTGTTAGATGAACCGGAAAATAAGGTTGCAGAA
GCCAAAGGAAATGGTAAAGGTGGTCCTACAATGGAGTCTGCTGAGGAAGATGATGAGGAG
TTGGCAACGAGAGTGGAACAGGACTCGAGGGGCATCAGGCTGAAGTACGCCACCGCCAAG
TGGATAGCATCACACACAGAGAATACACTCACAGATCTATCACTAACTGTGAAACCTGGC
AAATTAATAGCAGTGATAGGTCCAGTTGGCGCTGGCAAGTCATCTCTGCTACATGTGCTG
CTGAAGGAACTGCCGCTCCACTCGGGCAGCGTACACGTCGGCGGTACTATTTCGTATGCT
AGTCAGGAACCCTGGCTTTTCGCAGGTAGCGTACGTCAAAACATTCTGTTCGGACAAGCG
ATGGATCGTCCCCGGTACAACGCTGTGGTACGGAGGTGTGCTCTGGACAGAGACTTCACA
CTATTTCCTCACGGTGACAAGACGGTGGTGGGGGAGAGGGGAGTCAGTCTCAGTGGAGGT
CAGCGTGCTCGCATCTCCCTGGCGCGTGCTGTGTACAAGCGCGCGGATATCTATCTGTTG
GATGATCCTCTGTCCGCCGTGGACGCGCACGTCGGCCGCCATCTCTTCGAGTCGTGTGTG
GTCGGCTACCTCAAGAACACCACCAGGGTGCTTGTAACACATCAGCTGCAGTTCCTGAGA
GACGTCGATCAGATTATCATACTAAAGAATGGTTCTATAGCGGCTGCGGGTGATTTCGAA
ACGCTCAGCGCTTCCGGAATGGACTTCGCCACTTTACTGGCGAGGGGAGAGGAGGAAGAG
AGACCGGCTCCGGAAGAAAAATCCATTGTGGAGGCAGAGGAATCAATGCTGCAAGGCAGT
TTCAGGAAACGTCAGATGAGCATACATTCGGTCAGTTCGGTGGATAACCTGACAGCCACG
GCGCCACCAGAGGGCGGTAGGGAGGAAGCGGAAATGCGATCAGCTGGTGCAGTTTCCGGT
GCTGTGTACGGCGCCTATCTGGGTGCAAGCGGACATCCGCTGATGGTTGCTCTTATGGTA
CTGGTGGCTGTGCTGGCGCAGTTGCTAGGATCTGGCTCCGATTGGTGGACCAGTTATTGG
GTGAATCAAGAGGAGGATCATCCACAGACGGTGTTAAGGACACTAGACTCGAGTAACACG
TCAGGTCCGCTACAGTACTCCTCAAACTTCACACAGGCTCTGCTTGAAAACGCACACTTC
AGTTCCGGTCTGACCAGATACGACTGCATTTATATTTATACTGGTATGGTGGTGTCGCTG
GTGGTGATATCTCTGCTGCGGTCATTCATGTTCTTCTCTATGGCGATGCGAGCGTCGACT
CGGCTACACAACAACATGTTCAGTTCCATAACGCGGGCCCCGATGAGATTCTTCCACACC
AACCCATCGGGGAGAATCCTCAATAGATTCTCGAAGGACATGGGAGCGGTCGACGAGGTG
CTCCCGGCTGCCTTGCTTGATGTGCTGCAAATCGGTCTATCCCTGATCGGTATAGTGGTG
GTGGTGGCGGTGGTGAACTTCTGGCTGCTGGTCCCCACACTCTTCATAGGTCTGATCTTC
TACGGTCTTCGCATATTCTACCTGTCGTCCAGCCGCAGCATCAAGCGCCTCGAGGGTGTG
ACTCGCAGTCCAGTGTTCTCTCACCTGAACGCGTCTCTTCAAGGCATCACCACTATTCGT
GCGTTCGGTGCCCAGGAAGCTCTCATCAGAGAGTTTGATAACCATCAGGACCTACATAGC
TCTGCCTGGTACTTATTCATAGCCAGCTCCCGTGCCTTCGGTTTCTGGTTGGATCTTGTT
TGCGTTATTTACATTGCTGTAGTAACGCTCAGTTTCCTCGTGTTTGGACAAGAGGAATAT
GGTGGCAAGGTCGGTTTAGCCATCACTCAAGCTATGGGGTTGACCGGTATGTTCCAGTGG
GGAATGAGACAGAGCACGGAACTGGAAAATCAGATGACCAGCGTTGAGAGGATACAAGAA
TATTCGAACATAGAATCTGAGCCGCCTCTCGAGTCTGAACCGAGTAAGAAGCCACCGCCG
TCATGGCCAGACGCTGGTCGCTTGGAGTTCAGACACGTGTTCCTGTACTACACGCCCGGG
GAACCCCCGGTGTTGAAGGACCTCAACTTACTGGTGCTGCCGAAGGAAAAAGTCGGCATC
GTGGGTAGAACGGGAGCGGGGAAGTCCTCGCTTATTAACGCCCTCTTCAGATTAGCTAAA
ATTGAGGGGGAGATAATAATAGATGGTAGGGAGACGTCACAGCTCGGTCTCCATGAACTG
AGGAGTCAGATTTCCATCATACCACAGGAGCCTGTTCTGTTCTCCGGCACCATGAGACAC
AATCTGGATCCCTTCGACGAATATCCGGATCAAGTACTTTGGAGGGCTCTAGAGGAGGTG
GAGTTAAAGGAAGCCGTGAACGAGTTACCCGCGGGTCTCAGTTCACGTATGTCTGAGGGG
GGTGGGAACTTCTCAGTGGGGCAGAGACAGTTGGTGTGCCTGGCGCGGGCCATCGTGAGA
CGGAACAGACTCCTGGTGTTGGACGAGGCCACCGCGAACGTTGACCCACAAACCGATGCT
CTTATACAAACTACAATAAGAAACAAATTCGCCGACTGCACCGTCCTCACCATCGCCCAC
AGACTACACACAGTCATGGATTCAGATAAGGTGTTGGTGATGGACGCCGGCAGCATGGTC
GAGTTCGACCATCCTCACAAGCTGCTGCAGCATATTGGCGGAGTGCTGCGGGGGATGGCG
GAGCAGACGGGCAAGGCCACGTACGACGCGCTGGCGCGACAGGCTAAGACGGTGAGGATA
TACTGTTGTGTACTGGCTGGCATGGTTATACTGACAACATGGTAA

Protein sequence:

MWLEISWAAVVGVGFMLAFIPLQAYLGKRTSVLRLKTAIRTDERVRLMNEILSGIQVIKM
YTWEKPFADLVAKARKQEIKQIRATSYIRGVLTSFIMFTTRICLLVSILAFVLENNVISA
KQVFVVTSFYNILRQTMTVFFPQGIAQVAEATISIQRLQNFMLYEDTSKPVPGLAEIQTS
TKPKAKEVKEEPRPSMESKEDLETKDSKPVLDEPENKVAEAKGNGKGGPTMESAEEDDEE
LATRVEQDSRGIRLKYATAKWIASHTENTLTDLSLTVKPGKLIAVIGPVGAGKSSLLHVL
LKELPLHSGSVHVGGTISYASQEPWLFAGSVRQNILFGQAMDRPRYNAVVRRCALDRDFT
LFPHGDKTVVGERGVSLSGGQRARISLARAVYKRADIYLLDDPLSAVDAHVGRHLFESCV
VGYLKNTTRVLVTHQLQFLRDVDQIIILKNGSIAAAGDFETLSASGMDFATLLARGEEEE
RPAPEEKSIVEAEESMLQGSFRKRQMSIHSVSSVDNLTATAPPEGGREEAEMRSAGAVSG
AVYGAYLGASGHPLMVALMVLVAVLAQLLGSGSDWWTSYWVNQEEDHPQTVLRTLDSSNT
SGPLQYSSNFTQALLENAHFSSGLTRYDCIYIYTGMVVSLVVISLLRSFMFFSMAMRAST
RLHNNMFSSITRAPMRFFHTNPSGRILNRFSKDMGAVDEVLPAALLDVLQIGLSLIGIVV
VVAVVNFWLLVPTLFIGLIFYGLRIFYLSSSRSIKRLEGVTRSPVFSHLNASLQGITTIR
AFGAQEALIREFDNHQDLHSSAWYLFIASSRAFGFWLDLVCVIYIAVVTLSFLVFGQEEY
GGKVGLAITQAMGLTGMFQWGMRQSTELENQMTSVERIQEYSNIESEPPLESEPSKKPPP
SWPDAGRLEFRHVFLYYTPGEPPVLKDLNLLVLPKEKVGIVGRTGAGKSSLINALFRLAK
IEGEIIIDGRETSQLGLHELRSQISIIPQEPVLFSGTMRHNLDPFDEYPDQVLWRALEEV
ELKEAVNELPAGLSSRMSEGGGNFSVGQRQLVCLARAIVRRNRLLVLDEATANVDPQTDA
LIQTTIRNKFADCTVLTIAHRLHTVMDSDKVLVMDAGSMVEFDHPHKLLQHIGGVLRGMA
EQTGKATYDALARQAKTVRIYCCVLAGMVILTTW