New model in OGS2.0 | DPOGS203074  |
---|---|
Genomic Position | scaffold47:- 2976-23545 |
See gene structure | |
CDS Length | 2940 |
Paired RNAseq reads   | 3384 |
Single RNAseq reads   | 8577 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010849 (0.0) |
Best Drosophila hit   | CG4562, isoform A (0.0) |
Best Human hit | multidrug resistance-associated protein 4 isoform 2 (7e-81) |
Best NR hit (blastp)   | PREDICTED: similar to ATP-dependent bile acid permease [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to ATP-dependent bile acid permease [Nasonia vitripennis] (0.0) |
GeneOntology terms    | GO:0043190 ATP-binding cassette (ABC) transporter complex GO:0042626 ATPase activity, coupled to transmembrane movement of substances GO:0005215 transporter activity GO:0055085 transmembrane transport GO:0005524 ATP binding GO:0016021 integral to membrane |
InterPro families    | IPR009147 Cystic fibrosis transmembrane conductance regulator IPR003593 ATPase, AAA+ type, core IPR001140 ABC transporter, transmembrane domain IPR003439 ABC transporter-like IPR017940 ABC transporter, integral membrane type 1 IPR011527 ABC transporter, transmembrane domain, type 1 |
Orthology group | MCL10006 |
Nucleotide sequence:
ATGGAATCAAACAAAAAAAAAGGCCGGCCGCCGCATCCGAGAGCAAAGGCAAATCCATTT
TCAGCATTAACATTTGGATGGACACTACCTATGTTTTGGAGTGGATTACGGAAAGAACTA
GAAGAATCAGATTTATACCAGCCTCTGGAAGAGCATGCGTCTGGCCCTCTGGGTGATAAA
TTCGCTCGTCTATGGGAGGAGGAAGTAGCCAGGGCTGAAGGCAAGCGCACTCCAAGTCTC
CTCAGAGTAATTCTAAGAGCTTACGCAGCGAGATGCATGCTGTATGGGTTCGTGCTATTC
TTCATGGAGTGTGGGATACGCATAGCCCAGCCGGTTTTCCTAGGCAAGCTCGTGGAATAT
TACAGTCCAGATCAGAAGACCATGAAACCTCAAGAGGCGTATATGTACGCTGGTGCTGTA
GTGCTGTGTTCCGCTTTGAATGTGTTTGTGGTCCATCCATACATGATGGCGATATTACAC
ATGGGCATGAAATTCAGGGTCGCTTGCTGTTCGCTTATATACAGGAAGTCTCTCAGATTG
TCGAAAACAGCTCTAGGTGAGACAACAATAGGTCAAGTAGTCAACCTGCTATCAAACGAC
GTGAATAGATTCGACGTGGCCGTTATATTCCTTCACTACCTATGGATCGGACCCCTCGCT
ACGGTCATCGTGACGTACTTCATGTGGCTCGAGATAAGCTGGGCGGCTGTGGTCGGTGTT
GGGTTCATGCTGGCTTTCATACCTTTGCAAGCGTACTTGGGGAAACGTACGTCGGTATTG
CGTTTGAAGACAGCCATCCGTACTGACGAGAGGGTGAGATTGATGAACGAGATACTCTCC
GGTATCCAGGTCATCAAGATGTACACGTGGGAGAAGCCGTTCGCTGATCTGGTCGCTAAG
GCCAGGAAGCAGGAAATAAAACAGATCCGCGCTACGTCATATATACGGGGTGTCCTGACT
TCTTTCATAATGTTTACCACGAGGATATGTCTGCTGGTGTCAATACTGGCTTTTGTGCTG
GAGAATAATGTGATCAGCGCCAAACAGGTGTTTGTGGTCACCAGCTTCTATAACATACTG
AGACAGACCATGACAGTATTTTTCCCACAAGGTATAGCACAAGTGGCTGAAGCGACAATA
TCCATCCAAAGGTTACAGAATTTCATGTTGTATGAAGATACAAGCAAGCCGGTCCCAGGT
CTGGCTGAGATACAGACCTCGACCAAACCTAAAGCGAAGGAGGTCAAAGAAGAACCGAGA
CCTTCGATGGAATCCAAAGAGGATCTTGAAACTAAAGACTCAAAGCCAGTGTTAGATGAA
CCGGAAAATAAGGTTGCAGAAGCCAAAGGAAATGGTAAAGGTGGTCCTACAATGGAGTCT
GCGGAGGAAGATGACGAGGAGTTGGCGACGAGAGTGGAACAGGACTCAAGGGGCATCAGG
CTGAAGTACGCCACCGCCAAGTGGATAGCATCACACACAGAGAATACACTCACGGATCTA
TCACTCACTGTGAAACCTGGCAAATTAATAGCAGTGATAGGTCCAGTTGGTGCTGGCAAG
TCATCTCTGCTGCATGTGCTGCTGAAGGAGGTAGCGTACGTCAAAACATTCTGTTCGGAC
AAGCGATGGATCGTCCTCGGTACAACGCTGTGGTACGGAGGTCAGCGTGCTCGCATCTCC
CTGGCGCGTGCTGTGTATAAGCGCGCGGATATCTATCTGTTGGATGATCCTCTGTCCGCC
GTGGACGCGCACGTCGGCCGCCATCTCTTCGAGTCGTGTGTGGTGGGCTACCTCAAGAAC
ACCACCAGGGTGCTTGTAACACATCAGCTGCAGTTCCTGAGAGACGTCGATCAGATTATC
ATATTAAAGAATGGTTCTATAGCGGCTGCGGGTGATTTCGAAACGCTCAGCGCTTCCGGA
ATGGACTTCGCCACTTTACTGGCGAGGGGAGAGGAGGAAGAGAGACCGGCTCCGGAAGAA
AAATCCATTGTGGAGGCAGAGGAATCAATGCTCCAAGGCAGTTTCAGGAAACGTCAGATG
AGCATACATTCGGTCAGTTCGGTGGATAACCTAACAGCCACGGCGCCACCAGAGGGCGGT
AGGGAGGAAGCGGAAATGCGATCAGCTGGTGCAGTTTCCGGTGCTGTGTACGGCGCCTAT
CTGGGTGCAAGCGGACATCCGCTGATGGTTGCTCTTATGGTACTGGTGGCTGTGCTGGCG
CAGTTGCTAGGATCTGGCTCTGATTGGTGGACCAGTTATTGGGTGAATCAAGAGGAGGAT
CATCCACAGACGGTGTTAAGGACACTAGACTCGAGTAACACGTCAGGTCCGCTACAGTAC
TCCTCAAACTTCACACAGGCTCTGCTTGAAAACGCACACTTCAGTTCCGGTCTAACCAGA
TACGACTGCATTTATATTTATACTGGTATGGTGGTGTCGCTGGTGGTGATATCTCTGCTG
CGGTCATTCATGTTCTTCTCTATGGCGATGCGAGCGTCGACTCGGCTACACAACAACATG
TTCAGTTCCATAACGCGGGCGCCGATGAGATTCTTCCACACCAACCCATCAGGGAGAATC
CTCAACAGATTCTCGAAGGACATGGGAGCGGTCGACGAGGTACTCCCGGCTGCCTTGCTT
GATGTACTGCAAATCGGTCTATCCCTGATCGGTATAGTGGTGGTGGTGGCGGTGGTGAAC
TTCTGGCTGCTGGTCCCCACACTCTTCATAGGTCTGATCTTCTACGGTCTTCGCATATTC
TACCTGTCGTCCAGCCGCAGCATCAAGCGCCTCGAGGGTGTGACTCGCAGTCCAGTGTTC
TCTCACCTGAACGCGTCTCTTCAAGGCATCACCACTATTCGTGCGTTCGGTGCCCAGGAA
GCTCTCATCAGAGAGTTTGATAACCATCAGGACCTACATAGCTCTGCCTGGTATGTATAA
Protein sequence:
MESNKKKGRPPHPRAKANPFSALTFGWTLPMFWSGLRKELEESDLYQPLEEHASGPLGDK
FARLWEEEVARAEGKRTPSLLRVILRAYAARCMLYGFVLFFMECGIRIAQPVFLGKLVEY
YSPDQKTMKPQEAYMYAGAVVLCSALNVFVVHPYMMAILHMGMKFRVACCSLIYRKSLRL
SKTALGETTIGQVVNLLSNDVNRFDVAVIFLHYLWIGPLATVIVTYFMWLEISWAAVVGV
GFMLAFIPLQAYLGKRTSVLRLKTAIRTDERVRLMNEILSGIQVIKMYTWEKPFADLVAK
ARKQEIKQIRATSYIRGVLTSFIMFTTRICLLVSILAFVLENNVISAKQVFVVTSFYNIL
RQTMTVFFPQGIAQVAEATISIQRLQNFMLYEDTSKPVPGLAEIQTSTKPKAKEVKEEPR
PSMESKEDLETKDSKPVLDEPENKVAEAKGNGKGGPTMESAEEDDEELATRVEQDSRGIR
LKYATAKWIASHTENTLTDLSLTVKPGKLIAVIGPVGAGKSSLLHVLLKEVAYVKTFCSD
KRWIVLGTTLWYGGQRARISLARAVYKRADIYLLDDPLSAVDAHVGRHLFESCVVGYLKN
TTRVLVTHQLQFLRDVDQIIILKNGSIAAAGDFETLSASGMDFATLLARGEEEERPAPEE
KSIVEAEESMLQGSFRKRQMSIHSVSSVDNLTATAPPEGGREEAEMRSAGAVSGAVYGAY
LGASGHPLMVALMVLVAVLAQLLGSGSDWWTSYWVNQEEDHPQTVLRTLDSSNTSGPLQY
SSNFTQALLENAHFSSGLTRYDCIYIYTGMVVSLVVISLLRSFMFFSMAMRASTRLHNNM
FSSITRAPMRFFHTNPSGRILNRFSKDMGAVDEVLPAALLDVLQIGLSLIGIVVVVAVVN
FWLLVPTLFIGLIFYGLRIFYLSSSRSIKRLEGVTRSPVFSHLNASLQGITTIRAFGAQE
ALIREFDNHQDLHSSAWYV