DPGLEAN03658 in OGS1.0

New model in OGS2.0DPOGS206453 
Genomic Positionscaffold599:- 42799-45585
See gene structure
CDS Length2787
Paired RNAseq reads  1007
Single RNAseq reads  2796
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005466 (0.0)
Best Drosophila hit  alpha-Adaptin, isoform A (0.0)
Best Human hitAP-2 complex subunit alpha-2 (0.0)
Best NR hit (blastp)  PREDICTED: similar to AGAP009538-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  adaptor-related protein complex 2, alpha 2 subunit [Nasonia vitripennis] (0.0)
GeneOntology terms




  
GO:0003674 molecular_function
GO:0005886 plasma membrane
GO:0005905 coated pit
GO:0008565 protein transporter activity
GO:0015031 protein transport
GO:0048488 synaptic vesicle endocytosis
InterPro families








  
IPR008152 Clathrin adaptor, alpha/beta/gamma-adaptin, appendage, Ig-like subdomain
IPR017104 Adaptor protein complex AP-2, alpha subunit
IPR002553 Clathrin/coatomer adaptor, adaptin-like, N-terminal
IPR003164 Clathrin adaptor, alpha-adaptin, appendage, C-terminal subdomain
IPR016024 Armadillo-type fold
IPR009028 Clathrin/coatomer adaptor, adaptin-like, appendage, C-terminal subdomain
IPR013041 Clathrin/coatomer adaptor, adaptin-like, appendage, Ig-like subdomain
IPR011989 Armadillo-like helical
IPR013038 Clathrin adaptor, alpha-adaptin, appendage, Ig-like subdomain
IPR015873 Clathrin alpha-adaptin/coatomer adaptor, appendage, C-terminal subdomain
Orthology groupMCL11661

Nucleotide sequence:

ATGCCTGCCGTCAGAGGAGATGGAATGCGAGGACTCGCAGTTTTTATATCAGATATACGG
AATTGTAAGAGTAAAGAAGCAGAAATAAAAAGAATTAATAAAGAACTGGCTAACATACGT
AGTAAATTTAAAGGTGACAAAACCTTAGATGGATATCAGAAGAAGAAGTATGTCTGCAAA
CTCCTGTTTATCTTTCTGTTGGGTCATGATATTGATTTCGGTCATATGGAAGCAGTGAAC
TTACTGTCCTCTAATAAATACTCAGAGAAGCAAATTGGATATCTTTTTATATCAGTTTTG
GTAAATACTAATAGTGATCTCATAAAACTTATCATACAGAGCATTAAAAATGACTTGCAG
TCTCGAAACCCTATTCACGTAAATCTTGCACTACAATGTATAGCTAATATTGGTAGTAAG
GATATGGCTGAGGCTTTTGGAACCGAGATTCCAAAATTACTAGTCTCTGGTGACACCATG
GATGTAGTCAAGCAGTCAGCAGCACTATGTCTTCTCAGATTGTTTCGTAAGTGTCCAGAA
ATTATTCCAGGAGGAGAATGGACTTCAAGAATCATACATTTGCTGAATGACCCACATATG
GGGGTAGTCACTGCTGCCACATCTCTTATAGATGCACTTGTTAAGAAAAATCCAGAAGAA
TATAAAGGATGTGTCACACTGGCTGTGGCCCGCCTTAGTAGGATTGTTACAGCAAGTTAC
ACTGATTTACAGGATTATACATATTATTTTGTGCCAGCCCCTTGGTTATCTGTTAAACTT
TTGCGCTTACTCCAGAACTACACCCCACCTTCAGAAGAGCCTGGAGTTCGTGGACGTTTA
TCTGAGTGCTTGGAAACCATATTTAATAAAGCTCAAGAGCCACCTAAGTCAAAGAAAGTG
CAACATTCGAATGCAAAGAATGCTGTTCTCTTTGAAGCAATAAGTCTTATAATTCACAAT
GATAGTGAACCTAATTTACTAGTTAGGGCATGCAATCAGCTTGGACAATTTTTAAGTAAC
AGAGAAACTAATTTGAGATACTTGGCACTTGAGTCAATGTGCCACCTTGCAACATCAGAA
TTTTCCCACGAAGCTGTTAAAAAACATCAGGAAGTTGTAATTTTATCTATGAAAATGGAA
AAGGATGTGTCTGTTAGACAGCAAGCTGTAGATTTACTATATGCCATGTGTGATAAAACA
AATGCTGAAGAAATTGTCCAAGAAATGTTGGCTTATCTTGAAACAGCTGATTATTCAATC
AGAGAAGAAATGGTCCTCAAAGTTGCCATATTATCTGAAAAATATGCTACTGACTTCACT
TGGTATGTTGATGTCATATTAAACCTTATAAGAATAGCTGGAGATTATGTTTCAGAGGAA
GTGTGGTATAGAGTTATTCAAATTGTCATAAACAGAGATGAAGTCCAAGGATATGCAGCA
AAAACTGTTTTTGAAGCCCTTCAAGCTCCAACTTGTCATGAAAATATGGTAAAAGTGGGT
GGATACATACTGGGTGAATTTGGAAACTTAATTGCTGGTGATACAAGATCTTCACCACAA
GTCCAATTTGAACTGCTACATTCAAAGTATCATCTTTGTTCTGCAGCTACAAGAGCACTG
TTATTGTCAACTTACATTAAACTGGTAAATCTCTTCCCTGAAATTAAAAACAGAGTGCAA
GAAGTTTTCCGTGCTGACTCAAATTTGCGATCTGCTGATGTAGAATTACAACAAAGGGCA
TCAGAATATCTGCAATTAAGTATAGTTGCCAGTTCAGACGTATTAGCAACAGTTTTGGAA
GAAATGCCTGCATTCCCAGAACGGGAATCATCAATTTTGGCTGTACTGAAAAAGAAGAAA
CCAGGTCGCATACCTGATGATGTAAAGGAGTCTAAGAGTCCTCAACCCAGTATCACACCA
GCTCCAGTTATTAATAATTCTATAAACAGCAATAATTCCAGTGCTGATCTTCTTGGTTTA
TCAACTCCTCCTGGTACTAATGCCACTACAGGAAATGGTTTATTAGATGTTCTTGGAGAC
TTATACTCTACACCCAAGAAAAGCCCAATCACTGTACAACAAAATAATATTAAGAAATTC
TTGTTTAAGAATAATGGAGTACTCTTTGAAAATGATCTCATACAAATTGGCGTTAAAAGT
GAATTCAGACAGAATTTGGGAAGAATCGGACTATTTTATGGTAATAAGACACAATCTGCT
ATTCAAAATGTCCATCCTGAACTACATTGGACTGATTTGCACAAACTGAATGTGCAGATG
AAACCTATGGAACCTGTTCTGGAAGCAGGTGCTCAAATTCAACAAATGCTAACAGCTGAG
TGCATTGAAGACTTTGCTGATGCACCAAGTATGTCAGTGTCATTCCTGTACAACAATGTT
CCACAGAAAATCTCAATGAAACTGCCCTTAACACTAAATAAATTCTTTGAACCAACTGAA
ATGAATGGAGAATCATTTTTCGCTAGGTGGAAGAATTTAGGTGGTGAACAACAAAGGGCG
CAAAAAATTTTCAAAGCTCAAGGCGCAATAGATATCCCAGCCACCCGAACTAAACTGGCT
GGTTTCGGTATGCAATTATTAGATGGTATTGATCCCAATCCTGACAACTTTGTGTGTGCA
GGAATTGTACATACAAGAGTTCAGCAGGTAGGATGCTTAATGAGATTGGAACCTAACAAA
CAAGCTCAAATGTTTAGACTTACTGTTAGATCAAGTAAAGAAACGGTCTCACAGGAAATA
TGTAATTTGCTAGCTGATCAATTCTAA

Protein sequence:

MPAVRGDGMRGLAVFISDIRNCKSKEAEIKRINKELANIRSKFKGDKTLDGYQKKKYVCK
LLFIFLLGHDIDFGHMEAVNLLSSNKYSEKQIGYLFISVLVNTNSDLIKLIIQSIKNDLQ
SRNPIHVNLALQCIANIGSKDMAEAFGTEIPKLLVSGDTMDVVKQSAALCLLRLFRKCPE
IIPGGEWTSRIIHLLNDPHMGVVTAATSLIDALVKKNPEEYKGCVTLAVARLSRIVTASY
TDLQDYTYYFVPAPWLSVKLLRLLQNYTPPSEEPGVRGRLSECLETIFNKAQEPPKSKKV
QHSNAKNAVLFEAISLIIHNDSEPNLLVRACNQLGQFLSNRETNLRYLALESMCHLATSE
FSHEAVKKHQEVVILSMKMEKDVSVRQQAVDLLYAMCDKTNAEEIVQEMLAYLETADYSI
REEMVLKVAILSEKYATDFTWYVDVILNLIRIAGDYVSEEVWYRVIQIVINRDEVQGYAA
KTVFEALQAPTCHENMVKVGGYILGEFGNLIAGDTRSSPQVQFELLHSKYHLCSAATRAL
LLSTYIKLVNLFPEIKNRVQEVFRADSNLRSADVELQQRASEYLQLSIVASSDVLATVLE
EMPAFPERESSILAVLKKKKPGRIPDDVKESKSPQPSITPAPVINNSINSNNSSADLLGL
STPPGTNATTGNGLLDVLGDLYSTPKKSPITVQQNNIKKFLFKNNGVLFENDLIQIGVKS
EFRQNLGRIGLFYGNKTQSAIQNVHPELHWTDLHKLNVQMKPMEPVLEAGAQIQQMLTAE
CIEDFADAPSMSVSFLYNNVPQKISMKLPLTLNKFFEPTEMNGESFFARWKNLGGEQQRA
QKIFKAQGAIDIPATRTKLAGFGMQLLDGIDPNPDNFVCAGIVHTRVQQVGCLMRLEPNK
QAQMFRLTVRSSKETVSQEICNLLADQF