Monarch geneset OGS2.0

DPOGS206453
TranscriptDPOGS206453-TA2787 bp
ProteinDPOGS206453-PA928 aa
Genomic positionDPSCF300070 - 153540-156326
RNAseq coverage312x (Rank: top 36%)
Annotation
HeliconiusHMEL0129430.096.99% 
BombyxBGIBMGA005466-TA0.094.94% 
Drosophilaalpha-Adaptin-PB0.082.28% 
EBI UniRef50UniRef50_O949730.069.59%AP-2 complex subunit alpha-2 n=254 Tax=Opisthokonta RepID=AP2A2_HUMAN
NCBI RefSeqXP_971368.10.084.75%PREDICTED: similar to AGAP009538-PA [Tribolium castaneum]
NCBI nr blastpgi|910926800.084.75%PREDICTED: similar to AGAP009538-PA [Tribolium castaneum]
NCBI nr blastxgi|3320246830.085.21%AP-2 complex subunit alpha [Acromyrmex echinatior]
Group
Gene OntologyGO:00160200membrane
GO:00085650protein transporter activity
GO:00150310protein transport
GO:00054887.7e-213binding
GO:00068861.1e-154intracellular protein transport
GO:00301171.1e-154membrane coat
GO:00161921.1e-154vesicle-mediated transport
GO:00301313.8e-45clathrin adaptor complex
KEGG pathwaytca:6600110.0 
 K11824 (AP2A)maps-> Huntington's disease
    Endocytosis
InterPro domain[1-929] IPR0171040Adaptor protein complex AP-2, alpha subunit
[6-617] IPR0119897.7e-213Armadillo-like helical
[30-587] IPR0025531.1e-154Clathrin/coatomer adaptor, adaptin-like, N-terminal
[8-593] IPR0160245.4e-112Armadillo-type fold
[816-928] IPR0158731.4e-47Clathrin alpha-adaptin/coatomer adaptor, appendage, C-terminal subdomain
[815-928] IPR0090287e-46Clathrin/coatomer adaptor, adaptin-like, appendage, C-terminal subdomain
[693-815] IPR0130383.8e-45Clathrin adaptor, alpha-adaptin, appendage, Ig-like subdomain
[815-923] IPR0031647.6e-43Clathrin adaptor, alpha-adaptin, appendage, C-terminal subdomain
[684-814] IPR0130412e-36Clathrin/coatomer adaptor, adaptin-like, appendage, Ig-like subdomain
[698-809] IPR0081522.5e-26Clathrin adaptor, alpha/beta/gamma-adaptin, appendage, Ig-like subdomain
Orthology groupMCL11387 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206453-TA
ATGCCTGCCGTCAGAGGAGATGGAATGCGAGGACTCGCAGTTTTTATATCAGATATACGGAATTGTAAGAGTAAAGAAGCAGAAATAAAAAGAATTAATAAAGAACTGGCTAACATACGTAGTAAATTTAAAGGTGACAAAACCTTAGATGGATATCAGAAGAAGAAGTATGTCTGCAAACTCCTGTTTATCTTTCTGTTGGGTCATGATATTGATTTCGGTCATATGGAAGCAGTGAACTTACTGTCCTCTAATAAATACTCAGAGAAGCAAATTGGATATCTTTTTATATCAGTTTTGGTAAATACTAATAGTGATCTCATAAAACTTATCATACAGAGCATTAAAAATGACTTGCAGTCTCGAAACCCTATTCACGTAAATCTTGCACTACAATGTATAGCTAATATTGGTAGTAAGGATATGGCTGAGGCTTTTGGAACCGAGATTCCAAAATTACTAGTCTCTGGTGACACCATGGATGTAGTCAAGCAGTCAGCAGCACTATGTCTTCTCAGATTGTTTCGTAAGTGTCCAGAAATTATTCCAGGAGGAGAATGGACTTCAAGAATCATACATTTGCTGAATGACCCACATATGGGGGTAGTCACTGCTGCCACATCTCTTATAGATGCACTTGTTAAGAAAAATCCAGAAGAATATAAAGGATGTGTCACACTGGCTGTGGCCCGCCTTAGTAGGATTGTTACAGCAAGTTACACTGATTTACAGGATTATACATATTATTTTGTGCCAGCCCCTTGGTTATCTGTTAAACTTTTGCGCTTACTCCAGAACTACACCCCACCTTCAGAAGAGCCTGGAGTTCGTGGACGTTTATCTGAGTGCTTGGAAACCATATTTAATAAAGCTCAAGAGCCACCTAAGTCAAAGAAAGTGCAACATTCGAATGCAAAGAATGCTGTTCTCTTTGAAGCAATAAGTCTTATAATTCACAATGATAGTGAACCTAATTTACTAGTTAGGGCATGCAATCAGCTTGGACAATTTTTAAGTAACAGAGAAACTAATTTGAGATACTTGGCACTTGAGTCAATGTGCCACCTTGCAACATCAGAATTTTCCCACGAAGCTGTTAAAAAACATCAGGAAGTTGTAATTTTATCTATGAAAATGGAAAAGGATGTGTCTGTTAGACAGCAAGCTGTAGATTTACTATATGCCATGTGTGATAAAACAAATGCTGAAGAAATTGTCCAAGAAATGTTGGCTTATCTTGAAACAGCTGATTATTCAATCAGAGAAGAAATGGTCCTCAAAGTTGCCATATTATCTGAAAAATATGCTACTGACTTCACTTGGTATGTTGATGTCATATTAAACCTTATAAGAATAGCTGGAGATTATGTTTCAGAGGAAGTGTGGTATAGAGTTATTCAAATTGTCATAAACAGAGATGAAGTCCAAGGATATGCAGCAAAAACTGTTTTTGAAGCCCTTCAAGCTCCAACTTGTCATGAAAATATGGTAAAAGTGGGTGGATACATACTGGGTGAATTTGGAAACTTAATTGCTGGTGATACAAGATCTTCACCACAAGTCCAATTTGAACTGCTACATTCAAAGTATCATCTTTGTTCTGCAGCTACAAGAGCACTGTTATTGTCAACTTACATTAAACTGGTAAATCTCTTCCCTGAAATTAAAAACAGAGTGCAAGAAGTTTTCCGTGCTGACTCAAATTTGCGATCTGCTGATGTAGAATTACAACAAAGGGCATCAGAATATCTGCAATTAAGTATAGTTGCCAGTTCAGACGTATTAGCAACAGTTTTGGAAGAAATGCCTGCATTCCCAGAACGGGAATCATCAATTTTGGCTGTACTGAAAAAGAAGAAACCAGGTCGCATACCTGATGATGTAAAGGAGTCTAAGAGTCCTCAACCCAGTATCACACCAGCTCCAGTTATTAATAATTCTATAAACAGCAATAATTCCAGTGCTGATCTTCTTGGTTTATCAACTCCTCCTGGTACTAATGCCACTACAGGAAATGGTTTATTAGATGTTCTTGGAGACTTATACTCTACACCCAAGAAAAGCCCAATCACTGTACAACAAAATAATATTAAGAAATTCTTGTTTAAGAATAATGGAGTACTCTTTGAAAATGATCTCATACAAATTGGCGTTAAAAGTGAATTCAGACAGAATTTGGGAAGAATCGGACTATTTTATGGTAATAAGACACAATCTGCTATTCAAAATGTCCATCCTGAACTACATTGGACTGATTTGCACAAACTGAATGTGCAGATGAAACCTATGGAACCTGTTCTGGAAGCAGGTGCTCAAATTCAACAAATGCTAACAGCTGAGTGCATTGAAGACTTTGCTGATGCACCAAGTATGTCAGTGTCATTCCTGTACAACAATGTTCCACAGAAAATCTCAATGAAACTGCCCTTAACACTAAATAAATTCTTTGAACCAACTGAAATGAATGGAGAATCATTTTTCGCTAGGTGGAAGAATTTAGGTGGTGAACAACAAAGGGCGCAAAAAATTTTCAAAGCTCAAGGCGCAATAGATATCCCAGCCACCCGAACTAAACTGGCTGGTTTCGGTATGCAATTATTAGATGGTATTGATCCCAATCCTGACAACTTTGTGTGTGCAGGAATTGTACATACAAGAGTTCAGCAGGTAGGATGCTTAATGAGATTGGAACCTAACAAACAAGCTCAAATGTTTAGACTTACTGTTAGATCAAGTAAAGAAACGGTCTCACAGGAAATATGTAATTTGCTAGCTGATCAATTCTAA

Protein sequence:

>DPOGS206453-PA
MPAVRGDGMRGLAVFISDIRNCKSKEAEIKRINKELANIRSKFKGDKTLDGYQKKKYVCKLLFIFLLGHDIDFGHMEAVNLLSSNKYSEKQIGYLFISVLVNTNSDLIKLIIQSIKNDLQSRNPIHVNLALQCIANIGSKDMAEAFGTEIPKLLVSGDTMDVVKQSAALCLLRLFRKCPEIIPGGEWTSRIIHLLNDPHMGVVTAATSLIDALVKKNPEEYKGCVTLAVARLSRIVTASYTDLQDYTYYFVPAPWLSVKLLRLLQNYTPPSEEPGVRGRLSECLETIFNKAQEPPKSKKVQHSNAKNAVLFEAISLIIHNDSEPNLLVRACNQLGQFLSNRETNLRYLALESMCHLATSEFSHEAVKKHQEVVILSMKMEKDVSVRQQAVDLLYAMCDKTNAEEIVQEMLAYLETADYSIREEMVLKVAILSEKYATDFTWYVDVILNLIRIAGDYVSEEVWYRVIQIVINRDEVQGYAAKTVFEALQAPTCHENMVKVGGYILGEFGNLIAGDTRSSPQVQFELLHSKYHLCSAATRALLLSTYIKLVNLFPEIKNRVQEVFRADSNLRSADVELQQRASEYLQLSIVASSDVLATVLEEMPAFPERESSILAVLKKKKPGRIPDDVKESKSPQPSITPAPVINNSINSNNSSADLLGLSTPPGTNATTGNGLLDVLGDLYSTPKKSPITVQQNNIKKFLFKNNGVLFENDLIQIGVKSEFRQNLGRIGLFYGNKTQSAIQNVHPELHWTDLHKLNVQMKPMEPVLEAGAQIQQMLTAECIEDFADAPSMSVSFLYNNVPQKISMKLPLTLNKFFEPTEMNGESFFARWKNLGGEQQRAQKIFKAQGAIDIPATRTKLAGFGMQLLDGIDPNPDNFVCAGIVHTRVQQVGCLMRLEPNKQAQMFRLTVRSSKETVSQEICNLLADQF-