Monarch geneset OGS2.0

DPOGS206301
TranscriptDPOGS206301-TA1323 bp
ProteinDPOGS206301-PA440 aa
Genomic positionDPSCF300082 - 1206288-1211605
RNAseq coverage2532x (Rank: top 5%)
Annotation
HeliconiusHMEL0171170.099.09% 
BombyxBGIBMGA009359-TA4e-9138.83% 
DrosophilaAP-50-PC0.095.23% 
EBI UniRef50UniRef50_O625300.095.23%AP-50, isoform A n=133 Tax=Opisthokonta RepID=O62530_DROME
NCBI RefSeqXP_001845524.10.097.05%clathrin coat assembly protein AP50 [Culex quinquefasciatus]
NCBI nr blastpgi|3323759190.097.50%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323759190.097.50%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00068862.6e-282intracellular protein transport
GO:00301322.6e-282clathrin coat of coated pit
GO:00301312.6e-282clathrin adaptor complex
GO:00161922.6e-282vesicle-mediated transport
GO:00055159.8e-89protein binding
GO:00068104.7e-43transport
KEGG pathwaycqu:CpipJ_CPIJ0036970.0 
 K11826 (AP2M1)maps-> Huntington's disease
    Endocytosis
InterPro domain[1-440] IPR0156292.6e-282Clathrin coat associated protein AP-50
[1-440] IPR0013922.4e-202Clathrin adaptor, mu subunit
[157-440] IPR0089689.8e-89Clathrin adaptor, mu subunit, C-terminal
[1-141] IPR0110124.7e-43Longin-like
[2-126] IPR0227752.1e-06AP complex, mu/sigma subunit
Orthology groupMCL15366 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206301-TA
ATGATTGGGGGGCTGTTCGTATACAATCACAAAGGGGAGGTGTTGATATCTCGGGTGTACCGTGACGATATCGGTCGTAATGCCGTGGATGCGTTCAGGGTGAACGTGATCCACGCTCGTCAGCAGGTTCGCTCGCCTGTCACCAACATCGCTAGGACATCATTTTTTCATATAAAGAGAGCCAACATTTGGCTAGCGGCGGTCACGAAGCAGAATGTCAACGCCGCTATGGTGTTTGAATTCCTATTGAAGATCATTGATGTGATGCAGTCTTACTTCGGAAAGATATCCGAAGAAAACATCAAGAATAACTTCGTGCTCATCTACGAACTGCTGGATGAAATCCTGGACTTTGGCTACCCTCAGAACTCGGATACCGGAGTCCTGAAGACGTTTATCACTCAGCAAGGAATCAAGTCGGCCACCAAGGAAGAACAGGCTCTCATTACGTCACAGGTGACTGGCCAGATCGGCTGGCGTCGTGAAGGCATCAAGTATCGACGTAATGAGTTGTTCCTCGATGTACTGGAGTATGTCAATTTATTGATGTCACCTCAAGGTCAAGTTTTGTCGGCTCATGTTGCCGGTAAGGTGGTGATGAAGTCTTACCTGTCAGGAATGCCGGAATGCAAGTTTGGCATTAATGATAAAATTGTCATGGAAGCCAAGGGCAAAGGCAACGGCGGCATCTCCGGCAACACGGACAGTGACCCAGCTCGCTCCGGTAAACCTGTAGTCGTAATCGATGACTGTCAGTTCCACCAGTGCGTGAAGCTCAGCAAGTTTGAGACGGAGCACTCTATATCGTTCATACCGCCCGATGGAGAGTTTGAACTCATGAGATACCGTACAACAAAGGATATATCCCTGCCGTTCCGTGTGATCCCCTTGGTGCGTGAAGTCGGTCGCACCAAGATGGAAGTGAAGGTAGTCCTGAAGTCAAACTTCAAGCCGTCGCTCCTGGGGCAGAAGATCGAAGTGAAGATCCCGACACCGTTGAACACGAGCGGGGTGCAGTTGATCTGTCTGAAGGGGAAGGCCAAGTACAAGCCCTCGGAGAACGCTATTGTGTGGAAGATCAAGCGTATGGCTGGTATGAAGGAGACCCAGTTGTCCGCTGAGATCGAGCTCCTGGAGACTGACACCAAGAAGAAGTGGACGCGGCCACCGATCTCTATGGGATTCGAAGTGCCCTTCGCACCTTCCGGCTTCAAGGTCCGTTATCTGAAGGTGTTCGAGCCCAAGCTGAACTACTCTGATCACGATGTTATTAAATGGGTCCGGTACATCGGACGTTCCGGGCTGTACGAGACGCGATGTTAA

Protein sequence:

>DPOGS206301-PA
MIGGLFVYNHKGEVLISRVYRDDIGRNAVDAFRVNVIHARQQVRSPVTNIARTSFFHIKRANIWLAAVTKQNVNAAMVFEFLLKIIDVMQSYFGKISEENIKNNFVLIYELLDEILDFGYPQNSDTGVLKTFITQQGIKSATKEEQALITSQVTGQIGWRREGIKYRRNELFLDVLEYVNLLMSPQGQVLSAHVAGKVVMKSYLSGMPECKFGINDKIVMEAKGKGNGGISGNTDSDPARSGKPVVVIDDCQFHQCVKLSKFETEHSISFIPPDGEFELMRYRTTKDISLPFRVIPLVREVGRTKMEVKVVLKSNFKPSLLGQKIEVKIPTPLNTSGVQLICLKGKAKYKPSENAIVWKIKRMAGMKETQLSAEIELLETDTKKKWTRPPISMGFEVPFAPSGFKVRYLKVFEPKLNYSDHDVIKWVRYIGRSGLYETRC-