Monarch geneset OGS2.0

DPOGS214857
TranscriptDPOGS214857-TA1950 bp
ProteinDPOGS214857-PA649 aa
Genomic positionDPSCF300091 + 30534-42781
RNAseq coverage838x (Rank: top 15%)
Annotation
HeliconiusHMEL0150100.064.43% 
BombyxBGIBMGA010067-TA5e-9658.54% 
Drosophilalap-PD2e-14063.88% 
EBI UniRef50UniRef50_E2ANE58e-14973.18%Phosphatidylinositol-binding clathrin assembly protein LAP n=3 Tax=Formicidae RepID=E2ANE5_CAMFO
NCBI RefSeqXP_001664041.16e-15657.12%phosphatidylinositol-binding clathrin assembly protein [Aedes aegypti]
NCBI nr blastpgi|1571378021e-15457.12%phosphatidylinositol-binding clathrin assembly protein [Aedes aegypti]
NCBI nr blastxgi|3504115377e-17754.31%PREDICTED: phosphatidylinositol-binding clathrin assembly protein-like [Bombus impatiens]
Group
Gene OntologyGO:00055431.9e-53phospholipid binding
GO:00302761.1e-52clathrin binding
GO:00482681.1e-52clathrin coat assembly
GO:00055451.1e-521-phosphatidylinositol binding
GO:00301181.1e-52clathrin coat
KEGG pathway 
InterPro domain[2-202] IPR0114171.9e-53ANTH
[79-206] IPR0147121.1e-52Clathrin adaptor, phosphoinositide-binding, GAT-like
[1-65] IPR0089421.1e-13ENTH/VHS
Orthology groupMCL11570 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214857-TA
CAGAGATTCACGCAGTATTTAGCGTCGAGCAATTCGACGTTTCAATTAAGCAACTTCCATGACAAAAGCGGGGTTCAAGGAGCGGCCGGTGCCAGGATCGGATATGACATGTCGCCGTTCATCAGGAGGTACGCCAAGTACCTCAACGAGAAGGCCCTGTCCTATAGGACTGTCGCCTTTGACTTCTGCAAGGTCAAGAGGGGTAAAGAAGAGGGCTCCCTCCGTATGATGAACGCGGAGAAGCTGTTAAAGACTCTGCCGGTGCTACAAGCACAGCTCGACGCGCTGCTGGAGTTCGATTGTACAGCCAATGACCTCACCAACGGGGTCATCAATATGTGCTTCATGCTGTTATTCAGAGACCTTATCAGGCTGTTCGCTTGCTATAACGATGGCATCATCAACTTATTGGAGAAGTACTTTGATATGAACAAGAAGAACTGCCGGGATGCTCTCGACCTCTATAAGAAATTCCTCATCAGGATGGATAGAGTCGGAGAGTTCTTGAAGGTGGCAGAAAATGTCGGCATCGATAAAGGCGACATCCCTGACCTGACCAAAGCTCCCAGCAGTCTTCTGGATGCCCTTGAAGGACACCTCGCAACCCTTGAAGGCAAAAAAGGTTCCGCAGCGAACACCCCCACGCAAACCGCCAGCGCCCAAAAGAACGTGGCCAGTGTGATGGGTGCTTTATCATCAACGTCATCGTCATTCGGCAACGCGGCCGCCTCGACCAGGCTGGACGCTTCCAACGGGAGCATGTTCATAGACGACTCCCTCAAACAGCAGGCCCTGGCCGAGGAAGAAGCCGCCATGAACCAATACAAGGTCCACTTATATCAAGGGAACGAATTTACAGGTGATGAGGATATCACGCAGATAACGGAGAGATTTAACAAGGTGCAGTCTCCGACAAACTCAGCCAATAATCCATTCTTGTCTACGGGTCCGACTGTAGTGGACTTATTCGCTCCTGCACCGGAGCCCCAGCAGCCTCCTGCGGGGGGCAAAGCCTCCGACGATCTCCTTTCACTGGGCAATCCCTTCGCAGACATGTTTGGAGCCCCGCAACCCGCGCCCGCCGCCTGGCAGAGCAACGGGTTCGCGGCCTTCCCGCAACCGACGCAACCATCGCAACCAGCCCAACCCGCTCACACCAACAACACCTTCGTATCCGAGGCTAATTTCACTAACGTCTTCAACACTGACTCCGCAGAAAATTTTGATAATTTGTATGTCAAGGAGATCGATGATTTTGGTGTCGGTCATCAATGTATACATATTTATAATGAAAATCTTGTGCAAGCTGCCGCTAACCCGTTCATGGGTATGGGAGACCCAACCCAGCCGATATCGGCATCCCCAGCCCGCCCCCCACCACCGAATCCCCAAAAATCCGCCTTCGACGATCTCGAGGACGCCATGAGAATCTCACTCGGAGGCTCCCCAGCGAAACAGAGCGCACCCATCACACAGCAGCCGCAGCCGATGACGCAGCACTTCGGTGACGTCATGATGTCTCAGCCAATGATGTTCGGATCGCCAGCGAGGCAGCCGATGATGGGGATGGCCATGGGTCAACAACAACAACCACAACAGAATAAAGTACTAACCGGGGATCTAGATTCCTCGCTGGCTCAGCTCGCCAACAATCTCACTATCAACAAAGCCGCTCCCAAAGCGATGCAGTGGAGCCCGAAGAGTGGCTGCAAACCTGGGGGTGCCTGGAGTCCGCAGCCGATGCAGGCCACAACCGGAGCGGGTTACAGACCCATGGGCCAAGGCATGACTCTACCGCCAATGCCCCATACATTACCCCATACGTACCATCCCCCGCATTTTGTTCTGCAACAGCCGCCCATGGTAATGGGCATGAACGCGCCCATGATGCCCATGGGGTCGATGCCGATGCGCCCCTCCATACCGCCCACCACAAACCAATTCAGTTAA

Protein sequence:

>DPOGS214857-PA
QRFTQYLASSNSTFQLSNFHDKSGVQGAAGARIGYDMSPFIRRYAKYLNEKALSYRTVAFDFCKVKRGKEEGSLRMMNAEKLLKTLPVLQAQLDALLEFDCTANDLTNGVINMCFMLLFRDLIRLFACYNDGIINLLEKYFDMNKKNCRDALDLYKKFLIRMDRVGEFLKVAENVGIDKGDIPDLTKAPSSLLDALEGHLATLEGKKGSAANTPTQTASAQKNVASVMGALSSTSSSFGNAAASTRLDASNGSMFIDDSLKQQALAEEEAAMNQYKVHLYQGNEFTGDEDITQITERFNKVQSPTNSANNPFLSTGPTVVDLFAPAPEPQQPPAGGKASDDLLSLGNPFADMFGAPQPAPAAWQSNGFAAFPQPTQPSQPAQPAHTNNTFVSEANFTNVFNTDSAENFDNLYVKEIDDFGVGHQCIHIYNENLVQAAANPFMGMGDPTQPISASPARPPPPNPQKSAFDDLEDAMRISLGGSPAKQSAPITQQPQPMTQHFGDVMMSQPMMFGSPARQPMMGMAMGQQQQPQQNKVLTGDLDSSLAQLANNLTINKAAPKAMQWSPKSGCKPGGAWSPQPMQATTGAGYRPMGQGMTLPPMPHTLPHTYHPPHFVLQQPPMVMGMNAPMMPMGSMPMRPSIPPTTNQFS-