Monarch geneset OGS2.0

DPOGS200145
TranscriptDPOGS200145-TA1197 bp
ProteinDPOGS200145-PA398 aa
Genomic positionDPSCF300128 - 284038-288028
RNAseq coverage7021x (Rank: top 2%)
Annotation
HeliconiusHMEL0056542e-16499.63% 
BombyxBGIBMGA002788-TA2e-14384.84% 
DrosophilaPax-PI9e-15263.18% 
EBI UniRef50UniRef50_E2C8K42e-15863.84%Paxillin n=6 Tax=Formicidae RepID=E2C8K4_HARSA
NCBI RefSeqXP_002064422.11e-16166.20%GK23838 [Drosophila willistoni]
NCBI nr blastpgi|1954328363e-16066.20%GK23838 [Drosophila willistoni]
NCBI nr blastxgi|3838638791e-16764.60%PREDICTED: paxillin-like [Megachile rotundata]
Group
Gene OntologyGO:00082702.9e-23zinc ion binding
KEGG pathwaydwi:Dwil_GK238384e-161 
 K05760 (PXN)maps-> Chemokine signaling pathway
    Regulation of actin cytoskeleton
    Leukocyte transendothelial migration
    Bacterial invasion of epithelial cells
    Focal adhesion
    VEGF signaling pathway
InterPro domain[140-205] IPR0017812.9e-23Zinc finger, LIM-type
Orthology groupMCL13321 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200145-TA
ATGAGACTATTTAATTTGATCTATGGAAGAATAATTCAATTAACTATGATTTGGTTATGTCCAACATGCCGTTCGTCTGTATCTCCGATCCCATCCGGTACTCTCCCGCGACCTGGCACCAAACAGGTGACGGTGACCGTTCAAGAGACGGTGGTCGAACCAGCCCAGGCACCACCACCACAGGCCACCACCGTGAGACACCATCACGCCTCCAGCGCTACTAAGGAGCTGGACGACTTGATGGCATCCCTCTCAGACTTTAAGGTAAGCGGCGGTGCAGGTCCAGGCGAACAAGGGACCCACGTGTACAGAGAGAGGAAAGCCTGGGAGGAACATTACCGCAGCCCGCAACCGGAGGCCGCTTCGCTGGAACACATGCTTGGCTCTCTTCGAGCAGACATGAGCCGCCAAGGAGTACAAACACCCCAGAAGGGATGCTGCAACGCCTGCGAGAAACCGATCGTCGGACAGGTCATCACAGCGCTGGGACGCACGTGGCATCCCGAGCACTTCACGTGTGCTCATTGTAACCAAGAGCTCGGCACCAGGAACTTCTTCGAGCGCGACGGCCACCCGTACTGCGAGCCCGACTACCACAACCTGTTCTCACCGAGATGCGCCTACTGCAACGGACCGATCCTGGACAAATGCGTGACGGCGCTGGAGAAGACCTGGCACACGGAGCACTTCTTCTGCGCTCAGTGCGGCCAGCAGTTCGGGGAAGAAGGATTCCACGAGAGGGACGGGAAACCGTACTGTAGGGCCGATTACTTCGACATGTTCGCGCCGAAGTGCGGCGGCTGCAACAAGCCGATCATGGAGAACTACATCTCCGCCCTGAACACACAGTGGCATCCTGACTGCTTCGTCTGCAAGGATTGTCAGATGGCTGTTAAGGGAAAAACCTTCTATGCGATGGAGGGTAAGCCGGAATGCCGCGAGCCTTTCCACGGCGGTTCATTCTTCGAGCACGAGGGCCAACCGTACTGCGAGACTCACTACCACGGGAAGCGAGGGTCTCTGTGCGCCGGGTGTCACAAGCCCATAGCCGGGAGATGTATCACGGCGATGTTCAGGAAGTTCCACCCGGAACACTTCGTCTGCGCGTTCTGCCTCCGCCAGCTCAACAAGGGCACCTTCAAAGAACAGAACGACAAACCCTACTGTCACGCCTGCTTCGATAAACTCTTCGGCTGA

Protein sequence:

>DPOGS200145-PA
MRLFNLIYGRIIQLTMIWLCPTCRSSVSPIPSGTLPRPGTKQVTVTVQETVVEPAQAPPPQATTVRHHHASSATKELDDLMASLSDFKVSGGAGPGEQGTHVYRERKAWEEHYRSPQPEAASLEHMLGSLRADMSRQGVQTPQKGCCNACEKPIVGQVITALGRTWHPEHFTCAHCNQELGTRNFFERDGHPYCEPDYHNLFSPRCAYCNGPILDKCVTALEKTWHTEHFFCAQCGQQFGEEGFHERDGKPYCRADYFDMFAPKCGGCNKPIMENYISALNTQWHPDCFVCKDCQMAVKGKTFYAMEGKPECREPFHGGSFFEHEGQPYCETHYHGKRGSLCAGCHKPIAGRCITAMFRKFHPEHFVCAFCLRQLNKGTFKEQNDKPYCHACFDKLFG-