Monarch geneset OGS2.0

DPOGS202157
TranscriptDPOGS202157-TA1476 bp
ProteinDPOGS202157-PA491 aa
Genomic positionDPSCF300162 - 75619-81555
RNAseq coverage426x (Rank: top 29%)
Annotation
HeliconiusHMEL0037015e-11164.50% 
BombyxBGIBMGA003431-TA6e-10962.70% 
DrosophilaWASp-PB1e-6241.88% 
EBI UniRef50UniRef50_E2BEB97e-7544.74%Wiskott-Aldrich syndrome protein n=8 Tax=Formicidae RepID=E2BEB9_HARSA
NCBI RefSeqXP_002432217.13e-7541.73%Neural Wiskott-Aldrich syndrome protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838552603e-8339.86%PREDICTED: uncharacterized protein LOC100882400 [Megachile rotundata]
NCBI nr blastxgi|1565379525e-10442.56%PREDICTED: hypothetical protein LOC100124250 [Nasonia vitripennis]
Group
Gene OntologyGO:00055154.4e-44protein binding
GO:00081543.3e-23actin polymerization or depolymerization
GO:00050833.3e-23small GTPase regulator activity
GO:00156293.3e-23actin cytoskeleton
GO:00064613.3e-23protein complex assembly
KEGG pathwayphu:Phum_PHUM5773101e-74 
 K05747 (WAS)maps-> Shigellosis
    Chemokine signaling pathway
    Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Bacterial invasion of epithelial cells
    Fc gamma R-mediated phagocytosis
    Adherens junction
InterPro domain[5-139] IPR0119934.4e-44Pleckstrin homology-type
[22-127] IPR0006971.3e-27EVH1
[208-312] IPR0110263.3e-23Wiscott-Aldrich syndrome, C-terminal
[203-259] IPR0000952.5e-11PAK-box/P21-Rho-binding
Orthology groupMCL11619 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202157-TA
ATGCCAAAGGGAGAGAACAGGCCGAGCGTCCTGTTGACTCCGGAAGAGAACGATCTGGTGTTTAGCCTCATCGGAGCTAAATGTCAGAGTCTAGCGACAGCTGTAGTACAATTATTCACTACCGAGGGACCGGATCATTCAGAATGGAAGAAGAAAGACACGGGGGTGCTGTGCCTTATAAAAGATAATAGCAAACGTTCATACTTCTTCCGGATCTACTGCCTCTATCGGAGGTCGTTGATTTGGGAACATGAAGTCTATCTGCAGATTGAATACAAAAATCCCAGACCGTATTTACATACGTTTGAAGCCGAGGAATACATGACGGCATTTAATTTCGCAAATGAAATGGAGGCGACGGTGCTAAGGAATATTCTTTTAGAGAAAATTGAACTGCGTAAACAAAGACGGCAAGTTCGTAACAATCGTTCGATGATGGTCCCCCGTAATAACTCGACGGTTCATGAGTCTTCGTCGCGGTACAACGGCGCCCCTCCCCCGCCGCCGCTCACCACCACCACCGCCACCACCAATACTAAGACTAACACCCTCAATTCCTTGAAAGGCTCGGGGAGGAAACCGAAAGCGCGCAAACTGACCAAGGCTGACATCGGCATGCCGAAGGACTTCAAGCACGTGTCACACGTCGGATGGGACGCCAACAAAGGGTTCGACGTGGATCTGCCGGAGGAGAAGCTCCGCTGGTTCTTCGACAAGGCGGGCGTGTCGGAGACGCAGCTCAACGACCAGGAGACGAGGATGTTCATATACGACTTCATCATCAAGAACGGCGGAGCGGACGCGGTCAACGAGGACCTCACGGACGAACCGCCGCCGCCATACTCGGAGTCCCGGAGCCCCGCGCCGCCTGTCCCCGCCCGCGCCCCGCACCCCCCCGCGCCCCCGTCTCGTGCTCCGCCCCCGCCGCCGGCGCGGTCCGTACCCCCTCCGCCGCCGCCGCCAGCGACCCTCGCGCCGCGGAACCCTCCGCCGCCCAGACCGACACAACCCCCGGCCCCGGCGCCGCCGTCCATGCCTCCCCCTCCTCCCCCGCCGTCGCTGGCTCCGCCTCCGCCGCCGCCTCCCCCCGCACCGCCCGCGCCTCCCCCGCCGAGCTCCGAGGACAGGTCAGAGCTGCCCGCCGCCAACAGTGACCCGCGGGCCGCTCTCATGGCGAGCATACGGAGCGGCAACAAGAACTTGAGGCCCGTGGATTCCGTATCTAAGTCTTCGGCCAGCACGGACGACAGCAGGAACAACTTATTGAGCGAAATCCGTCAAGGGATCACATTGAAATCGGTGCGGCGGGAGAGTGTCACCGCGGGCGACGAGAAGACTACCAACAACGTAGAAAACGCGAGTGGCCTCGCCGGCGCGCTGGCCCGGGCGCTCAAGGAGCGGGCGAGGGCGATACACTCCTCGGACGACGAAGACGACACCGACAACACCACCAGCGACGGAGAGTGGGACTTCTAG

Protein sequence:

>DPOGS202157-PA
MPKGENRPSVLLTPEENDLVFSLIGAKCQSLATAVVQLFTTEGPDHSEWKKKDTGVLCLIKDNSKRSYFFRIYCLYRRSLIWEHEVYLQIEYKNPRPYLHTFEAEEYMTAFNFANEMEATVLRNILLEKIELRKQRRQVRNNRSMMVPRNNSTVHESSSRYNGAPPPPPLTTTTATTNTKTNTLNSLKGSGRKPKARKLTKADIGMPKDFKHVSHVGWDANKGFDVDLPEEKLRWFFDKAGVSETQLNDQETRMFIYDFIIKNGGADAVNEDLTDEPPPPYSESRSPAPPVPARAPHPPAPPSRAPPPPPARSVPPPPPPPATLAPRNPPPPRPTQPPAPAPPSMPPPPPPPSLAPPPPPPPPAPPAPPPPSSEDRSELPAANSDPRAALMASIRSGNKNLRPVDSVSKSSASTDDSRNNLLSEIRQGITLKSVRRESVTAGDEKTTNNVENASGLAGALARALKERARAIHSSDDEDDTDNTTSDGEWDF-