Monarch geneset OGS2.0

DPOGS213541
TranscriptDPOGS213541-TA1839 bp
ProteinDPOGS213541-PA612 aa
Genomic positionDPSCF300033 - 419300-424821
RNAseq coverage293x (Rank: top 38%)
Annotation
HeliconiusHMEL0054810.089.09% 
BombyxBGIBMGA011823-TA0.090.53% 
Drosophilap130CAS-PD3e-8637.05% 
EBI UniRef50UniRef50_E0VL442e-12643.04%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VL44_PEDHC
NCBI RefSeqXP_002426838.14e-12743.04%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3071846183e-12842.05%Breast cancer anti-estrogen resistance protein 1 [Camponotus floridanus]
NCBI nr blastxgi|3071846184e-12742.52%Breast cancer anti-estrogen resistance protein 1 [Camponotus floridanus]
Group
Gene OntologyGO:00055152.5e-16protein binding
KEGG pathwaydre:5683095e-50 
 K05726 (BCAR1, CAS)maps-> Chemokine signaling pathway
    Regulation of actin cytoskeleton
    Leukocyte transendothelial migration
    Bacterial invasion of epithelial cells
    Focal adhesion
InterPro domain[382-571] IPR0219015.6e-50CAS family, C-terminal domain of unknown function
[223-380] IPR0149281.3e-47Serine rich protein interaction
[37-130] IPR0014522.5e-16Src homology-3 domain
Orthology groupMCL14773 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213541-TA
ATGGTCATCGTAATGGCGAGTAAGCTGTATCTGACCAGATTCTTACGGAAAATAAGAAGATCGCGTTGCGTTGTCTGTTCTAGCAATGAAAATAAGGCTTTTCAAAATGGATCAAATTATGGGTGTATGGCGCGTGCGTTGTACGACAATATAGCGGAGTCGCCGGACGAGTTGGCGTTCAGACGAGGCGACCTCCTCACAGTCCTGGAGCAGAACACTGGCGGCAGCGAGGGCTGGTGGCTCTGCTCGCTCAGAGGAAGACAGGGAATATGCCCCGGAAACAGGCTGCGTATAGTTGCCGGGGTTTTCGACGCAAGTTCCGCTCTACAGAGGCGACGCACGCGCACCCCCGCCCCCGTCGCCCACCAACCACTCCCCTCACAACAGTCACCCGCCTTCACAAAAATCGAGGAATGCTCTCATTACGACGTCCCTCGGGCTCCGATGCCGGTTCAACGTATTATCGGTACGTACGATTGTCCCCGGTCGCAAGGGGACTGGTACGACGCCCCTCGTGCGCCGCGGCCGGCCAGCGCGGACTCCGCCTGCAGTGGCACGGGTTCCCTGACGTCGGCCACATCCAGCGCCTCCGCCAACTCGGGCAGTTCAGCGAACTCAGCTTCCAGCACTTATGATGTACCCAGATCCCGAGCCTTGCCGCTGCCGTGCGACGCCGCCATGGAGGCCTTAGAACGGTTACAGGAGGAGGCGTCCACGGCGGTTTCCCGCCTGCTGTCCTATGTCACACCGGGCTGGCGGCGGCGCGGGGCGTTGCGACCGCGCGTGCTAGACGTGCGTGTTGCGGGCGCTCGTCTGCGAGCAGCCTTACACGACCTCGCCGTGTTCGCTGACGCTACACTGGCCAACGCTCATGATGCACAAGACAAAGGTATCGCAGTAAAGCTACGGCCACTAGTGAAGGCTTTAAAGGACGCCGAGCGGATCACACACGAGGCGACCAGCGCGCTCGACGCCGGCGACTGGGCCCCGGAGCGGCTGGAGCGCGACAGGGAGCCCACGGACGGCACGCACGACGCGCTCGACCAGCTCGTCGCGTGTGCGCGCTCCCTCACCGAGGACGTGCGCCGAGCCGCCTCCTTCATACACGGAAACGCCTCACTACTGTTCAGGCGTTCCGCGACAGTTCCTGAACACGAGTGGACTGAGGAGTACGATTACGTGAGGTTGGAGTCCAGGAGTGCCGTTGGTCGGAGGAACGCCGAGATCCGGGCAGCTTTGCCGGACAAACTCAGGGCCTCCTTCGACGCGCTCGTCCGCGACGCGGACCATGCGGGCGAGGTGAGTGCTGTAGCAGCTGCAACCCGCCTTCCAGCGGATGATCGCCAGCTGGCCGCGTTCTACGCCGCGCAGACGGCTACGTACGGAGCGCACCTCTCGACCGCCGTGGAAGCCTTCCTCAGGACCATCGACATGGGACAACCGCCTGACGTGTTCCTCGCACACGGCAAGTTCGTGGTGCTCAGCGCGCACAGGATCGTACACGTCGGGGACACCGTGCACAGGAGCGCCCAGCACTCGGGGCTGAAGTCAAAAATACTAAGGTGTTCGGACGCACTATCGGACTCGTTAGCGGCGACCGTGGCCAAAACTAAAGCGGCGGCGCTGCAGTTCCCTTGCGCGAGCGCGGTGGCCGAGATGGCTGAGGCGGCGCGGACCTTGGCCGCCAGGGCGCAGGAGTTGAGACGAGCCCTAGTGAGAGCGGCCGAACCACCTCAAGACACACCCTCTACCACGGTGCCGCCGTCCTCCACCACGACACCTCTCACCCCACTCACGCCGCTCGCACCTCACCCCACCACCACCCTCCCCGTATTATAA

Protein sequence:

>DPOGS213541-PA
MVIVMASKLYLTRFLRKIRRSRCVVCSSNENKAFQNGSNYGCMARALYDNIAESPDELAFRRGDLLTVLEQNTGGSEGWWLCSLRGRQGICPGNRLRIVAGVFDASSALQRRRTRTPAPVAHQPLPSQQSPAFTKIEECSHYDVPRAPMPVQRIIGTYDCPRSQGDWYDAPRAPRPASADSACSGTGSLTSATSSASANSGSSANSASSTYDVPRSRALPLPCDAAMEALERLQEEASTAVSRLLSYVTPGWRRRGALRPRVLDVRVAGARLRAALHDLAVFADATLANAHDAQDKGIAVKLRPLVKALKDAERITHEATSALDAGDWAPERLERDREPTDGTHDALDQLVACARSLTEDVRRAASFIHGNASLLFRRSATVPEHEWTEEYDYVRLESRSAVGRRNAEIRAALPDKLRASFDALVRDADHAGEVSAVAAATRLPADDRQLAAFYAAQTATYGAHLSTAVEAFLRTIDMGQPPDVFLAHGKFVVLSAHRIVHVGDTVHRSAQHSGLKSKILRCSDALSDSLAATVAKTKAAALQFPCASAVAEMAEAARTLAARAQELRRALVRAAEPPQDTPSTTVPPSSTTTPLTPLTPLAPHPTTTLPVL-