Monarch geneset OGS2.0

DPOGS211015
TranscriptDPOGS211015-TA1002 bp
ProteinDPOGS211015-PA333 aa
Genomic positionDPSCF300004 + 1253440-1256053
RNAseq coverage1013x (Rank: top 13%)
Annotation
HeliconiusHMEL0060453e-16186.19% 
BombyxBGIBMGA006378-TA0.093.99% 
Drosophilastck-PC6e-16177.78% 
EBI UniRef50UniRef50_Q7Z4I73e-11965.08%LIM and senescent cell antigen-like-containing domain protein 2 n=97 Tax=Bilateria RepID=LIMS2_HUMAN
NCBI RefSeqXP_001358747.22e-16078.08%GA20717 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1984523824e-15978.08%GA20717 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|2897409914e-17177.78%focal adhesion protein PINCH-1 [Glossina morsitans morsitans]
Group
Gene OntologyGO:00082705.7e-22zinc ion binding
KEGG pathwayaga:AgaP_AGAP0085322e-33 
 K05760 (PXN)maps-> Chemokine signaling pathway
    Regulation of actin cytoskeleton
    Leukocyte transendothelial migration
    Bacterial invasion of epithelial cells
    Focal adhesion
    VEGF signaling pathway
InterPro domain[1-334] IPR0173513.9e-210PINCH
[196-257] IPR0017815.7e-22Zinc finger, LIM-type
Orthology groupMCL11653 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211015-TA
ATGTCTTTGGACAATATGTTCTGCACTCGGTGCGGAGACGGATTTGAGACCAATGAAAAGATAGTTAATTCTAATGGAGAGCTCTGGCATACAGGATGTTTTGTGTGTGCCCAATGCTTCCGTGTATTTCCTGATGGCGTCTTCTTTGAATTTGAAGGTCGTAAATACTGCGAGCGTGATTTCCAAGTTTTATTTGCACCTTGCTGTGGCAAATGTCGGGAATTCATAATTGGTCGTGTGATAAAAGCAATGAATTCCAACTGGCATCCAGCTTGTTTCCGTTGTGAGGAATGTAACGCTGAACTAGCTGATGCAGGGTTTATTAAACACGCCGGTCGTGCTTTGTGCCACGCTTGTAACGCAAGAATCAAGGCGGACGGACTTCAGAACTATATATGCCACAAATGCCATGGTGTGATCGACGGGGAGCCGCTCCGTTACCGAGGTGAGGTATACCACGGTTATCATTTCACGTGTGCAACCTGCGGCCTAGAGTTGGACCACACCGCCCGCGAGGTGAAAAACCGGCCTGGATACGCTGCCAATGACGTGAATAATCTATTCTGCCTCCGTTGTCATGATAAAATGGGTATACCCATTTGTGGAGCTTGTCGCCGCCCTATCGAAGAAAGAATTGTTACTGCTTTGGGAAAACATTGGCATGTTGAGCACTTCGTGTGTGCCAAATGTGAGAAACCGTTCCACGGTCATAGACACTACGAGAAGAAGGGACTTGCGTACTGCGAACAACACTATCACCAGCTGTTCGGCAACCTGTGCTATGTTTGCAACCAAGTTATCGCAGGAGATGTTTTCACCGCATTGAACAAAGCATGGTGCGTTCATCATTTCGCGTGCGCGGTCTGTGACACCGCGCTCAGCACTCGTAGTAAGTTCTACGAGTATGATGAGCGTCCAGCGTGTAGAAGGTGCTATGAGCGGCTACCATCAGAGCTACGGCGTAGACTACGACGAGCTCACCATTATACTATGCGACGGTGA

Protein sequence:

>DPOGS211015-PA
MSLDNMFCTRCGDGFETNEKIVNSNGELWHTGCFVCAQCFRVFPDGVFFEFEGRKYCERDFQVLFAPCCGKCREFIIGRVIKAMNSNWHPACFRCEECNAELADAGFIKHAGRALCHACNARIKADGLQNYICHKCHGVIDGEPLRYRGEVYHGYHFTCATCGLELDHTAREVKNRPGYAANDVNNLFCLRCHDKMGIPICGACRRPIEERIVTALGKHWHVEHFVCAKCEKPFHGHRHYEKKGLAYCEQHYHQLFGNLCYVCNQVIAGDVFTALNKAWCVHHFACAVCDTALSTRSKFYEYDERPACRRCYERLPSELRRRLRRAHHYTMRR-