Monarch geneset OGS2.0

DPOGS205083
TranscriptDPOGS205083-TA2328 bp
ProteinDPOGS205083-PA775 aa
Genomic positionDPSCF300074 + 185662-192978
RNAseq coverage2570x (Rank: top 5%)
Annotation
HeliconiusHMEL0057440.089.86% 
BombyxBGIBMGA006878-TA0.081.17% 
Drosophilamys-PA0.057.94% 
EBI UniRef50UniRef50_B3MRJ40.059.17%Integrin beta n=1 Tax=Drosophila ananassae RepID=B3MRJ4_DROAN
NCBI RefSeqNP_001161754.10.082.35%integrin beta subunit 1 [Bombyx mori]
NCBI nr blastpgi|3201304470.083.57%integrin beta 1 subunit [Spodoptera exigua]
NCBI nr blastxgi|3201304470.083.70%integrin beta 1 subunit [Spodoptera exigua]
Group
Gene OntologyGO:00083057.6e-158integrin complex
GO:00071607.6e-158cell-matrix adhesion
GO:00071557.6e-158cell adhesion
GO:00048727.6e-158receptor activity
GO:00072297.6e-158integrin-mediated signaling pathway
GO:00054887.6e-158binding
KEGG pathwayaga:AgaP_AGAP0008150.0 
 K05719 (ITGB1)maps-> Axon guidance
    Leishmaniasis
    Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Pathways in cancer
    Shigellosis
    Leukocyte transendothelial migration
    Hypertrophic cardiomyopathy (HCM)
    Phagosome
    Focal adhesion
    Bacterial invasion of epithelial cells
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    ECM-receptor interaction
    Small cell lung cancer
    Dilated cardiomyopathy
    Cell adhesion molecules (CAMs)
InterPro domain[8-775] IPR0158120Integrin beta subunit
[1-431] IPR0023692.1e-202Integrin beta subunit, N-terminal
[729-775] IPR0148367e-26Integrin beta subunit, cytoplasmic
[618-704] IPR0128964.1e-15Integrin beta subunit, tail
Orthology groupMCL10164 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205083-TA
ATGGAAGGCGGCACCGCTGGCTGTGACGAAGCCTACATATTTAATCCGGATAACCAACAGTCAATCGACGCTTCTTTTAACAGAGAATTGACTAGAGCCAAAGGTCGTATGGGCGTTGGGATGGAGTCCTCTTACTACGAAGAATCTATGAGTAGCAGCAGCAGCAGCAGCAGCAGTAGTAGCAGTAGCAGCAGTAGCATCAAGGGCGGTGGTTACATGGGAGCAGCGGGCGGTGAAAACCTGGTGCAGATTAAACCACAAAGAGTTAAACTACAATTACGGATGAATCAAATGCAAAAGATGTCATTCTCGTACGCTCAAGCCCAAGACTATCCGGTTGACCTCTACTACCTCATGGATCTCAGTAGGTCTATGAAGAACGACAAGGAGAAGCTCAGTACACTAGGAAGTCTACTGTCTAGTACTATGAGAAATATCACATCCAACTTTAGAATTGGTTTTGGTTCCTTCGTGGATAAACTAGTCATGCCATATGTGTCCACAGTACCAAAAAATTTGATATCGCCGTGTGACGGTTGTGCTGCCCCCTACGGTTACAAGAATCAAATGTCTCTGAGCAACGACACGGACTTTTTCGATAAAGCTGTAGCCCGAGCTGACGTATCCGGTAACTTGGACGCCCCGGAAGGAGGCTTTGACGCTATCATGCAGGCTGTGGTTTGTAAGAGAGAGATCGGCTGGAGGGAACATGCAAGAAAACTACTCGTGTTTTCAACTGATGCTGGGTTCCATTACGCTGGTGACGGAAAGTTGGGAGGTATAGTCCAGCCTAATGATGGTGAATGTCACATGGAAGATAATTCATACACACACTCAACAAAACAGGATTATCCAAGCATTTCACAGATCAATTTGAAGGTTAAAGAGCACGCTATAAACGTGATATTCGCTGTGACAGCTGAGCAGATCAATGTGTATGAGCAGCTCAGCAAACATATTGAGGGTTCCAGCTCGGGTATACTGAGTGAGGACTCGGACAATGTGGTTGACCTTGTTAGAGAGCAATACAATAAAATTACTTCAGCTGTTGAGATGAAGGATACATCTAGTGACGCGGTACAGATATTGTATTACTCATCATGTTTGGGCGGCAAGCTGCAGCAGACTAACAAGTGTGAGGGGCTTAAGGTTGGAGATGTTGTGGAATTCACAGCGGAAATAACATTAAAGGAATGTCCCAAGGACCGTAATAAGTGGAAACAGATGTTTACTATTTATCCTGTCGGTGTGTCTGAAAGTTTAACTGTAGAGTTGGAAATGCTCTGTGACTGTCCCTGTGAACATCCCGGGCACCATGCGTACAACGACAGTCCATTAGTATGTAGCGGGCAGGGTATCTCTCAGTGTGGAGTCTGTGTGTGTCCTCCGGGGCGCTTCGGTAAGAACTGCGAGTGTTCAGCCCACGGCGGCGTGTCCGTGGAACAGGAGCGAGGCTGCAGACCGACCAACGCTAGCTCTGGACCTATGTGTTCCAACAGAGGCATGTGCATCTGCGGTATATGCGAGTGTAACAAGATGGACGACCCGCTGAAGGTGATATCTGGTCCGTTCTGCGAGTGCGATAACTTCACGTGCGACATGAACAAGGGTCAGCTGTGTTCTGGCCCTGATCACGGGGAGTGCGTGTGCGGAAAGTGTTCCTGTCACCCGCAGTACAGCGGCCCCGCCTGCCAGTGTCTTAAGGACCAGGCGCCTTGCATGTCACCCGAAAACAACAAAATCTGCAGCGGCAATGGTAAATGCGTTTGCGGTCAGTGCGTGTGCAATGTTGATGAAGATAGACACTACTACGGAAAATATTGTGAGAACTGTCCGACGTGCCCCGGCCGTTGTGAGGACTTCAAGCAGTGCGTGTTGTGTGAAGTACACAAGCGCGGGCCCCTGTACCGGGACGAGGCGCCGGAGTGCGGGGACTGCGCCCTCTACCCCGAGGTCGTCGAGGGAAAGATAGAAGCAAACGAGACGCTGAATGAACATTTGTGTAGTTTCTACGACGACGAAGACTGTTTGTACGTTTACGTGTATTCATACAACGAAACAAGGCATTTGCACATCAGAGCACAGAAAGAACACGAATGTCCTAAAAAGGTTTATATCCTGGGTATCGTGTTGGGTGTGATAGCGGCCATCGTGCTGGTGGGGCTGGCGCTGCTGATGCTGTGGAAGATGGTCACCACCATACACGACAGGAGAGAGTTCGCACGCTTCGAGAAGGAACGCATGATGGCTAAGTGGGACACGGGTGAAAATCCGATTTACAAGCAAGCGACATCTACATTCAAAAATCCGACCTACGCCGGGAAATAG

Protein sequence:

>DPOGS205083-PA
MEGGTAGCDEAYIFNPDNQQSIDASFNRELTRAKGRMGVGMESSYYEESMSSSSSSSSSSSSSSSSIKGGGYMGAAGGENLVQIKPQRVKLQLRMNQMQKMSFSYAQAQDYPVDLYYLMDLSRSMKNDKEKLSTLGSLLSSTMRNITSNFRIGFGSFVDKLVMPYVSTVPKNLISPCDGCAAPYGYKNQMSLSNDTDFFDKAVARADVSGNLDAPEGGFDAIMQAVVCKREIGWREHARKLLVFSTDAGFHYAGDGKLGGIVQPNDGECHMEDNSYTHSTKQDYPSISQINLKVKEHAINVIFAVTAEQINVYEQLSKHIEGSSSGILSEDSDNVVDLVREQYNKITSAVEMKDTSSDAVQILYYSSCLGGKLQQTNKCEGLKVGDVVEFTAEITLKECPKDRNKWKQMFTIYPVGVSESLTVELEMLCDCPCEHPGHHAYNDSPLVCSGQGISQCGVCVCPPGRFGKNCECSAHGGVSVEQERGCRPTNASSGPMCSNRGMCICGICECNKMDDPLKVISGPFCECDNFTCDMNKGQLCSGPDHGECVCGKCSCHPQYSGPACQCLKDQAPCMSPENNKICSGNGKCVCGQCVCNVDEDRHYYGKYCENCPTCPGRCEDFKQCVLCEVHKRGPLYRDEAPECGDCALYPEVVEGKIEANETLNEHLCSFYDDEDCLYVYVYSYNETRHLHIRAQKEHECPKKVYILGIVLGVIAAIVLVGLALLMLWKMVTTIHDRREFARFEKERMMAKWDTGENPIYKQATSTFKNPTYAGK-