Monarch geneset OGS2.0

DPOGS214093
TranscriptDPOGS214093-TA2283 bp
ProteinDPOGS214093-PA760 aa
Genomic positionDPSCF300014 - 2243367-2248098
RNAseq coverage141x (Rank: top 55%)
Annotation
HeliconiusHMEL0114260.046.90% 
BombyxBGIBMGA006878-TA5e-8527.48% 
Drosophilamys-PA5e-9127.92% 
EBI UniRef50UniRef50_E0VEZ03e-9934.22%Integrin beta n=1 Tax=Pediculus humanus corporis RepID=E0VEZ0_PEDHC
NCBI RefSeqXP_002424702.16e-10034.22%myospheroid protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420077821e-9834.22%myospheroid protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420077824e-10934.37%myospheroid protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00048725.7e-144receptor activity
GO:00054885.7e-144binding
GO:00071605.7e-144cell-matrix adhesion
GO:00083055.7e-144integrin complex
GO:00071555.7e-144cell adhesion
GO:00072295.7e-144integrin-mediated signaling pathway
GO:00160207e-07membrane
GO:00072757e-07multicellular organismal development
KEGG pathwaydme:Dmel_CG15604e-89 
 K05719 (ITGB1)maps-> Axon guidance
    Leishmaniasis
    Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Pathways in cancer
    Shigellosis
    Leukocyte transendothelial migration
    Hypertrophic cardiomyopathy (HCM)
    Phagosome
    Focal adhesion
    Bacterial invasion of epithelial cells
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    ECM-receptor interaction
    Small cell lung cancer
    Dilated cardiomyopathy
    Cell adhesion molecules (CAMs)
InterPro domain[1-758] IPR0158125.7e-144Integrin beta subunit
[27-428] IPR0023694.5e-97Integrin beta subunit, N-terminal
[16-70] IPR0162017e-07Plexin-like fold
Orthology groupMCL35009 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214093-TA
ATGCGAGTGATATATTTTGTCGCAATTTGTTTCGTTTTTCGATGTTCGTCATCTATAGACCTGTGTAAACAGCACAAAACCTGCCACAACTGCATTAGGGATGCCAATAAGTGCGTTTGGTGCGACAGTCACAAATTTAATGACACCAAATGTAAATCGGCGGCAGAACAATTGGAAAACTGGTGCCCACACACTATAATTAACCCAAAAAGTGTTGTTAAAGTTATTACAAACAATGATTTTTCATCAGAAAGGGGTAAAGTGATTCATATGAAACCACAGCAGATACAAATAAATATCAGACCTGGAGATGTCATTGATTTTGATTTCTCGTTTAGAAAAGCGCAAAACTATCCAGTGGATTTATATTTTCTTTTAGACGGTTCGGCTTCAATGGCATCTGTTAAGAATGAAATTGTTAAGCAGACTGAAAGTATATATCAGATGATGAAAAGTATGACTGATAATGTTTATTTGGGAATGGGATCTTTTGTAGACAAAAATTTGTTGCCATTCACTAATGTACTAAACAGCACAAACACTTACTCATTCCGCAATAGACTAAAGCTCATTAATGATCCCGAGGCCTTTAAAAAAACCATTAATGATACAGCATTCGGTTATAATTATGACGAACAAGAGGGTACATTAGATGCTCTCGCACAAGTTATTGTATGTAAAGAACAGATTGGTTGGCGAGAAGAATCTAGAAAAATCATTTTAGTTTTCACTGGAGCATCGTTCCACGCGGCTAGTGATGGAATTTTCGGAGGTGTCGTTGAACCGTATGATGGCAAATGTTATCTAGAAAATGATGTATATTCTAAAGAAACTGTAATGGACTATCCTTCAGTCGGTATTATCAATAAACTGGCATCTGAAGATGAAAAAATTATAATTTTTGCCATCGATGAAGAAGCAAAGAACATTTATAAATCACTTACGAACTTTATTACTGGCTCTAAAGTAACAAAATACGGTGGCGATCTAATTGCAAACATGCTTAAAACTGTTTATGAGGAAATTTCTCAAAATTTAAAACTGAAAGTAAACATGGACGCTGAACATCGAAAGAATTTTGAATTTTTCTTCAATCCAGATTGTTATAATGCTTATAGACAATATGAAGACTGCGAAGTAGTACATTACGAAGAGAAACATTTTACTGGGACCATAAAACTTCTGTCGTACATTGAAAATGATAGCGTTAAGATGGATATTGTTTTTGAAGGTATCAAGGAAAAAATTGAATTAGACATATCTATTATAAAACGATGTGACTGCAAAAGGGAAGAAAATTCAACGTCCTGCAATTACCACGGCTCCCTGTATTGTGGAATATGTGAATGTGAAGAAAACAGATATTACGGTGATAGTTGTCAATGCCAAAACACTACAAGCGCCTTAAATCCAAGTGATGAAGCTACTTGCATCGCTCCCGGTAGTAACTCTACATGTCATAACCGTGGTTCTTGCAAGTGTGGAATGTGCAAGTGTAGGAATGGATACAAAGGTAAGTTCTGTGAATGCAGTGATCACAGCTGTGAGCGGGGCGCTGATAACGAGTTGTGTTCTGGTCCTCTCAGAGGTGTATGTGATTGTGGAAAGTGCAACTGTAAGAGTGGCTGGACTGGTTCGGTATGCGATTGTTCCACTTCCAAGACGGAGTGCTTAAGTAATGACCAGACATTATGTAGTAATCGTGGCGTTTGCATATGCGGTCGATGCAAATGCAACGATATCTCCGATTGGGATGTTAGGACTAAAGAGCAACCAGACTGCCAACTTTCCTGTCCCCAAGATAATCAGGATCCATCATCATGTCGCCACAGACAATGCATCAACATAGAACCAATCGTTTTATGCCACCTTAATAGCGACGATTGCCAGCCTGTAGACAACCTAAATATCACACTAATAAAGAATCTGACCATGGTCCAAGCGGAAGGAGATTGGTACCATTGCAGTAGAGTCATAGTTGATGTGGGATGTTATACTAAATTCCTGTACAGATATTCAAAAGACAAATATGGAATTGAAATCGTTATGGACAAGAATGTTGATTGCATCGAAGCAAATTATATCCGAGGTTTCATATGCTTGTTTACTCTGATCTTCATCGGAGTGGGAACTCTGATCGCGTGGAAATATTGGACGGACCGGAGAGATCGTTTGGAATACGAAAAATTGTTCCAACAAGTAAACGAAACTGAGACTGAGAACGTCCTCTTCGTACCACCGCTACATAGTTACAGAAACCCCTCGTATCAAGGACACTTATAA

Protein sequence:

>DPOGS214093-PA
MRVIYFVAICFVFRCSSSIDLCKQHKTCHNCIRDANKCVWCDSHKFNDTKCKSAAEQLENWCPHTIINPKSVVKVITNNDFSSERGKVIHMKPQQIQINIRPGDVIDFDFSFRKAQNYPVDLYFLLDGSASMASVKNEIVKQTESIYQMMKSMTDNVYLGMGSFVDKNLLPFTNVLNSTNTYSFRNRLKLINDPEAFKKTINDTAFGYNYDEQEGTLDALAQVIVCKEQIGWREESRKIILVFTGASFHAASDGIFGGVVEPYDGKCYLENDVYSKETVMDYPSVGIINKLASEDEKIIIFAIDEEAKNIYKSLTNFITGSKVTKYGGDLIANMLKTVYEEISQNLKLKVNMDAEHRKNFEFFFNPDCYNAYRQYEDCEVVHYEEKHFTGTIKLLSYIENDSVKMDIVFEGIKEKIELDISIIKRCDCKREENSTSCNYHGSLYCGICECEENRYYGDSCQCQNTTSALNPSDEATCIAPGSNSTCHNRGSCKCGMCKCRNGYKGKFCECSDHSCERGADNELCSGPLRGVCDCGKCNCKSGWTGSVCDCSTSKTECLSNDQTLCSNRGVCICGRCKCNDISDWDVRTKEQPDCQLSCPQDNQDPSSCRHRQCINIEPIVLCHLNSDDCQPVDNLNITLIKNLTMVQAEGDWYHCSRVIVDVGCYTKFLYRYSKDKYGIEIVMDKNVDCIEANYIRGFICLFTLIFIGVGTLIAWKYWTDRRDRLEYEKLFQQVNETETENVLFVPPLHSYRNPSYQGHL-