POCA_Becker

Readme (Becker et al.)

Software Dependencies (p. 2) Installation (p. 3) Demo: Volcano plot (p. 4–5) Demo: Spearman correlation and multiple linear regression analyses (p. 6–7)

Software Dependencies RStudio 2023.06.1+524 "Mountain Hydrangea" Release (547dcf861cac0253a8abb52c135e44e02ba407a1, 2023-07-07) for windows Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) RStudio/2023.06.1+524 Chrome/110.0.5481.208 Electron/23.3.0 Safari/537.36

Windows 64-bit, v10.0.22631 Build 22631

Microsoft® Excel® for Microsoft 365 MSO (Version 2407 Build 16.0.17830.20056) 64-bit

This software has also been tested on: RStudio 2021.09.2+382 "Ghost Orchid" Release (fc9e217980ee9320126e33cdf334d4f4e105dc4f, 2022-01-04) for macOS Mozilla/5.0 (Macintosh; Intel Mac OS X 10_14_6) AppleWebKit/537.36 (KHTML, like Gecko) QtWebEngine/5.12.10 Chrome/69.0.3497.128 Safari/537.36

MacOS v10.14.6 (“Mojave”)

No non-standard hardware are required.

Installation (10 min) Navigate to “https://dailies.rstudio.com/version/2023.06.1+524.pro1/” and download the correct installer (for Windows 10+: “https://s3.amazonaws.com/rstudio-ide-build/electron/windows/RStudio-2023.06.1-524.exe”)

If these links are no longer active, versions can also be found at https://cran.r-project.org/.

Go through the prompts to download the software to your machine. Open Rstudio when it is done.

Run the following script to install the required packages: install.packages(c(“pbapply”,”stringr”,”ggplot2”,”reshape2”,”ggrepel”,”RColorBrewer”,”readxl”,”tidyverse”,”dplyr”,“purr”,“broom”,”msigdbr”)

Volcano plot demo (R markdown file)

Take the imputed and log2 transformed data from Perseus, saved as a .csv file. It has nine columns and looks like this:

Place the data, as a .csv, in a new folder “data” nested within a folder you will use for the analysis. In this example, the shown path is: “C:\Users\andbp\Documents\R_data\Readme_example\data”

Open Data File S1. In line 18, change the working directory path to your folder. (C:\Users\andbp\Documents\R_data\Readme_example)

Define the fnames function by running “names<-list.files.()” and then “fnames” in the console.

In line 19, set the number within [c(#)] to the number corresponding to the “data” folder after fnames has been run (will be ‘1’ if there are no other files in the folder)

Change line 26 to specify the name of the file after the “fnames” argument. In this example, it would be: “files = lapply(paste(fnames,"Example_data_volcano.csv", sep= "/"), read.csv, sep = ",")”

Run the rest of the code chunk (through line 37).

If this does not run, check that your input data file is formatted the same way with the correct column headers (ProteinID, Gene, Description)

Run the next code chunk (line 42) to merge the data.

Run the next code chunk (lines 47-49). A dataframe named “df” should populate in the R environment.

Check the dataframe to ensure the data are displaying properly (will look analogous to the .csv file in Excel)

Run the next code chunk (lines 54-79).

Run the next code chunk (lines 84-89).

Run the next code chunk (lines 94-108).

Run the next code chunk (lines 113-113) to generate the volcano plot. It should display in the R markdown file and should look like this:

To save the volcano plot or results file, run line 138 or 139, respectively.

Lines 142-170 have ways to label the “results2” dataframe with different lists of proteins to either color specific data points on the plots differently or label specific data points with black circles on the plot.

Spearman correlation and multiple linear regression analysis (R markdown file)

Take the imputed and log2 transformed data from Perseus for both the labeling and expression protein intensities of each replicate, saved as .csv files. They have nine columns and look like this:

Expression data:

Labeling data:

Note the column headers: they begin with ”parental” or “SRB1” for the expression data or “control” or “treatment” for the labeling data.

Place the .csv files in a new folder In this example, the shown path is: “C:\Users\andbp\Documents\R_data\Readme_example\data”

Save Data File S2 into the same folder and open it. In line 17, change the working directory path to your folder. (C:\Users\andbp\Documents\R_data\Readme_example)

In line 18, specify the name of the file that contains the protein intensities for the labeling data.

In line 19, specify the name of the file that contains the protein intensities for the expression data.

Run the chunk (lines 6–20)

In lines 27 and 39, check that the values in the “cols = matches()” arguments match the column headers in your data.

In line 109, change the values in the msigdbr() argument to match the desired database (GO Molecular Function in this example) to be used for the Spearman correlation.

Run the chunk (lines 24–125). A preview of the results of the Spearman correlation will appear.

Run the next chunk (lines 128–193) to visualize the results of the multiple linear regression and the Spearman correlation analyses.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Data_File_S1.Rmd		Data_File_S1.Rmd
Data_File_S2.Rmd		Data_File_S2.Rmd
Example_data_volcano.csv		Example_data_volcano.csv
README.md		README.md
Readme_example_results_Spearman_correlation.csv		Readme_example_results_Spearman_correlation.csv
Readme_example_results_linear_models.csv		Readme_example_results_linear_models.csv
Readme_example_volcano_results2.csv		Readme_example_volcano_results2.csv
Readme_revision.pdf		Readme_revision.pdf
example_data_expression_FP.csv		example_data_expression_FP.csv
example_data_labeling_FP.csv		example_data_labeling_FP.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

POCA_Becker

About

Uh oh!

Releases

Packages

BackusLab/POCA_Becker

Folders and files

Latest commit

History

Repository files navigation

POCA_Becker

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages