R/proteome.R
ccleProteome2SummarizedExperiment.RdFunctionality for storing the protein expression data
from the Cancer Cell Line Encyclopedia (CCLE) in a
SummarizedExperiment.
ccleProteome2SummarizedExperiment(df, cell.line = "HCT116")a data.frame storing the CCLE protein expression data with
one measurement in each row. Typically obtained from ExperimentHub.
See examples.
character. One or more cell line IDs such as "HCT116"
(human colon cancer cell line 116). Use NULL to not subset by cell line.
Defaults to "HCT116", which will then subset the df to measurements for
HCT116 only.
A SummarizedExperiment storing protein expression
data for the specified cell line(s).
CCLE proteomics: https://gygi.hms.harvard.edu/publications/ccle.html
# Connect to ExperimentHub
eh <- ExperimentHub::ExperimentHub()
#> snapshotDate(): 2022-10-31
# Obtain CCLE proteome data frame
AnnotationHub::query(eh, c("gygi", "depmap"))
#> ExperimentHub with 1 record
#> # snapshotDate(): 2022-10-31
#> # names(): EH3459
#> # package(): depmap
#> # $dataprovider: Broad Institute
#> # $species: Homo sapiens
#> # $rdataclass: tibble
#> # $rdatadateadded: 2020-05-19
#> # $title: proteomic_20Q2
#> # $description: Quantitative profiling of 12399 proteins in 375 cell lines, ...
#> # $taxonomyid: 9606
#> # $genome:
#> # $sourcetype: CSV
#> # $sourceurl: https://gygi.med.harvard.edu/sites/gygi.med.harvard.edu/files/...
#> # $sourcesize: NA
#> # $tags: c("ExperimentHub", "ExperimentData", "ReproducibleResearch",
#> # "RepositoryData", "AssayDomainData", "CopyNumberVariationData",
#> # "DiseaseModel", "CancerData", "BreastCancerData", "ColonCancerData",
#> # "KidneyCancerData", "LeukemiaCancerData", "LungCancerData",
#> # "OvarianCancerData", "ProstateCancerData", "OrganismData",
#> # "Homo_sapiens_Data", "PackageTypeData", "SpecimenSource",
#> # "CellCulture", "Genome", "Proteome", "StemCell", "Tissue")
#> # retrieve record with 'object[["EH3459"]]'
ccle.prot <- eh[["EH3459"]]
#> snapshotDate(): 2022-10-31
#> see ?depmap and browseVignettes('depmap') for documentation
#> loading from cache
ccle.prot <- as.data.frame(ccle.prot)
# Turn into a SummarizedExperiment
se <- ccleProteome2SummarizedExperiment(ccle.prot)