Table of contents
General notes
- Relative to the data accessible online, the data available for bulk download have undergone additional filtering to make them compatible with global analyses.
For example, in the 2022-10-25 haploid/homozygous diploid dataset, 46 screens reported data for 100-652 essential genes, even if the screens were performed using haploid or homozygous diploid YKO collections that should not include essential genes.
For consistency, we removed all essential genes from the data matrix. The data, however, remains in the database and is accessible/downloadable from the corresponding screen's page.
Similarly, we removed 394 dubious ORFs and 214 "rare" ORFs, i.e. those that have reported values in less than 25 publications.
If you're interested in a more unfiltered version of the data, please reach out to abaryshk@gmail.com.
- We did our best at recovering a list of tested strains for as many screens as possible.
Whenever the list was not released as part of the original publication, we contacted the authors via email.
If the authors were unable to provide the list, we estimated the screen’s tested space from the tested spaces of all other screens.
The estimate was based on a consensus list, i.e. the list of strains that have been tested in at least 50% of all screens that did declare their tested space (screens were excluded from the estimate if their declared tested space was considerably lower than other screens).
If a screen reported values outside of the consensus tested space, those values were retained.
Haploid and homozygous diploid data
Screen meta-data and phenotypic values
- A README file
- A list of 14,484 haploid and homozygous diploid screens with relevant metadata
- A gene x screen matrix (4,554 x 14,484) of harmonized but not normalized phenotypic
values
- A gene x screen matrix (4,554 x 14,484) of harmonized and normalized phenotypic values
(NPVs)
Download (608 MB)
Gene-gene phenotypic similarities
- A gene x gene matrix (4,554 x 4,554) of phenotypic similarity estimates (+ a matrix of their corresponding standard deviations) as described here
Download (316 MB)
Screen-screen phenotypic similarities
- A screen x screen matrix (14,484 x 14,484) of phenotypic similarity estimates (+ a matrix of their corresponding standard deviations) as described here
Download (3.1 GB)
Screen meta-data and phenotypic values
- A README file
- A list of 14,372 haploid/homozygous diploid screens with relevant metadata
- A gene x screen matrix (4,554 x 14,372) of harmonized but not normalized phenotypic
values
- A gene x screen matrix (4,554 x 14,372) of harmonized and normalized phenotypic values
(NPVs)
Download
Heterozygous diploid data
Format: tab-delimited
Contents:
- A README file
- A list of 7,011 heterozygous diploid screens with relevant metadata
- A gene x screen matrix (5,639 x 7,011) of harmonized but unnormalized phenotypic
values
- A gene x screen matrix (5,639 x 7,011) of harmonized and normalized phenotypic values
(NPVs)
Download