Normalisr is a parameter-free normalization and statistical association testing framework that unifies single-cell differential expression, co-expression, and pooled single-cell CRISPR screen analyses with linear models. By systematically detecting and removing nonlinear confounders arising from library size at mean and variance levels, Normalisr achieves high sensitivity, specificity, speed, and generalizability across multiple scRNA-seq protocols and experimental conditions with unbiased p-value estimation.
Normalisr first removes confounding technical noises from raw read counts to recover the biological variations. Then, linear association testing provides a unified inferential framework with several advantages: (i) exact P-value estimation without permutation, (ii) native removal of covariates (e.g. batches, house-keeping programs, and untested gRNAs) as fixed effects, (iii) robustness against read count distribution distortions with enough (> 100) cells, and (iv) computational efficiency.
Normalisr is in python and provides a command-line and a python functional interface. Normalisr is published in Nature Communications (2021).
Normalisr is on PyPI and can be installed with pip: pip install normalisr
. You can also install Normalisr from github: pip install git+https://github.com/lingfeiwang/normalisr.git
. Make sure you have added Normalisr's install path into PATH environment before using the command-line interface (See FAQ). Normalisr's installation should take less than a minute.
There are more advanced installation methods but if you want that, most likely you already know how to do it. If not, give me a shout (See Issues).
Normalisr provides a command-line and a python functional interface below. You can use the examples provided below to guide yourself through Normalisr's use. Sphinx-based documentation is underway.
- Commmand-line interface
You can run Normalisr by typing
normalisr
on command-line. Normalisr uses submodules for different analysis steps. Typenormalisr
ornormalisr -h
for general help, and for examplenormalisr de -h
for help on submodule 'de' of differential expression.Normalisr uses tsv (tab separated values) file format for input and output matrices, and text file for row and column names, such as cells and genes, one per line. For initial input, Normalisr also accepts the sparse mtx format (Cell Ranger output) for raw read count matrix. Gzipped input/output files are automatically recognized if file name suffix '.gz' is present.
- Python functional interface
Normalisr's python functional interface is more flexible than command-line, but requires knowledge of python programming. Documentation of any function can be obtained with
?
in ipython or jupyter notebook, such as:import normalisr.normalisr as norm ?norm.de
The example jupyter notebooks also illustrate the scope of functions Normalisr provides.
Documentations are available as html and pdf.
You can find several examples in the 'examples' folder, to cover all functions Normalisr currently provides. The example datasets have been scaled down to run on a 16GB-memory personal computer. Although they only serve as demonstrations of work here, the pipelines should be transferable to a full-scale, different dataset. Since Normalisr is non-parametric, the only adjustable parameters are for quality control and final cutoffs of differential or co-expression. You can change down-sampling parameters in the examples to run the full datasets on a larger computer.
You can find more details in the respective examples.
Pease raise an issue on github.
- Single-cell normalization and association testing unifying CRISPR screen and gene co-expression analyses with Normalisr, Lingfei Wang, Nature Communications 2021. https://doi.org/10.1038/s41467-021-26682-1
- I installed Normalisr but typing
normalisr
says 'command not found'. - See below.
- I installed Normalisr but typing
- How do I use a specific python version for Normalisr's command-line interface?
- You can always use the python command to run Normalisr, such as
python3 -m normalisr
to replace commandnormalisr
. You can also use a specific path or version for python, such aspython3.7 -m normalisr
or/usr/bin/python3.7 -m normalisr
. Make sure you have installed Normalisr for this python version.
- Why don't the examples work?
- Please make sure you followed every step in the README.md of the respective example folder with Internet connection, and then submit an issue report detailing at which executed line the error occurred with input and output.
- Does Normalisr run on Windows?
- I have not tested Normalisr on Windows. However, it is purely in python and should be able to function properly.