DeepLINCS: A framework for deep learning with LINCS L1000 Data

DeepLINCS offers a variety of utilities for working with L1000 data and training deep learning models.

The class Dataset, built on pandas dataframes, offers methods to load, sample, filter, subset, and explore these data in memory.

DeepLINCS also provides a wrapper around around TensorFlow to efficiently load a Dataset and train a variety of deep learning models on the GPU.

You can find the software on github.

Installation

pip install deep-lincs

Downloading the data

The data for 1.3 million L1000 profiles are availabe on GEO. The script load_files.sh fetches the Level 3 data along with all metadata available. The largest file is quite big (~50Gb) so please be patient.

git clone https://github.com/manzt/deep_lincs.git && cd deep_lincs
source load_files.sh # download raw data from GEO

Getting started

Navigate to the example notebooks and try building a model!

cd notebooks
jupyter lab # easiest for plotting with altair