Installation

pVACtools is written for Linux but some users have been able to run it successfully on Mac OS X. If you are using Windows you will need to set up a Linux environment, for example by setting up a virtual machine.

pVACtools requires Python 3.6 or above. Before running any installation steps, check the Python version installed on your system:

python -V

If you don’t have Python 3 installed, we recommend using Conda to emulate a Python 3 environment. We’ve encountered problems with users that already have Python 2.x installed when they also try to install Python 3. The defaults will not be set correctly in that case. If you already have Python 2.x installed we strongly recommmend using Conda instead of installing Python 3 locally.

Once you have set up your Python 3 environment correctly you can use pip to install pVACtools. Make sure you have pip installed. pip is generally included in python distributions, but may need to be upgraded before use. See the instructions for installing or upgrading pip.

After you have pip installed, type the following command on your Terminal:

pip install pvactools

You can check that pvactools has been installed under the default environment like so:

pip show pvactools

pip will fetch and install pVACtools and its dependencies for you. After installing, each tool of the pVACtools suite is available in its own command line tree directly from the Terminal.

If you have an old version of pVACtools installed you might want to consider upgrading to the latest version:

pip install pvactools --upgrade

Installing MHCflurry

If you wish to run the MHCflurry prediction algorithm, you will need to install the mhcflurry python package on your system. This package is set as a dependency for the pvactools package so it should be installed automatically when you download or upgrade the pvactools package. You can install it manually by running:

pip install mhcflurry

Note

The mhcflurry package needs to be installed in the same Python 3 environment as the pvactools package.

Next, you will need to download the download the MHCflurry datasets and trained models:

mhcflurry-downloads fetch

Note

The mhcflurry-downloads fetch command will need to be run manually, even if the mhcflurry package was already installed automatically as a dependency with the pvactools package.

You can check that the mhcflurry package was installed successfully by running:

mhcflurry-predict -h

This should pull up the help page for the MHCflurry predictor.

Please note that MHCflurry depends on tensorflow, which will automatically be installed as a dependency to the mhcflurry package. Newer versions of tensorflow might not be compatible with older CPUs. In that case you will see a core dump failure. Downgrading tensorflow manually to version 1.5.0 should solve this problem:

pip install tensorflow==1.5.0

Installing MHCnuggets

If you wish to run the MHCnuggets prediction algorithm, you will need to install the mhcnuggets python package on your system. This package is set as a dependency for the pvactools package so it should be installed automatically when you download or upgrade the pvactools package. You can install it manually by running:

pip install mhcnuggets

Note

The mhcnuggets package needs to be installed in the same Python 3 environment as the pvactools package.

You can check that the mhcnuggets package was installed successfully by running:

pip show mhcnuggets

This should show information about the mhcnuggets package.

Please note that MHCnuggets depends on tensorflow, which will automatically be installed as a dependency to the mhcnuggets package. Newer versions of tensorflow might not be compatible with older CPUs. In that case you will see a core dump failure. Downgrading tensorflow manually to version 1.5.0 should solve this problem:

pip install tensorflow==1.5.0

Installing BigMHC

If you wish to run the BigMHC_EL or BigMHC_IM prediction algorithms, you will need to install BigMHC on your system. This package not a direct dependency of the the pvactools packages and needs to be installed manually by running:

pip install git+https://github.com/griffithlab/bigmhc.git#egg=bigmhc

Note

BigMHC needs to be installed in the same python 3 environment as the pvactools package.

You can check that BigMHC was installed successfully by running:

pip show bigmhc

This should show information about the BigMHC installation.

Installing DeepImmuno

If you wish to run the DeepImmuno prediction algorithm, you will need to install DeepImmuno on your system. This package not a direct dependency of the the pvactools packages and needs to be installed manually by running:

pip install git+https://github.com/griffithlab/deepimmuno.git#egg=deepimmuno

Note

DeepImmuno needs to be installed in the same python 3 environment as the pvactools package.

You can check that DeepImmuno was installed successfully by running:

pip show deepimmuno

This should show information about the DeepImmuno installation.

Installing BLAST

To run the reference proteome similarity step, standalone BLAST may be used. To install BLAST please see the official documentation. The BLAST tool needed is Protein BLAST (blastp). Please make note of the installation path of blastp (retrievable by calling which blastp), as that path is needed for the --blastp-path argument in the various pVACtools commands.

You will also need to install either the refseq_select_prot or the refseq_protein BLAST reference proteome databases. You can do so by running the update_blastdb.pl script provided with your BLAST installation (located in the bin subdirectory). You will need to set the BLASTDB to point to the installation directory of your BLAST reference proteome databases.

Downlaoding Reference Proteome FASTA file

As an alternative to BLAST, a reference proteome fasta file may be used for the reference proteome similarity step and specified as an input via the --peptide-fasta command. Any proteome fasta may be used. Ensembl provides reference proteome fastas for many species. For example, the latest reference proteome fasta for human can be downloaded from this link.

Docker and CWL

Versioned Docker containers for pVACtools are available on DockerHub using the griffithlab/pvactools repo. The Docker container contains pVACtools as well as installations of the standalone IEDB MHC Class I and Class II software. These are installed at /opt/iedb (--iedb-install-directory /opt/iedb).

An example on how to run pVACseq using Docker can be found on the Getting Started page.

Common Workflow Language (CWL) tool wrappers for pVACseq, pVACfuse, and pVACvector can be downloaded using the pvactools download_cwls command.

Download CWL tool wrappers

usage: pvactools download_cwls [-h] destination_directory

Download pVACtools CWLs for each tool's main pipeline

positional arguments:
  destination_directory
                        Directory for downloading CWLs

optional arguments:
  -h, --help            show this help message and exit