.. image:: ../images/pVACvector_logo_trans-bg_sm_v4b.png :align: right :alt: pVACvector logo .. _pvacvector_run: Usage ==================================== .. warning:: Using a local IEDB installation is strongly recommended for larger datasets or when the making predictions for many alleles, epitope lengths, or prediction algorithms. More information on how to install IEDB locally can be found on the :ref:`Installation ` page. It may be necessary to explore the parameter space a bit when running pVACvector. As binding predictions for some sites vary substantially across algorithms, the most conservative settings may result in no valid paths, often due to one "outlier" prediction. Carefully choosing which predictors to run may help ameliorate this issue as well. In general, setting a lower binding threshold (e.g., 500nM) and using the median binding value (``--top-score-metric median``) will lead to greater possibility of a design, while more conservative settings of 1000nM and lowest/best binding value (``--top-score-metric lowest``) will give more confidence that there are no junctional neoepitopes. When running pVACvector with a ``--percentile-threshold`` the ``--percentile-threshold-strategy`` parameter specifies how to evaluate junctional epitopes. The ``conservative`` option fails a junction if a junctional epitope fails EITHER the binding threshold OR the percentile threshold (default). The ``exploratory`` option fails a junction only if a junctional epitope fails BOTH the binding threshold AND the percentile threshold. The latter will increase the odds of a successful run (since a junction is less likely to be invalidated) but also increase the odds of a true junctional epitope. Running pVACvector with spacer amino acid sequences may help eliminate junctional epitopes. The list of spacers to be tested is specified using the ``--spacers`` parameter. Peptide combinations without a spacer can be tested by including ``None`` in the list of spacers. The default spacer amino acid sequences are "None", "AAY", "HHHH", "GGS", "GPGPG", "HHAA", "AAL", "HH", "HHC", "HHH", "HHHD", "HHL", "HHHC". Peptide junctions are tested with each spacer in the order that they are specified. If a tested spacers results in a valid junction without any well-binding junction epitopes, that junction will not be tested with any other spacers, even if a different spacer could potentially result in better junction scores. This reduces runtime. If a tested spacer for a junction doesn't yield a valid junction (i.e., there are well-binding junction epitopes) the junction is tested wtih the next spacer in the input list. If, after testing all spacers, no valid path is found, clipped versions of peptides are tested by removing leading and/or trailing amino acids and constructing junctions with the clipped peptides. The maximum number of amino acids to clip is controlled by the ``--max-clip-length`` argument. In some cases, the (core) neoantigen candidate of a peptide sequence may be located toward the beginning or end of the sequence. In these cases, clipping may accidentially remove amino acids of the core neoantigen. To prevent this, the ``--max-clip-length`` should be set to the shortest number of flanking amino acids of any of the peptides to include in the vector. Alternatively, pVACvector also supports specifying the core neoantigen in the FASTA header when using a FASTA file as the input to pVACvector. If the core neoantigens for each sequence are specified in the input FASTA file, pVACvector will not clip into these neoantigens, even if the flanking sequence is smaller than the ``--max-clip-length``. The core neoantigen should be specified like so: .. code-block:: none >Peptide1 {"Best Peptide": "LYYSYGLLHI"} WLYYSYGLLHIYGSGGYALYF In this example ``Peptide1`` is the ID of the sequence, ``LYYSYGLLHI`` is the core neoantigen candidate, and ``WLYYSYGLLHIYGSGGYALYF`` is the peptide sequence to include in the vector. The Best Peptide information will already be included in the FASTA headers if the FASTA file is created by using the ``pvacseq generate_protein_fasta`` command in conjunction with an aggregated report TSV as the ``--input-tsv`` parameter. If no solution is found after testing all spacers and after clipping peptides, pVACvector will attempt to find a partial solution by excluding peptide sequences. The number of peptide sequences that are allowed to be removed is controlled via the ``--allow-n-peptide-exclusion`` parameter. Partial solutions will be written to their own result subdirectory. The subdirectory name reflects which peptide(s) were removed from the partial solution. Our current recommendation is to run pVACvector several different ways, and choose the path resulting from the most conservative set of parameters. .. program-output:: pvacvector run -h .. .. argparse:: :module: tools.pvacvector.run :func: define_parser :prog: pvacvector run