Skip to content

Commit

Permalink
Incorporate DSL2 features into docs (nextflow-io#3793)
Browse files Browse the repository at this point in the history

Signed-off-by: Ben Sherman <bentshermann@gmail.com>
Signed-off-by: Paolo Di Tommaso <paolo.ditommaso@gmail.com>
Co-authored-by: Phil Ewels <phil@seqera.io>
Co-authored-by: Paolo Di Tommaso <paolo.ditommaso@gmail.com>
  • Loading branch information
3 people authored and abhi18av committed Oct 28, 2023
1 parent 4084b0a commit 42b269a
Show file tree
Hide file tree
Showing 16 changed files with 945 additions and 831 deletions.
2 changes: 1 addition & 1 deletion docs/azure.md
Original file line number Diff line number Diff line change
Expand Up @@ -221,7 +221,7 @@ The pool name can only contain alphanumeric, hyphen and underscore characters.
:::

:::{warning}
If the pool name includes a hyphen, make sure to wrap it with single quotes. For example::
If the pool name includes a hyphen, make sure to wrap it with single quotes. For example:

```groovy
azure {
Expand Down
8 changes: 1 addition & 7 deletions docs/cli.md
Original file line number Diff line number Diff line change
Expand Up @@ -1322,13 +1322,7 @@ The `run` command is used to execute a local pipeline script or remote pipeline
$ nextflow run nextflow-io/hello -qs 4
```

- Execute the pipeline with DSL-2 syntax.

```console
$ nextflow run nextflow-io/hello -dsl2
```

- Execute a pipeline with a specific workflow as the entry-point, this option is meant to be used with DSL-2. For more information on DSL-2, please refer to {ref}`dsl2-page`
- Invoke the pipeline with a specific workflow as the entry-point.

```console
$ nextflow run main.nf -entry workflow_A
Expand Down
7 changes: 6 additions & 1 deletion docs/conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,7 @@
extensions = [
'sphinx.ext.mathjax',
'sphinxcontrib.mermaid',
'sphinxext.rediraffe',
'sphinx_rtd_theme',
'myst_parser'
]
Expand All @@ -38,6 +39,10 @@

myst_heading_anchors = 3

rediraffe_redirects = {
'dsl2.md': 'dsl1.md'
}

# Add any paths that contain templates here, relative to this directory.
templates_path = ['_templates']

Expand Down Expand Up @@ -75,7 +80,7 @@

# List of patterns, relative to source directory, that match files and
# directories to ignore when looking for source files.
exclude_patterns = ['_build']
exclude_patterns = ['_build', '**README.md']

# The reST default role (used for this markup: `text`) to use for all documents.
#default_role = None
Expand Down
4 changes: 2 additions & 2 deletions docs/container.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ If your Apptainer installation support the "user bind control" feature, enable t

### How it works

The integration for Apptainer follows the same execution model implemented for Docker. You won't need to modify your Nextflow script in order to run it with Apptainer. Simply specify the Apptainer image file from where the containers are started by using the `-with-apptainer` command line option. For example::
The integration for Apptainer follows the same execution model implemented for Docker. You won't need to modify your Nextflow script in order to run it with Apptainer. Simply specify the Apptainer image file from where the containers are started by using the `-with-apptainer` command line option. For example:

```bash
nextflow run <your script> -with-apptainer [apptainer image file]
Expand Down Expand Up @@ -72,7 +72,7 @@ Nextflow no longer mounts the home directory when launching an Apptainer contain

### Multiple containers

It is possible to specify a different Apptainer image for each process definition in your pipeline script. For example, let's suppose you have two processes named `foo` and `bar`. You can specify two different Apptainer images specifying them in the `nextflow.config` file as shown below::
It is possible to specify a different Apptainer image for each process definition in your pipeline script. For example, let's suppose you have two processes named `foo` and `bar`. You can specify two different Apptainer images specifying them in the `nextflow.config` file as shown below:

```groovy
process {
Expand Down
187 changes: 187 additions & 0 deletions docs/dsl1.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,187 @@
(dsl1-page)=

# Migrating from DSL 1

In Nextflow version `22.03.0-edge`, DSL2 became the default DSL version. In version `22.12.0-edge`, DSL1 support was removed, and the Nextflow documentation was updated to use DSL2 by default. Users who are still using DSL1 should migrate their pipelines to DSL2 in order to use the latest versions of Nextflow. This page describes the differences between DSL1 and DSL2, and how to migrate to DSL2.

In Nextflow versions prior to `22.03.0-edge`, you must enable DSL2 explicitly in order to use it. You can either set the feature flag in your pipeline script:

```groovy
nextflow.enable.dsl=2
```

Or set the environment variable where you launch Nextflow:

```bash
export NXF_DEFAULT_DSL=2
```

## Processes and workflows

In DSL1, a process definition is also the process invocation. Process inputs and outputs are connected to channels using `from` and `into`. Here is the {ref}`getstarted-first` example written in DSL1:

```groovy
nextflow.enable.dsl=1
params.str = 'Hello world!'
process splitLetters {
output:
file 'chunk_*' into letters
"""
printf '${params.str}' | split -b 6 - chunk_
"""
}
process convertToUpper {
input:
file x from letters.flatten()
output:
stdout result
"""
cat $x | tr '[a-z]' '[A-Z]'
"""
}
result.view { it.trim() }
```

To migrate this code to DSL2, you need to move all of your channel logic throughout the script into a `workflow` definition. Additionally, you must call each process explicitly, passing any input channels as arguments (instead of `from ...`) and receiving any output channels as return values (instead of `into ...`).

Refer to the {ref}`workflow-page` page to learn how to define a workflow. The DSL2 version of the above script is duplicated here for your convenience:

```{literalinclude} snippets/your-first-script.nf
:language: groovy
```

## Channel forking

In DSL1, a channel can be used as an input only once; to use a channel multiple times, the channel must be forked using the `into` operator.

In DSL2, channels are automatically forked when connecting two or more consumers.

For example, this would not work in DSL1 but is not a problem in DSL2:

```groovy
Channel
.from('Hello','Hola','Ciao')
.set{ cheers }
cheers
.map{ it.toUpperCase() }
.view()
cheers
.map{ it.reverse() }
.view()
```

Similarly, process outputs can be consumed by multiple consumers automatically, which makes workflow scripts much easier to read and write.

## Modules

In DSL1, the entire Nextflow pipeline must be defined in a single file (e.g. `main.nf`). This restriction becomes quite cumbersome as a pipeline becomes larger, and it hinders the sharing and reuse of pipeline components.

DSL2 introduces the concept of "module scripts" (or "modules" for short), which are Nextflow scripts that can be "included" by other scripts. While modules are not essential to migrating to DSL2, nor are they mandatory in DSL2 by any means, modules can help you organize a large pipeline into multiple smaller files, and take advantage of modules created by others. Check out the {ref}`module-page` to get started.

:::{note}
With DSL2, the Groovy shell used by Nextflow also imposes a 64KB size limit on pipeline scripts, so if your DSL1 script is very large, you may need to split your script into modules anyway to avoid this limit.
:::

## Deprecations

### Processes

- The `set` process input type is no longer supported, use {ref}`tuple <process-input-tuple>` instead.

- The `set` process output type is no longer supported, use {ref}`tuple <process-out-tuple>` instead.

- The `mode flatten` option for process outputs is no longer available. Use the {ref}`operator-flatten` operator on the corresponding output channel instead.

- Unqualified value and file elements in a tuple declaration are no longer allowed. Use an explicit `val` or `path` qualifier.

For example:

```groovy
process foo {
input:
tuple X, 'some-file.sam'
output:
tuple X, 'some-file.bam'
script:
"""
your_command --in $X some-file.sam > some-file.bam
"""
}
```

Use:

```groovy
process foo {
input:
tuple val(X), path('some-file.sam')
output:
tuple val(X), path('some-file.bam')
script:
"""
your_command --in $X some-file.sam > some-file.bam
"""
}
```

### Channels

- Channel method `bind` has been deprecated in DSL2.
- Channel method `<<` has been deprecated in DSL2.
- Channel factory `create` has been deprecated in DSL2.

### Operators

- Operator `choice` has been deprecated in DSL2. Use {ref}`operator-branch` instead.
- Operator `close` has been deprecated in DSL2.
- Operator `countBy` has been deprecated in DSL2.
- Operator `into` has been deprecated in DSL2, as it is no longer needed.
- Operator `fork` has been renamed to {ref}`operator-multimap`.
- Operator `groupBy` has been deprecated in DSL2. Use {ref}`operator-grouptuple` instead.
- Operators `print` and `println` have been deprecated in DSL2. Use {ref}`operator-view` instead.
- Operator `route` has been deprecated in DSL2.
- Operator `separate` has been deprecated in DSL2.
- Operator `spread` has been deprecated in DSL2. Use {ref}`operator-combine` instead.

### DSL2 Preview

An early preview of DSL2 was available in 2020. Note that some of that early DSL2 syntax has since changed.

- The `nextflow.preview.dsl=2` (and `nextflow.enable.dsl=1`) feature flags are no longer needed.

- Anonymous and unwrapped includes are no longer supported. Use an explicit module inclusion instead.

For example:

```groovy
include './some/library'
include bar from './other/library'
workflow {
foo()
bar()
}
```

Should be replaced with:

```groovy
include { foo } from './some/library'
include { bar } from './other/library'
workflow {
foo()
bar()
}
```
Loading

0 comments on commit 42b269a

Please sign in to comment.