The Illumina NovaSeq 6000 Integration v2.5.0 supports the integration between Clarity LIMS and the NovaSeq 6000 instrument. This integration also provides support for Clarity LIMS v6.2.0 and includes updates to the technology stack and third party libraries and utilities.
Clarity LIMS is automatically stopped during installation of the RPM and will need to be restarted. The file-based and API-based integrations cannot run at the same time. When the API-based integration has finished installing, shut down Clarity LIMS and uninstall the file-based integration.
This document provides instructions for installing NovaSeq 6000 Integration v2.5.0. It also describes the components that are installed in the default configuration.
NovaSeq 6000 Integration v2.5.0 has the following prerequisites:
Mount run data network-attached storage (NAS) share
Secret Util is installed
IPP is installed
Prerequisite 1: Mount Run Data NAS Share
Mounting the NAS share of run data is needed to capture and generate files associated with the sequencing run. To mount NAS shares that contain data from the Clarity LIMS server, set up the following Write privileges as the glsai user:
Sample sheet directory — This directory is where the Prepare Files for NovaSeq script writes sample sheets.
Run recipe directory — This directory is where the Prepare Files for NovaSeq script writes run recipe files in the JSON format.
Set up the Read privileges as the glsjboss and glsai users (or as the entire claritylims group) for the run output directory. This directory is where the sequencing service looks for run folders. The run folders must be in this directory. The service does not search sub-directories.
The sample sheet, run recipe, and run output directories can be configured independently and do not have to be on the same mount or share.
Prerequisite 2: Secret Util Installation
The Clarity LIMS installation tooling configures the Secret Util installation. No additional configuration is necessary. For more information on Secret Utility configuration, refer to the [Clarity LIMS (Clarity & LabLink Reference Guide) documentation](../../../clarity-lims/clarity-and-lablink.md).
Installation
NovaSeq 6000 Integration v2.5.0 supports both on-premise and cloud integrations. This integration is distributed as the following RPM packages:
BaseSpaceLIMS-novaseq-extensions
BaseSpaceLIMS-novaseq-sequencing-service (depends on BaseSpaceLIMS Automation Worker v1.x)
The BaseSpaceLIMS-novaseq-extensions RPM installs the following items:
Protocols and steps
Validation workflow
Database properties that configure the service
novaseq-extensions.jar file that provides sample sheet generation and other utility scripts
The BaseSpaceLIMS-novaseq-sequencing-service RPM installs the following items:
Bash scripts to run the novaseq_seqservice through systemctl
novaseq-sequencing.jar
novaseq-remote-extensions.jar file that contains scripts that need access to the run folders on the NAS
On-Premise Installation
Use the following instructions to install the BaseSpaceLIMS-novaseq-extensions and BaseSpaceLIMS-novaseq-sequencing-service RPMs on the Clarity LIMS server.
Install the RPMs
On the Clarity LIMS server, log in as the root user.
Run the following yum commands to install the RPMs:
yum install BaseSpaceLIMS-novaseq-extensions --enablerepo=<< repo name info from support >>
yum install BaseSpaceLIMS-novaseq-sequencing-service --enablerepo=<< repo name info from support >>
Enter y to confirm that you want to proceed with the RPM installations.
If Secret Util is not installed or configured, you must configure it before proceeding. As the glsjboss user, run the following script:
When prompted, enter the channel name for the Automation Worker and sequencing service.
When upgrading from NovaSeq Integration v2.2.0 to v2.5.0, a conflict is expected during the workflow import with the Flowcell Type analyte custom field due to SP being added to the list of preset values. You can manually add SP as one of the values of the Flowcell Type custom field after you import the workflow.
Configure the Database Service Properties
To enable capture and generation of files associated with the sequencing run, configure the properties using the omxProps-ConfigTool utility.
Start the Sequencing Service
Run the following command to start the sequencing service:
systemctl start novaseq_seqservice
Configurable Database Properties
This version of the integration only supports a single active search-replace pair of each type. The sampleSheetPathPrefixSearchReplaceSuffixes and netPathPrefixSearchReplaceSuffixes properties must contain a single number instead of a comma-separated list of numbers.
The sample sheet, run recipe, and run output folders can all be configured independently and do not have to be on the same mount or share. For examples of valid configuration mount points and properties, refer to Example Network Folder Structure and Database Properties.
Cloud Installation
The BaseSpaceLIMS-novaseq-extensions RPM must be installed on the Clarity LIMS server. The BaseSpaceLIMS-novaseq-sequencing-service RPM can be installed remotely on another server within the network.
Specifications
The following hardware, operating system, and network specifications must be met to install the BaseSpaceLIMS-novaseq-sequencing-service RPM:
Hardware requirements:
64-bit processor (dual core 2.0 GHz)
OS requirements, plus at least an additional 2 GB RAM
When prompted, enter the channel name for the Automation Worker and sequencing service. The recommended channel name is remotens. For more information, refer to Update Channel Name.
When upgrading from NovaSeq Integration v2.2.0 to v2.5.0, a conflict is expected during the workflow import with the Flowcell Type analyte custom field due to SP being added to the list of preset values. You can manually add SP as one of the values of the Flowcell Type custom field after you import the workflow.
Update Channel Name
The configure_extensions_novaseq.sh script prompts for a channel name and sets the name on the following preconfigured automations:
Prepare Files for NovaSeq (configured on Dilute and Denature (NovaSeq 6000 v2.3) and Load to Flowcell (NovaSeq 6000 v2.3) steps)
Read InterOp Metrics (configured on AUTOMATED NovaSeq Run (NovaSeq 6000 v2.3) step)
If you need to change the channel name, you must update it manually on each automation. The import script does not update this setting.
Configure the Database Service Properties
To enable capture and generation of files associated with the sequencing run, configure the properties using the omxProps-ConfigTool utility.
Install Sequencing Service RPM on Remote Server
On the applicable server, log in as the root user.
The configure.sh script is provided by BaseSpaceLIMS-AutomationWorker.
Start the Sequencing Service
Run the following command to start the sequencing service:
systemctl start novaseq_seqservice
Configurable Database Properties
This version of the integration only supports a single active search-replace pair of each type. The sampleSheetPathPrefixSearchReplaceSuffixes and netPathPrefixSearchReplaceSuffixes properties must contain a single number instead of a comma-separated list of numbers.
The sample sheet, run recipe, and run output folders can all be configured independently and do not have to be on the same mount or share. For examples of valid configuration mount points and properties, refer to Example Network Folder Structure and Database Properties.
Workflows, Protocols, and Steps Installed
NovaSeq 6000 v2.3 Workflow
Protocol 1: Run Format (NovaSeq 6000 v2.3)
Steps:
Define Run Format (NovaSeq 6000 v2.3)
Protocol 2: NovaSeq Standard (NovaSeq 6000 v2.3)
Steps:
Make Bulk Pool for NovaSeq Standard (NovaSeq 6000 v2.3)
Dilute and Denature (NovaSeq 6000 v2.3)
Protocol 3: NovaSeq Xp (NovaSeq 6000 v2.3)
Steps:
Make Bulk Pool for NovaSeq Xp (NovaSeq 6000 v2.3)
Dilute, Denature & ExAmp (NovaSeq 6000 v2.3)
Load to Flowcell (NovaSeq 6000 v2.3)
Protocol 4: AUTOMATED - NovaSeq Run (NovaSeq 6000 v2.3)
For detailed descriptions of the steps and automations included in each protocol, and details on other components in the configuration, refer to NovaSeq 6000 Integration v2.5.0 Configuration.
The NovaSeq instrument software includes the following components:
NovaSeq Control Software (NVCS) — Contains the user interface for setting up the sequencing run. Responsible for controlling the instrument and acquiring the images.
Real-Time Analysis v3 (RTA3) — Takes the images generated by the first module, processes, and analyzes them. Makes sure that data files are created and copied to the final destination folder.
Sequencing Analysis Viewer (SAV) — Displays the important quality metrics generated by the RTA3 software.
NovaSeq recipes — Provides system operation instructions for use with NovaSeq 6000 reagent kits for SP, S1, S2, and S4 flow cells.
Universal Copy Service — Copies output files to destinations such as final destination folder and/or BaseSpace Sequence Hub (when the instrument is configured for use with BaseSpace Sequence Hub).
Instrument Integration Configuration
For the NovaSeq API integration to work, the NovaSeq instrument must be able to communicate with Clarity LIMS through the API. Complete the following steps to configure the NVCS and confirm that you can access Clarity LIMS from the instrument.
Launch NVCS and wait for the initialization process to complete.
On the NVCS Settings page, do the following:
Under Mode Selection, select File-Based.
In the adjacent field, enter the network folder location that NVCS retrieves the JSON recipe from.
In the Output Folder field, enter the default output folder location.
Select Save to complete the configuration.
On the NVCS home page, you can now select Sequence to log into Clarity LIMS and start a run.
Example Network Folder Structure and Database Properties
Instrument Control Software Configuration
Refer to the following examples for the configuration of the instrument control software:
The following examples assume the same configuration and mount points from the previous sections.
After completing the Dilute and Denature (NovaSeq 6000 v2.3) or Load to Flowcell (NovaSeq 6000 v2.3) step, an automated script copies the run recipe JSON file and sample sheet to the appropriate network locations. For example, a library tube or flow cell with the barcode AA12345-NNN would have the following directory content:
When the NovaSeq run starts, the instrument control software creates a new run folder named after the run ID. Then, the software copies the run recipe JSON file to a Clarity LIMS subfolder inside it. For example, a run ID of 170301_A001_AHHCC would have the following directory content:
The NovaSeq 6000 sequencing service detects the run folder based on the presence of the three files inside it and starts the AUTOMATED NovaSeq Run (NovaSeq 6000 v2.3) step automatically.
After the sequencing run has completed, the Universal Copy Service copies the run data and InterOp files to the network drive. Then, the service creates a CopyComplete.txt file in the run folder. At this point, the directory would have the following content:
The NovaSeq 6000 sequencing service detects the completed sequencing run based on the presence of the CopyComplete.txt file, executes InterOp metrics parsing for the run, and automatically completes the AUTOMATED - NovaSeq Run (NovaSeq 6000 v2.3) step.
You must include the --enablerepo command line argument. Illumina Support provides the repo file and appropriate name to use.
For the on-premise installation, the Illumina Vault server is not available for public access.
If you are running the sequencing service and Automation Worker on the same instance, make sure that the instance is running a compatible version of Oracle Linux. For compatibility, refer to NovaSeq 6000 Integration v2.5.0 Release Notes.
You must include the --enablerepo command line argument. Illumina Support provides the repo file and appropriate name to use.
The configuration provided with this integration has been established to support NovaSeq 6000 lab processes. Any configuration changes to protocols or workflows, including renaming protocols, steps, and fields, could break processes.