Attention: Confluence is not suitable for the storage of highly confidential data. Please ensure that any data classified as Highly Protected is stored using a more secure platform.
If you have any questions, please refer to the University's data classification guide or contact ict.askcyber@sydney.edu.au

Skip to end of banner
Go to start of banner

Trinity

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Current »

Trinity, developed at the Broad Institute and the Hebrew University of Jerusalem, performs de novo reconstruction of transcriptomes from RNA-seq data. If you run this on Artemis, you will achieve best performance if you specify the --workdir option to use local node storage instead of /project, /scratch, or /home. This is because large assemblies generate millions of small files that degrade the performance of Artemis's Lustre filesystem. Node local storage performs much better than Lustre for this workload.

The way to tell Trinity to write to node local storage is to use the PBS variable "$TMPDIR". $TMPDIR contains the path to the compute node's local node storage. You need to additionally specify an working directory with the word "trinity" in it for the job to run. To set up the Trinity working directory, add the following two lines to your PBS script before you run Trinity:

export TRINITY_WORKDIR=${TMPDIR}/trinity_workdir
mkdir -p ${TRINITY_WORKDIR}

You can then add the option --workdir ${TRINITY_WORKDIR} to your Trinity command. An example PBS script to run the sample job provided with Trinity version 2.6.6 is provided below. In this job, I copied /usr/local/trinity/2.6.6/sample_data/test_Trinity_Assembly to /project/Project/abcd1234/trinity/input/test_Trinity_Assembly before submitting the job.

#!/bin/bash
#PBS -P Project
#PBS -l select=1:ncpus=4:mem=16gb
#PBS -l walltime=1:00:00
#PBS -j oe

module load python/2.7.13
module load bowtie2/2.3.3.1
module load samtools/1.8
module load jellyfish/2.2.6
module load salmon/0.11.0
module load trinity/2.6.6

export TRINITY_WORKDIR=${TMPDIR}/trinity_workdir
export OUTDIR=/project/Project/abcd1234/trinity/trinity_output
mkdir -p ${TRINITY_WORKDIR}

cd /project/Project/abcd1234/trinity/input/test_Trinity_Assembly

#######################################################
##  Run Trinity to Generate Transcriptome Assemblies ##
#######################################################

Trinity --seqType fq --max_memory 4G \
              --left reads.left.fq.gz \
              --right reads.right.fq.gz \
              --output ${OUTDIR} \
              --workdir ${TRINITY_WORKDIR} \
              --SS_lib_type RF \
              --CPU 4

  • No labels