Ansys Learning Forum Forums Discuss Simulation Fluids Running ANSYS Fluent on a HPC Cluster Reply To: Running ANSYS Fluent on a HPC Cluster

michaelsalas
Subscriber

launch process still times out 

here is the batch script:

#!/bin/bash
#SBATCH -J ansysjob              # job name
#SBATCH -e ansysjob.%j.err       # error file name
#SBATCH -o ansysjob.%j.out       # output file name
#SBATCH -N 2                     # request 2 nodes
#SBATCH -n 256                   # request 128 cores
#SBATCH -t 0:20:00               # designate max run time
#SBATCH -A DDM23001              # charge job to myproject
#SBATCH -p development           # designate queue

# Load necessary modules
module load python3/3.9.7
module load ansys

srun hostname -s | uniq -c | sort -k2 -V | awk '{printf("%s:%d\n",$2,$1)}' > hosts.$SLURM_JOB_ID

# Wait for finishing

wait

# set library path for Fluent shared libraries
export LD_LIBRARY_PATH=/scratch/tacc/apps/ANSYS/2023R2/v232/fluent/lib/lnamd64:$LD_LIBRARY_PATH

# give permissions to the pyfluent script
chmod 700 /scratch/10223/mjs7392/jcat/intake_script.py

# pre-create blank output file
touch /scratch/10223/mjs7392/jcat/fluent_output.log

# change to the directory where the Slurm job was submitted
cd $SLURM_SUBMIT_DIR

# run Fluent
/scratch/tacc/apps/ANSYS/2023R2/v232/fluent/bin/fluent 3ddp -g -mpi=openmpi -pib -cnf=hosts.$SLURM_JOB_ID -t $SLURM_NTASKS -driver > /scratch/10223/mjs7392/jcat/fluent_output.log 2>&1 &

wait

# run the script
python3 /scratch/10223/mjs7392/jcat/intake_script.py


here is the error:

Warning:  DISPLAY environment variable is not set.
  Graphics and GUI will not operate correctly
  without this being set properly.
Warning:  DISPLAY environment variable is not set.
  Graphics and GUI will not operate correctly
  without this being set properly.
pyfluent.launcher ERROR: Exception caught - TimeoutError: The launch process has timed out.
Traceback (most recent call last):
  File "/home1/10223/mjs7392/.local/lib/python3.9/site-packages/ansys/fluent/core/launcher/standalone_launcher.py", line 253, in __call__
    raise ex
  File "/home1/10223/mjs7392/.local/lib/python3.9/site-packages/ansys/fluent/core/launcher/standalone_launcher.py", line 233, in __call__
    _await_fluent_launch(
  File "/home1/10223/mjs7392/.local/lib/python3.9/site-packages/ansys/fluent/core/launcher/launcher_utils.py", line 59, in _await_fluent_launch
    raise TimeoutError("The launch process has timed out.")
TimeoutError: The launch process has timed out.

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/scratch/10223/mjs7392/jcat/intake_script.py", line 17, in
    solver = pyfluent.launch_fluent(
  File "/home1/10223/mjs7392/.local/lib/python3.9/site-packages/ansys/fluent/core/utils/deprecate.py", line 49, in wrapper
    return func(*args, **kwargs)
  File "/home1/10223/mjs7392/.local/lib/python3.9/site-packages/ansys/fluent/core/utils/deprecate.py", line 49, in wrapper
    return func(*args, **kwargs)
  File "/home1/10223/mjs7392/.local/lib/python3.9/site-packages/ansys/fluent/core/launcher/launcher.py", line 285, in launch_fluent
    return launcher()
  File "/home1/10223/mjs7392/.local/lib/python3.9/site-packages/ansys/fluent/core/launcher/standalone_launcher.py", line 296, in __call__
    raise LaunchFluentError(self._launch_cmd) from ex
ansys.fluent.core.launcher.error_handler.LaunchFluentError: 
Fluent Launch string: nohup /scratch/tacc/apps/ANSYS/2023R2/v232/fluent/bin/fluent 3ddp -t256 -cnf=c306-005:128,c306-006:128 -gu -sifile=/tmp/serverinfo-0g7hxku6.txt -nm &

the pyfluent script is the same for both batch filesÂ